-
gingeropolous
anyone getting one of these RISCV miners?
-
plowsof
They "sold out"(tm) in a few seconds
-
kico
you mean in bitmains website?
-
kico
-
kico
sigh 3,299.87
-
nioc
They don't say how loud it is :D
-
nioc
Also needs 220v service
-
nioc
200-240v
-
m-relay
<hbs:matrix.org> I am surprised by the weight
-
sech1
plowsof so GPU scalpers jumped on this too?
-
kico
iBet these "resellers" just bought the whole batch
-
hyc
that'd be most likely
-
hyc
hell they may have been preordered, which is why it "sold out in seconds"
-
sech1
so Bitmain's price has never been the real price
-
tevador
When are they shipping? Would be interesting if someone could post some pics of the board without heatsinks.
-
kico
word wish I could get one just to tear it apart :P
-
kico
tevador, mineshop.eu says "In stock shipping in 7-10 days"
-
kico
"Available Sept 10 – 30th sept 2023"
-
kico
but god knows they might be taking pre orders to then order or smth
-
sech1
"naked board without heatsinks" lol, sounds like porn
-
sech1
someone will eventually publish the pictures, with all part numbers
-
tevador
It's probably something based on XuanTie C910.
-
sech1
probably, but it has to be heavily customized for RandomX
-
sech1
XuanTie C910 doesn't have crypto instructions (?)
-
tevador
I don't think it would be worth it to design a custom chip just for Monero. It's most likely a stock C910 with 4 cores/8 MB of cache per cluster.
-
tevador
At most they could have added a coprocessor for the scratchpad expansion, but most likely they just dedicated a few cores for that.
-
sech1
They probably have multiple chips on the board. My nonce analysis showed 52 mining threads in each of the 10 horizontal lines, so around 520 mining cores per device
-
sech1
so probably 16x4 core chips, but only 13 are active
-
sech1
These chips are TSMC 12nm by the way
-
hyc
12nm? how do you know? I think even my car radio's ARM chip is on 8nm
-
sech1
-
hyc
cool thanks
-
m-relay
<xfedex:matrix.org> 12 nm is really a lot. I guess that if RISC-V CPUs are underdeveloped.
-
hyc
configs up to 8MB L2 cache. that'd be what they'd need for 4 cores
-
hyc
any idea what the XIE instruction extensions do?
-
hyc
there is also a C920 64-core chip, same instruction features
-
tevador
-
hyc
yes, I already saw that
T-head-Semi/openc910 #17
-
hyc
while looking for the XIE docs.
-
hyc
but doesn't seem like XIE has anything to help crypto
-
sech1
If they really use it "as is", the only way to nerf it is to have moooore AES :D
-
sech1
but AES is already like 5-8% of hash time on Ryzen
-
tevador
AES is too ASIC friendly
-
tevador
AFAIK there are no chips that support the crypto extension yet
-
sech1
-
sech1
because next come steps 1-2 with 30-50 clock cycles wait for read from L3
-
sech1
which is not used by anything yet
-
hyc
C series instruction manual:
-
hyc
-
sech1
for example, registers f0-f3 could go through a few AES round before writing them back to scratchpad
-
sech1
just generating ideas now
-
sech1
30-50 clock cycles window means we could have 8 AES rounds done on 4 registers (32 instructions in total) on each loop iteration
-
sech1
Assuming AES latency is 4 cycles
-
sech1
hyc yep, I don't see any crypto instructions
-
sech1
32 AES instructions per loop => 524288 additional AES instructions per hash
-
sech1
Right now one RandomX hash does 262144 AES instructions
-
sech1
so it's 3x increase on AES side, almost for free
-
sech1
I like it
-
tevador
We'd have to benchmark the impact on light mode perf.
-
sech1
Why light mode? Step 11 is the same in light mode
-
sech1
"The values of registers f0-f3 are written to the Scratchpad (L3) at address spAddr0 (64-byte aligned)."
-
sech1
This is where I suggest to add AES
-
sech1
and the L3 read delay in steps 1-2 will also be the same
-
sech1
AES will interleave with this L3 read delay
-
tevador
If it fits in L3 cache delay, it's probably fine.
-
sech1
also, it will improve scratchpad entropy
-
sech1
because f0-f3 are not totally random when we store them
-
tevador
The only complication is the soft AES mode, which would probably need to call a subroutine there. Currently we don't have any AES in VM code.
-
sech1
yes, but this subroutine would be just a bunch of AES calls, easy to write
-
sech1
soft AES becomes less and less relevant (for x64 and ARM)
-
sech1
newer RISC-V CPUs will also have it
-
tevador
For example, Raspberry Pi 4 doesn't have AES, so it would be impacted. But if the verification time doesn't increase by more than a few percent, it's probably fine.
-
sech1
IIRC soft AES mode spends around 30% of time calculating AES
-
sech1
so we could add 4 AES rounds per loop (16 instructions), doubling the total AES instruction count
-
sech1
which would mean ~50% of time for AES in soft AES mode
-
sech1
or 30% slowdown for CPUs without AES, if my math is right
-
sech1
"CPUs without AES" = "Bitmain professional miners without AES"
-
sech1
Raspberry Pi, I hate them. They cheaped out on crypto extensions. Literally all other SBCs have them.
-
sech1
also, RPi4 was released in 2019, and we're discussing tweaks for future CPUs, tweaks that will go live in 2024 (RPi4 will be 5 years old by then).
-
elucidator
they left it out for cost reasons tbh, not everybody gets to use free instruction sets :P
-
sech1
Yes, but RPi4 is more like an exception, not a rule
-
sech1
I like this AES tweak also because it improves scratchpad entropy
-
tevador
What is relevant is the impact on light mode verification time for low end hardware that might be running a Monero node. We don't want to increase the hardware requirements for running a node.
-
sech1
I have RPi 3b+ for testing
-
sech1
It already takes almost a second to verify a hash (light mode + soft AES mode)
-
sech1
It doesn't have enough RAM to run full dataset mode
-
sech1
Or my memory betrays me
-
sech1
I remember it could do something around 20 h/s when mining
-
sech1
so 5 h/s per core, or 200 ms to verify a hash
-
sech1
so with this AES tweak, it would be 30% more - 260 ms
-
elucidator
18:06:10 <@sech1> Yes, but RPi4 is more like an exception, not a rule => yeah ofc. i got all generations of rpis i can test with, rpi4 is my home node. can test it and report
-
elucidator
welp we don't have aarch xmrig handy, got it and testing now
-
elucidator
> miner speed 10s/60s/15m 102.5 n/a n/a H/s max 103.0 H/s
-
elucidator
qu.ax/bDmY.png well now it's worse than that, i think i should shutdown monerod
-
elucidator
yeah barely 90h/s
qu.ax/Dko.png
-
sech1
You need to enable huge pages
-
elucidator
tried with sudo, let me check other options
-
elucidator
i think this needs a kernel recompile
-
sech1
it should be more than 100 h/s with huge pages
-
gingeropolous
so riscv CPUs are unwanted in the monero mining ecosystem?