-
sech1
There is still sense to tweak RandomX to make it align better with modern and future CPUs
-
sech1
irrespective of the current development
-
tevador
I think that this "ASIC" by Bitmain proves that RandomX has achieved its goal.
-
m-relay
<k4r4b3y:karapara.net> Can you expand on that?
-
sech1
Antminer X3 (Cryptonight): 220 kh/s, 465W
-
sech1
Antminer X5 (RandomX): 212 kh/s, 1350W
-
sech1
Are they going backwards?
-
tevador
And you can get ~150 H/J with an ordinary CPU.
-
sech1
Yes, well tuned 7950X can get 150 H/J
-
sech1
they use weird "J/K"
-
sech1
which is joules per 1000 hashes?
-
sech1
150 h/J = 6.67 "J/K"
-
sech1
Server CPUs (64 core EPYCs) can achieve even higher efficiency
-
tevador
This ASIC only has a better space density. It also depends on how much they will charge for it.
-
sech1
My bet is they haven't decided on the price yet
-
sech1
They're making twitter poll now to see what people expect and how much they can milk out of it
-
sech1
they've been mining with these devices since December 2021, so they for sure returned at least part of their investment
-
tevador
Coincidentally, I was going to propose 2 new instructions that use the carry and overflow flag, which RISC-V does not have.
-
m-relay
<k4r4b3y:karapara.net> +1
-
sech1
I was thinking about another instruction which makes very good sense to be added in the next RandomX version
-
m-relay
<xfedex:matrix.org> Fun fact: In December 2021 there already were x86 CPUs more energy efficient than that ASIC
-
m-relay
<k4r4b3y:karapara.net> Should we favor against risc-v though? Some other anon had posed the same question here before.
-
tevador
I don't think we need to react to this ASIC at all. We can just implement the CFROUND optimization and the double hashing.
-
m-relay
<k4r4b3y:karapara.net> +1 on "no need to react yet" .
-
tevador
Since it's not actually an ASIC, it's a general pupose compute platform with custom firmware.
-
sech1
They have ~500 RISC-V cores there by my calculations
-
sech1
split into 10 boards
-
sech1
good density
-
sech1
yes, CFROUND has bugged me almost since the beginning when I started optimizing for real CPUs
-
sech1
no real programs use it that often
-
sech1
so it makes sense that this instruction is slow
-
hyc
jst as long as it's never possible for it to randomly get zero uses
-
hyc
otherwise we have to rethink whether the easy/hard program problem is an issue again
-
tevador
BTW, if anyone is wondering how x86 can still compete in efficiency against RISC designs, the answer is here in the third bullet point:
github.com/tevador/RandomX/blob/master/doc/design.md#22-program
-
tevador
hyc: 1/16 frequency for CFROUND should still be safe because 2048/16 = 128, i.e. rounding mode would change on average 96x per program (32x it would stay the same)
-
hyc
ISTR thre are other RISC families that don't have carry or overflow flags, but can't recall specifics right now
-
tevador
MIPS
-
hyc
ah yes
-
tevador
The carry/overflow instruction would be an option in case this ASIC was an actual ASIC implementing the RandomX ISA in silicon. In that case, it would be bricked as carry/overflow can't be emulated using the remaining instructions.
-
sech1
I just did a quick test on Ryzen 5 3600: 6498 h/s stock, 6979 h/s just by setting CFROUND frequency to 0 and increasing ISTORE frequency to 17 (from 16).
-
sech1
so 7.4% speedup just from not running CFROUND
-
hv-bridge
<\x> ah yeah, just an update on iot though
-
hv-bridge
<\x> new router/AP platforms now are 64 bit ARM mostly cortex A53 + aes with up to 2GB of RAM
-
hv-bridge
<\x> thats the wifi 6/6e stuff
-
hv-bridge
<\x> next gen will be like cortex a73 with up to 4GB ram
-
hv-bridge
<\x> wifi 7 stuff
-
hv-bridge
<\x> its like this for next gen
-
hv-bridge
-
tevador
2 GB of RAM is not enough to mine RandomX efficiently
-
hv-bridge
<\x> yup
-
hv-bridge
<\x> but on next gen it likely will be
-
hv-bridge
<\x> pretty common to see aarch64 everywhere on iot hits nowadays, yes people still leave their passwords default on ssh
-
tevador
I don't see a problem if IoT botnets start helping to secure the network, unless they can reach 50%, which seems unlikely
-
hv-bridge
<\x> for those its just a ram tweak yeah
-
hv-bridge
<\x> just ask for more ram
-
hv-bridge
<\x> and youll lose most iot
-
tevador
Adding more RAM is not an easy tweak, see the design docs.
-
hv-bridge
<\x> mainstream computers are also heading towards having more ram
-
hv-bridge
<\x> its like the norm for ddr5 is uhh 2x 16G setups
-
tevador
yeah but we still want to be able to verify hashes on a Raspberry PI with 512 MB
-
hyc
I was thinking of writing another VM for mining in 2GB of RAM
-
hyc
just keep the expanded dataset for 1GB, and use the cache for the other half
-
sech1
I remember that the slowdown from the other half is still too bad
-
hyc
I guess it would be. but still an improvement over lightmode