-
DataHoarder
hmm sech1, but each one currently does get different initialization entropy
-
DataHoarder
and each slot does get xor'd along the way with register data, which means you still have to store 3 values total across iterations
-
eureka
speaking of zen-c, is there any major performance difference w/ RX on zen-c cores? after accounting for clock speed differences, that is
-
eureka
clocks are lower and caches are smaller, but possibly cache latency?
-
eureka
on 4/4c I see it's ~5ghz/3.7ghz so definitely some raw hashrate difference, but IPC is similar no?
-
DataHoarder
afaik they left the other stuff equal in zen5c besides lowering l3
-
DataHoarder
it's not similar to intel efficiency cores that change implementation
-
DataHoarder
chipsandcheese.com/p/testing-amds-bergamo-zen-4c-spam < this says that L3 gets also a hit to latency on zen4c
-
hyc
just finished building gcc/g++ 14.3.0, still failed to build. Maybe gas is also out of date
-
hyc
debian paste refused the output: Could not add your entry to the paste database:
-
hyc
Spam detected: Content is primarily a list of links or hashes.
-
sech1
Yes, risc-v vector aes instructions are very new, so gas must be new too
-
hyc
-
hyc
I'll try to get that built...
-
sech1
Orange Pi RV2 has gcc 15 out of the box (in the distro from their official site), so it's not that bad for risc-v boards
-
hyc
Licheepi appears to be abandoned, they haven't updated their OS image in 2 years
-
hyc
a shame, since I got 16GB of RAM in mine
-
hyc
ok. latest gas built, got past there
-
hyc
all tests passed, SSSE/AVX skipped
-
hyc
and fast reciprocal skipped
-
hyc
-
sech1
Nice
-
sech1
I see you're testing the final v2 code, not just that RV64 vector PR?
-
sech1
No large pages? You have 16 GB RAM, you can enable them
-
hyc
ah yes, I thought this branch was just the PR merged on
-
hyc
do I still need to check just the PR itself?
-
hyc
-
sech1
The PR itself is included in v2 branch, so if it works, it works
-
hyc
ok then I'm going to approve the PR
-
sech1
And large pages?
-
hyc
checking that now
-
sech1
That's a more powerful board than RV2
-
sech1
better hashrate
-
hyc
lemme see, how many hugepages do I need here
-
sech1
I always set it to 1280
-
sech1
1200 should be enough too, but 1280 is a more round number :)
-
hyc
ok
-
hyc
I set 1200
-
hyc
-
sech1
Definitely faster than RV2
-
sech1
Nice to see it working
-
hyc
too bad no vector AES but all good
-
sech1
RV2 could do 110.7 h/s max on v1 (with XMRig)
-
sech1
And I think 99 h/s on v1 with randomx-benchmark
-
sech1
and 67 h/s or so on v2
-
sech1
RV2 doesn't have AES too
-
sech1
I don't know of any boards that have it now
-
sech1
yes, found in the chat log: "Orange Pi RV2 (Ky X1 CPU, software AES): v1 99.3 h/s, v2 67.1 h/s"
-
hyc
have you tried PR#174, monsterpages?
-
hyc
seems decent enough
-
sech1
No, I haven't tried it yet. But I can try it when I get back home, in a week or so
-
hyc
cool
-
sech1
"Memory initialized in 29.9764 s"
-
sech1
RV2 initializes dataset in 15 seconds
-
sech1
Thanks to vector code
-
hyc
nice
-
sech1
And maybe also because it's 8-core
-
hyc
did you look at all at the v0.7 vector instructions? I wonder if the stuff you used is all present in there
-
hyc
granted, only THead's patched gas supports it
-
sech1
No, I downloaded v1.0 vector specs and only looked at that documentation
-
sech1
v0.7 was an intermediary standard, all new boards will have v1.0
-
hyc
ok. I have both v0.7 and v1.0 docs around here but haven't looked at any of it in a while
-
hyc
bitmain also trying to contact me on reddit
paste.debian.net/hidden/014a2749
-
DataHoarder
> We want to discuss/clarify on the new updates on Monero.
-
hyc
personally I don't see why anything needs to be discussed, everything is there in the github repos
-
DataHoarder
and the irc channels and matrix rooms are public
-
sech1
-
sech1
Bitmain can't IRC, apparently
-
sech1
Is it blocked in China?
-
DataHoarder
matrix :)
-
sech1
They can make a Github issue, ffs :)
-
sech1
-
sech1
What does Bitmain tech team want to discuss anyway, we even have RISC-V code for them to use now :)
-
hyc
yeah they don't even have to lift a finger
-
DataHoarder
just send them the irc logs
-
DataHoarder
-
hyc
btw, SG2044 is most likely what they're using now
browser.geekbench.com/v6/cpu/8661173
-
sech1
Does SG2044 have hardware AES, or do they use some kind of SoC with AES accelerator on it - that is the question...
-
sech1
SG2044 is a powerful beast, especially in the memory department
-
sech1
-
DataHoarder
> 1 Processor, 1 Core, 64 Threads
-
hyc
SG2042 4 memory controllers, 4 channels. SG2044 32 memory controllers, 32 channels. What a beast.
-
DataHoarder
-
DataHoarder
-
DataHoarder
cannot find AES mention there (?)
-
DataHoarder
C920v2
-
DataHoarder
if it implements 1.0 it should have the vector crypto no?
-
plowsof
ask them for details via the provided email :D
-
hyc
interesting, gcc 14 adopted support for THeadVector, so basically vector0.7
-
DataHoarder
-
DataHoarder
interesting comparisons against Zen2
-
hyc
yeahI get the feeling it'd be a great database machine too
-
DataHoarder
if it has aes, the x9 would still be a pretty nice platform if they don't become DoA like previous
-
hyc
-
hyc
apparently the vector unit can access any memory, independent of MMU permissions
-
DataHoarder
that is afaik about the C920v1(?)
-
DataHoarder
ahahahah
-
DataHoarder
spectre is inevitable, just offer instructions to do this directly
-
hyc
lol
-
DataHoarder
email-to-irc service when :)
-
hyc
-
hyc
so probably no crypto extensions
-
sech1
Zk are scalar crypto extensions
-
sech1
Zvk* are vector crypto extensions
-
sech1
They are different
-
hyc
either way, not referenced here
-
sech1
In addition to the standard RV64GCB[V] ISA, C907 has also implemented the XIE (XuanTie Instruction Extension). The XIE consists of extended instructions optimized for load/store, arithmetic, bitwise and cache/TLB operations. When enabled, these instructions improve the performance significantly.
-
sech1
Interesting
-
sech1
SG2044 has XIE too
-
sech1
Looking for the list of instructions now
-
hyc
I think current gcc with -march=native should already turn those on
-
hyc
though perhaps some of these only make sense in asm
-
hyc
-
hyc
So yeah, Zvk doesn't get mentioned anywhere. and I guess it's separate from RVV1.0?
-
sech1
-
sech1
I don't see any crypto instructions mentioned
-
sech1
Those extensions add instructions similar to the regular RISC-V zba/zbb/zbc instructions, we already have support for those
-
hyc
-
DataHoarder
without AES in-line, that'd be pretty devastating
-
DataHoarder
I wonder if they have an accelerator for scratchpad AES
-
sech1
hyc "According to the latest RVA profile, vector crypto should be preferred:
github.com/riscv/riscv-profiles/releases/tag/rva23-rvb23-ratified" - tevador
-
hyc
what the hell are folks running on. Linux kernel already supports it all.
lwn.net/Articles/952854
-
sech1
I don't know what they're running on, but I debugged my vector aes code in qemu
-
sech1
They did the same, probably, since it was in 2023
-
hyc
I guess
-
hyc
-
sech1
zvkt = make certain instructions constant time
-
sech1
it's not a specific instruction set
-
hyc
ok. so spacemit x60 claims RVA22 support and RVV1.0 but crypto is still optional in RVA22 and it sounds like they don't support it
-
sech1
X5 used SG2042R which must be a custom chip, so maybe X9 uses a custom SG2044