05:20:38 hmm sech1, but each one currently does get different initialization entropy 05:22:12 and each slot does get xor'd along the way with register data, which means you still have to store 3 values total across iterations 07:41:13 speaking of zen-c, is there any major performance difference w/ RX on zen-c cores? after accounting for clock speed differences, that is 07:42:17 clocks are lower and caches are smaller, but possibly cache latency? 07:43:12 on 4/4c I see it's ~5ghz/3.7ghz so definitely some raw hashrate difference, but IPC is similar no? 08:01:02 afaik they left the other stuff equal in zen5c besides lowering l3 08:01:28 it's not similar to intel efficiency cores that change implementation 08:04:34 https://chipsandcheese.com/p/testing-amds-bergamo-zen-4c-spam < this says that L3 gets also a hit to latency on zen4c 13:17:09 just finished building gcc/g++ 14.3.0, still failed to build. Maybe gas is also out of date 13:17:28 debian paste refused the output: Could not add your entry to the paste database: 13:17:29 Spam detected: Content is primarily a list of links or hashes. 13:19:11 Yes, risc-v vector aes instructions are very new, so gas must be new too 13:21:20 https://pastebin.com/D5QpcLvh 13:21:30 I'll try to get that built... 13:44:58 Orange Pi RV2 has gcc 15 out of the box (in the distro from their official site), so it's not that bad for risc-v boards 13:52:34 Licheepi appears to be abandoned, they haven't updated their OS image in 2 years 13:53:11 a shame, since I got 16GB of RAM in mine 13:56:49 ok. latest gas built, got past there 13:58:37 all tests passed, SSSE/AVX skipped 13:58:43 and fast reciprocal skipped 14:03:48 mining with 1 thread https://paste.debian.net/hidden/6c5e26ee 14:06:30 Nice 14:06:48 I see you're testing the final v2 code, not just that RV64 vector PR? 14:07:18 No large pages? You have 16 GB RAM, you can enable them 14:12:42 ah yes, I thought this branch was just the PR merged on 14:12:56 do I still need to check just the PR itself? 14:13:43 4 threads mining https://paste.debian.net/hidden/fc6d9c0d 14:14:52 The PR itself is included in v2 branch, so if it works, it works 14:15:07 ok then I'm going to approve the PR 14:15:08 And large pages? 14:15:14 checking that now 14:15:29 That's a more powerful board than RV2 14:15:32 better hashrate 14:15:58 lemme see, how many hugepages do I need here 14:16:35 I always set it to 1280 14:16:59 1200 should be enough too, but 1280 is a more round number :) 14:17:17 ok 14:17:25 I set 1200 14:19:30 largepages https://paste.debian.net/hidden/84f42e24 14:19:54 Definitely faster than RV2 14:19:58 Nice to see it working 14:20:13 too bad no vector AES but all good 14:20:17 RV2 could do 110.7 h/s max on v1 (with XMRig) 14:20:24 And I think 99 h/s on v1 with randomx-benchmark 14:20:36 and 67 h/s or so on v2 14:20:49 RV2 doesn't have AES too 14:20:57 I don't know of any boards that have it now 14:22:32 yes, found in the chat log: "Orange Pi RV2 (Ky X1 CPU, software AES): v1 99.3 h/s, v2 67.1 h/s" 14:28:42 have you tried PR#174, monsterpages? 14:28:55 seems decent enough 14:35:10 No, I haven't tried it yet. But I can try it when I get back home, in a week or so 14:35:30 cool 14:35:39 "Memory initialized in 29.9764 s" 14:35:45 RV2 initializes dataset in 15 seconds 14:35:48 Thanks to vector code 14:35:55 nice 14:38:12 And maybe also because it's 8-core 14:42:06 did you look at all at the v0.7 vector instructions? I wonder if the stuff you used is all present in there 14:42:57 granted, only THead's patched gas supports it 14:44:20 No, I downloaded v1.0 vector specs and only looked at that documentation 14:44:32 v0.7 was an intermediary standard, all new boards will have v1.0 14:44:49 ok. I have both v0.7 and v1.0 docs around here but haven't looked at any of it in a while 15:02:34 bitmain also trying to contact me on reddit https://paste.debian.net/hidden/014a2749 15:06:07 > We want to discuss/clarify on the new updates on Monero. 15:15:28 personally I don't see why anything needs to be discussed, everything is there in the github repos 15:18:37 and the irc channels and matrix rooms are public 15:20:58 PR merged, https://github.com/SChernykh/RandomX/tree/v2 rebased 15:21:21 Bitmain can't IRC, apparently 15:21:26 Is it blocked in China? 15:21:40 matrix :) 15:22:10 They can make a Github issue, ffs :) 15:22:49 Damn, that FreeBSD packaging bug is still not fixed... https://github.com/SChernykh/RandomX/actions/runs/21253931716/job/61162948713 15:23:39 What does Bitmain tech team want to discuss anyway, we even have RISC-V code for them to use now :) 15:24:59 yeah they don't even have to lift a finger 15:26:12 just send them the irc logs 15:26:27 https://libera.monerologs.net/monero-pow/20260122 15:28:01 btw, SG2044 is most likely what they're using now https://browser.geekbench.com/v6/cpu/8661173 15:28:42 Does SG2044 have hardware AES, or do they use some kind of SoC with AES accelerator on it - that is the question... 15:29:08 SG2044 is a powerful beast, especially in the memory department 15:29:51 https://arxiv.org/html/2508.13840v1 15:31:40 > 1 Processor, 1 Core, 64 Threads 15:33:02 SG2042 4 memory controllers, 4 channels. SG2044 32 memory controllers, 32 channels. What a beast. 15:34:40 dataset for the core is afaik https://www.xrvm.com/product/xuantie/C920 https://www.xrvm.com/community/download?id=4240183866976964608 15:35:10 https://github.com/sophgo/sophgo-doc/blob/main/SG2042/T-Head/XuanTie-C910-C920-UserManual.pdf 15:36:11 cannot find AES mention there (?) 15:37:39 C920v2 15:38:19 if it implements 1.0 it should have the vector crypto no? 15:38:30 ask them for details via the provided email :D 15:38:40 interesting, gcc 14 adopted support for THeadVector, so basically vector0.7 15:40:41 damn https://arxiv.org/html/2508.13840v1/graphs/streams.png 15:41:20 interesting comparisons against Zen2 15:41:21 yeahI get the feeling it'd be a great database machine too 15:43:27 if it has aes, the x9 would still be a pretty nice platform if they don't become DoA like previous 15:44:12 https://www.hackster.io/news/ghostwrite-a-serious-flaw-in-the-t-head-xuantie-c910-and-c920-cores-hits-popular-risc-v-sbcs-14833c98e33d 15:44:38 apparently the vector unit can access any memory, independent of MMU permissions 15:44:41 that is afaik about the C920v1(?) 15:44:43 ahahahah 15:45:21 spectre is inevitable, just offer instructions to do this directly 15:45:28 lol 15:47:23 email-to-irc service when :) 15:54:23 I don't see any Zkr / Zkt references here https://gcc.gnu.org/pipermail/gcc-patches/2025-March/677772.html 15:54:30 so probably no crypto extensions 15:54:52 Zk are scalar crypto extensions 15:54:58 Zvk* are vector crypto extensions 15:55:01 They are different 15:55:13 either way, not referenced here 16:01:50 In addition to the standard RV64GCB[V] ISA, C907 has also implemented the XIE (XuanTie Instruction Extension). The XIE consists of extended instructions optimized for load/store, arithmetic, bitwise and cache/TLB operations. When enabled, these instructions improve the performance significantly. 16:01:52 Interesting 16:02:02 SG2044 has XIE too 16:02:24 Looking for the list of instructions now 16:02:45 I think current gcc with -march=native should already turn those on 16:03:05 though perhaps some of these only make sense in asm 16:03:25 crypto specs https://github.com/riscv/riscv-crypto/releases 16:04:01 So yeah, Zvk doesn't get mentioned anywhere. and I guess it's separate from RVV1.0? 16:06:48 Probably this: https://github.com/XUANTIE-RV/thead-extension-spec/releases/tag/2.3.0 16:07:07 I don't see any crypto instructions mentioned 16:09:44 Those extensions add instructions similar to the regular RISC-V zba/zbb/zbc instructions, we already have support for those 16:10:00 Zvk is still pretty new https://dl.acm.org/doi/abs/10.1145/3658644.3691394 16:11:44 without AES in-line, that'd be pretty devastating 16:12:13 I wonder if they have an accelerator for scratchpad AES 16:15:38 hyc "According to the latest RVA profile, vector crypto should be preferred: https://github.com/riscv/riscv-profiles/releases/tag/rva23-rvb23-ratified" - tevador 16:15:55 what the hell are folks running on. Linux kernel already supports it all. https://lwn.net/Articles/952854/ 16:16:23 I don't know what they're running on, but I debugged my vector aes code in qemu 16:17:16 They did the same, probably, since it was in 2023 16:17:46 I guess 16:18:49 zvkt here https://www.rt-rk.com/gcc-tuning-for-spacemit-x60-building-an-in-order-dual-issue-scheduler-model-part-i/ 16:20:43 zvkt = make certain instructions constant time 16:20:49 it's not a specific instruction set 17:18:23 ok. so spacemit x60 claims RVA22 support and RVV1.0 but crypto is still optional in RVA22 and it sounds like they don't support it 17:29:45 X5 used SG2042R which must be a custom chip, so maybe X9 uses a custom SG2044