-
sech1
We can use XMRig (dev branch) now for testing:
xmrig/xmrig #3769
-
sech1
-
sech1
16 cores, hardware AES support and 256-bit vectors
-
sech1
This one is interesting
-
jpk68
Has anyone tested the Milk-V Mars?
-
jpk68
It's not a super interesting board but I can post benchmarks
-
jpk68
I keep getting a build error when trying to compile on the board. It seemed to work last time
-
jpk68
Error finding "<riscv_vector.h>"
-
jpk68
-
sech1
This is a standard header for risc-v gcc
-
sech1
double check that you're using a recent gcc version
-
jpk68
Thanks. Looks like I'll have to compile it from source.
-
jpk68
Looking back at older chat logs, it's gonna take a while haha
-
hyc
SpacemiT_K3 definitely sounds promising
-
hyc
no availability before April
-
hyc
Sipeed has a SpacemIT K1 module LicheePi 3A that can replace the LicheePi 4A CPU module
-
DataHoarder
-
hyc
Would be cool if they release a new CPU module with the K3
-
hyc
-
hyc
still no sign of hardware AES in SG2044
-
sech1
It doesn't have hardware AES
-
sech1
Whatever they did in X9, it has to be either a separate accelerator on the same package (kind of like an APU), or a special version of SG2044
-
tevador
It has an on-die "Security Protocol Accelerator" that can do AES. Source:
old.reddit.com/r/RISCV/comments/1lv…phgo_technology_newsletter_20250709
-
hyc
ah nice find. have never seen their API
-
sech1
"Because encryption, decryption and hashing are performed entirely in the accelerator’s DMA pipeline, throughput is significantly higher"
-
sech1
"SPACC streams data over a 128-bit AXI interface"
-
sech1
So SG2044 is basically screwed with RandomX v2
-
sech1
This crypto accelerator is designed for throughput, not latency. And low latency is required in the main loop
-
hyc
the only hits I get on security protocol accelerator / spacc are from Synopsis
-
tevador
It's perfect for scratchpad fill/hash, but not so much for v2.
-
sech1
My tests on Orange Pi RV2 showed 2x slowdown with v2 because of software AES
-
sech1
and because of bigger program size, but software AES contributed the most
-
sech1
I googled a little about AXI interfaces - their typical latencies are 50-100 AXI clock cycles, and AXI clock frequency is in the range 100-250 MHz.
-
sech1
Best case scenario (250 MHz and 50 cycles) is 200 ns latency to get anything back from the SPACC
-
sech1
well, it has access to L3, so CPU cores can read processed data from L3 directly, but they still need to know when it's safe to read the data - they need to wait for SPACC's "ok" to read the data
-
sech1
nevermind, I was looking at the wrong numbers. It is much faster (5-10 cycles), and frequency can be as high as CPU cores
-
sech1
But SPACC can only exchange data with CPU cores through the L3 cache, as far as I understand, so the whole thing can't be faster than L3 access (+SPACC's internal latency)