#monero-pow

00:20

crypto_grampy[m]

<gingeropolous> "im betting a second p2pool is..." <- We should come up with a name for og-p2pool soon
00:23

nioc

sechistheone
00:27

crypto_grampy[m]

I like that. If someone creates a new pool, would it just use a different port? How do you know which p2pool you're on if there are multiple?
00:43

gingeropolous

yeah i think its just different port
00:48

crypto_grampy[m]

I don't understand this stuff at all, but would it make any sense to add a unique id? So everyone who wants to start their own p2pool uses the same port, but you connect with peers that run with the same unique id... I.e. pool name which could be sechistheone
00:49

crypto_grampy[m]

If you want to switch pools, you set the pool name and connect to an IP of a known peer in that pool?
00:50

crypto_grampy[m]

s/set/change/
05:39

DataHoarder

There is already an unique id created based on consensus parameters crypto_grampy[m]
05:40

DataHoarder

the issue with more p2pool is 51% attacks that just grab rewards gingeropolous (they can’t do anything Monero wise, just change what shares get paid)
12:41

minereni_d

Hey everyone
12:41

minereni_d

I'm noticing the number of upstreams on my xmrig proxy keeps increasing, while the number of miners remains the same
12:41

minereni_d

[2021-11-04 09:27:28.308] proxy 150.00 kH/s, shares: 4/0 +0, upstreams: 24, miners: 24 (max 24) +0/-0
12:41

minereni_d

[2021-11-04 09:28:28.568] proxy 100.00 kH/s, shares: 4/0 +0, upstreams: 27, miners: 24 (max 24) +3/-3
12:42

minereni_d

Then I also notice that p2pool actually sends work items 1 for each upstream
12:43

minereni_d

Is it normal that I receive more upstreams than workers? are these upstreams garbage collected somehow?
12:44

hv-bridge

<sech1> if you run xmrig-proxy in simple mode then 1 upstream = 1 miner
12:44

hv-bridge

<sech1> it probably keeps an upstream for a while after miner disconnects
12:48

minereni_d

yes, I use it in simple mode, as I understand this is the way to run it with p2pool
12:48

minereni_d

I've never seen the upstream number go down eventually
13:01

\x

hyc sech1 everyone
13:01

\x

xmrig.com/benchmark/762KY3
13:01

pauliouk

damn
13:02

pauliouk

not bad - expensive bit of kit I'm guessing?
13:02

\x

needs optimization or prolly latency too high
13:02

\x

pauliouk: 5400 is pretty low end for ddr5
13:02

\x

7000 at the topend with an xOC 2 dimmer board
13:02

\x

they say 6400 is attainable by most chips on the daily
13:14

hv-bridge

<sech1> phoronix.com/scan.php?page=article&item=intel-12600k-12900k&num=3
13:15

\x

hwbot.org/submission/4846628_hocayu…ory_frequency_ddr5_sdram_4352.3_mhz
13:15

\x

sech1
13:18

hv-bridge

<sech1> so ~8.9 kh/s on Monero and ~10.4 kh/s on Wownero
13:18

hv-bridge

<sech1> just as I expected
13:26

\x

sech1: okay man, done with that bench, maybe youll get more in a few days heh
13:26

\x

gonna bother the guy for other stuff now
13:26

\x

sech1: did it run properly?
13:44

hv-bridge

<sech1> looks like it did
13:46

\x

sech1: p-cores have avx512 but for now intel disabled it since windows cant handle it
13:46

\x

like e-cores dont have it
13:47

\x

so when running certain stuff it uhhhh
13:47

\x

illegal instruction
13:47

\x

intel will likely re-enable it idk
13:47

hv-bridge

<sech1> avx512 is a waste
13:47

\x

an early asus bios enables you to toggle it and it loads an older microcode and it disables the e-cores
13:47

hv-bridge

<sech1> better have more smaller cores without avx512
13:47

\x

intel is still gonna try to enable it
13:48

\x

but yeah, needs software support
13:48

\x

just a heads up
13:48

\x

sech1: i cant promise a 1t run as it takes too much time and evryone is asking those guy to run meme benchmarks
13:48

\x

maybe domorrow ill get the guy to run 1t
13:55

\x

i.imgur.com/Pewg31V.png
13:55

\x

sech1: interested on an avx512 run or is it really useless for you?
13:56

hv-bridge

<sech1> useless
13:56

hv-bridge

<sech1> also: 3dnews.ru/assets/external/illustrations/2021/11/04/1052960/memlat.png
13:56

\x

kk
13:56

hv-bridge

<sech1> huge L3 latency
13:56

hv-bridge

<sech1> 1T RandomX will be bad
13:56

\x

how many minutes do you expect for 1M?
13:56

\x

1t 1M
13:57

\x

i just cant bother the guy to run a long benchmark for now, every fucker on the room is asking for 3dmark stuff with weird settings
13:57

hv-bridge

<sech1> probably ~900 h/s, so 17-19 minutes
13:57

\x

lmao
13:57

\x

so rocketlake still 1t king
13:57

hv-bridge

<sech1> 1t can wait
13:57

\x

kk
13:58

\x

yeah good thing this guy woke up on time now
13:58

\x

and i got the first benchmark after nda lift
13:58

hv-bridge

<sech1> 40 cycles on Zen3 vs 65 cycles on Alder Lake
14:01

\x

sech1: im still impressed on the performance though
14:01

\x

not bad for the first try of big little on x86
14:02

\x

im sure there will be growing pains but yeah, not bad
14:02

\x

intel is back man
14:02

\x

intel is back
14:04

nioc

So 12900 = 3700
14:05

\x

for mining ye
14:05

\x

intel still a cachelet
14:08

hyc

sech1: are you able to reproduce the core_tests crash on arm64/jit ?
14:37

hv-bridge

<sech1> I only have RPi4 and it overheats under prolonged load and then reboots or hangs
14:57

pauliouk

sech1, want me to send you a heat sink kit and 3v fan? :) keeps mine running
14:58

hv-bridge

<sech1> I have a heat sink there. It only works fine if I open the window and put my RPi4 under cold air from the street 🙂
14:59

pauliouk

fresh air is always a plus I guess :)
16:09

\x

sech1: you seem spot on, guy said its doing 900 h.s on 1t
16:09

\x

based sech1
16:09

\x

anyway, we will finish the bench
16:28

\x

sech1 xmrig.com/benchmark/7Kp9ny
17:20

hyc

ok I'm running under gdb again, will grab disasssembly when it dies
17:21

hyc

folks please use #monero-mining or something for general p2pool usage stuff
17:21

hyc

this channel should be development focused
17:29

hyc

sech1: paste.debian.net/1218230
17:29

nioc

there is also #p2pool-log for p2pool chatter
17:30

hyc

let me know if you want more context before/after there
17:32

hyc

paste.debian.net/1218231 same thing, with hex bytes
17:40

sech1

so it was a call to randomx_calc_dataset_item_aarch64 that landed in unmapped memory
17:40

sech1

it looks like linker bug to me
17:42

sech1

or no, it's not a linker bug. randomx_calc_dataset_item_aarch64 is copied to allocated memory to construct super scalar hash code
17:45

sech1

hyc you need to print out CodeSize and CalcDatasetItemSize to see how much it allocates. It uses pointer differences between function pointer, maybe it's actually some linker magic that breaks things
17:48

hyc

randomx::CodeSize = 13464
17:48

hyc

randomx::CalcDatasetItemSize = 66308
17:54

hyc

the memory region is 81920 bytes, should have been large enough for all of that
17:55

hyc

the target address is far below the beginning of the region
17:55

hyc

+++++++++++++
17:57

hyc

sech1 what values do you get in your build?
18:05

sech1

these values look correct. I need to remember how it's actually JITted
18:16

sech1

hyc that "bl 0xffffe1f70438" is definitely wrong, it should jump to 0x0000ffffe1f95... because JIT doesn't overwrite jump destination there. It only overwrite previous two "add" instructions
18:18

sech1

maybe linker error
18:19

sech1

can you try compiling and running xmrig there? It has the same code there
18:41

hyc

afaik this only fails in core_tests
18:42

hyc

this box is currently running both monerod and p2pool, so this code is already live on the box
18:44

hyc

I have previously run xmrig on here without any problems either
18:45

hyc

maybe I can figure out how to set a watchpoint on that branch instr and trap when it gets overwritten
18:46

hyc

or I wonder if there's just garbage in there left over from a previous testcase
18:49

wfaressuissia

What's the minimal reproduction you have ?
18:49

wfaressuissia

`tests/core_tests/core_tests --generate_and_play_test_data --filter gen_block_big_major_version` does it fail with this filter ?
18:49

wfaressuissia

did you try to reduce it to single rx_slow_hash call ?
18:50

sech1

if it's a linker error, just calling rx_slow_hash from anywhere within that binary should cras
18:50

sech1

*crash
18:51

sech1

actually, it's better to search for the machine code "ea 03 00 91" (mov x10, sp) and check what's before it in working binary and crashing binary
18:52

sech1

this instruction should be unique enough to find RandomX code
18:58

hyc

running now with the filter
18:58

hyc

SEGv'd again
18:58

hyc

different stacktrace tho
18:59

hyc

#0 0x0000ffffe1d6f438 in ?? ()
18:59

hyc

only 1 stack frame, nothing else
18:59

wfaressuissia

can you set preliminary breakpoint on rx_slow_hash, it must be the first call
19:00

hyc

ok trying again with breakpoint
19:00

hyc

hit breakpoint, now what?
19:01

wfaressuissia

`finish`
19:01

hyc

segv
19:02

wfaressuissia

set breakpoint on randomx_calculate_hash
19:02

wfaressuissia

and step manually
19:02

wfaressuissia

there will be few steps before actual jit
19:02

wfaressuissia

and then likely few instructions within jit and segv
19:02

hyc

ok running
19:02

wfaressuissia

cache init stage is not interesting, it can be even commented probably
19:05

sech1

the actual JIT code execution starts at "compiler.getProgramFunc()(reg, mem, scratchpad, RANDOMX_PROGRAM_ITERATIONS);" in vm_compiled.cpp
19:07

hyc

yeah I'm stepping thru asm code now
19:08

hyc

this could take a while, one instr at a time
19:12

hyc

ok, segv
19:13

hyc

gahh. the process disappeared, can't inspect memory*////
19:15

hyc

trying again. cat was on kbd
19:18

wfaressuissia

`one instr at a time` you could do `99999 si` and `1 si` and then bisect actual number of instructions before seg
19:18

wfaressuissia

but `si` is useful only within jit, c++ can be stepped with simple `step`
19:23

hyc

ok I have it
19:23

hyc

paste.debian.net/1218250
19:27

sech1

well, the jump offset is the same in both cases
19:27

hyc

yes. the execution was completely linear from 91000 to 91b38
19:28

hyc

then it branched to 94430
19:28

sech1

I mean the bl instruction that jump to unmapped memory
19:28

hyc

yeah it's the same bytes as before
19:29

sech1

it's probably linker bug
19:29

sech1

I blame binutils again
19:29

hyc

hmmm
19:29

sech1

IIRC we had problem with it on ARM before
19:29

selsta

does ARM on Mac use JIT?
19:29

hyc

yes
19:29

sech1

tevador/RandomX #128
19:31

hyc

so this is gnu ld 2.34
19:31

hyc

ubuntu 20.04.2 lts
19:34

wfaressuissia

"github.com/tevador/RandomX/blob/mas…/src/jit_compiler_a64_static.S#L434" it's this place in jit, right ?
19:35

sech1

yes
19:36

sech1

here's where the previous 2 add instructions are updated: github.com/tevador/RandomX/blob/master/src/jit_compiler_a64.cpp#L224
19:36

sech1

but jump instruction is not touched there
19:39

hyc

hmmmm
19:39

hyc

that symbol is not present in the .o file
19:39

hyc

oh there it is
19:40

hyc

paste.debian.net/1218253
19:42

hyc

maybe reordering chunks in the source file would avoid the problem
19:47

wfaressuissia

disassemble this randomx_program_aarch64_light_dataset_offset, what is the destination of relative jump there ? is it randomx_calc_dataset_item_aarch64 ?
19:48

wfaressuissia

you can disassemble executable with objdump / gdb / anything else
19:49

hyc

sure
19:49

hyc

it is <randomx_calc_dataset_item_aarch64@plt>
19:49

hyc

it's treating it as an external global, not a local reference
19:50

hyc

that would explain the problem, it needs to jump thru the plt to get fixed up to the correct address
19:50

wfaressuissia

and jit is doing stupid memcpy of asm with assumption that it's PIE
19:50

hyc

and that's not happening here
19:51

wfaressuissia

PIC (position independent code)
19:51

hyc

yes
19:51

hyc

dunno how to force the asm to emit a relative branch here
19:51

sech1

it should be PIC
19:51

wfaressuissia

`as -fPIC ...` ?
19:52

sech1

it's a local jump within the same .S file
19:52

hyc

well, it has assembled as a global reference, not a local
19:53

sech1

probably because randomx_calc_dataset_item_aarch64 is used in jit_compiler_a64.cpp
19:53

sech1

but it should've optimized it to relative jump during linking
19:54

hyc

still, it would be better to just emit it as local and leave it alone. I wonder if reordering so it's not a forward reference would make any difference
19:55

hyc

or just give randomx_calc_dataset_item_aarch64 2 labels, one local
19:55

sech1

yeah, but how you can be sure it's the only place like this
19:55

sech1

maybe it's first of many
19:56

hyc

hmmm. ok will objdump the .o
19:56

sech1

actually they're all declared as globel (see first lines of jit_compiler_a64_static.S)
19:59

hyc

they appear to be relocatable references in the .o file instead of relative
19:59

wfaressuissia

"github.com/tevador/RandomX/blob/mas…r/src/jit_compiler_x86_static.S#L94"
20:00

wfaressuissia

"paste.debian.net/hidden/9831a12f" build with clang and shared libs failed on some jump within jit too
20:00

wfaressuissia

and it was x86_64
20:01

hyc

I'm going to insert a bunch of local labels in here and see if it makes a difference
20:02

wfaressuissia

"libera.monerologs.net/monero-pow/20210819" steps to reproduce
20:09

hyc

yes, inserting and using local labels works, the global references are gone
20:09

hyc

test succeeds
20:11

wfaressuissia

can you patch x86_64 jit, i'll repeat the above test with clang
20:11

wfaressuissia

* ... write patch for x86_64 jit and share it ...
20:12

hyc

ok, gimme a couple minutes
20:14

hyc

fyi github.com/hyc/RandomX/tree/relocs
20:14

hyc

I'll add the x86 patch to this branch
20:18

wfaressuissia

works
20:19

wfaressuissia

paste.debian.net/hidden/e1081be5
20:20

hyc

I'm fixing up one more reference
20:20

hyc

line 151
20:20

hyc

rx_dataset_init
20:21

hyc

that seems to be all
20:21

hyc

I presume the .asm file should get the same change
20:22

wfaressuissia

maybe
20:25

hyc

I guess randomx_dataset_init was safe because it's within the same function
20:25

hyc

but whatever, I changed it already, will leave it in
20:26

hyc

ok, branch updated. we need someone to build on windows to test the .asm file
20:26

wfaressuissia

not me
20:27

hyc

yeah, I don't have recent msvc here either
20:32

selsta

is running randomx-tests enough?
20:33

hyc

probably
20:33

selsta

(don't know if that uses jit)
20:34

selsta

I can run it on Windows CI
20:35

hyc

cool. but is it actually building with msvc or with gnu toolchain?
20:40

wfaressuissia

the last unanswered question, why did it fail only within core_tests ?
20:40

hyc

or mebbe I should just leave the .asm source untouched, since nobody has reported a crash in windows
20:40

hyc

probably because it's dynamic linking librandomx.so
20:41

hyc

wfaressuissia: in a static build there would be no PLT reference
20:41

hyc

dunno
20:41

wfaressuissia

can verify this hypothesis somehow quickly ? via build logs maybe
20:42

hyc

I didn't build tests on my release build, lemme check
20:47

hyc

release build def builds static librarry, debug build uses shared library
20:53

wfaressuissia

core_tests fails independently on build type (debug/release), right ?
20:54

sech1

actually I had a lot of similar problems with debug MSVC builds
20:54

sech1

it always replaced pointer to functions to pointers to jmp instructions that jumped to actual functions
20:58

hyc

ok, then the .asm patch should prob help
20:59

hyc

wfaressuissia: I suppose it would still fail on release build if it assembled an absolute addr reference instead of a relative one
21:00

wfaressuissia

why monerod works independently on build type then ?
21:01

hyc

link time optimization?
21:01

wfaressuissia

core_tests and monerod are both executables in the end, what's the difference ?
21:01

hyc

dunno
21:02

hyc

the release build definitely assembled absolute references on master
21:06

hyc

but monerod binary has relative references there
21:08

hyc

gdb monerod ; disass/r randomx_program_aarch64_light_dataset_offset
21:08

hyc

so monerod works because the linker did the right thing there
21:09

wfaressuissia

what bit to flip in order to get the same problem in monerod as in core_tests ?
21:09

wfaressuissia

* which bit ...
21:09

hyc

good question
22:27

hyc

my release build of core_tests doesn't crash. the code has a relative reference
22:28

hyc

so it's only a problem with dynamic librandomx.so
22:28

wfaressuissia

then all questions are answered

5 years ago

« a day earlier

a day later »

today »