-
br-m
<preland> @rucknium: I should be ready to present some more of my findings next Wednesday
-
br-m
<rucknium> MRL meeting in this room in two hours.
-
br-m
<rucknium> Meeting time!
monero-project/meta #1321
-
br-m
<rucknium> 1. Greetings
-
br-m
<vtnerd> hi
-
rbrunner
Hello
-
br-m
<gingeropolous> howdy
-
plowsof
👋
-
br-m
<rucknium> 2. Updates. What is everyone working on?
-
br-m
<rucknium> me: Fixed the database lock issue for the Monero net scanner. Revising the webapp (Not linking to it now since it has a big error message :D )
-
br-m
<vtnerd> me: re-worked the lwsf<->lws<->monerod protocols for fcmp++ tree building. still works! currently looking into a bug about 0-conf reporting and fcmp++
-
br-m
<gingeropolous> me: working on monerosim. claude code is wow. might have just one-shotted the whole mining shim thing. although i got the generate blocks one working great as well, doesn't require any monero mods. waiting on MRC resources freeing up to test 1000 node scale up. can currently get 85 agents on 32GB ram. nodes sync, txs generate and relay, included in blocks.
-
br-m
<articmine> Hi
-
br-m
<jbabb:cypherstack.com> knee-deep in reviewing the carrot impl
-
DataHoarder
implemented carrot tx proofs (generate, verify) on my libraries, doing some benchmarks on RandomX V2
-
br-m
<rucknium> 3. Spy nodes (
monero-project/meta #1124).
-
br-m
<rucknium> I had planned to have the spy nodes webapp revised by now, but I'm getting an unexpected launch error on the server at the moment.
-
br-m
<rucknium> The network scanner seems to be working well again after I switched to a different Rust crate for connected with SQLite databases. It allows specifying a longer connection timeout so the database doesn't appear fatally locked to the tokio threads.
-
br-m
<rucknium> For finalizing the new ban list, we need to have the original signers, at least, sign it. IIRC, SethForPrivacy's Docker image requires that the signatures are valid.
-
br-m
<rucknium> The signers would be @boog900:monero.social , @syntheticbird:monero.social , @jeffro256:monero.social , and myself.
-
br-m
<rucknium> I usually don't like to do announcements on Friday or Monday. So we could aim to announce tomorrow or next on Tuesday.
-
br-m
<rucknium> We would want to hit Reddit, monero.town, Twitter at least with the announcement.
-
br-m
<rucknium> And the GitHub issue
-
br-m
<gingeropolous> +1
-
br-m
<rucknium> Anything more on spy nodes?
-
br-m
<rucknium> 4. FCMP alpha stressnet (
monero.town/post/6763165).
-
br-m
<rucknium> @gingeropolous:monero.social suggested that tx spam volume be decreased so that he can have all the RAM available on the big-RAM Monero Research Computing machine. The one with 1TB of RAM.
-
br-m
<rucknium> I think that would be fine
-
br-m
<rucknium> So that @gingeropolous:monero.social can run the scaled-up monerosim
-
br-m
<gingeropolous> it should be like 5-8 hrs is all i need
-
br-m
<articmine> With the new scaling parameters we need to consider a significantly higher penalty free for stressnet instead of just relying on spam.
-
br-m
<articmine> Otherwise it will take years to test for many of the identified issues with scaling
-
br-m
<rucknium> Yes. I think initially we thought that the actual FCMP hard fork scaling parameters should be used for beta stressnet, but now I think differently.
-
br-m
<articmine> The simplest is to increase the penalty free zone ZM
-
br-m
<rucknium> By the way, I checked the current long term median block weight for this stressnet. It is 664 KB. The long term median before stressnet started was 176 KB.
-
br-m
<articmine> Then keep the other parameters 8x on MN and 1.2x on ML the same
-
br-m
<rucknium> AFAIK, get_info doesn't give you the long term median, which is what I thought before. You have to query a block header.
-
br-m
<gingeropolous> @articmine:monero.social: , im hopeful monerosim allows for faster testing. so far i can run 6 hours of monero network in about 20 minutes.... faster if the hardware is better
-
DataHoarder
-
DataHoarder
Indeed comes from block headers
-
br-m
<articmine> @gingeropolous: So a factor of 18. That will certainly help
-
br-m
<gingeropolous> gonna need moar ram
-
br-m
<boog900> @rucknium: I think that value would just be that blocks long term weight not the median long term weight
-
br-m
-
br-m
<rucknium> @boog900:monero.social: What's the difference?
-
br-m
<articmine> The long term block weight can be significantly higher than the long term median, if growth lags behind the long term median.
-
br-m
<articmine> It can also be lower
-
br-m
<rucknium> Is there an RPC query that can get the median long term weight?
-
br-m
<boog900> A blocks long term weight is the value that is used in the long term blocks list to get the median long term weight
-
br-m
<boog900> In the same way a blocks weight is used in the short term list to get the short term median block weight
-
br-m
<boog900> You could get the long term median weight under some circumstances if the block is outside a certain range by multiplying or dividing by 1.7. For example before stressnet the median was almost certainly 176 * 1.7
-
br-m
<rucknium> @boog900:monero.social: Your answer is "no, not an easy way to get this info"
-
br-m
<rucknium> unless Mercury is in retrograde
-
br-m
<articmine> If the long term block weight is under the penalty free zone, then the long term median is the penalty free zone.
-
br-m
<articmine> On it has been given enough time to reset
-
br-m
<boog900> @rucknium: Well the blocks long term weight is bounded to the long term median divided by 1.7 and multiplied by 1.7 so you can get the value if you have a very small or very large block.
-
br-m
<boog900> But yeah I'm not sure if there is an API that just tells you the value
-
br-m
<articmine> With the new proposed changes this bound is 1.2
-
br-m
<rucknium> In the 26 November meeting @jberman:monero.social said
-
br-m
<rucknium> > personally I'd like to complete the alpha stressnet (reach a point where alpha stressnet is running smooth under reasonable conditions), ideally within the next 4 weeks. and then reopen a conversation on target dates
-
br-m
<rucknium> @jberman:monero.social and/or @jeffro256:monero.social , how close do you think is the end of alpha and beginning of beta stressnet?
-
br-m
<jberman> apologies, I messed up timing on this meeting
-
br-m
<jeffro256> Me too, hi
-
br-m
<jbabb:cypherstack.com> A "scalenet" with modifered parameters to make spamming much easier to spam in order to get better data on the scaling question makes sense to me, but as a subsequent testnet following this stressnet with its current params
-
br-m
<jbabb:cypherstack.com> s/spam/do
-
br-m
<articmine> @boog900: No the block weight can still be up to 100x ML under the current scaling and up to 16x ML under the proposed scaling
-
br-m
-
br-m
<boog900> @articmine: The long term weight is a different value.
-
br-m
<jberman> Re: end of alpha: We have 1 more open PR for v1.5 that prevents OOM's during initial block download. It is an upstream issue, but the stressnet is triggering it for some. That PR is:
seraphis-migration/monero #275
-
br-m
<articmine> I've how many cycles of ML
-
br-m
-
br-m
<articmine> Over how many cycles of ML is this long term weight calculated
-
br-m
<articmine> MzL can lag
-
br-m
<jeffro256> We've got the weight func modified in the staging branch, Berman is currently working on some changes to tx relay v2 with 0xfffc, we're not going to wait for carrot-derived wallets, hot/cold wallets is ready on PR #52, Unbiased hash-to-point is implemented and ready to merged into the staging branch, the other points are mostly resolved on the staging branch AFAIK
-
br-m
<jberman> I also PR'd changes to tx relay v2 that would be nice to have tested in alpha, since tx relay v2 is a fairly significant change affecting the daemon:
0xFFFC0000/monero #62
-
br-m
<jeffro256> So we basically just need to integrate the updated scaling parameters and tx relay v2 and it should be ready to go I think
-
br-m
<jeffro256> Yeah true, we need to upstream the span changes
-
br-m
<jeffro256> But we don't have to wait for them to be upstreamed to start the beta, yeah?
-
br-m
<jberman> Sorry I wasn't clear. We need that PR specfically for v1.5 of the alpha stressnet. I was just highlighting how that is an upstream issue (and a separate upstream PR is warranted as well)
-
br-m
<jberman> I was thinking we verify that v1.5 solves all the major outstanding alpha stressnet issues people have reported during testing (like OOM's) before moving to beta
-
br-m
<jberman> A few stressnet users have also reported new crashes even with a pre-release of v1.5, and I'm still working my way through those, starting with this segfault:
seraphis-migration/monero #258
-
br-m
<jberman> I'm almost done with a fix for that issue, and then it's on to this issue:
seraphis-migration/monero #277
-
br-m
<jeffro256> What path should we take if people keep experiencing some of these same errors? Is it not still worth moving to the beta stressnet and continue debugging there?
-
br-m
<jberman> It's hard to prioritize changes like scaling algo for beta when there are triggerable outstanding segfaults
-
br-m
<articmine> @jberman: I would suggest keeping the existing scaling algo until these issues are fixed
-
br-m
<jeffro256> I absolutely wouldn't push for production with outstanding segfault issues, but ostensibly the errors, if not fixed, will also occur on beta for similar reasons.
-
br-m
<jeffro256> If they occur for similar reasons, then debugging on these issues isn't impacted by moving to beta.
-
br-m
<jberman> I also think we need a solution this PR is addressing first and foremost before we should move on to beta, because OOM's have plagued alpha and this should take care of the last major known cause:
seraphis-migration/monero #275
-
br-m
<articmine> @jeffro256: If we change the scaling algo not right away
-
br-m
<jeffro256> If anything, the scaling issues triggering segfaults more often would only make it easier to debug.
-
br-m
<jberman> @jeffro256: these aren't issues with the FCMP++ integration i.e. they are issues with current production code is also why I think they deserve priority
-
br-m
<jeffro256> @jberman: Also, sorry I meant to review this yesterday, will do today.
-
br-m
<jberman> np thank you 🙏
-
br-m
<jeffro256> @jberman: That's fair, I understand why youbut also I think that we can do this largely in parallel since as you're saying FCMP++ scaling and these OOM errors
-
br-m
<jeffro256> oops
-
br-m
<rucknium> Anything more on stressnet?
-
br-m
<jeffro256> I understand why prioritizing this issue over FCMP++ scaling, but I just think that they're mostly orthogonal . But optimistically, if #275 closes up the OOM issues, then we can go full speed ahead :)
-
br-m
<jeffro256> Let's hope that happens
-
br-m
<jberman> I agree they're definitely orthogonal. It's just a matter of manpower really. I don't want to divert attention away from dealing with reliability issues like the OOM's / segfault. if we had someone else (like perfect daemon :) ) focusing on them, I would be content moving on to beta tasks like scaling
-
br-m
<rucknium> Has anyone heard anything from perfect_daemon? I assume, given his M.O., that he's working on a huge PR in secret.
-
br-m
<jeffro256> Nope
-
br-m
<jeffro256> I've tried to DM unsuccessfully
-
br-m
<jberman> I assume the same
-
br-m
<jbabb:cypherstack.com> FCMP testing itself is definitedly a priority over scaling questions. a separate "scalenet" that makes spamming much easier is a orthogonal/tangential to the goal of "just" getting FCMPs working, but it allows an avenue for those that're concerned about scaling issues to prove their point without so much effort invested into spambots.
-
br-m
<vtnerd> @ruckinum I was about to say if perfect-daemon can’t look at this, I probably could given that my fcmp++ changes are going well
-
br-m
<vtnerd> the question is whether we should re-write the span code. probably needs it, otoh its semi-working in production so
-
br-m
<articmine> @jbabb:cypherstack.com: This is where a significantly higher penalty free zone can be used
-
br-m
<boog900> I think the syncing code is up there with the worst parts of the daemon.
-
br-m
<jberman> fwiw, I was primarly referring to issues like
seraphis-migration/monero #258 and
seraphis-migration/monero #277 , but span code also ya
-
br-m
<jbabb:cypherstack.com> I agree overall that they're separate questions. And maybe it is too much effort just to investigate scaling, but if we want to lower the parameters that we've apparently achieved consensus on, a "scalenet" seems like a way to do that more easily than with the current spam approach
-
br-m
<jberman> I'm pretty close to done on 258 though
-
br-m
<rucknium> @jbabb:cypherstack.com: What do you mean? Would there be no spam on scalenet?
-
br-m
<jbabb:cypherstack.com> @rucknium: I mean that the current spam efforts are doing well to increase blocksize, but if we really want to break things, opening up the scaling params would make that a lot easier to do. I agree that it's basically tangential to FCMP work, tho
-
br-m
<articmine> @jbabb:cypherstack.com: I agree.
-
br-m
<jbabb:cypherstack.com> since we already have loose consensus on params that can be paired with the FCMP HF, if we don't want to lower them anymore, a "scalenet" may be unnecessary.
-
br-m
<jberman> one benefit to the current approach is that it's showing cracks in the system as they appear (and would appear in production conditions), and we tackle them in that priority order
-
br-m
<rucknium> I think we just need to decide what's the target block size to hit in the next stressnet. Then make it easy to hit it.
-
br-m
<jberman> e.g. issues 4mb blocks showed up fairly early, got tackled fairly early because of that
-
br-m
<jbabb:cypherstack.com> it may be unnecessary or something that's safe to defer until after the FCMP HF / beta FCMP work
-
br-m
<articmine> @jbabb:cypherstack.com: The advantage of a scalenet is to accelerate the process.
-
br-m
<articmine> Then we can see in say a few months an issue that may take several years
-
br-m
<rucknium> Anything else on stressnet?
-
br-m
<jberman> My take on status: release v1.5 (hopefully within the next days), see what if any major reliability issues remain, either work on those reliability issues or continue toward beta. Tx relay v2 at this point may be fine to kick to beta
-
br-m
<rucknium> We can end the meeting here. Thanks everyone.
-
br-m
<articmine> Thanks
-
DataHoarder
As a note to the long term weights I should be able to display these on the explorer for the current tip, given I'm getting the miner data and calculating the expected growth inline
-
DataHoarder
Scalenet sounds great plus a few tricks of ours, if the point is to specifically stress that part. Might be good to expect resetting it regularly (so we don't end with TiB of state) and can test new changes iteratively
-
br-m
<rucknium> The new version of
moneronet.info is working now 😁
-
br-m
<monerobull:matrix.org> > 1 TB ram > <@rucknium> @gingeropolous:monero.social suggested that tx spam volume be decreased so that he can have all the RAM available on the big-RAM Monero Research Computing machine. The one with 1TB of RAM.
-
br-m
<monerobull:matrix.org> > And I always thought we are a poor project!
-
br-m
-
br-m
<rucknium> Thank you donors!
-
br-m
<321bob321> Ram only node inbound