15:29:09 Meeting in this room in 1.5 hours. 17:00:43 Meeting time! https://github.com/monero-project/meta/issues/908 17:00:54 1. Greetings 17:01:50 Hello 17:02:23 Hi 17:03:52 2. Updates. What is everyone working on? 17:04:57 Me: back on subaddressses. No ETA, but been contacted by multiple people about it now, so it's hopefully done soon^tm 17:05:22 me: Writing down some early thoughts about analyzing PocketChange privacy. I also read the Ciphertrace employee's statement for the defense in the Bitcoin Fog case. 17:06:19 Anything really surprising there, in that statement? 17:07:27 Yes. It is interesting to see Chainalysis and Ciphertrace fight. I can discuss in the discussion section. 17:09:19 3. Discussion. What do we want to discuss? 17:10:31 If nothing else pressing, please do report about that fight :) 17:11:11 Here is the filing from the Ciphertrace employee: https://storage.courtlistener.com/recap/gov.uscourts.dcd.232431/gov.uscourts.dcd.232431.159.1.pdf 17:12:21 This was filed after a short filing from Chaianlysis that responded to some questions by the court. The Chainalysis statement said that their methodology was not peer reviewed and it did not have a defined false positive rate (error rate). 17:14:31 My opinion is that the problems with Chaianlysis's methodology do not mean that blockchain surveillance is completely ineffective (i.e. that bitcoin privacy is "good enough"), but that Chainalysis's methodology is unscientific and unfit for producing the evidence up to the standard of proof in a legal system with strong protections for the accused. 17:15:11 If blockchain surveillance is more scientific, then yes bitcoin still has major privacy problems. 17:16:44 The Ciphertrace filing criticized the lack of false positive rates and Chainalysis's use of expanded change output classification rules. They call it "heuristic 2 (behavioral)", which I guess has some meaning in the original case documents that I have not read. 17:17:13 "Therefore, the discrepancy rate between Ciphertrace and Chainalysis Bitcoin Fog attribution is roughly 67%." 17:17:25 Those pages and pages of documented errors are pretty depressing, and damning. 17:18:04 The filing also has discrepancy rates for other entities like AlphaBay 17:18:46 Found it. 17:19:35 This is a quite rare look into the workings of these companies, right? Probably so far only their customers get such details. 17:19:55 The filing from the Ciphertrace employee says that maybe the defendant (Roman Sterlingov) was accused since he had the only KYC'ed account among all the suspected Bitcoin Fog operator withdrawals. 17:21:30 Did you find any heuristic in there that surprised you personally? That you never heard about, or thought that it would not work well enough? 17:21:31 Consider if the defendant was targeted by a different entity in a different place. Would a dictatorship allow him to review the evidence against him? What about a criminal gang coming to rob him? False accusations are as big of a threat, if not more, to users of transparent coins. 17:23:42 rbrunner: Mostly the filing refers to papers that have been posted publicly. One "new" statement is that Ciphertrace said that they do try to link bitcoin addresses/txs to IP addresses: "Ciphertrace collects IP addresses via our own node operation and links them to bitcoin addresses." 17:24:08 That's not surprising, but I had not seen a public statement from them that this is part of their surveillance 17:24:18 Ok 17:25:05 After reading this diagonally, Ciphertrace looks anyway much more "dangerous" than their competitor there ... 17:25:19 "Blockchain forensics should only be used to generate investigatory leads. Standing alone, they are insufficient as a primary source of evidence. What is striking about this case is the conclusions reached without any corroborating evidence for the blockchain forensics." 17:27:45 My note on Monero real spend classification using tx fungibility defects has a formula for the false positive rate, so I am ahead of Chainalysis in that regard: https://github.com/Rucknium/misc-research/tree/main/Monero-Fungibility-Defect-Classifier/pdf 17:28:04 :) 17:28:56 Ciphertrace was purchased by MasterCard, so their business outlook may be different from Chainalysis now. 17:30:10 Maybe they looked at both before buying and reached a good conclusion 17:31:22 Standards of evidence are going to be different for a court of law and advertising targeting, to take the two extremes. 17:32:19 The court case will go on, based on these filings, I assume? 17:32:27 MasterCard is probably doing regulatory compliance based on risks scores. Maybe some ad targeting too 17:34:13 I guess so. I think the prosecution is pushing for a plea deal like usual, so maybe they are in plea deal negotiations or if the defense thinks their side is strong enough they could take it to an actual trial. 17:36:34 Somehow amazing how all those cryptocurrency funds and ETFs that spring up now risk loss of some of their funds because they turn out to be "dirty" 17:37:00 No wonder MasterCard tries to find something solid 17:37:10 Ironically Ciphretrace has permitted no peer review of their Monero "tracing" methodology. I would have guessed with the Mastercard purchase that they would stop working on it, but IIRC they released a statement at the August 2022 hard fork saying you can run but you can't hide, Monero users. 17:38:49 Anyway, I have been thinking about the privacy impact of users of Monerujo's PocketChange (and any similar system) to split wallet contents into multiple outputs to get around the 10 block lock. 17:39:12 There are two components: privacy before the PC tx and privacy after. 17:40:29 The biggest privacy impact in the post-PC txs would be if transactions consolidate outputs from the PC tx in a single tx later. 17:40:56 You would see two or more rings in the later tx that reference multiple outputs from the original PC tx 17:41:40 So we would want to know how often that ring pattern would occur as a coincidence when a tx actually has nothing to do with a PC tx 17:41:43 I guess such a consolidation can happen quite easily 17:42:35 I think that the false positive rate is going to be a function of the number of txs on the blockchain with more than two outputs, their age, and the decoy selection algorithm. 17:42:54 All roads lead back to the decoy selection algorithm and questions about what wallet2 actually does. 17:43:57 The DSA wasn't very important for the analysis of guessibility of real spends with tx fungibility defects because....well, just because 17:44:47 This consolidation problem does not get smaller if more and more wallets start to implement such a feature, right? 17:45:18 But how densely the DSA selects from certain regions of the output set matters a lot for PC analysis because the probability of selecting multiple outputs from the same exact transaction in a large sea of transactions is going to depend a lot of how big that sea is. 17:47:23 rbrunner: I think if there are more txs with greater than two outputs on chain, then a classification rule that uses multiple outputs from the same tx would have a higher false positive rate. How much more? The math has to be written. 17:49:26 I don't think I will develop the PocketChange analysis much further at this time unless I get a sudden lightning bolt of insight. I would probably put it in a list of possible projects to work on in a CCS later. 17:50:28 Priorities would be set partially based on community input. 17:51:43 I am giving advice to a research group that is planning to recreate the BTC transaction graph with Monero rules. gingeropolous thought it would be a good idea to do that. 17:52:05 A lot of things need to be modified. I am wondering if I am missing anything: 17:52:30 I don't understand. BTC transaction graph with Monero rules - what rules? 17:53:01 You need to eliminate the 10 block lock and even allow child txs within the same block. I guess you could do the idea of referencing by TXID instead out sequential output index 17:53:28 Have a private testnet where you broadcast and confirm the same txs in the BTC blockchain, but with monerod. 17:54:04 tx verification for mining blocks is going to take a very long time. I wonder if there is a way to just turn it off. 17:54:12 Sound a bit crazy, at first read. 17:54:31 They will need a lot of SSD space. I guess 3 TB at least 17:54:41 Why is it crazy? 17:55:05 Maybe I don't understand yet how that could possibly be useful. 17:55:09 You need coin emission to match BTC's emission. 17:55:29 Should this demonstrate that we could handle the same amount of transactions? 17:56:08 There are databases with entity labels on BTC transactions and addresses. Like centralized exchanges. The BTC tx graph can give you the behavioral parameters to simulate what EAE attacks could look like. 17:57:03 The main purpose of doing this would be to analyze the privacy of Monero when you have a ground truth of what an actual tx graph would be. 17:57:35 You would need to eliminate the limit on the number of tx outputs. 17:58:47 You mean you could let loose some heuristics on that new Monero-fied blockchain and then could see exactly how good they were? 17:59:04 Yes 17:59:29 Clever. But probably is anything but trivial. 18:00:00 But already the dropped 10 block limit would disturb things, I guess 18:00:28 Why not just create a simulation in Python or R? There would be pros and cons to doing a simulation. A real blockchain is what the research group wants to do. 18:01:10 So let them research, I would say :) 18:01:19 Methods based on timing analysis wouldn't apply in the exact same way, right. 18:02:07 Right, but I want to warn them if there's things that need to be modified so they are not surprised later. 18:02:48 Good idea. I think mining is no problem, you can easily accept arbitrarily low difficulties. 18:03:08 There is a switch in monerod already I think for that, to create a large blockchain easily 18:03:28 Yes. I pointed them to https://github.com/moneroexamples/private-testnet 18:04:13 Yup, " --fixed-difficulty 100 " as a daemon start flag 18:05:25 Maybe the growth limits of the blocks need to get relaxed 18:05:51 Or even dropped, if you want the same transactions in the same blocks? 18:06:17 We can end the meeting. Let me know if you think of anything major that they would have to modify. 18:06:39 Alright. Thanks for all the interesting info. 18:07:15 I think the BTC block size growth may be smooth enough to fit within the dynamic limits. I'm not sure.