This is where I do wish we had more people working on the theoretical CS side of...

sfpotter · 2025-07-18T21:34:30 1752874470

Getting theoretical results along these lines that can be operationalized meaningfully is... really hard.

neonbrain · 2025-07-20T14:29:04 1753021744

At least it could add a theoretical bound on the expected hallucinations for a particular model/quant at hand? Although I'm very skeptical that companies would disclose their training corpus, and derivative models trained on top of foundation models are another level of indirection, it would still be interesting to have these numbers, even if just as rough estimates. The compression angle in this thread is spot-on, but yeah, operationalizing this is hard.