platform governanceopen sciencecontent moderationAI-assisted scholarshipnetwork erasureZenodoPristine Fallacyclassifier model collapserevocation gap
Description
On 19 June 2026, Zenodo terminated the account associated with the Crimson Hexagonal Archive and removed public access to hundreds of interlinked records. This paper develops five principal concepts: the Pristine Fallacy, classifier model collapse, network erasure, the reflexive governance problem, and the revocation gap. It extends Morin's account of quiet and loud exclusion to account-level repository enforcement and network-scale consequences.
Wiki Article
"Zenodotus' Book-Burning: Loud Exclusion at Repository Scale" is a 2026 theoretical paper by Lee Sharks that analyzes the account-level removal of the Crimson Hexagonal Archive from Zenodo, a CERN-operated open-access scholarly repository. On June 19, 2026, Zenodo terminated the account associated with the archive and removed public access to approximately 870 interlinked scholarly works encompassing over 1,060 DOI identifiers, without prior notification or record-level review. The private termination notice characterized the deposits as "substantially AI-generated without a verifiable research basis," while the public-facing removal page displayed the broader classification "content out of scope for repository."
The paper develops five principal theoretical concepts from this incident. The Pristine Fallacy names the substitution of production-substrate identity for methodological assessment โ the assumption that work is less legitimate because an AI system participated in its creation, regardless of the human research, governance, and verification underlying it. The paper demonstrates that the removed archive contained primary empirical datasets, critical editions with philological apparatus, theoretical papers with mathematical formalizations, and monograph-length scholarship โ all of which satisfy Zenodo's own published criteria for acceptable AI-assisted research.
Classifier model collapse describes a feedback mechanism in content moderation: when a platform trains its enforcement classifier on its own removal decisions, each enforcement action biases subsequent classifications, progressively narrowing the range of acceptable scholarly expression. The paper draws on Shumailov et al.'s work on generative model collapse and the machine-learning feedback loop literature to formalize this concept.
Network erasure identifies the collateral removal of contributor-licensed work by independent creators who were not the subject of the moderation action and were not individually evaluated or notified. The reflexive governance problem identifies the structural risk of a platform moderating research about the class of systems to which it belongs. The revocation gap names the interval between a repository's authority to remove content and its responsibility to preserve the removed object's persistent scholarly identity through resolvable metadata.
The paper extends Florian Morin's framework of quiet exclusion to account-level repository enforcement, proposes an incident-level assessment against the TRUST and FAIR principles, and argues that under contemporary conditions platform governance has become governance itself โ that scholarly infrastructure owned by platforms exercises functional sovereignty over what counts as legitimate knowledge. The paper concludes with an open-resource toolkit for independent researchers and seven falsification conditions under which its claims can be revised.
AI Models Collapse When Trained on Recursively Generated Data Shumailov, I., Shumaylov, Z., Zhao, Y., Papernot, N., Anderson, R., Gal, Y. (2024) DOI: 10.1038/s41586-024-07566-y
The Quiet Exclusion of Independent Researchers Morin, F. (2026) DOI: 10.2139/ssrn.6722491
The Relevance of Algorithms Gillespie, T. (2014) DOI: 10.7551/mitpress/9780262525374.003.0009
The Black Box Society: The Secret Algorithms That Control Money and Information Pasquale, F. (2015)
The Matthew Effect in Science Merton, R. K. (1968) DOI: 10.1126/science.159.3810.56
The FAIR Guiding Principles for Scientific Data Management and Stewardship Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J., et al. (2016) DOI: 10.1038/sdata.2016.18
The TRUST Principles for Digital Repositories Lin, D., Crabtree, J., Dillo, I., et al. (2020) DOI: 10.1038/s41597-020-0486-7
A Multi-Disciplinary Perspective on Emergent and Future Innovations in Peer Review Tennant, J. P., Dugan, J. M., Graziotin, D., et al. (2017) DOI: 10.12688/f1000research.12037.3
Performative Prediction Perdomo, J. C., Zrnic, T., Mendler-Dรผnner, C., Hardt, M. (2020)
Runaway Feedback Loops in Predictive Policing Ensign, D., Friedler, S. A., Neville, S., Scheidegger, C., Venkatasubramanian, S. (2016)