TECH

arXiv one-year ban for unchecked AI-generated paper submissions

34+

Signals

Strategic Overview

01.
On May 14-15, 2026, arXiv computer science chair Thomas Dietterich announced a one-year submission ban for authors whose papers contain unchecked AI-generated content, with re-entry conditional on first clearing a reputable peer-reviewed venue.
02.
Penalties trigger on 'incontrovertible evidence' like hallucinated references or embedded LLM meta-comments ('here is a 200 word summary; would you like me to make any changes?'), not on LLM use itself.
03.
The rule operates as a one-strike policy: moderators flag, section chairs confirm, and authors retain appeal rights, with all listed authors held responsible regardless of who produced the offending text.
04.
It extends an October 31, 2025 policy that already required CS review articles and position papers to be accepted at a peer-reviewed journal or conference before posting, explicitly excluding workshop review.
05.
Coverage by 404 Media, TechCrunch, and Decrypt framed the move as arXiv cracking down on 'AI slop' after a record September 2025 month of about 26,000 submissions concentrated among new submitters.

Deep Analysis

What Counts As 'Incontrovertible Evidence' Of AI Slop

arXiv has formalized a one-strike enforcement regime for visibly unchecked AI-generated content.

The new arXiv rule is narrow on paper and sweeping in practice. The trigger is not LLM use, but visible carelessness: hallucinated references, fabricated citations, and embedded model meta-comments such as 'here is a 200 word summary; would you like me to make any changes?' or 'the data in this table is illustrative, fill it in with the real numbers from your experiments' ^[1]. Chair Thomas Dietterich's logic is unforgiving — if the authors did not even check the LLM output, nothing else in the paper can be trusted either ^[1]. Procedurally, it is a one-strike rule, but with safeguards: arXiv moderators flag the issue, the section chair must confirm before the ban applies, and authors retain appeal rights ^[2]. Crucially, responsibility attaches to every listed author, regardless of who actually pasted the prompt response into the manuscript ^[1]. That design choice converts a content-moderation problem into a collective-liability regime: junior coauthors inherit the same one-year exile as the senior author who shipped the broken draft, and the appeal process becomes the only buffer against guilt-by-association.

The Moderator Math: A Capacity Crisis Disguised As A Quality Rule

Strip away the rhetoric and arXiv's policy is fundamentally a queuing problem. By September 2024, nearly a quarter of all arXiv computer science abstracts already showed evidence of LLM modification ^[3]. One year later, arXiv hit a record of roughly 26,000 monthly submissions, with the surge concentrated among new submitters ^[4]. The CS section now absorbs hundreds of AI-generated survey submissions every month, and overall rejection rates have climbed to roughly 10-12% as 'slop' grew exponentially through early 2025 ^[3]^[5]. Dietterich is explicit that moderator capacity, not AI use itself, is the binding constraint — arXiv simply lacks the volunteer hours to manually evaluate every LLM-assisted survey paper ^[3]. The ban-plus-peer-review-gating structure is a cost-shifting mechanism: it pushes screening labor from arXiv's unpaid moderators onto journals and conference program committees. That makes the policy less a principled stand on scientific integrity than a triage protocol, dressed up in the language of trust.

Collateral Damage: Ghost Coauthors And A De Facto Permanent Ban

The policy's edges are where the pain concentrates. Because banned authors must clear formal peer review before posting anything new, economist Joy Buchanan argues the one-year ban 'effectively functions as a permanent preprint ban' — researchers cannot share working drafts during the restriction, and peer review timelines often dwarf the nominal ban length ^[6]. AI safety researcher Stephen Casper notes the related October 2025 rule already disadvantages 'young people, people without access to lots of compute, and people who are not at institutions that have lots of publishing,' because review and position papers are exactly the formats junior scholars use to enter a field ^[3]. Worse, AI-generated papers can list real researchers as coauthors without their knowledge, raising the prospect that innocent academics get swept into a ban they had no role in causing — a scenario Dietterich did not fully address ^[5]. Reddit discussion across r/PhD, r/math and r/MachineLearning was overwhelmingly supportive of the underlying ban, but the same threads flagged exactly these coauthor-liability and detection-bias risks, including the obvious exploit of weaponizing the rule by listing rivals as coauthors on poisoned drafts.

The Coming Arms Race And The Inversion Of The Polish Signal

arXiv's enforcement model relies on visible artifacts — the dangling 'would you like me to make any changes?', the placeholder table, the fabricated DOI. As detection tightens, bad actors are already moving to adversarial techniques: a Nikkei investigation found 17 arXiv preprints with hidden prompts engineered to coerce favorable AI-assisted peer reviews ^[8]. Hallucinated citations themselves are also slipping past human review at the high end: a Columbia study of 2.5 million biomedical papers found hallucinated citations rose tenfold since 2023, reaching one in every 277 papers in early 2026, and GPTZero identified over 100 hallucinated citations across 53 accepted NeurIPS papers that had each cleared at least three human reviewers ^[5]. Founder Paul Ginsparg captures the deeper problem: LLMs invert academia's oldest heuristic. 'The abstract could look reasonable, even isolated paragraphs, but you couldn't see that something was very wrong until you actually tried to follow the logic' ^[4]. When prose polish stops correlating with thinking quality, reviewers lose their cheapest filter — and arXiv loses the moderator economics that made open preprints viable in the first place.

The Precedent: A Template Other Preprint Servers Will Borrow

arXiv is the most influential preprint server in computer science, so its enforcement choices act as a template. The October 31, 2025 rule already pushed review articles and position papers behind a peer-review wall, explicitly disqualifying workshop review as sufficient ^[7]. The May 2026 ban layers personal sanctions on top of that content gate. Together they redefine the preprint server from a fast, open posting venue into something closer to a gated mirror of formally reviewed work — at least for the categories most prone to LLM abuse. Pressure on bioRxiv, medRxiv, SSRN and other preprint servers struggling with the same dynamics now rises, because they can either adopt arXiv-style sanctions or absorb the displaced submissions of authors arXiv has banned ^[7]. The deeper bet is institutional: arXiv is wagering that the cost of being slightly less open is smaller than the reputational cost of becoming a known dumping ground for unverified AI text. If that bet pays off, expect peer-review re-entry gates to become the default enforcement primitive across scientific preprint infrastructure.

Historical Context

2024-09-01

A study found roughly one quarter of all computer science abstracts on arXiv showed evidence of LLM modification, foreshadowing the moderation crisis.

2025-09-01

arXiv hit a monthly record of about 26,000 submissions, with the surge driven primarily by new submitters rather than established authors.

2025-10-31

arXiv tightened its CS category by requiring review articles and position papers to be peer-reviewed and accepted at a journal or conference before submission, explicitly excluding workshops.

2025-11-01

Japanese newspaper Nikkei reported that 17 arXiv preprints contained hidden adversarial prompts intended to manipulate AI-powered peer reviewers.

2026-05-14

Dietterich publicly posted the clarified enforcement policy on X, including the one-year ban and the peer-review re-entry requirement.

2026-05-15

Mainstream tech outlets reported the policy on May 15-16, 2026, framing it as arXiv cracking down on AI-generated slop.

Power Map

Key Players

Subject

arXiv one-year ban for unchecked AI-generated paper submissions

arXiv

Cornell-affiliated preprint server (transitioning to independent nonprofit) that owns and enforces the new policy while facing volume pressure from AI-generated submissions.

Thomas G. Dietterich

Chair of arXiv's computer science section and Distinguished Professor Emeritus at Oregon State University; announced and articulated the one-year ban on X.

Paul Ginsparg

Founder of arXiv; publicly framed LLM-generated submissions as an existential challenge to the platform's mission of rapid, trustworthy preprint sharing.

Steinn Sigurdsson

Scientific Director of arXiv; describes the volume of egregious AI submissions moderators already reject that motivated the formal policy.

Computer science researchers and authors

Direct targets of the policy; CS is the section where AI-generated submissions are most concentrated and where the rules apply first.

Early-career and under-resourced researchers

Subgroup that critics say is disproportionately exposed because review and position papers are often authored by junior scholars.

Fact Check

8 cited

Source Articles

Top 4

THE SIGNAL.

Analysts

"Argues unchecked LLM output destroys trust in an entire paper, justifying a one-year ban followed by peer-review gating, and ties the policy directly to a moderator-capacity crunch driven by LLM-assisted survey papers."

Thomas G. Dietterich

Chair, arXiv Computer Science section; Distinguished Professor Emeritus, Oregon State University

"Views LLM-generated content as existentially threatening because superficially polished abstracts and paragraphs disguise major logical and substantive gaps that only emerge when readers try to follow the argument."

Paul Ginsparg

Founder of arXiv

"Defends the policy by pointing to the volume of egregiously bad submissions the public never sees because moderators already reject them, treating the new ban as formalizing an existing screening burden."

Steinn Sigurdsson

Scientific Director, arXiv

"Warns that survey and position-paper restrictions disproportionately harm early-career researchers and people outside well-resourced institutions who rely on those formats to enter discourse."

Stephen Casper

AI safety researcher

"Argues the one-year ban effectively functions as a permanent preprint ban because banned researchers cannot share working papers during the restriction and must clear formal peer review before re-entry."

Joy Buchanan

Economist, Economist Writing Every Day blog

The Crowd

"⚠️Do you use arXiv? Don't miss this important note on their policy regarding LLM-generated papers: "The penalty is a 1-year ban from arXiv followed by the requirement that subsequent arXiv submissions must first be accepted at a reputable peer-reviewed venue.""

@@keenanisalive0

"🛑 arXiv finally had enough of the AI paper spam — it's changing the rules to keep the bots out. arXiv's computer science category will only accept review and position papers that are already peer-reviewed and accepted by a journal or conference, a move aimed at curbing low-effort AI-written surveys."

@@rohanpaul_ai0

"arXiv's ban is a warning shot. Universities need comprehensive AI literacy programs NOW. Not more bans. Not detection software. Actual education about ethical AI use in research. Before every platform implements emergency restrictions. Is your institution developing AI literacy?"

@@PhDtoProf0

"ArXiv to Ban Researchers for a Year if They Submit AI Slop"

@u/404mediaco1400

Broadcast

arXiv bans academic authors for AI slop papers

arXiv's New Ban: Hallucinated References Now Cost You a Year

Prof. Ramin Zabih | Challenges and opportunities for arXiv in the Age of AI