TECH

Anthropic warns of recursive self-improvement as Claude writes 80% of its code

63+

Signals

Strategic Overview

01.
On June 4, 2026, Anthropic researchers Marina Favaro and Jack Clark published 'When AI builds itself,' a post arguing the world needs a coordinated mechanism to slow or pause frontier AI development before recursive self-improvement erodes meaningful human oversight.
02.
The headline metric: Claude now authors more than 80% of the code merged into Anthropic's own codebase, up from low single digits when Claude Code launched in February 2025, while engineers ship roughly 8x more code per quarter than in 2024.
03.
Anthropic's internal next-generation model, Mythos Preview, hit a 52x mean speedup on a CPU-only LM training-optimization benchmark, compared with about 3x for experienced humans who typically take four to eight hours on the same task.
04.
Anthropic stresses recursive self-improvement has not yet happened and is not inevitable, but warns that even rare misalignment today could compound across AI-built successor generations until oversight is lost.

Deep Analysis

What the 80%, 8x and 52x numbers actually measure

Anthropic's case for taking recursive self-improvement seriously rests on a stack of internal metrics that look much sharper when unpacked individually. The headline figure is that Claude now authors more than 80% of code merged into Anthropic's codebase, up from low single digits before Claude Code launched in research preview in February 2025 ^[1]. Sitting beneath that is an 8x quarter-over-quarter increase in code shipped per engineer between 2024 and Q2 2026, plus 800 internal API fixes that Anthropic estimates would have taken about four years of human effort to complete ^[1]^[6]. None of these are public benchmarks; they are productivity readings from a single lab on its own product, so the absolute level matters less than the slope.

The more load-bearing number is Mythos Preview's 52x mean speedup on a CPU-only LM training-optimization task, where skilled human engineers manage roughly 3x and take four to eight hours to do it ^[4]^[8]. That benchmark is the closest thing in the post to an actual recursive-self-improvement signal: an AI system optimizing the code that trains AI systems. Outside Anthropic, METR independently places Claude Mythos Preview's 50% time horizon at 'at least 16 hours' — the ceiling of METR's current task suite — with a 95% confidence interval from 8.5 to 55 hours, and notes the horizon has been roughly doubling every four months ^[2]. Put together, the picture is less 'AI replacing engineers' and more 'AI compressing the iteration loop that produces better AI', which is the specific mechanism Anthropic is asking regulators to think about.

The brake politics and the regulatory-capture critique

The reason Anthropic is talking about a pause now, rather than after recursive self-improvement clearly arrives, is laid out by Jack Clark on BBC Newsnight: 'You want the option to be able to take your foot off the gas and put your foot on the brake' ^[9]. Co-authors Favaro and Clark argue that 'rare occurrences of misalignment present in today's models could compound as the models build their successors, growing more frequent but less understood until we lose control of them' ^[6]. The post explicitly concedes the political problem: training runs are easier to hide than missile silos, defection incentives are enormous, and Anthropic says it would only pause if peers verifiably did the same — 'whoever continues while others pause could inherit the lead' ^[4].

That acknowledgment has not blunted the regulatory-capture read. Constellation Research's Holger Mueller asks whether Anthropic is 'trying to freeze the status quo so it can catch up, or simply retain its lead' ^[5]. Enderle Group's Rob Enderle goes further, calling a coordinated pause 'practically impossible, because the economic and national security stakes are simply too high for any superpower to willingly hit the brakes now' ^[5]. Community reception on r/singularity has been similarly suspicious, with the most-upvoted threads framing the post as pre-IPO marketing and the 'coordinated global slowdowns' framing as a textbook regulatory-capture pitch from the lab that benefits most if smaller competitors have to file the same paperwork. The disagreement is not really about whether the productivity numbers are real; it is about whether 'we need a brake' is a safety argument or a moat argument.

The skeptical read: LeCun, hardware reality, and the in-house regressions

Yann LeCun, who has built a public posture around dismissing LLM-scaling claims, called the post a mix of 'navel-gazing, some marketing, and a lot of very sincere beliefs' and reiterated that LLM-based systems cannot reach human-level intelligence in his view ^[7]. That is the high-profile critique, but the more interesting skepticism is structural. The r/accelerate thread on Jack Clark's roughly 60% odds of RSI by 2028 includes a widely upvoted reality check on what closing the loop actually costs: running a Flash-tier model in inference mode is roughly $5k of hardware at 170W, but running the same model in a true self-improvement configuration — generation, verification and optimization concurrently — requires an 8x NVIDIA B200 node with 1.5TB of HBM3e VRAM and 3.2 Tbps InfiniBand, around $650k of hardware drawing 17 kW. Even if the algorithm works, the per-iteration energy and capital bill puts a ceiling on how quickly anyone can iterate.

The second strand of skepticism comes from inside Anthropic's own customer base. Reddit threads on r/singularity surface a stubborn complaint that Claude Opus 4.7 and 4.8 feel like regressions in everyday coding work, alongside a 55k+ GitHub backlog of closed-but-unresolved issues that users keep pointing to as counter-evidence to the 'engineers ship 8x more code' framing. The tension is that the 80% figure measures lines merged, not bugs prevented or reviewer hours saved — a point WinBuzzer makes explicit, arguing the real story is that review and integration discipline now matters more, not less ^[8]. The skeptical read is not that the numbers are fake; it is that 'AI writes the code' and 'AI improves itself' are different claims, and Anthropic's own customers are surfacing failure modes that complicate the second one.

Engineers as reviewers, and the 4.5%-of-GitHub second-order effect

If the productivity numbers hold even directionally, the labor story is not 'engineers replaced' but 'engineers reorganized into reviewers and orchestrators.' Decrypt highlights that Claude Code now accounts for roughly 4.5% of all public GitHub commits, generating about 2.6 million per week — a footprint big enough that Anthropic's internal workflow shifts ripple into how the broader open-source ecosystem reviews, merges and audits code ^[2]. An internal March 2026 survey of 130 Anthropic employees reported a 4x productivity output bump from working with Mythos Preview, and Claude's success rate on complex engineering tasks hit 76% in May 2026, a 50-point jump in six months ^[1]^[2]. That is the regime in which a human engineer's day starts looking less like writing and more like triaging diffs.

The second-order question is what 'human in the loop' even means at that point. The r/accelerate discussion around Anthropic's 60%-by-2028 RSI estimate landed on a sharp definition fight: 'by recursive self improvement we mean no human in the loop' — and the most-upvoted technical comment argued the loop is already half-closed because today's frontier models are routinely used to develop, train and test their own successors. Anthropic's own framing tries to thread that needle by reporting hockey-stick capability charts on its blog while simultaneously saying recursive self-improvement 'is not inevitable' ^[1]^[3]. The contradiction is doing a lot of work: the same post is both a productivity flex and a regulatory ask, and the reason the safety community is paying attention is that those two arguments are now coming from the same place at the same time.

Historical Context

2024-03

Claude's measured task horizon sat at roughly 4 minutes for Opus 3, the baseline of the doubling trend Anthropic later used to argue that horizons are doubling every four months.

2025-02

Claude Code launched in research preview; at that point Claude authored only low single-digit percentages of merged code at Anthropic.

2025-11

Internal evaluation found Claude matched human researcher judgment 51% of the time in November 2025, marking the point where Anthropic began describing the model as a credible research collaborator rather than just a tool.

2026-03

Claude task horizon reached approximately 12 hours, with Anthropic projecting week-long autonomous tasks by 2027 if the doubling trend continues.

2026-04

Anthropic's CPU-only LM training-optimization benchmark showed Mythos Preview achieving a 52x mean speedup, up from about 3x for Claude Opus 4 a year earlier.

2026-06-04

Favaro and Clark publish 'When AI builds itself,' formally calling for a coordinated mechanism to slow or pause frontier AI development.

Power Map

Key Players

Subject

Anthropic warns of recursive self-improvement as Claude writes 80% of its code

Anthropic

Frontier AI lab publishing the 'When AI builds itself' research post and the call for coordinated slowdowns; the same company reporting that Claude now writes the majority of its production code.

Jack Clark

Anthropic co-founder and head of policy; co-author of the post and the public face arguing on BBC Newsnight that the industry needs a brake on frontier development.

Marina Favaro

Researcher at the Anthropic Institute and Clark's co-author on the post; helped frame the compounding-misalignment thesis.

METR

Independent evaluator whose benchmark places Claude Mythos Preview's 50% time horizon at the upper bound of the current task suite, around 16 hours of sustained autonomous work.

Yann LeCun

AI pioneer and former Meta chief AI scientist; prominent critic dismissing the post as a mix of navel-gazing and marketing while maintaining LLM-based systems cannot reach human-level intelligence.

Rob Enderle

Principal analyst at Enderle Group arguing a global pause is impractical because the economic and national-security stakes are simply too high for any superpower to willingly hit the brakes.

Holger Mueller

VP and principal analyst at Constellation Research questioning whether Anthropic's call is genuine safety advocacy or strategic positioning to freeze a competitive landscape it might want to catch up in.

Fact Check

9 cited

Source Articles

Top 5

THE SIGNAL.

Analysts

"Argues the world needs the option to brake frontier AI development before recursive self-improvement removes human oversight, and warned that reaching 100% Claude-written code is possible within two years. 'You want the option to be able to take your foot off the gas and put your foot on the brake.'"

Jack Clark

Anthropic co-founder, head of policy

"Warn that the rare occurrences of misalignment present in today's models could compound as the models build their successors, growing more frequent but less understood until humans lose control."

Marina Favaro and Jack Clark

Anthropic researchers, post co-authors

"Dismisses the post as overclaiming: 'There is a bit of navel-gazing, some marketing, and a lot of very sincere beliefs about what Anthropic thinks is likely in the near future of AI.' Maintains LLM-based systems cannot reach human-level intelligence."

Yann LeCun

AI pioneer, former Meta chief AI scientist

"Believes a coordinated global pause is practically impossible: 'This would be practically impossible, because the economic and national security stakes are simply too high for any superpower to willingly hit the brakes now.'"

Rob Enderle

Principal analyst, Enderle Group

"Reads the call as strategic positioning, asking whether Anthropic is 'trying to freeze the status quo so it can catch up, or simply retain its lead' rather than acting purely on safety grounds."

Holger Mueller

VP and principal analyst, Constellation Research

The Crowd

"Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It's happening faster than we thought, and the implications deserve greater attention."

@@AnthropicAI22258

"We just published internal data on how much of Claude's development is already being done by Claude: - Over 80% of all code merged into our codebase is now written by Claude - It's been months since many researchers at Anthropic hand-wrote code - The typical Anthropic engineer"

@@alexalbert__2663

"Recursive self-improvement post by Anthropic: "Each time we release a model, we give it code that trains a small AI model, ask the new model to speed it up. In May 2024, Claude Opus 4 averaged a ~3x speedup. This April, Mythos Preview achieved ~52x." RSI is happening, and I"

@@Yuchenj_UW514

"Anthropic - Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor."

@u/Educational_Grab_473874

Broadcast

Zvi's Mic Works! Recursive Self-Improvement, Live Player Analysis, Anthropic vs DoW + More!

AI Can Improve Itself Now. We're Sure That's Fine.

Is Anthropic a Cult? AI Beats ER Doctors & Recursive Self-Improvement | This Week in AI E12