Anthropic Mythos: the private gatekeeper of zero-days
TECH

Anthropic Mythos: the private gatekeeper of zero-days

40+
Signals

Strategic Overview

  • 01.
    Anthropic unveiled Claude Mythos Preview on April 7, 2026 as a frontier model that can autonomously find and exploit zero-day vulnerabilities across every major operating system and web browser when directed to do so.
  • 02.
    Rather than release Mythos publicly, Anthropic launched Project Glasswing, a closed coalition of 12 founding partners — AWS, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorgan Chase, Linux Foundation, Microsoft, NVIDIA, Palo Alto Networks, and Anthropic itself — authorized to use the model exclusively for defensive vulnerability research.
  • 03.
    On June 1, 2026, Anthropic informed the European Commission that ENISA would be granted Mythos access, making it the first EU agency admitted to Glasswing after months of EU pressure over asymmetric, US-only access.
  • 04.
    Palo Alto Networks ran Mythos and OpenAI's GPT-5.5-Cyber across 130+ of its products in roughly one month and reported 75 legitimate vulnerabilities, with three weeks of model-driven analysis matching a full year of manual penetration testing.

Deep Analysis

Who Gets to See the Bugs

The story buried under the Mythos launch is not capability — it is governance. By refusing to ship Mythos as a normal API product and instead routing access through Project Glasswing, Anthropic has effectively made itself the gatekeeper deciding which companies and which governments get to see what may be the most productive zero-day discovery system ever built [1][2]. The founding coalition reads like a list of US infrastructure incumbents: AWS, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorgan Chase, Linux Foundation, Microsoft, NVIDIA, Palo Alto Networks, and Anthropic itself [2]. There is no European telco, no European bank, no European agency on that list. For two months, the practical security map of the internet was being redrawn inside a private US coalition.

That is what makes the June 1 ENISA announcement consequential rather than ceremonial. Anthropic communicated the decision to the European Commission over the weekend, and ENISA becomes the first EU agency authorized to use the model exclusively for defensive purposes [3][4]. The path to that decision ran through pressure: EU lawmakers called the asymmetry 'extremely worrying,' Commission representatives flew to San Francisco for briefings, and an invited Anthropic hearing was declined for short notice [5]. The deeper question — whether a private lab should be the entity adjudicating sovereign access to defensive cyber capability at all — has not been answered. It has just been deferred behind a one-off accommodation. Other agencies in other jurisdictions will now be measuring their own positions against ENISA's, and Anthropic will be measuring each request against its own internal policy. Self-regulation, as Gary Marcus put it bluntly, is the operating model whether anyone signed off on it or not [6].

What the 23,000 Number Actually Means

Anthropic's headline statistic is a deliberately large one: Mythos Preview has surfaced 23,019 potential issues across more than 1,000 open-source projects [7]. The 30-day Glasswing update touts more than 10,000 high or critical-severity findings and a true-positive rate above 90% on validated samples [8]. Read carefully, the qualifier matters. As of late May 2026, only 1,900 of the 23,000 issues had been reviewed by external security firms, and 1,726 of those were confirmed [7]. The cybersecurity community on Reddit has been making exactly that point, and they are not wrong: the gap between 'potential vulnerabilities surfaced' and 'verified, patched, gone' is the gap where most security tooling collapses.

The other number worth scrutinizing is disclosure throughput. Anthropic reports 1,596 vulnerabilities disclosed across 281 open-source projects as of May 22, with only 97 patched [7]. That is roughly a 6% patch rate on disclosed bugs, even with a $2.5M grant to Alpha-Omega/OpenSSF and $1.5M to Apache to absorb the workload [2]. Triage capacity is the binding constraint, not detection. This is the part of the story Anthropic's marketing flattens: the model is fast enough at finding bugs to overwhelm the human and institutional pipelines that have to verify and fix them. Cloudflare's Grant Bourzikas was right that the qualitative leap is exploit-chain construction, not enumeration [9]. But on the defensive side, enumeration is what the world is currently drowning in.

The $25 Per Million Token Economics

Mythos post-preview pricing is set at $25 per million input tokens and $125 per million output tokens, roughly 5x the rate of Opus 4.6. Concrete workloads make that pricing tangible: about $20,000 for 1,000 OpenBSD scans, around $10,000 for the FFmpeg work that surfaced a 16-year-old H.264 bug missed by five million automated tests, and $1,000 to $2,000 per individual Linux exploit [1]. Those numbers explain why Anthropic is underwriting Glasswing partners through a $100M usage-credit pool — at list price, even well-resourced security teams would burn through hundreds of thousands of dollars per audit, and the model's value proposition would collapse under its own cost curve [2].

The subsidy reshapes the market in ways worth naming. Glasswing partners get effectively unlimited Mythos against their own codebases; non-partners do not. Open-source maintainers get scans paid for by Anthropic but receive a deluge of disclosures with a 6% patch rate. Security consultancies pricing manual penetration tests now compete against a tool whose marginal cost to its sanctioned users is approximately zero. And the pricing itself — 5x Opus — sets a floor for what frontier-class cyber AI is allowed to cost, which is the kind of signal competitors read carefully. XBOW's positioning of OpenAI's GPT-5.5 as the open alternative to Mythos's invite-only model is not just competitive marketing; it is a wager that openness will be a more durable distribution strategy than gatekeeping once the capability becomes table stakes.

Three Weeks vs. A Year

The Palo Alto Networks bake-off is the most concrete data point about what Mythos actually does to the security industry. Palo Alto ran Mythos and GPT-5.5-Cyber across more than 130 of its own products in roughly one month, surfaced 75 legitimate vulnerabilities — well above its typical 5 to 10 per month baseline — and published 26 CVEs [10][11]. Three weeks of model-driven analysis matched what would normally be a full year of manual penetration testing, with broader coverage [10][12]. Chief Product Officer Lee Klarich's framing was unsentimental: these models are much better at writing working exploits than anything seen before [10].

The defensive-window claim that followed is the part the security industry has not metabolized. Palo Alto is telling customers that attackers are 3 to 5 months from comparable capability [11]. If that timeline is correct — and there is no way to verify it externally — the implication is that every organization which currently relies on annual or semi-annual third-party penetration tests is on a clock. The economics of the third-party pentest itself are now in question: a one-month engagement against a Glasswing-subsidized tool will plausibly find more in three weeks than a quarterly retainer finds in a year. Cloudflare's internal test of 50+ repositories, Mozilla's 271 surfaced Firefox vulnerabilities, and the 27-year-old OpenBSD TCP/SACK flaw that survived decades of human review all point in the same direction [1][7][9]. Manual penetration testing as a primary line of defense is not dead, but it is no longer the most productive use of senior security headcount.

The Safety Narrative on a Seven-Week Clock

Anthropic's safety framing has worked harder than its technical framing. The company's April 7 disclosure positioned Mythos as too dangerous to release as a normal product, and Project Glasswing was the structural answer to that framing [1][2]. Seven weeks later, ENISA is in, the partner list has grown, and the institutional logic of 'controlled defensive access' is being stretched to cover what is starting to look like a fast onboarding curve. The Institute for Security and Technology's reading is the one to take seriously: Anthropic itself acknowledged in its model card that the risk of significantly harmful outcomes substantially enabled by Mythos Preview's misaligned actions is very low, but higher than for previous models [13].

That sentence is the safety narrative's real load-bearing claim, and it cuts both ways: it concedes the elevated risk, and it asks the world to trust Anthropic's process for managing it. Gary Marcus's response — that Anthropic showed admirable restraint but that competitors such as OpenAI and xAI may not, and that self-regulation is too little, too late — is the most clarifying take in the public record [6]. Mythos is the first concrete test of whether a private AI lab's safety judgment can serve as durable governance for a capability with national-security implications, and the answer is going to come from how the next ten access decisions are made, not from how the first one was.

Historical Context

2026-04-07
Announced Claude Mythos Preview and launched Project Glasswing with 12 founding partners, declining to release the model publicly while granting defensive-only access to a closed coalition.
2026-04-29
Published analysis arguing Mythos exposes new loss-of-control risks beyond traditional misuse framing, citing Anthropic's own acknowledgment of elevated misaligned behavior.
2026-05-06
EU lawmakers scheduled scrutiny of Mythos risks; Anthropic declined a hearing invitation citing short notice, intensifying transatlantic friction over US-only access.
2026-05-08
Mythos disclosures triggered widespread alarm in cybersecurity circles, even as researchers like Vidoc Security's Klaudia Kloc argued similar zero-day capabilities had already existed in older models.
2026-05-13
Disclosed it had used Mythos and GPT-5.5-Cyber to find 75 vulnerabilities across 130+ products in roughly one month and warned that attackers were 3-5 months from comparable access.
2026-05-18
Published Project Glasswing findings highlighting exploit-chain construction and working-proof generation as Mythos's standout capabilities, distinct from prior bug-listing tools.
2026-05-26
Published a Project Glasswing 30-day update reporting more than 10,000 high/critical-severity findings and a >90% true-positive rate on validated samples.
2026-06-01
Anthropic informed the European Commission that ENISA would receive Mythos access, making it the first EU agency in Project Glasswing and the first crack in the US-only access wall.

Power Map

Key Players
Subject

Anthropic Mythos: the private gatekeeper of zero-days

AN

Anthropic

Creator of Mythos Preview and operator of Project Glasswing; controls who gets access, sets the $25/$125 per million token pricing, and underwrites partners through a $100M usage-credit pool plus grants to OpenSSF and Apache.

EN

ENISA (EU Agency for Cybersecurity)

First EU agency admitted to Glasswing; will test Mythos for defensive cyber work on European critical infrastructure after months of public pressure from MEPs and the European Commission.

PA

Palo Alto Networks

Headline Glasswing partner; used Mythos to surface 75 bugs and publish 26 CVEs across 130+ products, and warns customers attackers are 3-5 months from comparable capability.

EU

European Commission

Briefed by Anthropic on Mythos's cyber capabilities and brokered ENISA's access after Commission officials traveled to San Francisco to press the case.

CL

Cloudflare

Glasswing partner that tested Mythos on 50+ internal repositories and publicly highlighted its exploit-chain construction and proof-generation strengths as the qualitative leap over prior tools.

UK

UK AI Security Institute (AISI)

Received early access to evaluate Mythos's cyber capabilities prior to the broader Glasswing rollout, providing an independent government assessment of the model's offensive and defensive cyber profile.

Fact Check

13 cited
  1. [1] Claude Mythos Preview
  2. [2] Project Glasswing
  3. [3] EU cybersecurity agency gains access to Anthropic Mythos
  4. [4] Anthropic to grant EU cybersecurity agency ENISA access to Mythos
  5. [5] EU presses Anthropic for access to Mythos as cyber concerns grow
  6. [6] What should we take from Anthropic's Mythos disclosure
  7. [7] Anthropic Mythos Detected 23,000 Potential Vulnerabilities Across 1,000 OSS Projects
  8. [8] Anthropic Project Glasswing update
  9. [9] Cloudflare on cyber frontier models
  10. [10] Palo Alto Networks says Mythos and GPT-5.5 found 75 bugs in weeks
  11. [11] Palo Alto on AI cyberattacks: Mythos and GPT-5.5
  12. [12] Palo Alto Networks says Mythos finds vulnerabilities in 3 weeks matching a year of manual penetration testing
  13. [13] What Anthropic's Mythos Preview tells us about AI loss-of-control risk

Source Articles

Top 4

THE SIGNAL.

Analysts

"Frontier models like Mythos and GPT-5.5-Cyber now write working exploits more reliably than anything Palo Alto had previously tested, which is why the company is warning customers that defenders have only a narrow window before attackers gain comparable access."

Lee Klarich
Chief Product Officer, Palo Alto Networks

"Mythos's distinguishing capability isn't enumerating bugs — earlier tools could do that — but chaining multiple small attack primitives together into a working exploit with a usable proof, which is what changes the defensive calculus."

Grant Bourzikas
Chief Security Officer, Cloudflare

"The models that we have right now are powerful enough to detect zero-days at large scale, meaning the framing of Mythos as singularly dangerous understates a broader capability shift that has already been underway for months."

Klaudia Kloc
CEO, Vidoc Security

"Anthropic deserves credit for not publicly shipping a model that can write working zero-day exploits, but voluntary corporate restraint cannot be the operating model for capabilities this consequential when competitors are unlikely to follow the same playbook."

Gary Marcus
Cognitive scientist; AI critic, NYU emeritus

"Mythos marks a shift from pure misuse-risk framing toward loss-of-control framing, because Anthropic itself acknowledges residual misaligned behavior at a level higher than prior models even if absolute harm probability remains low."

Ritika Verma & Mariami Tkeshelashvili
Researchers, Institute for Security and Technology
The Crowd

"Introducing Project Glasswing: an urgent initiative to help secure the world's most critical software. It's powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans."

@@AnthropicAI43985

"Mythos Preview has already found thousands of high-severity vulnerabilities—including some in every major operating system and web browser."

@@AnthropicAI7274

"Anthropic's Mythos raised the bar for AI vuln detection but kept it invite-only. GPT-5.5 is OpenAI's answer, and it's open to all. We had early access. Ran the benchmarks. Blackbox GPT-5.5 already beats whitebox GPT-5. Best pentesting model we've tested."

@@Xbow677

"Anthropic says Mythos has already found more than 10,000 vulnerabilities"

@u/Steap-Edit497
Broadcast
Explained: Why Indian govt is warning banks about Anthropic's 'Mythos' AI | Claude Mythos

Explained: Why Indian govt is warning banks about Anthropic's 'Mythos' AI | Claude Mythos

Why Anthropic's Mythos Is Sparking Alarm

Why Anthropic's Mythos Is Sparking Alarm

The AI Model TOO POWERFUL to be Released to the Public...

The AI Model TOO POWERFUL to be Released to the Public...