Anthropic's Mythos AI vulnerability detection model under Project Glasswing
TECH

Anthropic's Mythos AI vulnerability detection model under Project Glasswing

31+
Signals

Strategic Overview

  • 01.
    Anthropic's Claude Mythos Preview model, gated behind Project Glasswing, scanned more than 1,000 open-source projects and surfaced 23,019 vulnerabilities, 6,202 of them rated high or critical severity.
  • 02.
    Anthropic's Glasswing initial update reports that of a 1,752-finding sample, 90.6% were confirmed valid, with 62.4% rated high or critical severity.
  • 03.
    Mythos uncovered a certificate-forgery flaw in the wolfSSL cryptography library, CVE-2026-5194 at CVSS 9.1, that could let attackers forge certificates for legitimate services; a patch is rolling.
  • 04.
    Anthropic says no company, including itself, has yet built safeguards strong enough to prevent misuse, and will keep Mythos-class models gated until those safeguards mature.

Deep Analysis

The Bottleneck Moved Overnight

For two decades, finding a software vulnerability has been the hard part. Disclosure, triage, and patching all queued behind a small population of skilled researchers turning over rocks. Mythos Preview broke that queue. In one month under Project Glasswing, defensive partners surfaced more than 10,000 high or critical severity vulnerabilities, with Cloudflare alone finding roughly 2,000 bugs across its codebase and 400 of those rated high or critical [1]. Mozilla reported 271 vulnerabilities in Firefox 150, more than ten times what the prior Claude Opus 4.6 found in Firefox 148 [1]. And the headline number is even larger: 23,019 total findings across more than 1,000 open-source projects, with 6,202 rated high or critical [1][2].

The interesting move is what Anthropic itself said in the initial update: "Progress on software security used to be limited by how quickly we could find new vulnerabilities. Now it's limited by how quickly we can verify, disclose, and patch the large numbers of vulnerabilities found by AI" [1]. The number that proves the point is downstream of discovery. Of approximately 530 high or critical bugs Anthropic and partners actually reported upstream to maintainers, only 75 have been patched with 65 public advisories [1]. That is roughly a 14% remediation rate against a steadily refilling pipeline. The bottleneck did not disappear; it migrated to a system that was already capacity-constrained — small open-source maintainer teams, regression test suites, coordinated-disclosure email threads, change-management boards. Defenders now have an oracle that returns flaws faster than the rest of their stack can absorb them.

Cloudflare's Heresy: Stop Patching Faster

Cloudflare's CSO Grant Bourzikas spent a month with Mythos and came back arguing the opposite of what the industry expected him to argue. The conventional reaction to AI-accelerated bug discovery is to compress patch SLAs — find faster, ship faster, pray the gap stays small. Bourzikas says that is exactly the wrong response. His piece on the Cloudflare blog records direct evidence: "We learned a version of this when we tried letting the model write its own patches and watched a few go out that fixed the original bug while quietly breaking something else" [3]. AI-generated patches are not always safe to ship blind, and the regression risk grows with SLA pressure because aggressive deadlines force teams to skip the tests that catch silent breakage.

His alternative framing is architectural: "The principle is to make exploitation harder for an attacker even when a bug exists, so that the gap between when a vulnerability is disclosed and when it is patched matters less" [3]. In other words, treat the disclosure-to-patch interval as permanent, then design the system so a known unpatched bug is hard to weaponize anyway — segmented networks, capability-restricted runtime sandboxes, default-deny service meshes, hardware-backed memory tagging, assumed-compromise telemetry. This is heretical because it concedes ground that the security industry has spent years denying: defenders cannot win the discovery race anymore, so they have to win a different race. R&D World captured the strategic horizon by reporting that defenders now have months, not years, to prepare for Mythos-class capabilities to proliferate beyond the Glasswing partner list [4]. If Bourzikas is right, the rest of the industry's patch-velocity dashboards are measuring the wrong thing.

The $1,000 Bug Versus the $10,000 Bug

The most uncomfortable counter-evidence against Glasswing's framing is not coming from skeptics or critics; it is coming from a competing model. On May 12, the startup Depthfirst announced that its task-specialized AI uncovered critical infrastructure flaws Mythos had missed — including an NGINX flaw dating to 2008 — at roughly one-tenth the cost [5]. CEO Qasim Mithani put it bluntly: "by optimizing the model architecture for specific tasks, the company can accomplish work that would cost $10,000 with Mythos using just $1,000" [5]. The same KuCoin report notes Depthfirst is matching its claim with a $5M Open Defense Initiative routing credits to smaller enterprises and open-source projects [5].

That reframes the whole story. Anthropic priced Mythos Preview at roughly $25 per million input tokens and $125 per million output tokens during the research preview, frontier-tier pricing well above current Claude rates [6]. The Glasswing report acknowledges that finding a 27-year-old OpenBSD vulnerability cost under $20,000 in autonomous runs [7]. Those are reachable numbers for a Fortune 500 security team and out of reach for the maintainers of widely-used libraries that ship with consumer devices at scale. If Depthfirst's economics generalize, the comparative-advantage story flips: the moat is not frontier capability, it is task-specialized cost structure, and the small projects that most need scanning are the ones most likely to be reachable by the cheaper option. Cybernews and others picked up the implicit critique that Anthropic's "too powerful to release" framing is partly a moat-building narrative [8]; the Depthfirst result puts a price tag on that suspicion. The same suspicion is the dominant frame on developer YouTube — the most-engaged critical review of the launch argues that the real moat is money rather than model capability and that Mythos's per-bug economics are uneconomical compared with bug-bounty programs. The institutional read on X has been less skeptical and more financial, treating Glasswing as a corporate-consortium story alongside named partners; that gap between developer skepticism and institutional acceptance is itself worth watching.

The Three-Month Headstart That Could Backfire

Beneath the headlines about new bugs is a structural problem that defenders are only starting to talk about. The Glasswing partners — twelve founding organizations including AWS, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorgan, Linux Foundation, Microsoft, NVIDIA, and Palo Alto Networks, plus other defensive partners such as Cloudflare and Mozilla — are using Mythos right now to patch vulnerabilities that survived decades of human review (including the 27-year-old OpenBSD bug and a 16-year-old FFmpeg flaw that lived through millions of automated test runs) [7]. They will harden their stacks before Mythos-class capabilities go broad. Everyone else starts the race only when the model — or an equivalently capable open-weight successor — becomes generally available, and at that point attackers gain access at exactly the same moment defenders do.

This asymmetry is the loudest concern coming back from the developer community, and it is concrete. Anthropic has committed to eventually releasing Mythos-class models publicly once safeguards mature [9]. The Register report notes that, in the meantime, some open-source maintainers have asked Anthropic to slow disclosure pace, and that maintainers are also fielding low-quality AI-generated bug reports alongside legitimate Mythos findings, complicating triage [9]. The downstream effect is a long-tail patching problem: industrial controllers, smart-home hubs, and the wide install base of consumer devices that ship affected wolfSSL versions [2][10]rarely receive coordinated security updates, so upstream fixes do not translate into deployed protection. The cybersecurity subreddit captured the structural worry exactly: contributors there framed Glasswing as a 90-day head start for a few dozen organizations against a clock that starts the same day for attackers, with little capacity in the rest of the ecosystem to absorb the disclosure wave. OWASP founder Jeff Williams summarized the upstream doubt: "It's highly questionable that Anthropic will be able to limit the malicious uses of this model" [11]. Glasswing buys a head start for a few dozen organizations and hands the rest of the ecosystem a clock that is already counting down.

Historical Context

2026-02
Began running an early snapshot of Claude Mythos Preview against open-source projects, partnering with external security firms for triage and coordinated disclosure.
2026-04-07
Publicly launched Project Glasswing with twelve founding partners, $100M in Mythos usage credits, and $4M in open-source security donations.
2026-05-12
Announced its task-specialized AI found critical infrastructure bugs Mythos missed at roughly one-tenth the cost and launched a $5M Open Defense Initiative.
2026-05
Cloudflare's Grant Bourzikas published a detailed post on testing Mythos against more than 50 internal repositories and the limits of AI-written patches.
2026-05-25
Published the Glasswing initial update reporting 23,019 vulnerabilities across 1,000+ open-source projects, a 90.6% true-positive rate, and a planned eventual public release.

Power Map

Key Players
Subject

Anthropic's Mythos AI vulnerability detection model under Project Glasswing

AN

Anthropic

Builds and operates Mythos Preview, runs Project Glasswing, controls who gets access, and decides when Mythos-class models become public. Backed the launch with $100M in usage credits and $4M in open-source security donations.

CL

Cloudflare

A Glasswing partner that tested Mythos against more than 50 internal repositories and publicly argued the right defensive response is architectural hardening, not faster patch SLAs. Its skepticism is the most influential counter-narrative inside the program.

DE

Depthfirst

An AI-security startup challenging Mythos's economics, claiming its task-specialized model finds critical infrastructure bugs Mythos missed at roughly one-tenth the cost. It launched a $5M Open Defense Initiative to put credits in the hands of smaller defenders.

LA

Launch partners (AWS, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorgan Chase, Linux Foundation, Microsoft, NVIDIA, Palo Alto Networks)

Twelve founding organizations granted Mythos Preview access for defensive use. They are patching vulnerabilities right now that the rest of the internet does not know exist, which is the structural advantage Glasswing creates.

OP

Open-source maintainers

On the receiving end of Mythos disclosures with average two-week patch cycles and limited capacity. Several have asked Anthropic to slow disclosure pace; only 75 of roughly 530 high or critical bugs reported upstream have been patched so far.

Fact Check

11 cited
  1. [1] Project Glasswing: Initial update
  2. [2] Anthropic Mythos Detected 23,000 Potential Vulnerabilities Across 1,000 OSS Projects
  3. [3] Project Glasswing: what Mythos showed us
  4. [4] Post-Mythos, defenders have months, not years, to prepare for AI-powered hacking
  5. [5] AI Security Firm Depthfirst Discovers Critical Internet Vulnerabilities at 10x Lower Cost Than Anthropic
  6. [6] Anthropic's Project Glasswing Exposes the Next Challenge for Vulnerability Management
  7. [7] Project Glasswing
  8. [8] Cloudflare warns Mythos AI is too powerful for public release
  9. [9] Anthropic to release Mythos-class models to the public
  10. [10] Anthropic's Glasswing-Mythos uncovers 10,000 critical vulnerabilities
  11. [11] Anthropic launches Project Glasswing to secure critical software

Source Articles

Top 4

THE SIGNAL.

Analysts

"Argues defenders should architect for assumed-compromise rather than racing patches: "The principle is to make exploitation harder for an attacker even when a bug exists, so that the gap between when a vulnerability is disclosed and when it is patched matters less.""

Grant Bourzikas
Chief Security Officer, Cloudflare

"Reports direct evidence that AI-written patches are not safe to ship blind: "We learned a version of this when we tried letting the model write its own patches and watched a few go out that fixed the original bug while quietly breaking something else.""

Grant Bourzikas
Chief Security Officer, Cloudflare

"Claims task-specialized model architectures can match or exceed Mythos at a fraction of the price, undermining a 'bigger frontier model always wins' framing for security scanning."

Qasim Mithani
CEO, Depthfirst

"Skeptical that Anthropic can credibly contain misuse of a model this capable: "It's highly questionable that Anthropic will be able to limit the malicious uses of this model.""

Jeff Williams
Founder of OWASP; CTO, Contrast Security

"Frames the new bottleneck explicitly: "Progress on software security used to be limited by how quickly we could find new vulnerabilities. Now it's limited by how quickly we can verify, disclose, and patch the large numbers of vulnerabilities found by AI.""

Anthropic
Project Glasswing operator
The Crowd

"Introducing Project Glasswing: an urgent initiative to help secure the world's most critical software. It's powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans."

@@AnthropicAI44093

"Anthropic just dropped the first Project Glasswing update Claude Mythos found 10,000+ critical vulnerabilities in ONE month: > Cloudflare: 2,000 bugs, 400 high/critical severity > Mozilla: 271 vulnerabilities in Firefox 150 — 10x more vulnerabilities found in Firefox 148"

@@ns123abc2300

"Anthropic is launching Claude Mythos Preview for defensive security partners under Project Glasswing with $NVDA, $GOOGL, AWS, Apple, and Microsoft. Anthropic says the model has flagged thousands of high-severity vulnerabilities and won't be released publicly - The Verge"

@@wallstengine90

"Project Glasswing: Anthropic says Claude found 10,000 critical software flaws in a month"

@u/sksarkpoes3521
Broadcast
Claude Mythos Preview in 6 Minutes

Claude Mythos Preview in 6 Minutes

Project Glasswing/Claude Mythos: Anthropic's $x00 Million Marketing Stunt

Project Glasswing/Claude Mythos: Anthropic's $x00 Million Marketing Stunt

What Mythos & Glasswing by Anthropic mean for devs

What Mythos & Glasswing by Anthropic mean for devs