TECH

Claude Mythos: Anthropic's autonomous vulnerability hunter

45+

Signals

Strategic Overview

01.
Anthropic unveiled Claude Mythos Preview on April 7, 2026, a frontier general-purpose model capable of autonomously discovering and exploiting zero-day vulnerabilities across major operating systems and web browsers when directed by a user.
02.
Through Project Glasswing, Anthropic gave roughly 50 partners — including AWS, Apple, Google, Microsoft, Cisco, CrowdStrike, JPMorgan Chase, the Linux Foundation, NVIDIA and Palo Alto Networks — restricted early access, backed by $100M in usage credits and $4M in OSS donations.
03.
Anthropic scanned more than 1,000 open-source projects with Mythos and surfaced 23,019 potential vulnerabilities, of which 6,202 were rated high or critical severity.
04.
Anthropic and six independent security research firms assessed 1,752 high or critical findings and validated more than 90% as true positives, but only 75 have been patched and 65 advisories published across the >1,100 reported so far.
05.
Internal Anthropic source strings reference 'claude-mythos-1-preview', signaling a coming Mythos 1 integration into Claude Code and Claude Security — moving the capability from invite-only research preview toward broader commercial availability.
06.
Downstream, BNP Paribas extended its partnership with Mistral AI for three years to build a sovereign European cybersecurity AI model as a Mythos hedge — a sign enterprises are now planning around Mythos-class capability, not just consuming it.

The patching bottleneck is now the real cybersecurity story

The most consequential number out of the May 26 Glasswing update isn't 23,019 flagged or 6,202 critical — it's 75 patched against more than 1,100 reported, with 65 advisories published ^[1]. Anthropic itself frames this as the structural problem, writing that 'the relative ease of finding vulnerabilities compared with the difficulty of fixing them amounts to a major challenge for cybersecurity' ^[1]. Independent validation reinforces the asymmetry: Anthropic and six external research firms reviewed 1,752 high or critical findings and confirmed more than 90% as true positives ^[1], so the patch gap isn't a false-positive problem — it's a maintainer-bandwidth problem. Mozilla is the outlier with 271 Firefox vulnerabilities patched in Firefox 150 ^[2], but Mozilla has paid security staff; the median open-source project pinged by Mythos does not. The implication for defenders is uncomfortable: an attacker with similar-class capability doesn't need every flaw — just one in the long tail that no maintainer got to.

Hype vs. ledger: only one CVE is publicly attributable to Glasswing

Two months in, the publicly traceable CVE ledger for Project Glasswing is much thinner than the press release suggests. VulnCheck's Patrick Garrity flagged that 'Anthropic's Project Glasswing has generated significant attention — but very little concrete data' ^[3], with only CVE-2026-4747 — the 17-year-old FreeBSD NFS remote code execution bug — explicitly attributed to the program ^[3]. wolfSSL produced 8 CVEs and a 5.9.1 release, but most other partner findings remain under embargo or unattributed ^[2]. More damaging to the frontier-tier framing: Vidoc Security Lab and AISLE researchers published 'We Reproduced Anthropic's Mythos Findings With Public Models' ^[4], showing smaller, cheaper open-source models could rediscover several showcased bugs ^[5]. The CyberGym score gap Anthropic published — 83.1% for Mythos vs 66.6% for Opus 4.6 ^[6]— is real, but the marginal capability premium may not justify the marketing tier when 'good-enough' models are catching the same low-hanging fruit.

Sovereign-AI hedging: BNP Paribas isn't waiting

The BNP Paribas / Mistral AI announcement on May 26 is the first clear sign that Mythos has shifted enterprise procurement, not just security tooling ^[7]. BNP extended its Mistral partnership for three years specifically to build a European, sovereign cybersecurity AI model — a hedge against being dependent on a US frontier lab for the model class that could break your bank. BNP CIO Marc Camus deliberately reframed the discussion: 'The focus has been a lot on is Mythos accessible or not accessible? but let's not forget there are other models from other firms that exist' ^[7]. Translation: regulated European institutions cannot bet defense on whether Anthropic chooses to grant them Glasswing access, and the next 18 months will likely see parallel sovereign efforts — French, EU-wide, possibly Japanese and UK — to avoid that single-vendor exposure ^[8]. CNBC reported similar reorientation among US banks in early May ^[9].

The harness matters more than the model — what practitioners with access are saying

Reddit threads from practitioners with Glasswing access converge on a less-flattering picture than the keynote demos. The macOS exploit Anthropic showcased required a custom harness — Mythos didn't act alone — and reportedly consumed roughly $40k in compute tokens for the Apple chain. False-positive triage is heavy: even with 90%+ true-positive validation on the curated 1,752 sample ^[1], the long tail of 23,019 findings ^[1]still requires human reviewers to sort. One practitioner reported Mythos failed to rediscover a known patched vulnerability in old code, suggesting the model's strength is novel pattern synthesis, not exhaustive recall. The takeaway for security teams evaluating Mythos 1 when it lands in Claude Code: budget for harness engineering and triage headcount, not just API spend. The $25/$125 per million token pricing signaled in YouTube coverage compounds the cost-per-finding question.

The dual-use admission Anthropic put in writing

Buried in the May 26 Glasswing update is a sentence that will be quoted in policy hearings for years: 'At present, no company — including Anthropic — has developed safeguards strong enough to prevent such models from being misused and potentially causing severe harm' ^[1]. That's a frontier lab conceding, on the record, that the offensive cybersecurity capability of its own released model exceeds its alignment toolkit. Mythos's own technical demonstrations make the stakes concrete: it autonomously chained four vulnerabilities into a full browser exploit including a JIT heap spray that escaped both renderer and OS sandboxes ^[10], and developed working Firefox 147 JIT exploits 181 times in evaluations versus Opus 4.6's two ^[10]. The same capability profile that lets a Glasswing partner harden wolfSSL also lets a determined adversary — once Mythos 1 ships to Claude Code ^[11]— find and weaponize the next FreeBSD-class flaw before any maintainer sees it.

Historical Context

2026-02-01

Launched Claude Security as a limited research preview powered by Claude Opus 4.7 for vulnerability scanning — the precursor product that Mythos 1 will fold into.

2026-03-26

Fortune reported a data leak that revealed Mythos's existence, framing it as 'a step change in capabilities' before Anthropic was ready to disclose it.

2026-04-07

Officially unveiled Claude Mythos Preview and Project Glasswing with the initial ~12 launch partners and the autonomous FreeBSD/Firefox exploit demonstrations.

2026-04-15

Independent press began questioning the gap between Anthropic's vulnerability claims and publicly verifiable CVEs, surfacing that only CVE-2026-4747 was explicitly attributed.

2026-05-08

Reported on cybersecurity 'hysteria' around Mythos and how US banks were reorienting AI security strategies in response.

2026-05-26

Anthropic published its Glasswing update with the 23,019/6,202/75-patched numbers; same day, BNP Paribas formally extended its Mistral partnership for a European Mythos defense.

Power Map

Key Players

Subject

Claude Mythos: Anthropic's autonomous vulnerability hunter

Anthropic

Developer and owner of Claude Mythos Preview; operator of Project Glasswing; committed $100M in usage credits and $4M in OSS donations.

Project Glasswing launch partners

AWS, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorgan Chase, Linux Foundation, Microsoft, NVIDIA, Palo Alto Networks plus ~40 additional infrastructure organizations granted restricted early access.

wolfSSL

Open-source cryptography library used by billions of devices; Mythos surfaced 8 CVEs, triggering wolfSSL 5.9.1 release.

FreeBSD Project

Affected vendor for CVE-2026-4747, the 17-year-old NFS remote code execution flaw Mythos autonomously identified and exploited.

Mozilla

Patched 271 Firefox vulnerabilities surfaced through Glasswing partner work, shipped in Firefox 150.

BNP Paribas

French bank extending its Mistral AI partnership for three years to prepare a European, sovereign cybersecurity AI alternative to Mythos.

Mistral AI

Building a dedicated cybersecurity AI model for European banks as a sovereign alternative to Mythos.

VulnCheck (Patrick Garrity)

External skeptic auditing the gap between Anthropic's marketing claims and publicly traceable CVEs.

AISLE / Vidoc Security Lab

Independent researchers who reproduced several showcased Mythos findings using smaller, cheaper open-source models, undercutting the 'frontier-only' framing.

Fact Check

12 cited

Source Articles

Top 4

THE SIGNAL.

Analysts

"Argues the cybersecurity narrative shouldn't fixate only on Mythos because comparable models exist from other vendors — quote: 'The focus has been a lot on is Mythos accessible or not accessible? but let's not forget there are other models from other firms that exist.'"

Marc Camus

Chief Information Officer, BNP Paribas

"Skeptical of headline numbers — argues the publicly attributable impact of Glasswing is small relative to its marketing footprint: 'Anthropic's Project Glasswing has generated significant attention — but very little concrete data.'"

Patrick Garrity

Vulnerability intelligence researcher, VulnCheck

"Concedes the safeguards problem in writing: 'At present, no company — including Anthropic — has developed safeguards strong enough to prevent such models from being misused and potentially causing severe harm.'"

Anthropic (corporate position)

Model developer

"Names the structural asymmetry: 'The relative ease of finding vulnerabilities compared with the difficulty of fixing them amounts to a major challenge for cybersecurity.'"

Anthropic (Glasswing report)

Model developer

"Reproduced Anthropic's showcased findings with smaller, cheaper open-source models — published under the title 'We Reproduced Anthropic's Mythos Findings With Public Models', challenging the frontier exclusivity claim."

Vidoc Security Lab / AISLE researchers

Independent security researchers

The Crowd

"We released Claude Opus 4.6 just two months ago. Today we're sharing some info on our new model, Claude Mythos Preview. https://t.co/Dz6um6HAWZ"

@@alexalbert__17630

"NEWS: Anthropic's new model, Claude Mythos, is so powerful that it is not releasing it to the public. Instead, it is starting a 40-company coalition, Project Glasswing, to allow cybersecurity defenders a head start in locking down critical software. https://t.co/1ehWqYi4iy"

@@kevinroose5355

"Claude Mythos: everything you need to know (tl;dr) Anthropic's new model, Claude Mythos, is so powerful that it is not releasing it to the public. Anthropic: "Mythos is only the beginning" Everything you need to know: The tl;dr with all key facts: Mythos found zero-day https://t.co/fSELD04BkD"

@@kimmonismus2223

"Claude Mythos really just vibe-checked the M5 in a week."

@u/CosmicParadox_1017

Broadcast

Claude Mythos is too dangerous for public consumption...

Claude Mythos Preview in 6 Minutes

Claude Mythos: Highlights from 244-page Release