TECH

Anthropic accuses Alibaba of distilling Claude

22+

Signals

Strategic Overview

01.
Anthropic sent a June 10, 2026 letter to the U.S. Senate Banking Committee accusing Alibaba and its Qwen lab of brazenly and illicitly attempting to extract Claude's AI capabilities.
02.
Anthropic alleges operators linked to Alibaba used nearly 25,000 fraudulent accounts to generate roughly 28.8 million exchanges with Claude over about six weeks.
03.
According to the letter, the activity ran from April 22 to June 5, 2026 and targeted Claude's most commercially valuable capabilities, software engineering and agentic reasoning.
04.
Anthropic says the operators used commercial proxy services to circumvent geographic restrictions that bar Chinese entities from accessing Claude; Alibaba declined to comment on the allegations.

Deep Analysis

Compute Laundering Through Cognition

The reason this accusation matters far beyond one corporate grievance is the mechanism behind it. Distillation is the practice of copying a model's behavior by querying it at scale and training a smaller model on its answers - no source code, no model weights, and no breached servers are involved. In Anthropic's telling, operators linked to Alibaba generated roughly 28.8 million exchanges with Claude using nearly 25,000 fraudulent accounts over about six weeks ^[1], deliberately steering the queries toward Claude's most commercially valuable capabilities, software engineering and agentic reasoning ^[2]. One industry analysis estimates that volume of conversation could yield on the order of 14.4 billion tokens of teacher output ^[1]- effectively a training corpus assembled one prompt at a time.

The strategic punchline is that this route sidesteps the entire architecture of U.S. export controls. Those rules govern the sale of advanced chips and model weights; they say nothing about asking a publicly available model a few million questions. As one commentary put it, current controls focus on hardware and model weights, but distillation extracts capability through API access alone ^[1]. Anthropic says the operators even used commercial proxy services to evade the geographic restrictions that bar Chinese entities from Claude ^[4]. If you think of frontier capability as something that normally requires enormous compute to create, distillation is a way to import that capability as finished cognition rather than as regulated silicon - compute laundering through the API.

The Hypocrisy Indictment and the Legitimacy Line

The loudest reaction to Anthropic's letter has not been agreement - it has been a counter-accusation. Developer communities on Reddit skewed strongly skeptical of Anthropic and sympathetic to Alibaba, with the dominant framing being that a company built on scraping the open internet and ingesting copyrighted books is poorly placed to complain that its own outputs were copied. The recurring refrain is some version of you stole it first. That same skepticism shows up among high-profile voices: Hugging Face CEO Clement Delangue has called distillation a very common practice that everyone is using and suggested Anthropic itself may have relied on it, and Elon Musk has publicly accused Anthropic of stealing data to train Claude in the first place.

Underneath the point-scoring is a genuine legal and ethical boundary that the technical community keeps drawing. Distillation as a machine-learning technique is legitimate and widely used; deep-dive coverage of the dispute lands on the view that the real issue is not distillation but the alleged fraudulent access - fake accounts, proxy services, and terms-of-service violations. In other words, learning from a model's public outputs may be defensible, while deceiving your way past access controls to do it at industrial scale is the part that looks indefensible. Anthropic's framing collapses those two things into a single act of theft ^[2]; many of its critics insist they must be kept apart.

Why the Letter Went to Congress, Not the Courtroom

Anthropic did not sue - it wrote to the Senate Banking Committee, and the choice of venue is itself an argument. The letter, authored by policy head Sarah Heck, casts the campaign as an effort to harvest US AI capabilities across frontier labs and repackage them as China's own without incurring the training and R&D costs required to train US frontier models ^[3], and it asks lawmakers to treat large-scale distillation as a distinct category of IP theft, paired with tighter export controls, penalties, and runtime-access governance such as identity gating, query-volume thresholds, and cross-provider reporting ^[3]. Anthropic also alleged Chinese government complicity and warned that, if successful, such attacks could pose an existential threat to the U.S. and its allies ^[4]- and that the Alibaba campaign continued even after administration warnings to Chinese labs ^[2].

This is where the contrarian read gets sharp. Several of the same Reddit threads characterize Anthropic's policy asks as anti-competitive - a bid to use national-security language to lock rivals out of a capability that is, technically, just very good text. The proposed remedies (volume caps, identity gating, cross-provider reporting) would raise the cost of competing with Claude for everyone, not only for accused bad actors. Whether you read the letter as a legitimate security alarm or as regulatory moat-building depends largely on whether you accept Anthropic's framing that learning from Claude's outputs is theft at all.

If It Copies This Cheaply, How Big Was the Moat?

The most uncomfortable question for Anthropic comes not from Beijing but from within the skeptics' own camp. If a frontier model's hardest-won capabilities - agentic reasoning, software engineering - can be substantially reconstructed by reading a few thousand dollars' worth of paid outputs, then the accusation doubles as an admission about how thin the moat actually is. One commenter distilled the irony directly: if your product can be copied by reading a few thousand paid outputs, maybe it is not really worth that much. The point lands harder because Anthropic is simultaneously arguing the capability is so valuable that copying it threatens national security and so exposed that 25,000 fake accounts could extract it.

The market gave its own ambivalent verdict. Anthropic's allegations are unverified and Alibaba has denied wrongdoing, yet Alibaba's American depositary receipts still fell more than three percent, dropping below $100, with some reporting noting a 16-month low ^[2]. That is a reaction to legal and reputational risk, not to any confirmed wrongdoing. The deeper signal worth watching is structural: if capability really can be siphoned through inference, the long-run defensibility of a closed frontier lab may rest less on the model itself and more on distribution, trust, and the very runtime controls Anthropic is now asking Congress to mandate.

Historical Context

2026-02

Anthropic previously accused three smaller Chinese AI startups of distillation, alleging they collectively generated more than 16 million exchanges using about 24,000 fraudulent accounts.

2026-06-10

Anthropic sent its letter to the Senate Banking Committee accusing Alibaba of the largest known distillation attack to date.

2026-06-24

The accusation became public through CNBC reporting, escalating into a wider U.S.-China AI dispute.

Power Map

Key Players

Subject

Anthropic accuses Alibaba of distilling Claude

Anthropic

Accuser and alleged victim; maker of Claude, now lobbying Congress to penalize distillation attacks and tighten controls on U.S. AI access by foreign rivals.

Alibaba (and its Qwen AI lab)

Accused party; alleged to have run the campaign to accelerate its competing Qwen models, but has declined to comment and the allegations are not independently verified.

Sarah Heck

Anthropic's head of policy and author of the letter; framed the campaign as an effort to harvest and repackage U.S. AI capabilities without the R&D cost.

Sen. Tim Scott and Sen. Elizabeth Warren

Leaders of the Senate Banking Committee and recipients of the letter; positioned to decide whether large-scale distillation becomes a new regulated category of IP theft.

Fact Check

4 cited

Source Articles

Top 3

THE SIGNAL.

Analysts

"Frames distillation as a deliberate effort to harvest US AI capabilities across frontier labs and repackage them as China's own without incurring the training and R&D costs required to build frontier models."

Sarah Heck

Head of Policy, Anthropic

"Argues distillation differs from traditional IP theft because no files are copied and no servers are breached - the theft happens entirely through inference, creating detection and legal challenges."

Sesame Disk

Industry commentary

"Notes that current controls focus on hardware and model weights, while distillation attacks extract capability through API access alone and therefore slip past existing safeguards."

Sesame Disk

Industry commentary

The Crowd

"SITUATION EXPLAINED: Anthropic accused Alibaba of conducting a "distillation attack" on Claude. We asked @ClementDelangue his thoughts: "Distillation is a very common practice that everyone is using. I wouldn't be surprised if Anthropic used distillation in the past for some of"

@@MTSlive66

"Anthropic has just accused Alibaba of launching the largest AI theft attack in history. The goal: extract Claude's capabilities... without paying a single dollar. How they did it: → They created nearly 25,000 fake accounts → 28.8 million conversations with Claude → All"

@@anuragdotdev13

"#NFSCIntelligence： Alibaba was accused of stealing from Claude through AI distillation at an industrial scale. Anthropic says nearly 25,000 fake accounts and 29 million interactions fed Alibaba's weaker AI model with Claude's output。。。 #MilesGuo #NFSC #TakeDownTheCCP"

@@dclibertyfarmEn7

"Anthropic says Alibaba must be punished for largest Claude cloning attack"

@u/deraser5100

Broadcast

US: Anthropic Accuses Alibaba Of Massive AI 'Theft' Campaign | Firstpost Live | N18G

CRISIS - Anthropic Claude Vs. the Chinese Distillers | Mentor Sandy explains | Billion Hopes

Elon Musk Slams Anthropic Over Data Theft Claims Amid Chinese AI Distillation Row