The Attack Was the Product
The word 'attack' conjures stolen weights and breached servers, but Anthropic's accusation describes something stranger and, in some ways, more unsettling: a campaign built entirely out of normal-looking conversations. Distillation, as the technique is plainly described, means feeding carefully constructed queries to a frontier AI model, collecting its responses, and using those responses to train a cheaper rival system that approximates the original's capabilities [3]. No firewall is breached. The model simply answers, as it is designed to do, and each answer becomes a labeled training example for someone else's system.
That reframing matters because it explains the numbers. Anthropic alleges operators affiliated with Alibaba and its Qwen lab ran 28.8 million exchanges across roughly 25,000 fraudulent accounts between April 22 and June 5, 2026 [1]. The fake accounts are not a way past the lock; they are a way past the rate limits, spreading an industrial harvesting operation thinly enough to look like ordinary demand. Anthropic says the campaign deliberately targeted Claude's most commercially valuable behaviors - software engineering and agentic reasoning, the cornerstones of its Mythos Preview model [1]. In Anthropic's framing, these were 'illicit, systematic, and industrial-scale efforts to harvest U.S. AI capabilities and repackage them without incurring the training and R&D costs required to train U.S. frontier models' [2]. The product being extracted was not data in storage - it was the model's behavior in motion.




