How a Debugging Session Becomes a Capability Leak
The fear driving this policy is mechanical, not abstract. When a Meta engineer asks Claude Code to help debug a model training script, chunks of that proprietary codebase potentially travel outside Meta's walls to an external server [1]. That is already a data-exposure problem on its own. But the deeper worry is the return trip: the suggestions, fixes and reasoning traces that come back are themselves the output of a rival frontier model.
This is where distillation enters. Distillation is the practice of training a 'student' model on the outputs of a stronger 'teacher' model, so the student inherits the teacher's behavior without ever seeing the teacher's weights. Meta's concern is that if Claude's or Codex's code suggestions, debugging logic and reasoning get absorbed into internal codebases or synthetic training data, those competitor capabilities effectively transfer into Llama [3]. It does not require anyone to deliberately copy a model - routine engineering, repeated across a large team, can quietly fold a competitor's intelligence into your own training pipeline. That is precisely why the rules single out using AI outputs to create test tasks or for code analysis [2], the exact paths by which a model's behavior gets captured and reused.
