How the detection pipeline actually decides
YouTube's new system stacks three signals on top of the existing self-disclosure toggle. The first is C2PA provenance metadata embedded in the media file itself [2]: if the upload carries a manifest indicating the content is fully AI-generated, the label is applied automatically and made permanent, with no creator override [3]. The second is first-party watermarking from Google's own generative stack - anything produced via Veo or Dream Screen is permanently labeled regardless of how it is edited or re-uploaded [1]. The third is a heuristic detector for what YouTube calls 'significant photorealistic AI use' [1]; that classifier is opaque, runs on every upload, and is the source of contestable, non-permanent labels. Crucially, none of these signals are intended to be exhaustive - YouTube still leans on creators to self-disclose synthetic content, and the auto-label is positioned as backfill for the disclosure gap that opened up as generative video tools proliferated [3].



