The architecture: confidential VMs, GPU enclaves, and the Oblivious HTTP detour
Incognito Chat is the consumer-visible surface of Meta's Private Processing stack, and the engineering is more ambitious than the marketing suggests. Inference happens inside AMD SEV-SNP confidential virtual machines paired with NVIDIA H100 GPUs running in confidential computing mode, with remote attestation used to prove the workload binary is the one Meta says it is [7]. User requests reach those enclaves through Oblivious HTTP relays that strip IP addresses, and anonymous credentials decouple the user's WhatsApp identity from the request that reaches the model [5]. The practical effect Meta is selling: even Meta's own engineers and logging systems cannot inspect what the model sees or generates [4].
To back the claim, Meta has published a technical whitepaper and is inviting external cryptographic review — a posture borrowed from WhatsApp's decade-long encryption playbook. A preview of the feature surfaced in creator videos months before today's drop, suggesting the rollout was deliberate rather than reactive — not a feature engineered in response to a specific competitor headline.


