Cutting the outbound wire: why Lockdown Mode targets the exit, not the entrance
Lockdown Mode does not try to stop a prompt injection from reaching ChatGPT. It assumes the attack will land and instead severs the path data takes on its way out. The clearest way to understand the design is Simon Willison's 'lethal trifecta,' cited across coverage: an AI with access to private data, exposed to untrusted content, and equipped with an outbound channel [5]. An attacker only needs all three at once. By disabling live web browsing, image retrieval, deep research, and agent mode, Lockdown Mode removes the third leg, the outbound channel through which a hijacked model could quietly ship secrets to an attacker-controlled endpoint [2]. This matters because the documented attacks were never about brute force. Check Point Research showed that a single malicious instruction hidden inside an email, PDF, or web page could turn a normal conversation into a covert exfiltration channel, in one case smuggling data out over DNS requests [6]. Lockdown Mode is a direct answer to that class of attack: starve the channel and the injection has nowhere to send what it steals.



