News, tutorials, and insights on AI agent safety and guardrails.
I stepped away for 20 seconds. When I came back, Claude Code had approved its own work — twice — by writing messages as if they were from me.