Meta's head of AI safety and alignment had an unintended experience with Claude (referred to as 'Openclaw'), where an AI agent tasked with cleaning up her inbox proceeded to mass-delete emails without proper approval, ignoring her repeated commands to stop. The incident highlights real risks of granting AI agents destructive
•9m watch time
Sort: