Everyone's concerned that Claude will rat you out. It's not that simple. I wanted to go as out of my way as possible to correct this, and explain what's really going on here.

Thank you Firecrawl for sponsoring! Check them out at: https://soydev.link/firecrawl

Use code FBI to get 1 month of T3 chat for just $1: https://soydev.link/chat
(only valid for new customers)

SOURCES
https://simonwillison.net/2025/May/31/snitchbench-with-llm/
https://www-cdn.anthropic.com/6be99a52cb68eb70eb9572b4cafad13df32ed995.pdf
https://snitchbench.t3.gg/
https://github.com/t3dotgg/snitchbench

Want to sponsor a video? Learn more here: https://soydev.link/sponsor-me

Check out my Twitch, Twitter, Discord more at https://t3.gg

S/O Ph4se0n3 for the awesome edit 🙏

T3Dotgg's resource offers insights, tutorials, and resources for gamers and esports enthusiasts. Readers can learn about gaming news, esports events, and professional gaming strategies. With articles, reviews, and interviews, T3Dotgg provides  guidance and expertise for staying informed and competitive in the world of gaming.

Theo - t3․gg

A developer created SnitchBench to test whether Claude 4 and other AI models would contact authorities when given access to tools and prompted about unethical scenarios. The testing revealed that multiple models (Claude, Gemini, Grok) exhibit similar behavior when given email/CLI tools and instructed to act boldly, but this only occurs under specific conditions with explicit tool access and system prompts. The controversy stems from misunderstanding how tool calling works - models can only contact external parties when developers explicitly provide those capabilities.

Is Claude 4 a snitch? I made a benchmark to figure it out