•From x.com

Theo - t3․gg @t3dotgg
I wish Google would stop benchmaxxing for long enough to make a usable model. Gemini 3 Pro is as smart as Opus 4.6 but it screws up tool calls as consistently as Grok 3 Mini https://t.co/WoZYXAWkV2
Sort:

Theo - t3․gg @t3dotgg
I wish Google would stop benchmaxxing for long enough to make a usable model. Gemini 3 Pro is as smart as Opus 4.6 but it screws up tool calls as consistently as Grok 3 Mini https://t.co/WoZYXAWkV2
Sort: