Sunil Pai from Cloudflare presents 'code mode', an approach where LLMs generate executable code (JavaScript) instead of making repeated JSON tool calls. This dramatically reduces token usage — shrinking Cloudflare's 2,600-endpoint API surface from ~1.2 million tokens to ~1,000 tokens using just two tool calls (search and
•19m watch time
Sort: