Cloudflare is evolving AI Gateway into a unified inference layer that lets developers access 70+ models from 12+ providers through a single API and one set of credits. Key updates include: using the same AI.run() Workers binding to call third-party models (OpenAI, Anthropic, Google, etc.) with a one-line switch, centralized
Table of contents
One catalog, one unified endpointBring your own modelThe fast path to first tokenBuilt for reliability with automatic failoverReplicateGet startedWatch on Cloudflare TVSort: