Cloudflare is evolving AI Gateway into a unified inference layer that lets developers access 70+ models from 12+ providers through a single API and one set of credits. Key updates include: using the same AI.run() Workers binding to call third-party models (OpenAI, Anthropic, Google, etc.) with a one-line switch, centralized

7m read timeFrom blog.cloudflare.com
Post cover image
Table of contents
One catalog, one unified endpointBring your own modelThe fast path to first tokenBuilt for reliability with automatic failoverReplicateGet startedWatch on Cloudflare TV

Sort: