Vercel AI Gateway now supports a `sort` option on `providerOptions.gateway` that lets you rank providers behind a model by cost, time to first token (TTFT), or throughput (TPS). Sorting is computed at request time, so price changes and latency shifts are reflected automatically. The feature composes with other routing controls like Zero Data Retention and manual `order` overrides. Every response also includes routing metadata showing which providers were considered, their metric values, the execution order, and any deprioritized providers.
Sort: