The Kubernetes community has formed the AI Gateway Working Group (WG AI Gateway), a new initiative to develop standards and best practices for networking infrastructure supporting AI workloads. An AI Gateway in Kubernetes context refers to network infrastructure implementing the Gateway API with AI-specific capabilities like token-based rate limiting, fine-grained access controls, payload inspection, and AI-protocol routing. Active proposals include payload processing (covering security features like prompt injection protection and optimization features like semantic routing and caching) and egress gateways (for securely routing traffic to external AI services like OpenAI, Vertex AI, and Bedrock). The group is presenting at KubeCon Europe 2026 in Amsterdam and welcomes contributions from gateway implementers, platform operators, and AI developers via GitHub, weekly meetings, and Slack.
Table of contents
What is an AI Gateway?Working group charter and missionActive proposalsUpcoming eventsGet involvedSort: