Amazon SageMaker Inference now supports OpenAI-compatible APIs, allowing developers to connect existing tools like the OpenAI SDK, LangChain, and Strands Agents directly to SageMaker endpoints by simply changing the endpoint URL. No custom integration code or SDK rewrites are needed. The feature preserves existing streaming logic and framework integrations while giving users control over GPU instances, VPC data residency, open source or fine-tuned models, and auto-scaling. Authentication uses existing AWS credentials. Available now across 14 AWS regions.
Sort: