Amazon SageMaker now supports sticky session routing for inference. This feature allows requests in the same session to be routed to the same instance, reducing latency and improving user experience by leveraging previously processed information. It is beneficial for applications requiring large data payloads or seamless interactive experiences. The feature is available in all regions where SageMaker is available.
Sort: