A solution architect at Swiss Post presents how they evolved their ML fraud detection system (IRIS) from manual deployments to a fully automated, reliable pipeline. The talk covers their AWS-based architecture using SageMaker for training and inference, EKS for serving, and Kafka for streaming. The core technique is shadow model deployment: running a new candidate model in parallel with the production model on real traffic, with zero user impact. This enables validation of operational metrics (latency), data drift, concept drift, and model-to-model drift using real production data. Synthetic requests with known labels are used to confirm drift direction. Once the shadow model proves superior, it is promoted to production via a riskless switch using Lambda and EventBridge, with no downtime.
Sort: