Instacart built a two-phase ML pipeline to automate digitization of grocery flyers, reducing processing time from 3-4 hours to 30 minutes. Phase 1 uses Meta's Segment Anything Model with custom algorithms for image segmentation, achieving 75-90% accuracy in extracting product bounding boxes. Phase 2 combines OCR, LLMs, and Instacart's existing search infrastructure to match segmented images to catalog products with 95% recall. The system handles complex flyer layouts through techniques like Weighted Boxes Fusion, model ensembling, and ML-based filtering.

9m read timeFrom tech.instacart.com
Post cover image
Table of contents
IntroductionOur Approach: A two phase pipelinePhase 1: Image SegmentationGet Prithvi Srinivasan ’s stories in your inbox

Sort: