Instacart built a two-phase ML pipeline to automate digitization of grocery flyers, reducing processing time from 3-4 hours to 30 minutes. Phase 1 uses Meta's Segment Anything Model with custom algorithms for image segmentation, achieving 75-90% accuracy in extracting product bounding boxes. Phase 2 combines OCR, LLMs, and

9m read timeFrom tech.instacart.com
Post cover image
Table of contents
IntroductionOur Approach: A two phase pipelinePhase 1: Image SegmentationGet Prithvi Srinivasan ’s stories in your inbox

Sort: