DoorDash developed DashCLIP, a multimodal embedding framework that combines text and image encoders to generate semantic representations of products and user queries. The system uses contrastive learning on 400,000 products, domain adaptation from BLIP-14M, and LLM-augmented relevance datasets to improve ad ranking and
Sort: