Yuki and Sho, machine learning engineers at Mercari, discuss fine-tuning the SigLIP Vision Language model for Mercari's product catalog to enhance the 'Visually Similar Items' feature. Through A/B testing, the fine-tuned model significantly improved tap rates and purchase counts. They detail the fine-tuning process, model

12m read timeFrom engineering.mercari.com
Post cover image
Table of contents
Fine-tuning of the SigLIP model using product dataDeployment ArchitectureConclusionReferences

Sort: