Mercari automated their search results quality monitoring by replacing manual review processes with LLM-based evaluation. Using Gemini 2.5 Pro, they implemented a system that scores search result relevance on a 0.0-1.0 scale based on Amazon's ESCI criteria (Exact, Substitute, Complement, Irrelevant). The solution provides both

5m read time From engineering.mercari.com
Post cover image
Table of contents
Mercari’s Product Search and Its Quality ManagementChallenges and Requirements in Search Results Quality ReviewAchieving Objective and Stable Monitoring with LLMs and Evaluation CriteriaHow the Quality Monitoring Tools workPossibilities for Further ExpansionConclusion

Sort: