LLM-Powered Relevance Assessment for Pinterest Search Han Wang | Machine Learning Engineer; Alex Whitworth | Staff Data Scientist; Pak Ming Cheung | Sr. Staff Machine Learning Engineer; Zhenjie Zhang …

Pint (Python In The News) curates the latest news, updates, and trends in the Python programming language ecosystem. Developers can stay informed about new Python releases, library updates, and community events, as well as discover Python-related articles and resources from around the web. Additionally, Pint highlights Python use cases in various domains such as web development, data science, and machine learning, offering insights into Python's versatility and widespread adoption.

Pinterest Engineering

Pinterest Search scaled their relevance assessment by fine-tuning open-source multilingual LLMs (XLM-RoBERTa-large) on human-annotated data to predict search result relevance. This approach reduced labeling costs and time while achieving 73.7% exact match with human labels and strong correlation metrics (Kendall's τ>0.5). By enabling stratified sampling designs with larger query sets, they reduced minimum detectable effects from 1.3-1.5% to ≤0.25%, primarily through variance reduction. The system successfully evaluates A/B experiments across multiple languages and query popularity segments, generating sDCG@K metrics for ranking quality assessment.