Learning From Pairwise Preferences: An Introduction to the Bradley Terry Model

A thorough introduction to the Bradley-Terry model for learning probabilistic rankings from pairwise comparisons. Covers the core mathematical formulation (latent strength parameters, log-likelihood, gradient interpretation), fitting methods (gradient ascent, Newton methods, MM updates), and identifiability constraints. Extends into contextual Bradley-Terry (equivalent to logistic regression on feature differences), with application to LMSYS Chatbot Arena for LLM evaluation. Also covers CrowdBT for handling noisy annotators via EM-based joint estimation of item strengths and annotator reliabilities, plus Bayesian extensions like TrueSkill.

#machine-learning

#llm

May 27•27m read time•From towardsdatascience.com

Table of contents

A Simple Example Fitting the Model From Data A Deeper Look at Bradley-Terry Model Fitting From Local Judgments to Global Structure Why Pairwise Comparisons Are Often Better Than Direct Scores Going Deeper: Identifiability, Curvature, and Optimization Contextual Bradley-Terry: When Strength Depends on Setting Accounting for Noisy Raters: When Not All Comparisons Are Equal Summary Further Reading

Comment

Bookmark

Copy

Sort: