Trusting the Untestable: Validation and Diagnostics for the Doubly Robust Models written by Ross Chu and Shima Nassiri The Causal Frontier: Measurement Beyond Randomization The gold standard for …

LyftEng's platform is a central hub for engineering insights and technology updates from Lyft's engineering team. Through articles, tech talks, and open-source contributions, LyftEng offers insights into engineering challenges, innovation projects, and best practices in software development. Readers can learn about Lyft's engineering culture, technology stack, and contributions to the broader engineering community.

Lyft Engineering

Lyft's data science team developed validation methods for Augmented Inverse Propensity Weighting (AIPW), a doubly robust causal inference model used when A/B testing isn't feasible. The platform requires rigorous confounder management with hundreds of features, applies propensity score corrections for downsampled data, and provides diagnostic scorecards checking propensity overlap and covariate balance. Validation against experimental ground truth from ride challenge programs revealed AIPW understates effects by 16% due to propensity trimming creating non-representative samples. The team added marginal sensitivity models and covariate comparison diagnostics to detect when hidden confounders or trimming compromise estimate reliability.

Trusting the Untestable: Validation and Diagnostics for the Doubly Robust Models

Get Shima Nassiri ’s stories in your inbox