Towards Data Science is a community-powered publication that showcases work in data science, machine learning and artificial intelligence. Every day newcomers, seasoned researchers and industry practitioners publish tutorials, research notes and real-world case studies that help the field move forward.

Towards Data Science

A practical guide to implementing causal inference using propensity score matching (PSM) in Python. PSM addresses selection bias in observational data by finding 'statistical twins' — control group members with nearly identical characteristics to treated subjects — enabling fair comparison without a randomized experiment. The tutorial walks through the full pipeline: computing propensity scores via logistic regression, matching pairs with nearest neighbors and a caliper threshold, evaluating balance using standardized mean difference (SMD), and measuring treatment effect with t-tests and Cohen's D.

Correlation vs. Causation: Measuring True Impact with Propensity Score Matching