Part 3 of a credit scoring model series focuses on data preprocessing: handling outliers and missing values in borrower data using Python. It covers creating a synthetic time variable for train/test/OOT splits, applying stratified splitting to preserve default rate and temporal structure, treating outliers with the IQR method
Sort: