Part 3 of a credit scoring model series focuses on data preprocessing: handling outliers and missing values in borrower data using Python. It covers creating a synthetic time variable for train/test/OOT splits, applying stratified splitting to preserve default rate and temporal structure, treating outliers with the IQR method

18m read timeFrom towardsdatascience.com
Post cover image
Table of contents
Creating a Time VariableImputing Missing ValuesConclusionReferences

Sort: