Decision trees often create perpendicular split conditions which can lead to overfitting, particularly with diagonal decision boundaries. Running PCA before fitting a decision tree can project data into orthogonal space, potentially reducing the tree's depth and improving performance. However, PCA components are not interpretable, which can be a limitation in some cases. Proper feature engineering might be necessary for better model performance.

5m read timeFrom blog.dailydoseofds.com
Post cover image
Table of contents
For those who want to build a career in DS/ML on core expertise, not fleeting trends:SPONSOR US
1 Comment

Sort: