This project goals is to predict whether the customer will be defaulted or not. This dataset consist of 307511 records and 112 features.
We are using several methods:
- Quasi constant method
- Chi square
- Univariate feature selection
- Univariate AUC
we are using AUC score as evaluation metrics, because in credit card default usually AUC is proper evaluation metric for this case.
- Logistic Regression
- Decision Tree
- Random Forest
- Adaboosting
- Gradient Boosting
After implementing hyperparameter tuning, our best fit model is using Random Forest Classifier model with :
- train auc score: 0.7067893531568168
- test auc score: 0.7065710624972077