Path: blob/master/14_imbalanced/handling_imbalanced_data_exercise.md
1141 views
Exercise: Handling imbalanced data in machine learning
Use this notebook but handle imbalanced data using simple logistic regression from skelarn library. The original notebook using neural network but you need to use sklearn logistic regression or any other classification model and improve the f1-score of minority class using,
Undersampling
Oversampling: duplicate copy
OVersampling: SMOT
Ensemble
Take this dataset for bank customer churn prediction : https://www.kaggle.com/barelydedicated/bank-customer-churn-modeling
Build a deep learning model to predict churn rate at bank
Once model is built, print classification report and analyze precision, recall and f1-score
Improve f1 score in minority class using various techniques such as undersampling, oversampling, ensemble etc
Thanks https://github.com/src-sohail for providing this solution.