- The dataset Contains 15 Columns and 48882 rows
- Out of 15 features 6 features are Continuos and 9 are Catagorial
- The Dataset is Highly Imbalanced
We need to predict income Based on a Dataset which is Highly Imbalanced
We Applied SMOTE Upsampling And Near Miss DownSampling to address the issue
LGBM Mdoel Achieved a f1 Score of 65.154 %
- Numpy
- Pandas
- Scikit-Learn
- Matplotlib
- Seaborn
- Pickle
pip install pandas
pip install numpy
pip install pickle5
pip install scikit-learn
pip install matplotlib
pip install seaborn
pip install imblearn
https://github.com/secrakib/Guess-My-Income.git
Apache License 2.0 |