Skip to content

secrakib/Guess-My-Income

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Guess-My-Income

images

Guessing Income Based On 15 Parameters

About Dataset

  • The dataset Contains 15 Columns and 48882 rows
  • Out of 15 features 6 features are Continuos and 9 are Catagorial
  • The Dataset is Highly Imbalanced

Problem

We need to predict income Based on a Dataset which is Highly Imbalanced

download (2)

Solution Approach

We Applied SMOTE Upsampling And Near Miss DownSampling to address the issue

Balanced Data After Preproceesing

download (3)


Best Model

LGBM Mdoel Achieved a f1 Score of 65.154 % Screenshot (76)

Stack Used

  • Numpy
  • Pandas
  • Scikit-Learn
  • Matplotlib
  • Seaborn
  • Pickle

Workflow Diagram

Diagram_page-0001

Diagram.pdf

Install Libraries

Pandas

pip install pandas

Numpy

pip install numpy

Pickle

pip install pickle5

Scikit-learn

pip install scikit-learn

matplotlib

pip install matplotlib

Seaborn

pip install seaborn

Imblearn

pip install imblearn

Clone Repo

https://github.com/secrakib/Guess-My-Income.git

License

Apache License 2.0

About

Handled Highly Imbalanced Data

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published