Skip to content

DeepakKumarGS/Categorical-Encoding

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Categorical-Encoding

Repository for Categorical Feature Encoding Challenge in Kaggle 2019

Achieved rank of 687 out of 1342 participants

Techniques tried out :

One hot encoding ( OHE) columns bin_0 to bin_4

Frequency encoding colums with high cardinality - nom_0 to nom_9 , ord_0 to ord_5,day,month

Model

LGBM with 5 fold cross validation .Parameters optimized through bayesian optimization

Techniques which did not work

1.Feature engineering - Split ord_5 into two columns and frequency encode

2.Remove variables which are present in 0.1% of the total data for columns with high cardinality

About

Repository for Categorical Feature Encoding Challenge in Kaggle 2019

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published