In this repository, I have created a project which is based on automated feature enginnering for machine leanring, where you just need to give your file path of any excel or csv file and it will apply all the feature engineering operations on that file and you will get one processed file, which you can directly use for you machine learning project.
-
Data Cleaning --> Handeling missing values, dealing with outliers and correcting data entry errors.
-
Data Integration --> Merging of multiple dataset in one single dataset
-
Data Transformation --> normalization and standardization log or power transformation
-
Handling Text and categorical data --> tokenization, one hot encoder, label encoding
-
Handeling Date and Time Data --> convert date and time into numerical format '''
command: streamlit run main.py
- Python.
- Streamlit.
- pandas Python Module.
- sklearn Python Module.
- sklearn.impute For Handeling Missing Values.
- sklearn.preprocessing For Normalization Of Data.
- pandas.get_dummies For Handling Categorical Data.
- pandas.read_csv or pandas.read_excel To Read Dataset.
- pandas.merger For Merge Datasets.
Dhrumil Patel