Generic Machine Learning Pipeline

This repository contains a versatile machine learning pipeline, exemplified with a housing price prediction task. While the current implementation is tailored to housing price prediction, the structure is designed to be adaptable for various other prediction tasks with minor modifications.

Overview

The pipeline follows these main steps:

Data Loading: Loads the dataset. In the example, housing datasets such as California and Ames are used.
Data Exploration: Explores the dataset to understand its characteristics.
Data Preprocessing: Processes the data to ensure it is suitable for modeling.
Train-Test Split: Divides the dataset into training and testing subsets.
Feature Selection: Uses RandomForestRegressor to identify significant features. This step can be adapted for other feature selection methods.
Model Building: Constructs a predictive model. The example uses the LightGBM algorithm, but other algorithms can be substituted.
Hyperparameter Tuning: Optimizes model parameters. GridSearchCV is employed in the example.
Model Evaluation: Assesses the model's performance using various metrics.
Model Saving: Serializes the trained model for deployment or future use.

Libraries Utilized

pandas
scikit-learn
LightGBM
joblib

Usage

To execute this pipeline, run:

python pipeline.py

For adapting this pipeline to other tasks, users may need to adjust data loading, preprocessing, and the choice of machine learning algorithm as per the specific requirements.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
pipeline.py		pipeline.py
readme.md		readme.md
ui.py		ui.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Generic Machine Learning Pipeline

Overview

Libraries Utilized

Usage

About

Releases

Packages

Languages

jasonjiang8866/tabularML

Folders and files

Latest commit

History

Repository files navigation

Generic Machine Learning Pipeline

Overview

Libraries Utilized

Usage

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages