Classic ML models

A set of classic ML models written from scratch using numpy

About The Project

This project serves an educational purpose of showing how models in Machine Learning work under the hood. I've made them using exclusively Python code library and numpy.
Where are several classification models:

Logistic Regression
K nearest neighbors
Support Vector Machine
Bayesian Classificator
Decision tree

Prerequisites

Python >= 3.9
Jupyter Notebook to run code in the notebook

Setup and run

Clone the repository by running

git clone https://github.com/dvarkless/Classic-ML-Models.git

Create a python virtual environment:

cd Classic-ML-Models
python -m venv venv

If you are using Linux or Mac:

source ./venv/bin/activate

If you are using Windows:

./venv/Scripts/activate.ps1

Create an IPython kernel if you want to run it in Jupyter Notebook:

python -m ipykernel install --user --name=classic-ml-models

Usage

Prepare a dataset, split it into training data, evaluation input and evaluation answers:

training_data = np.genfromtxt("datasets/light-train.csv", delimiter=",", filling_values=0)
evaluation_data = np.genfromtxt("datasets/medium-test.csv", delimiter=",", filling_values=0)
evaluation_input = evaluation_data_lite[:, 1:]
evaluation_answers = evaluation_data_lite[:, 0]

datapack = (training_data, evaluation_input, evaluation_answers)

Pass hyperparameters into two dicts:

The first one is used to create a class instance
The second dict passes parameters into model one-by-one. It is used to show how different parameters affect the model's prediction quality

hp = {
    'data_converter': get_plain_data,
    'normalization': True,
    'shift_column': True,
    'learning_rate': 0.05,
    'batch_size': 300,
    'epochs': 300,
    'num_classes': 26,
    'reg': 'l1',
    'reg_w': 0.05,
}

params_to_change = {
    'learning_rate': [0.01, 0.02, 0.05],
}

Run the model using a ModelRunner class:

MultilogRunner = ModelRunner(MultilogRegression, defaults=hp, metrics=my_metrics, responsive_bar=True)
MultilogRunner.run(*datapack, params_to_change, one_vs_one=True)

License

Distributed under the MIT License. See LICENSE.txt for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
datasets		datasets
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
image_feature_detector.py		image_feature_detector.py
metrics.py		metrics.py
model_base.py		model_base.py
model_bayes.py		model_bayes.py
model_decisiontree.py		model_decisiontree.py
model_knn.py		model_knn.py
model_multilog.py		model_multilog.py
model_runner.py		model_runner.py
model_svm.py		model_svm.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Classic ML models

About The Project

Prerequisites

Setup and run

Usage

License

About

Languages

License

dvarkless/Classic-ML-Models

Folders and files

Latest commit

History

Repository files navigation

Classic ML models

About The Project

Prerequisites

Setup and run

Usage

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages