Breast Cancer Classifier Using Machine Learning

This project is a Machine Learning-based breast cancer classifier that predicts whether a tumor is malignant or benign using key clinical data. The model is built with Python and leverages popular ML libraries.

Features

Data Preprocessing: Handles missing values, normalizes data, and prepares it for model training.
Model Training: Uses supervised learning algorithms, including Logistic Regression, Support Vector Machines, and Random Forests.
Evaluation Metrics: Includes accuracy, precision, recall, and F1 score for model performance evaluation.

Dataset

The dataset used in this project is sourced from the Breast Cancer Wisconsin (Diagnostic) dataset. It includes the following features:

Mean radius, texture, perimeter, area, and more.
Diagnosis: M (Malignant) or B (Benign).

Installation

Clone the repository:

git clone https://github.com/s3bu7i/ML-Breast-Cancer-Classifier.git

Navigate to the project directory:
```
cd ML-Breast-Cancer-Classifier
```
Install the required dependencies:
```
pip install -r requirements.txt
```

Usage

Run the preprocessing script:
```
python preprocess.py
```
Train the model:
```
python train.py
```
Evaluate the model:
```
python evaluate.py
```
Predict new samples:
```
python predict.py
```

Model Performance

The classifier achieves high accuracy and reliability in distinguishing between malignant and benign cases. Below are the results of key evaluation metrics:

Accuracy: 97%
Precision: 96%
Recall: 95%
F1 Score: 95%

Project Structure

ML-Breast-Cancer-Classifier/
├── data/                 # Dataset and preprocessing scripts
├── models/               # Saved models
├── notebooks/            # Jupyter notebooks for exploratory data analysis
├── scripts/              # Training and evaluation scripts
├── requirements.txt      # Python dependencies
└── README.md             # Project documentation

Future Enhancements

Implement deep learning models for improved performance.
Explore feature selection and optimization techniques.
Build a web application for real-time classification.

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
.github		.github
data		data
notebooks		notebooks
others		others
output		output
report		report
scripts		scripts
tests		tests
.coverage		.coverage
.flake8		.flake8
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
app.py		app.py
best_lgbm_model.joblib		best_lgbm_model.joblib
best_logreg_model.joblib		best_logreg_model.joblib
best_mlp_model.joblib		best_mlp_model.joblib
best_tree_model.joblib		best_tree_model.joblib
best_xgb_model.joblib		best_xgb_model.joblib
coverage.xml		coverage.xml
data_processing.log		data_processing.log
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Breast Cancer Classifier Using Machine Learning

Features

Dataset

Installation

Usage

Model Performance

Project Structure

Future Enhancements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

s3bu7i/ML-Breast-Cancer-Classifier

Folders and files

Latest commit

History

Repository files navigation

Breast Cancer Classifier Using Machine Learning

Features

Dataset

Installation

Usage

Model Performance

Project Structure

Future Enhancements

About

Topics

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages