This project is a Machine Learning-based breast cancer classifier that predicts whether a tumor is malignant or benign using key clinical data. The model is built with Python and leverages popular ML libraries.
- Data Preprocessing: Handles missing values, normalizes data, and prepares it for model training.
- Model Training: Uses supervised learning algorithms, including Logistic Regression, Support Vector Machines, and Random Forests.
- Evaluation Metrics: Includes accuracy, precision, recall, and F1 score for model performance evaluation.
The dataset used in this project is sourced from the Breast Cancer Wisconsin (Diagnostic) dataset. It includes the following features:
- Mean radius, texture, perimeter, area, and more.
- Diagnosis:
M(Malignant) orB(Benign).
- Clone the repository:
git clone https://github.com/s3bu7i/ML-Breast-Cancer-Classifier.git
- Navigate to the project directory:
cd ML-Breast-Cancer-Classifier - Install the required dependencies:
pip install -r requirements.txt
- Run the preprocessing script:
python preprocess.py
- Train the model:
python train.py
- Evaluate the model:
python evaluate.py
- Predict new samples:
python predict.py
The classifier achieves high accuracy and reliability in distinguishing between malignant and benign cases. Below are the results of key evaluation metrics:
- Accuracy: 97%
- Precision: 96%
- Recall: 95%
- F1 Score: 95%
ML-Breast-Cancer-Classifier/
├── data/ # Dataset and preprocessing scripts
├── models/ # Saved models
├── notebooks/ # Jupyter notebooks for exploratory data analysis
├── scripts/ # Training and evaluation scripts
├── requirements.txt # Python dependencies
└── README.md # Project documentation
- Implement deep learning models for improved performance.
- Explore feature selection and optimization techniques.
- Build a web application for real-time classification.