Pedal Detector (YOLOv8)

Object detection pipeline for identifying guitar pedals in pedalboard photos using YOLOv8n.

This project focuses on a practical end-to-end ML workflow in Python: dataset preparation, reproducible training, CLI inference, and model export.

Why This Project

Real-world computer vision task on custom data.
Complete workflow from raw labels to inference artifacts.
Lightweight model choice for fast inference.

Dataset Summary

Total labeled images: 164 (images/: 165 files, labels/: 164 files).
Train/val split: 139 / 25 (84.8% / 15.2%).
Class setup: single class guitar_pedal (YOLO TXT detection format).
Labeled objects: 1,652 total (1,413 train, 239 val), about 10.1 pedals per labeled image.
Raw training data (images/, labels/, datasets/) remains private due to licensing constraints.

Results Snapshot

Reference validation run: YOLOv8n, 50 epochs, batch 8, imgsz 640, seed 0, CPU.

Checkpoint	Precision	Recall	F1	mAP@50	mAP@50-95
Baseline (epoch 1)	0.7358	0.7082	0.7217	0.7543	0.4421
Best (epoch 39)	0.9407	0.9411	0.9409	0.9751	0.7343
Final (epoch 50)	0.9042	0.9596	0.9311	0.9759	0.7157

Baseline to best improvement:

mAP@50-95: +0.2922 (+66.1%)
F1: +0.2192 (+30.4%)

Error Analysis

Method:

Evaluated runs/detect/runs/detect/pedal_detector/weights/best.pt on datasets/val/images (25 images, 239 GT boxes).
Analysis thresholds: inference conf=0.25, NMS IoU 0.45, TP match IoU >=0.5.
Outputs saved to runs/detect/error_analysis_summary.json.

Aggregate detection errors at the selected operating point:

TP: 231
FP: 51
FN: 8
Precision: 0.8191
Recall: 0.9665
F1: 0.8868

Failure mode breakdown:

Failure mode	Count	Share	Mitigation priority
False positives on background/hardware	41	80.4% of FP	Add hard negatives (boards/cables/knobs without pedals), raise confidence threshold for deployment profile
Localization errors (0.1 <= IoU < 0.5 vs nearest GT)	10	19.6% of FP	Add tighter-box label QA on dense scenes, increase dense-scene samples
Missed medium objects (area ratio 1%-5%)	6	75.0% of FN	Targeted augmentations for medium-scale crowded layouts
Missed small objects (area ratio <1%)	1	12.5% of FN	Add higher-resolution crops / close-up samples
Missed large objects (area ratio >=5%)	1	12.5% of FN	Add lighting/occlusion diversity for large pedals

Additional quality signal:

22 / 231 matched detections (9.5%) had IoU < 0.75, indicating localization can still be tightened in cluttered scenes.

Hardest validation images by total errors (FP + FN):

images (82).jpg: FP 5, FN 1
ceOiSsG.jpg: FP 5, FN 0
A-Bass-Pedal-Setup-130720-small-finish.jpg: FP 1, FN 3
images (32).jpg: FP 3, FN 1
images (31).jpg: FP 4, FN 0

Example Prediction

Original	YOLOv8 Prediction

Generated with: python predict.py --source my_pedals.jpg --weights runs/detect/pedal_detector/weights/best.pt --conf 0.25 --iou 0.45 --save.

Quick Start

git clone https://github.com/<your-user>/pedal_detector.git
cd pedal_detector
python -m venv .venv
# Windows PowerShell
.venv\Scripts\Activate.ps1
pip install -r requirements.txt

Download pretrained weights from GitHub Releases:

curl -L -o best.pt https://github.com/escobar-david/pedal_detector/releases/download/yolo-pedalboards/best.pt
python predict.py --source my_pedals.jpg --weights best.pt --save

Reproduce Training

Reference run used for the metrics above:

python split_dataset.py --source-images images --source-labels labels --output-dir datasets --train-ratio 0.85 --seed 42
python train.py --data data.yaml --epochs 50 --batch 8 --imgsz 640 --device cpu --seed 0 --project runs/detect --name pedal_detector
python predict.py --source my_pedals.jpg --weights runs/detect/pedal_detector/weights/best.pt --save

Reference run training time: 7271.99 seconds (~2h 1m 12s).

CPU-only training:

python train.py --device cpu

CLI Usage

python train.py --help
python predict.py --help
python split_dataset.py --help
python interactive.py --help

Common CLI examples

Train with explicit run naming and seed:

python train.py --data data.yaml --epochs 50 --batch 16 --seed 123 --project runs/detect --name exp_seed123

Predict on CPU with custom thresholds:

python predict.py --source my_pedals.jpg --weights runs/detect/pedal_detector/weights/best.pt --device cpu --conf 0.3 --iou 0.45 --save

Run interactive mode on CPU and explicit weights:

python interactive.py --device cpu --weights runs/detect/pedal_detector/weights/best.pt

Split data with custom output and deterministic shuffle:

python split_dataset.py --source-images images --source-labels labels --output-dir datasets --train-ratio 0.9 --seed 7

Notes

predict.py now exits with a non-zero code when --source or --weights is invalid.
train.py supports reproducible runs via --seed and portable output paths via --project and --name.

Exit codes

train.py: returns 0 on successful execution.
split_dataset.py: returns 0 on successful split, non-zero for invalid --train-ratio or missing source directories.
predict.py: returns 0 on successful inference, non-zero when input source or weights are missing/invalid.
interactive.py: returns 0 on normal session exit (q/quit/exit), non-zero if weights cannot be resolved at startup.

Project Structure

pedal_detector/
|-- data.yaml
|-- split_dataset.py
|-- train.py
|-- predict.py
|-- interactive.py
|-- requirements.txt
|-- images/                # raw images (local/private by default)
|-- labels/                # raw labels (local/private by default)
|-- datasets/              # generated train/val split
`-- runs/                  # training and inference outputs

Design Choices

Model: YOLOv8n for speed and compact size.
Transfer learning from pretrained weights.
Augmentation tuned for small dataset:
HSV jitter
Horizontal flips
Moderate rotation (+/-10 degrees)
Mosaic and mixup

Limitations

Single class only: guitar_pedal.
Small dataset size may reduce generalization.
No instance segmentation or pedal-type classification yet.

Roadmap

Multi-class labels by pedal type.
Structured error analysis and evaluation reports.
Dockerized inference service.

Dataset and Rights

Before publishing images/labels publicly, confirm you have redistribution rights for every photo and annotation. Raw training data (images/, labels/, datasets/) is kept private in this project due to licensing constraints.

License

MIT. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.claude		.claude
.github/workflows		.github/workflows
runs/detect/predict		runs/detect/predict
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data.yaml		data.yaml
interactive.py		interactive.py
my_pedals.jpg		my_pedals.jpg
predict.py		predict.py
requirements.txt		requirements.txt
split_dataset.py		split_dataset.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pedal Detector (YOLOv8)

Why This Project

Dataset Summary

Results Snapshot

Error Analysis

Example Prediction

Quick Start

Reproduce Training

CLI Usage

Common CLI examples

Notes

Exit codes

Project Structure

Design Choices

Limitations

Roadmap

Dataset and Rights

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Pedal Detector (YOLOv8)

Why This Project

Dataset Summary

Results Snapshot

Error Analysis

Example Prediction

Quick Start

Reproduce Training

CLI Usage

Common CLI examples

Notes

Exit codes

Project Structure

Design Choices

Limitations

Roadmap

Dataset and Rights

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages