Computer-Vision-Playground

This is probably one of the projects I have cared about the most. It fuses my convolutional network from the (currently shelved) diffusion-model experiments with the multilayer perceptron from my earlier neural-network work, creating a playground for building simple computer-vision ideas.

Prerequisites

Python 3.9 or higher
Webcam
(Optional) CUDA-capable GPU for acceleration

What it does

Captures webcam frames and trains a model on the fly with user-labeled targets.
Chains a NumPy/CuPy-style CNN front-end to a hand-built MLP classifier.
Lets me prototype different spatial resolutions, kernel counts, and output spaces without regenerating scaffolding.

Current status

Live training works: the Train screen streams the camera feed, you pick the “correct output,” hit start, and gradients flow every frame.
Model creation works: tweak kernel sizes, depth, and class counts, then jump straight into training mode.
Loading/saving is in progress: the UI hooks exist, but serialization still needs to be finished.

Quick start

python ComputerVisionPlayground.py
Choose Create New, dial in the conv/MLP settings, and submit.
click Submit, aim the webcam, select the class label, and toggle live training.

GPU Acceleration (Optional)

To use CuPy for GPU acceleration:

# For CUDA 12.x
pip install cupy-cuda12x

# Then edit ConvolutionalNeuralNetwork_numpy.py:
# Change: import numpy as cp
# To:     import cupy as cp

Dependencies

PySide6: GUI framework
OpenCV: Webcam capture and image processing
NumPy: Numerical computing for neural networks
CuPy (optional): GPU-accelerated computing

Next up

Change the trining to sample videos based on the users input, and train X epochs on those videos. Move live loop trining to loading. (Done)
Finish the save/load path so experiments aren’t strictly in-memory. (Done)
Add better telemetry (loss plots, per-class confidence readouts) to understand what the live loop is learning.
Explore lightweight data augmentation for more stable real-time training.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
CVModel.py		CVModel.py
ComputerVisionPlayground.py		ComputerVisionPlayground.py
ConvolutionalNeuralNetwork_numpy.py		ConvolutionalNeuralNetwork_numpy.py
LICENSE		LICENSE
Network.py		Network.py
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Computer-Vision-Playground

Prerequisites

What it does

Current status

Quick start

GPU Acceleration (Optional)

Dependencies

Next up

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Computer-Vision-Playground

Prerequisites

What it does

Current status

Quick start

GPU Acceleration (Optional)

Dependencies

Next up

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages