Visage ML 🔥

Neural network library built from scratch in Mojo

A foundational machine learning library written in Mojo, implementing neural networks and training algorithms from first principles. Combines Python's expressiveness with C-level performance.

⚠️ Work in Progress - APIs are subject to change as we explore the design space for ML in Mojo.

Features

✅ Implemented

Linear Algebra

Vector operations (add, dot product, elementwise multiply/divide)
Matrix operations (multiply, transpose, matrix-vector, matrix-matrix)
Scalar operations and shape validation

Neural Networks

Dense (fully-connected) layers
Activation functions: ReLU, Sigmoid, Tanh, Softmax
Forward propagation through multi-layer networks

Training

Backpropagation - Complete gradient computation using chain rule
Loss functions: MSE, Binary Cross-Entropy
Gradient descent optimizer
Full training loop with real learning

🚧 In Progress

Advanced optimizers (Adam, momentum, RMSprop)
Regularization (dropout, L2, batch normalization)
Convolutional layers
SIMD optimization for performance
Model save/load

Quick Start

Installation

# Clone repository
git clone https://github.com/itsdevcoffee/mojo-visage.git
cd mojo-visage

# Install dependencies (requires Mojo)
pixi install

# Run tests
pixi run test

# Watch a network learn XOR!
pixi run train-xor

Basic Usage

from visage import matrix_vector_multiply, vector_add

fn main() raises:
    var weights: List[List[Float64]] = [
        [0.5, -0.3],
        [0.2, 0.8]
    ]
    var inputs: List[Float64] = [1.0, 2.0]

    var output = matrix_vector_multiply(weights, inputs)
    print(output)  # Neural network layer computation!

Train a Network

pixi run train-xor

Training Neural Network on XOR Problem
======================================

Epoch 500  | Loss: 0.254
Epoch 1000 | Loss: 0.114
Epoch 1500 | Loss: 0.018
...
Epoch 5000 | Loss: 0.001

Testing trained network:
Input: [0, 0] | Target: 0 | Prediction: 0.02 ✓
Input: [0, 1] | Target: 1 | Prediction: 0.96 ✓
Input: [1, 0] | Target: 1 | Prediction: 0.98 ✓
Input: [1, 1] | Target: 0 | Prediction: 0.04 ✓

✓ Training complete! Network learned XOR!

Examples

# Linear algebra operations
pixi run example-basic

# Forward propagation demo
pixi run example-network

# Train on XOR (classic non-linear problem)
pixi run train-xor

Project Structure

mojo-visage/
├── src/
│   ├── visage.mojo          # Core linear algebra
│   └── nn.mojo              # Neural network components
├── tests/                   # Test suite
├── examples/                # Usage examples & training demos
├── learn/                   # Educational content (separate from library)
└── docs/                    # Documentation

Using the library:

mojo -I src your_code.mojo

Development Status

Component	Status
Linear Algebra	✅ Complete
Activations	✅ Complete
Forward Pass	✅ Complete
Backpropagation	✅ Complete
Loss Functions	✅ Complete
Basic Training	✅ Complete
Advanced Optimizers	🚧 In Progress
Regularization	📋 Planned
Conv Layers	📋 Planned
SIMD Optimization	📋 Planned

Why Mojo?

Python-like syntax - Easy to read and write
C-level performance - Fast execution for ML workloads
Built for AI - First-class support for ML primitives
Zero dependencies - Pure Mojo implementation

Learning Resources

This library is built from scratch as a learning exercise. If you're interested in the educational journey:

learn/ - Step-by-step implementations (blocks 0-11)
STRUCTURE.md - Repository organization
Learning tasks: pixi run learn-*

Contributing

Contributions welcome! This project is actively exploring ML library design in Mojo.

Fork the repository
Create a feature branch
Write tests for new functionality
Submit a pull request

Areas where contributions are especially welcome:

SIMD optimizations
Advanced optimizers (Adam, RMSprop)
Regularization techniques
Performance benchmarks

Roadmap

v0.1 (Current) - Foundation

Core linear algebra
Basic neural network layers
Backpropagation
Training loop

v0.2 - Optimization

Advanced optimizers (Adam, momentum)
Learning rate schedules
SIMD acceleration
Performance benchmarks

v0.3 - Production Features

Model serialization (save/load)
Regularization (dropout, L2)
Batch normalization
Real dataset support

v0.4+ - Advanced

Convolutional layers
Recurrent layers
Custom autodiff
GPU support

License

MIT License - see LICENSE for details.

Acknowledgments

Built with Mojo 🔥

Inspired by building ML from first principles to understand what's really happening under the hood.

View on GitHub | Report Issues

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
examples		examples
learn		learn
src		src
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
STRUCTURE.md		STRUCTURE.md
pixi.lock		pixi.lock
pixi.toml		pixi.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Visage ML 🔥

Features

✅ Implemented

🚧 In Progress

Quick Start

Installation

Basic Usage

Train a Network

Examples

Project Structure

Development Status

Why Mojo?

Learning Resources

Contributing

Roadmap

License

Acknowledgments

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

itsdevcoffee/mojo-visage

Folders and files

Latest commit

History

Repository files navigation

Visage ML 🔥

Features

✅ Implemented

🚧 In Progress

Quick Start

Installation

Basic Usage

Train a Network

Examples

Project Structure

Development Status

Why Mojo?

Learning Resources

Contributing

Roadmap

License

Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages