LightwatchAI

A high-performance, custom implementation of a character-level LSTM language model with an interactive configuration system. Features cross-platform support for Windows, Linux, and macOS with GPU acceleration for both NVIDIA and AMD hardware.

Features

Interactive Model Configuration: Terminal-based menu system with arrow key navigation
LSTM Architecture: Long Short-Term Memory networks for superior text generation
Multi-GPU Support:
- NVIDIA CUDA: Optimized for NVIDIA GPUs using CUDA and cuBLAS
- AMD HIP: Optimized for AMD GPUs using ROCm/HIP
Runtime Configuration: Choose model size and sequence length interactively
Smart Checkpointing: Save and resume training progress automatically
Adam Optimizer: Advanced optimization for faster convergence
Cross-Platform: Windows, Linux, macOS with shell-specific instructions

Interactive Configuration

Model Size Options

Minimum: 64 hidden units
Mini: 96 hidden units
NotSoMini: 128 hidden units (default)
Normaal: 160 hidden units
Normalish: 192 hidden units
Big: 224 hidden units
Bigger: 256 hidden units
Biggest: 320 hidden units
LiterallyInsane: 512 hidden units

TBPTT Length Options

Shortest: 1 step
Short: 8 steps
NotShortButNotLong: 16 steps
YesLong: 32 steps (default)
Longer: 64 steps
Longest: 128 steps

Requirements

C++17 compatible compiler
CMake 3.16+ for building
Optional: CUDA Toolkit 12.0+ (NVIDIA GPU support)
Optional: ROCm/HIP (AMD GPU support)
Optional: OpenBLAS (CPU acceleration)
Optional: Qt 5.15+ or Qt 6.x (GUI support)

Platform-Specific Setup

Windows (PowerShell/Command Prompt)

Prerequisites:

Visual Studio 2019/2022 with C++ tools
CMake (install via Visual Studio Installer or download from cmake.org)
Git for Windows

Clone and Build:

# Clone repository
git clone https://github.com/watchthelight/LightwatchAI.git
cd LightwatchAI

# CPU-only build
mkdir build_cpu
cd build_cpu
cmake .. -G "Visual Studio 17 2022" -A x64
cmake --build . --config Release
cd ..

# NVIDIA CUDA build (if CUDA is installed)
mkdir build_cuda
cd build_cuda
cmake .. -G "Visual Studio 17 2022" -A x64 -DUSE_CUDA=ON
cmake --build . --config Release
cd ..

Run Executables:

# Interactive training
.\bin\lightwatch_train.exe dataset.bin

# Interactive generation
.\bin\lightwatch_run.exe checkpoint_latest.bin

# With command line overrides
.\bin\lightwatch_train.exe dataset.bin --hidden-size 256 --tbptt 64

Linux (Bash Shell)

Prerequisites (Ubuntu/Debian):

# Install build tools
sudo apt update
sudo apt install build-essential cmake git

# Optional: CUDA support (NVIDIA)
# Download and install CUDA Toolkit from developer.nvidia.com

# Optional: ROCm support (AMD)
# Follow ROCm installation guide for your distribution

# Optional: OpenBLAS
sudo apt install libopenblas-dev

Clone and Build:

# Clone repository
git clone https://github.com/watchthelight/LightwatchAI.git
cd LightwatchAI

# CPU-only build
mkdir build_cpu && cd build_cpu
cmake ..
make -j$(nproc)
cd ..

# NVIDIA CUDA build
mkdir build_cuda && cd build_cuda
cmake .. -DUSE_CUDA=ON
make -j$(nproc)
cd ..

# AMD HIP build
mkdir build_hip && cd build_hip
cmake .. -DUSE_HIP=ON
make -j$(nproc)
cd ..

# OpenBLAS build
mkdir build_blas && cd build_blas
cmake .. -DUSE_OPENBLAS=ON
make -j$(nproc)
cd ..

Run Executables:

# Interactive training
./lightwatch_train dataset.bin

# Interactive generation
./lightwatch_run checkpoint_latest.bin

# With command line overrides
./lightwatch_train dataset.bin --hidden-size 256 --tbptt 64

macOS (Bash/Zsh Shell)

Prerequisites:

# Install Xcode Command Line Tools
xcode-select --install

# Install Homebrew (if not already installed)
/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"

# Install CMake
brew install cmake

# Optional: OpenBLAS
brew install openblas

Clone and Build:

# Clone repository
git clone https://github.com/watchthelight/LightwatchAI.git
cd LightwatchAI

# CPU-only build
mkdir build_cpu && cd build_cpu
cmake ..
make -j$(sysctl -n hw.ncpu)
cd ..

# OpenBLAS build (recommended for macOS)
mkdir build_blas && cd build_blas
cmake .. -DUSE_OPENBLAS=ON
make -j$(sysctl -n hw.ncpu)
cd ..

Run Executables:

# Interactive training
./lightwatch_train dataset.bin

# Interactive generation
./lightwatch_run checkpoint_latest.bin

Fish Shell (Linux/macOS)

Build Commands:

# Clone repository
git clone https://github.com/watchthelight/LightwatchAI.git
cd LightwatchAI

# CPU build
mkdir build_cpu; and cd build_cpu
cmake ..
make -j(nproc)  # Linux
# make -j(sysctl -n hw.ncpu)  # macOS
cd ..

# CUDA build (Linux only)
mkdir build_cuda; and cd build_cuda
cmake .. -DUSE_CUDA=ON
make -j(nproc)
cd ..

Run Executables:

# Interactive training
./lightwatch_train dataset.bin

# Interactive generation
./lightwatch_run checkpoint_latest.bin

# Command line override
./lightwatch_train dataset.bin --hidden-size 256 --tbptt 64

Qt Installation for GUI

Windows

Download Qt Online Installer from https://www.qt.io/download
Run the installer and select Qt 5.15+ or Qt 6.x
Install Qt Creator (optional) and the MinGW or MSVC components
Ensure Qt bin directory is in PATH

Linux (Ubuntu/Debian)

# Install Qt5
sudo apt install qt5-default qtbase5-dev qtchooser

# Or install Qt6
sudo apt install qt6-base-dev

macOS

# Install Qt5 via Homebrew
brew install qt5

# Or download Qt6 from qt.io

GUI Build Instructions

Windows

# GUI build with Qt
mkdir build_gui
cd build_gui
cmake .. -G "Visual Studio 17 2022" -A x64 -DENABLE_GUI=ON
cmake --build . --config Release

Linux

# GUI build
mkdir build_gui && cd build_gui
cmake .. -DENABLE_GUI=ON
make -j$(nproc)

macOS

# GUI build
mkdir build_gui && cd build_gui
cmake .. -DENABLE_GUI=ON
make -j$(sysctl -n hw.ncpu)

GUI Usage

The GUI provides an easy-to-use interface for training and generation:

Select Training Data: Click "Select Training Data" to choose a .txt file. It will be automatically preprocessed.
Train: Click "Train" to start training on the selected data.
Generate: Click "Generate" to open a dialog for generation parameters and run text generation.

The right panel shows live output from the training/generation processes.

Note: Screenshots will be added after GUI implementation.

Usage Guide

Preparing Training Data

LightwatchAI supports custom training data with automatic preprocessing:

Option 1: Default Data File

Place your text file as data.txt in the project root
Run training: ./bin/lightwatch_train (Linux/macOS) or .\bin\lightwatch_train.exe (Windows)
The system automatically preprocesses data.txt → dataset.bin + vocab.json

Option 2: Custom Data File

# Specify custom input file
./lightwatch_train --data my_training_data.txt

# Force re-preprocessing even if files exist
./lightwatch_train --data my_training_data.txt --regen

Manual Preprocessing

# Use the standalone preprocessor
./bin/data_preprocess my_training_data.txt dataset.bin vocab.json

Data Format Requirements:

Plain text file (UTF-8 encoding recommended)
Any length (larger datasets = better models)
Characters are tokenized individually
No special preprocessing needed

Interactive Mode

When you run the executables without command line overrides, you'll see an interactive menu:

=== LightwatchAI Configuration ===
Select Model Size (Hidden Layer Size):

Use ↑/↓ arrow keys to navigate, Enter to select:

  Minimum (64)
  Mini (96)
→ NotSoMini (128)
  Normaal (160)
  Normalish (192)
  Big (224)
  Bigger (256)
  Biggest (320)
  LiterallyInsane (512)

Press 'q' to quit

Navigation:

↑/↓ Arrow Keys: Navigate menu options
Enter: Select current option
q: Quit application

Command Line Override

Skip the interactive menu by providing parameters directly:

# Training with specific configuration
./lightwatch_train dataset.bin --hidden-size 512 --tbptt 128

# Training with custom data
./lightwatch_train --data my_data.txt --hidden-size 256 --tbptt 64

# Generation with specific configuration
./lightwatch_run checkpoint_latest.bin --hidden-size 512 --tbptt 128 --len 2000 --temp 0.9 --seed "Once upon a time"

Non-Interactive Environments

The system automatically detects non-interactive environments (pipes, scripts, CI/CD) and uses default values:

Hidden Size: 128
TBPTT Length: 32

Training Options

./lightwatch_train [OPTIONS] [dataset.bin]

Options:
  --data FILE       Custom training data file (default: data.txt)
  --regen           Force re-preprocessing of data
  --hidden-size N   Model hidden size (default: 128)
  --tbptt N         TBPTT length (default: 32)

Generation Options

./lightwatch_run checkpoint.bin [OPTIONS]

Options:
  --hidden-size N    Model hidden size (must match training)
  --tbptt N         TBPTT length (must match training)
  --len N           Generate N characters (default: 100)
  --temp T          Sampling temperature 0.1-2.0 (default: 1.0)
  --topk K          Top-K sampling, 0=disabled (default: 50)
  --seed "text"     Initial seed text (default: empty)

Preparing Training Data

Convert your text file to binary format:

# Create a simple Python script to convert text to binary
python3 -c "
import sys
with open(sys.argv[1], 'r', encoding='utf-8') as f:
    text = f.read()
with open(sys.argv[2], 'wb') as f:
    f.write(text.encode('utf-8'))
" input.txt dataset.bin

Troubleshooting

Windows Issues

"MSVCR120.dll missing": Install Visual C++ Redistributable
CMake not found: Add CMake to PATH or reinstall with "Add to PATH" option
CUDA not detected: Ensure CUDA_PATH environment variable is set

Linux Issues

"Permission denied": Make executables executable with chmod +x bin/lightwatch_*
CUDA issues: Check nvidia-smi and ensure CUDA_HOME is set
Missing libraries: Install development packages (-dev suffix)

macOS Issues

"Developer cannot be verified": Run sudo spctl --master-disable temporarily
OpenMP not found: Install with brew install libomp

General Issues

Interactive menu not working: Ensure terminal supports ANSI escape sequences
Arrow keys not working: Try using a different terminal emulator
Build fails: Check CMake version is 3.16+ and compiler supports C++17

Project Structure

LightwatchAI/
├── src/                        # Source files
│   ├── data_preprocess.cpp/h   # Data preprocessing utilities
│   ├── lightwatch_config.cpp/h # Interactive configuration system
│   ├── lightwatch_types.cpp/h  # Runtime configurable parameters
│   ├── lightwatch_gui.cpp      # GUI main application
│   ├── gui_mainwindow.h/.cpp   # GUI main window implementation
│   ├── lightwatch_kernels.h    # LSTM computation kernels
│   ├── lightwatch_bptt.h       # Backpropagation implementation
│   ├── lightwatch_dataset.h    # Data loading utilities
│   ├── lightwatch_generate.h   # Text generation utilities
│   └── simple_cuda_test.cpp    # CUDA functionality test source
├── build/                      # Build outputs and CMake files
│   ├── build_cpu/              # CPU build directory
│   ├── build_cuda/             # CUDA build directory
│   └── ...                     # Other build configurations
├── bin/                        # Built executables
│   └── Release/                # Release executables
│       ├── lightwatch_train(.exe)  # Training executable
│       ├── lightwatch_run(.exe)    # Generation executable
│       ├── lightwatch_gui(.exe)    # GUI executable (if Qt available)
│       ├── data_preprocess(.exe)   # Data preprocessing utility
│       └── simple_cuda_test(.exe)  # CUDA functionality test
├── lightwatch_train.cpp        # Training executable source
├── lightwatch_run.cpp          # Generation executable source
├── CMakeLists.txt              # Build configuration
└── README.md                   # This file

Performance Notes

CPU Performance: Use OpenBLAS for significant speedup on CPU
GPU Memory: Larger models (Biggest, LiterallyInsane) require more VRAM
Training Speed: CUDA > HIP > OpenBLAS > Plain CPU
Model Quality: Larger hidden sizes generally produce better text

Contributing

Contributions welcome! Please:

Fork the repository
Create a feature branch
Test on multiple platforms
Submit a pull request

License

MIT License - see LICENSE file for details.

Releases

Pre-built Windows GUI executables are available in the Releases section. Download the latest .exe file to run the GUI without building from source.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
.github/workflows		.github/workflows
CMakeFiles		CMakeFiles
Release		Release
bin		bin
src		src
test_build		test_build
x64/Release		x64/Release
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
checkpoint_latest.bin		checkpoint_latest.bin
cmake_install.cmake		cmake_install.cmake
data.txt		data.txt
dataset.bin		dataset.bin
longer_dataset.txt		longer_dataset.txt
test_dataset.txt		test_dataset.txt
test_vocab.json		test_vocab.json
vocab.json		vocab.json

License

watchthelight/LightwatchAI

Folders and files

Latest commit

History

Repository files navigation

LightwatchAI

Features

Interactive Configuration

Model Size Options

TBPTT Length Options

Requirements

Platform-Specific Setup

Windows (PowerShell/Command Prompt)

Linux (Bash Shell)

macOS (Bash/Zsh Shell)

Fish Shell (Linux/macOS)

Qt Installation for GUI

Windows

Linux (Ubuntu/Debian)

macOS

GUI Build Instructions

Windows

Linux

macOS

GUI Usage

Usage Guide

Preparing Training Data

Option 1: Default Data File

Option 2: Custom Data File

Manual Preprocessing

Interactive Mode

Command Line Override

Non-Interactive Environments

Training Options

Generation Options

Preparing Training Data

Troubleshooting

Windows Issues

Linux Issues

macOS Issues

General Issues

Project Structure

Performance Notes

Contributing

License

Releases

Download

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages