Zen Gym

Unified AI Model Training Platform

Zen Gym is the training infrastructure for the Zen model family. It provides a single interface for supervised fine-tuning, reinforcement learning from human feedback, and model export across all Zen architectures — from 0.6B parameter edge models to 397B MoE frontier systems.

Supported Models

Model	Parameters	Type
zen-nano	0.6B	Language
zen-eco	4B	Language
zen-eco-instruct	4B	Instruct
zen-eco-thinking	4B	Reasoning
zen-eco-coder	4B	Code
zen-eco-agent	4B	Agent
zen-voyager	32B	Language
zen4-mini	8B	Language
zen4-pro	80B MoE	Language
zen4-thinking	80B MoE	Reasoning
zen4-coder-pro	80B MoE	Code
zen-max	235B MoE	Language
zen4-max	397B MoE	Language
zen-vl-*	4B-30B	Vision-Language

Quick Start

pip install zen-gym

# LoRA fine-tune zen-eco on Alpaca
zen-gym train --model zen-eco --method lora --dataset alpaca

# Launch web UI
zen-gym ui

Training Methods

Method	Stage	Description
Full	SFT	16-bit full parameter fine-tuning
LoRA	SFT	Low-Rank Adaptation, ~30% memory of full
QLoRA	SFT	Quantized LoRA (4/8-bit), ~10% memory of full
DoRA	SFT	Weight-Decomposed Low-Rank Adaptation
DPO	RLHF	Direct Preference Optimization
PPO	RLHF	Proximal Policy Optimization
GRPO	RLHF	Group Relative Policy Optimization
GSPO	RLHF	Group Sampled Policy Optimization (MoE-optimized)
KTO	RLHF	Kahneman-Tversky Optimization
ORPO	RLHF	Odds Ratio Preference Optimization
SimPO	RLHF	Simple Preference Optimization

Hardware Requirements

Model Size	Full	LoRA	QLoRA (4-bit)
0.6B	8 GB	4 GB	2 GB
4B	32 GB	16 GB	8 GB
8B	48 GB	24 GB	12 GB
32B	128 GB	48 GB	24 GB
80B MoE	256 GB	80 GB	48 GB
235B MoE	512 GB	160 GB	80 GB
397B MoE	768 GB	256 GB	128 GB

VRAM listed per-GPU. Multi-GPU setups supported via DeepSpeed ZeRO and FSDP.

Installation

git clone https://github.com/zenlm/gym
cd gym
pip install -e ".[torch,metrics]"

Optional accelerators:

# FlashAttention-2
pip install flash-attn --no-build-isolation

# Unsloth (2-5x speedup)
pip install unsloth

Usage

CLI

# Supervised fine-tuning with LoRA
zen-gym train \
  --model zen-eco \
  --method lora \
  --dataset alpaca \
  --lora-rank 128 \
  --batch-size 4 \
  --epochs 3 \
  --lr 2e-5

# GRPO reinforcement learning
zen-gym train \
  --model zen-eco \
  --method grpo \
  --dataset preference_data \
  --lr 1e-5

# QLoRA 4-bit training
zen-gym train \
  --model zen-voyager \
  --method qlora \
  --quant-bits 4 \
  --batch-size 1 \
  --gradient-accumulation 16

# Export to GGUF
zen-gym export \
  --model ./output/zen-eco-lora \
  --format gguf \
  --quant Q4_K_M

Config File

# configs/zen-eco-lora.yaml
model: zen-eco
method: lora
dataset: alpaca
lora_rank: 128
lora_alpha: 64
batch_size: 4
gradient_accumulation: 4
learning_rate: 2e-5
epochs: 3
flash_attn: true
output_dir: ./output/zen-eco-lora

zen-gym train --config configs/zen-eco-lora.yaml

Web Interface

zen-gym ui
# Open http://localhost:7860

Monitoring

Zen Gym integrates with standard experiment tracking:

TensorBoard -- built-in, zero config
Weights and Biases -- --report-to wandb
MLflow -- --report-to mlflow

Name		Name	Last commit message	Last commit date
Latest commit History 3,009 Commits
.claude		.claude
.github		.github
app		app
assets		assets
configs		configs
data		data
docker		docker
docs/paper		docs/paper
evaluation		evaluation
examples		examples
models		models
paper		paper
public		public
scripts		scripts
src		src
test_checkpoints		test_checkpoints
test_hp_search		test_hp_search
test_integration		test_integration
test_logs		test_logs
test_registry		test_registry
tests		tests
.dockerignore		.dockerignore
.env.local		.env.local
.gitattributes		.gitattributes
.gitignore		.gitignore
.huggingface		.huggingface
.pre-commit-config.yaml		.pre-commit-config.yaml
AUTHORS		AUTHORS
CITATION.cff		CITATION.cff
LICENSE		LICENSE
LLM.md		LLM.md
MANIFEST.in		MANIFEST.in
MODEL_CARD.md		MODEL_CARD.md
Makefile		Makefile
README-HF.md		README-HF.md
README.md		README.md
README_zh.md		README_zh.md
app.py		app.py
demo_training.py		demo_training.py
next.config.js		next.config.js
package.json		package.json
postcss.config.js		postcss.config.js
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements-app.txt		requirements-app.txt
requirements.txt		requirements.txt
setup.py		setup.py
simple_demo.py		simple_demo.py
tailwind.config.js		tailwind.config.js
test_data.json		test_data.json
test_grpo.py		test_grpo.py
test_inference.py		test_inference.py
test_production.py		test_production.py
test_quantization.py		test_quantization.py
test_qwen3.sh		test_qwen3.sh
test_simple.py		test_simple.py
test_simplified.py		test_simplified.py
test_zen_nano_model.py		test_zen_nano_model.py
train_zen_nano.py		train_zen_nano.py
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Zen Gym

Supported Models

Quick Start

Training Methods

Hardware Requirements

Installation

Usage

CLI

Config File

Web Interface

Monitoring

Related

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Zen Gym

Supported Models

Quick Start

Training Methods

Hardware Requirements

Installation

Usage

CLI

Config File

Web Interface

Monitoring

Related

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages