Multi-Agent Reinforcement Learning (MARL) Framework

A PyTorch-based framework for training multiple agents in various PettingZoo environments using independent Q-learning. Supports both cooperative and competitive scenarios with dynamic observation/action space handling.

🎮 Supported Environments

Environment	Type	Agents	Description
Simple Spread	Cooperative	2-10	Agents cover target landmarks while avoiding collisions
Simple Adversary	Mixed	3-7	Good agents cooperate against an adversary
Simple Tag	Competitive	4-8	Pursuit-evasion scenario with predator and prey
Knights Archers Zombies	Cooperative	2-12	Complex game with different agent types

🚀 Quick Start

Clone the repository:

git clone <repository-url>
cd marl-framework

Create a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

💻 Usage

Interactive Mode

python main.py

Command Line Mode

python main.py --env simple_spread --num-agents 3 --episodes 2000 --no-render

Arguments

Argument	Description	Default
--env	Environment name	(interactive)
--num-agents	Number of agents	(env default)
--no-render	Disable visualization	False
--episodes	Training episodes	2000

🔧 Training Configuration

Hyperparameters

Parameter	Value	Description
Learning Rate	0.001	Agent learning speed
Epsilon Start	1.0	Initial exploration rate
Epsilon End	0.1	Final exploration rate
Epsilon Decay	0.995	Exploration decay rate
Gamma	0.95	Reward discount factor
Batch Size	32	Training batch size

Neural Network Architecture

Input Layer: Environment-specific observation size
Hidden Layers: 2 x 128 units with ReLU
Output Layer: Environment-specific action size
Weight Initialization: Orthogonal
Optimizer: Adam

📊 Performance Metrics

The framework tracks:

Episode Total Reward
Average Game Length
Per-agent Performance
Best Episode Score
Training Progress

🏗️ Project Structure

marl-framework/
├── environments/        # Environment wrappers
├── agents/             # Neural network models
├── training/           # Training logic
└── main.py            # Entry point

🔍 Features

Dynamic observation/action space handling
Environment-specific configurations
Real-time training visualization
Flexible agent architectures
Comprehensive metrics tracking
Command-line and interactive modes

🤝 Contributing

Fork the repository
Create a feature branch (git checkout -b feature/AmazingFeature)
Commit changes (git commit -m 'Add AmazingFeature')
Push to branch (git push origin feature/AmazingFeature)
Open a Pull Request

📝 License

MIT License - see LICENSE file for details

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
agents		agents
environments		environments
training		training
.cursorignore		.cursorignore
.cursorrules		.cursorrules
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Agent Reinforcement Learning (MARL) Framework

🎮 Supported Environments

🚀 Quick Start

💻 Usage

Interactive Mode

Command Line Mode

Arguments

🔧 Training Configuration

Hyperparameters

Neural Network Architecture

📊 Performance Metrics

🏗️ Project Structure

🔍 Features

🤝 Contributing

📝 License

About

Releases

Packages

Languages

AIMLRLab/marl

Folders and files

Latest commit

History

Repository files navigation

Multi-Agent Reinforcement Learning (MARL) Framework

🎮 Supported Environments

🚀 Quick Start

💻 Usage

Interactive Mode

Command Line Mode

Arguments

🔧 Training Configuration

Hyperparameters

Neural Network Architecture

📊 Performance Metrics

🏗️ Project Structure

🔍 Features

🤝 Contributing

📝 License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages