ClusterEnv

ClusterEnv is a lightweight interface for distributed reinforcement learning (RL) environment execution on Slurm-managed clusters. It decouples environment simulation from training logic, enabling scalable rollout collection without adopting a monolithic framework. ClusterEnv mirrors the Gymnasium API and introduces two core components: the DETACH architecture and Adaptive Policy Synchronization (APS).

Basic Usage

from clusterenv import ClusterEnv, SlurmConfig

env_config = {
    "type": "gymnasium",
    "env_name": "LunarLander-v2",
    "kl_threshold": 0.05,
    "envs_per_node": 64
}

slurm_cfg = SlurmConfig(
    job_name="ppo_lander",
    nodes=4,
    gpus_per_node=2,
    partition="gpu",
    time_limit="02:00:00"
)

env = ClusterEnv(env_config, slurm_cfg)
env.launch()

obs = env.reset()
for _ in range(1000):
    obs, reward, done, info = env.step(agent)

Policy synchronization via APS is handled internally. Agents pull updated weights only when their local policy drifts too far from the central learner, as measured by KL divergence.

Installation

git clone https://github.com/rodlaf/ClusterEnv.git
cd ClusterEnv
pip install .

Requirements

Slurm with sbatch submission access
SSH access to allocated cluster nodes
Python 3.8+
RL agent that is instance of torch.nn.Module

Core Concepts

DETACH (Distributed Environment execution with Training Abstraction and Centralized Head) separates rollout collection from the training loop. Remote workers run only reset() and step() methods, while the learner remains centralized.

APS (Adaptive Policy Synchronization) addresses policy staleness by triggering weight updates only when divergence exceeds a user-defined KL threshold. This minimizes unnecessary communication while keeping behavior on-policy enough for efficient training.

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
examples/cleanrl		examples/cleanrl
src/clusterenv		src/clusterenv
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ClusterEnv

Basic Usage

Installation

Requirements

Core Concepts

About

Uh oh!

Releases

Packages

Languages

License

rodlaf/ClusterEnv

Folders and files

Latest commit

History

Repository files navigation

ClusterEnv

Basic Usage

Installation

Requirements

Core Concepts

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages