Symbolic Embodied Reasoning Environments (SERE)

SERE is a lightweight framework for building symbolic, embodied reasoning environments — where agents must manipulate objects, respect spatial and causal constraints, and satisfy task goals expressed in PDDL-style logic.

It’s designed for RL + LLM training, giving you a Gym-style API but with symbolic state, grounded actions, stochasticity, and reward shaping built in.

✨ Features

YAML-defined domains – types, predicates, fluents, actions (with preconditions, add/del, conditional and numeric effects).
PDDL-style grounding – parses (pick-up r1 mug1) into concrete state updates.
World state engine – maintains objects, facts, numeric fluents, and enforces invariants.
Derived predicates – author higher-level semantics in domain YAML without extra code.
Numeric fluents, durations, energy – model time, resources, and stochastic outcomes.
Reward shaping & termination rules – instant milestones, potential-based shaping, and structured all/any termination.
Multi-agent joint actions – require one action per robot and apply effects simultaneously.
Invariant plugins – register domain-specific constraints (e.g. “object can’t be in two places”).
Human-readable rendering – natural language + PDDL observations for LLM prompting, with affordance lists.
Reference plans & regression tests – validate domains and ensure backward compatibility.

🏗 Architecture

src/sere/
├── core/
│   ├── world_state.py       # Facts, objects, fluents, invariants
│   ├── semantics.py         # Clause + numeric evaluation, traces
│   ├── invariants.py        # Generic + domain-specific plugins
│   └── pddl_env/            # RL-style environment + prompting
│       ├── env.py           # Env: reset/step/reward/done
│       ├── engine.py        # Action application, stochastic outcomes
│       ├── planning.py      # Parse/execute action blocks
│       ├── rendering.py     # Messages + obs stitching
│       ├── prompt_formatter.py # System prompt + observations + affordances
│       └── run_mode.py      # interactive / batch / open_loop
│
├── pddl/                    # Domain parsing, grounding, NL mapping
├── io/                      # Task loader utilities
├── cli/                     # Command-line runner
│   └── run_task.py
└── assets/
    ├── domain/              # Domain YAMLs (kitchen, assembly, …)
    └── tasks/               # Task YAMLs (per domain)

🔧 Installation

git clone https://github.com/yourname/SERE.git
cd SERE
uv venv .venv
source .venv/bin/activate
uv sync

Requires Python 3.11+.

▶️ Running a Task

From the repo root:

python -m sere.cli.run_task kitchen/t01_one_step_steep.yaml

Note: cli.run_task looks at src/sere/assets/tasks/ for the yaml file.

You’ll see output like:

...
State:
  (at r1 hallway)
  (obj-at kettle1 kitchen)
  (clear-hand r1)
Goal:
  (tea-ready mug1)

Reply with (action args).

Example step:

(move r1 hallway kitchen)

The environment will parse and apply the action, update time/energy, and return the next observation plus reward.

🤖 Ludic integration (multi-agent)

SERE includes a Ludic wrapper that exposes each robot as a separate Ludic agent. Agent IDs are the sorted robot symbols (e.g. r1, r2). Actions are raw PDDL S-expressions with exactly one action per agent per step. Use (idle r) for no-op.

Make sure ludic is importable (e.g. export PYTHONPATH="$PWD/ludic/src" when using this monorepo).

from integrations.ludic import SereLudicEnv, pddl_action_parser
from sere.core.pddl_env.run_mode import RunMode
from sere.io.task_loader import load_task

env, meta = load_task(
    None,
    "kitchen/t11_multi_agent_parallel_brew.yaml",
    run_mode=RunMode.INTERACTIVE,
)
ludic_env = SereLudicEnv(env)

parser = pddl_action_parser()
# agent_map = {aid: Agent(..., parser=parser, ...) for aid in ludic_env.agent_ids}
# protocol = MultiAgentProtocol(agent_map)

If any active agent is missing an action for a step, the wrapper returns an invalid_move outcome and the episode may terminate (same semantics as SERE).

🛠 Authoring Domains & Tasks

Domains (assets/domain/*.yaml) define:
- Types (robot, location, object, …)
- Predicates (at, holding, in, …)
- Fluents (energy, time, …)
- Actions (with preconditions, add/del, conditional effects, stochastic outcomes, numeric updates, durations)
- Derived predicates (rules evaluated at runtime; never mutated by actions)
Tasks (assets/tasks/**/*.yaml) define:
- Objects with types
- Initial state (facts + fluent values)
- Statics (e.g. adjacency graph)
- Termination rules (with when or structured all/any)
- Optional shaping rules and reference plans
- Optional meta.multi_agent: true to require joint actions (one per robot)

This separation makes it easy to randomize tasks or auto-generate curricula.

To Do

improve docs and tutorials
add task/domain randomization hooks
add more domain clusters and cross-domain skill tags

Name		Name	Last commit message	Last commit date
Latest commit History 154 Commits
integrations		integrations
scripts		scripts
src/sere		src/sere
tests		tests
.gitignore		.gitignore
.python-version		.python-version
AGENTS.md		AGENTS.md
README.md		README.md
REVIEW.md		REVIEW.md
pyproject.toml		pyproject.toml
test_long_horizon.py		test_long_horizon.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Symbolic Embodied Reasoning Environments (SERE)

✨ Features

🏗 Architecture

🔧 Installation

▶️ Running a Task

🤖 Ludic integration (multi-agent)

🛠 Authoring Domains & Tasks

To Do

About

Uh oh!

Releases

Packages

Languages

hallerite/SERE

Folders and files

Latest commit

History

Repository files navigation

Symbolic Embodied Reasoning Environments (SERE)

✨ Features

🏗 Architecture

🔧 Installation

▶️ Running a Task

🤖 Ludic integration (multi-agent)

🛠 Authoring Domains & Tasks

To Do

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages