Open-Source Library for Fully Cooperative Multi-LLM Reinforcement Learning
-
Updated
Apr 1, 2026 - Python
Open-Source Library for Fully Cooperative Multi-LLM Reinforcement Learning
Benchmark and evaluation code for the ICLR 2026 Workshop paper 'Residual Drift Dominates Contradiction in Multi-Turn Constraint Reasoning'. 1,020 Z3-validated multi-turn constraint problems across seating, scheduling, and logic-grid domains.
Add a description, image, and links to the multi-turn-reasoning topic page so that developers can more easily learn about it.
To associate your repository with the multi-turn-reasoning topic, visit your repo's landing page and select "manage topics."