We will document the issues that are required to reach key milestones in this document.
How to join us
- Subscribe to our mailing list: link
- Join our Slack: link
Goal
Post-GTC / Q2 2026: Stabilize the benchmarking platform, expand accuracy coverage, harden reliability, and close all known P0 gaps following the GTC demo.
See Phase 1 roadmap: #83
ShowStopper
Functionality (P0)
Bug Fixes
Accuracy & Datasets (P1)
Performance & Benchmarking (P1)
Testing & CI (P0/P1)
UI/UX & Visualization (P1)
We will document the issues that are required to reach key milestones in this document.
How to join us
Goal
Post-GTC / Q2 2026: Stabilize the benchmarking platform, expand accuracy coverage, harden reliability, and close all known P0 gaps following the GTC demo.
See Phase 1 roadmap: #83
ShowStopper
Functionality (P0)
NotImplementedError— add user-friendly errors bug: EVAL and SUBMISSION test types raise bare NotImplementedError with no user-friendly message #218Bug Fixes
DuplicatePreparedStatementerror in recorder Resolve Endpoints PostGres dup element issue #213max_throughputmode causes connection timeouts "max_throughput" mode causes connection timeouts #202max_duration_msends online test prematurely before all samples are issued [Runtime setting] max_duration_ms: Ends online test prematurely (soon as duration is hit). #197target_qpshardcoded to10.0in Offline mode instead ofNonebug: target_qps hardcoded to 10.0 in Offline mode instead of None #219RuntimeSettingsstoresRandomobjects — benchmarks non-reproducible and non-serializable bug: RuntimeSettings stores Random objects making benchmarks non-reproducible and non-serializable #221Accuracy & Datasets (P1)
Performance & Benchmarking (P1)
Testing & CI (P0/P1)
UI/UX & Visualization (P1)