Problem
If the tmux server crashes (OOM, bug, manual kill), all sessions become orphaned. Aegis has no mechanism to detect or recover from this.
Failure Scenario
- tmux server crashes
- All @n window IDs become invalid
- Aegis continues running, thinking sessions exist
- All API calls fail with tmux errors
- ensureSession() creates new session, but old windows are lost
Suggested Fix
- Add tmux health check to /health endpoint
- Implement session reconciliation on tmux errors
- Store window names in state, attempt re-attach by name after tmux restart
Source
Resilience audit swarm (2026-03-28)
Problem
If the tmux server crashes (OOM, bug, manual kill), all sessions become orphaned. Aegis has no mechanism to detect or recover from this.
Failure Scenario
Suggested Fix
Source
Resilience audit swarm (2026-03-28)