Reproducible evaluation suite for LLM behavior research: epistemic pathology, delegated introspection, and temporal consciousness diagnostics
python alignment language-models reproducibility ai-safety interpretability research-engineering ai-alignment research-tools rlhf llm-evaluation behavioral-research llm-evals epistemic-alignment temporal-consciousness epistemic-pathology
-
Updated
Oct 22, 2025 - Python