AI Evaluation Playbook Docs

Welcome to the home for everything you need to design, build, and sustain high-quality evaluations for AI systems. This repo powers a Mintlify site focused on practical guidance for teams who want to measure model quality with clarity and rigor.

What you'll find

Engaging guides that walk through the mindset, principles, and workflows behind reliable evaluations.
Hands-on playbooks packed with facilitation tips, stakeholder prompts, and automation recipes.
Templates and rubrics you can copy-paste into your own tooling to jumpstart experimentation.

Local development

npm install
npm run dev

The docs run on Mintlify. Start the local server with npm run dev, then visit http://localhost:3000 to preview changes.

Contributing

Create a new branch for your update.
Make your edits using MDX components when they add clarity or interactivity.
Run the local server to verify the formatting and interactive elements.
Open a pull request with a summary of the changes and any new assets you've added.

Whether you're refining prompts, collecting human preference data, or monitoring regressions, these docs should equip you with the patterns to build trustworthy evaluation loops. If you spot a gap or have a new tactic to share, contributions are encouraged!

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
guides		guides
images		images
logo		logo
playbooks		playbooks
templates		templates
LICENSE		LICENSE
README.md		README.md
development.mdx		development.mdx
docs.json		docs.json
favicon.svg		favicon.svg
index.mdx		index.mdx
quickstart.mdx		quickstart.mdx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Evaluation Playbook Docs

What you'll find

Local development

Contributing

About

Uh oh!

Releases

Packages

Languages

License

jamescash/docs

Folders and files

Latest commit

History

Repository files navigation

AI Evaluation Playbook Docs

What you'll find

Local development

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages