Skip to content

Conversation

@WillieRuemmele
Copy link
Contributor

What does this PR do?

adds a custom evaluation section to the interview

What issues does this PR fix or reference?

@W-19120805@

@shetzel
Copy link
Contributor

shetzel commented Jul 28, 2025

yarn test:nuts is expected to fail since there are server bugs with agent create and agent test run.

I tested generating agent test specs with custom evals and the specs were generated correctly and I could deploy and run them, although running failed with a server side bug.

@shetzel shetzel merged commit 8ee5f7e into main Jul 28, 2025
12 of 13 checks passed
@shetzel shetzel deleted the wr/customEval branch July 28, 2025 21:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants