Skip to content

Actions: GeorgePearse/evaluator

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
51 workflow runs
51 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Feat: Add LLM-as-judge system for git diff evaluation
Deploy static content to Pages #15: Commit ff8dd08 pushed by GeorgePearse
3m 58s main
Docs: Add SWE-bench reference to homepage
Deploy static content to Pages #12: Commit c54501f pushed by GeorgePearse
46s main