Skip to content

Add description of what the skatebench benchmark tests#2

Open
lcrh wants to merge 1 commit intoT3-Content:mainfrom
lcrh:update-readme-description
Open

Add description of what the skatebench benchmark tests#2
lcrh wants to merge 1 commit intoT3-Content:mainfrom
lcrh:update-readme-description

Conversation

@lcrh
Copy link

@lcrh lcrh commented Sep 6, 2025

Added a clear explanation at the top of the README describing that this benchmark tests LLMs on skateboarding trick terminology, with a concrete example of a test case showing the prompt, correct answers, and failure conditions.

🤖 Generated with Claude Code

Added a clear explanation at the top of the README describing that this benchmark tests LLMs on skateboarding trick terminology, with a concrete example of a test case showing the prompt, correct answers, and failure conditions.

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
@vercel
Copy link

vercel bot commented Sep 6, 2025

@lcrh is attempting to deploy a commit to the Theo's projects Team on Vercel.

A member of the Team first needs to authorize it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant