Skip to content

Added Toksuite Benchmark#3669

Open
gsaltintas wants to merge 4 commits intoEleutherAI:mainfrom
gsaltintas:toksuite
Open

Added Toksuite Benchmark#3669
gsaltintas wants to merge 4 commits intoEleutherAI:mainfrom
gsaltintas:toksuite

Conversation

@gsaltintas
Copy link
Copy Markdown
Contributor

This pull request adds the necessary task configs and post-processing steps for the TokSuite benchmark, a multiple-choice benchmark in English, Turkish, Italian, Farsi, and Chinese, targeting surface-level perturbations of natural text proposed in https://arxiv.org/abs/2512.20757

@gsaltintas gsaltintas requested a review from 0xSMT as a code owner April 1, 2026 20:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant