GitHub - atla-ai/selene-mini

🛝 Playground | 📄 Technical report | 💻 GitHub | 👀 Sign up for the API

Selene Mini

Selene Mini is a state-of-the-art small language model-as-a-judge (SLMJ). Selene Mini achieves comparable performance to models 10x its size, outperforming GPT-4o on RewardBench, EvalBiasBench, and AutoJ.

Post-trained from Llama-3.1-8B across a wide range of evaluation tasks and scoring criteria, Selene Mini outperforms prior small evaluation models overall across 11 benchmarks covering three different types of tasks:

Absolute scoring, e.g. "Evaluate the harmlessness of this response on a scale of 1-5"
Classification, e.g. "Does this response address the user query? Answer Yes or No."
Pairwise preference. e.g. "Which of the following responses is more logically consistent - A or B?"

It is also the #1 8B generative model on RewardBench.

Resources

This repo features prompt templates used during training and hands-on examples for using Selene Mini.

Contact

Get in touch if you have any queries not covered in this repo.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
cookbooks		cookbooks
prompt-templates		prompt-templates
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Selene Mini

Resources

Contact

About

Releases

Packages

Contributors 2

Languages

License

atla-ai/selene-mini

Folders and files

Latest commit

History

Repository files navigation

Selene Mini

Resources

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages