Closed
Description
🚀 The feature, motivation and pitch
Verifier/reward models are going to be very important moving forward for building:
- High quality synthetic data pipelines
- Verifying model reasoning
- Multi agent systems
Could we add support for sequence classification models like Skywork/Skywork-Reward-Llama-3.1-8B
Alternatives
No response
Additional context
No response
Before submitting a new issue...
- Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.