Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some improvement on [Reasoning] #53

Draft
wants to merge 5 commits into
base: main
Choose a base branch
from
Draft

Conversation

YanSong97
Copy link
Collaborator

@YanSong97 YanSong97 commented Nov 12, 2024

  1. Add LLM-as-Judge tools
  2. Configurate data loading, model resources;
  3. add huggingface model worker;

2. Configurate LM/RM address;
3. Add tensor parallel example;
4. Black format;
2. refactor prompt resources, data loading;
3. add genrm infer fn and separate Q&A before value inference
4. batch limit for vllm request
2. add lm stop str to cfg file;
3. llm-as-judge also extract clean answer for evaluation;
4. add huggingface model worker;
5. update scripts
@YanSong97 YanSong97 force-pushed the reason_llm_as_judge branch from 2b06d58 to 345cd1d Compare December 8, 2024 21:03
@YanSong97 YanSong97 requested a review from ziyuwan December 8, 2024 21:19
2. add offline RM evaluation script
2. change file type to yaml;
3. fix rstar data loading
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants