Skip to content

Pull requests: LLM360/Reasoning360

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[algo] Adding CISPO policy loss
#150 opened Nov 6, 2025 by twkillian Loading…
2 tasks done
Bump sglang[all] from 0.4.6.post5 to 0.5.4.post1 dependencies Pull requests that update a dependency file python Pull requests that update python code
#148 opened Oct 27, 2025 by dependabot bot Loading…
Bump torchvision from 0.20.1 to 0.24.0 dependencies Pull requests that update a dependency file python Pull requests that update python code
#147 opened Oct 20, 2025 by dependabot bot Loading…
Bump ray from 2.46.0 to 2.50.1 dependencies Pull requests that update a dependency file python Pull requests that update python code
#146 opened Oct 20, 2025 by dependabot bot Loading…
Pr upstream verl merge diffaware
#137 opened Sep 29, 2025 by Jianshu1only Loading…
7 tasks
Bump tokenizers from 0.21 to 0.22.1 dependencies Pull requests that update a dependency file python Pull requests that update python code
#136 opened Sep 22, 2025 by dependabot bot Loading…
Update numpy requirement from <2.0.0 to <3.0.0 dependencies Pull requests that update a dependency file python Pull requests that update python code
#126 opened Aug 26, 2025 by dependabot bot Loading…
Add reward function for SynLogic dataset
#123 opened Aug 14, 2025 by LiqunMa Loading…
1 task
[fix] Fix math reward hanging [WIP]
#109 opened Jul 7, 2025 by BlankCheng Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.