-
Notifications
You must be signed in to change notification settings - Fork 10
Pull requests: opendilab/LightRFT
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feature(wjy): add R1-AQA (Audio Question Answering) training examples
enhancement
New feature or request
#44
opened Feb 12, 2026 by
JOY-SWang
Loading…
feature(pu): add init version of on_policy_distillation
enhancement
New feature or request
#43
opened Feb 10, 2026 by
puyuan1996
Loading…
feature(wzn): add language switcher and complete some Chinese documentation for LightRFT docs
documentation
Improvements or additions to documentation
style
Code or comments formatting
#42
opened Feb 10, 2026 by
zunian-wan
Loading…
3 of 40 tasks
feature(wzn): add LoRA training demo for Geo3K
enhancement
New feature or request
#41
opened Feb 10, 2026 by
zunian-wan
Loading…
1 of 47 tasks
feature(sunjx): implement dynamic sampling strategy in DAPO
#40
opened Feb 10, 2026 by
Jiaxuan-Sun
Loading…
feature(sunjx): add rejection sampling in grm_training
#38
opened Feb 6, 2026 by
Jiaxuan-Sun
Loading…
doc(pu): add init version of fast_exp_maker best practice
documentation
Improvements or additions to documentation
#37
opened Feb 3, 2026 by
puyuan1996
Loading…
polish(nyz): fix redundant requires and vllm compatibility
polish
Polish algorithms, tests or configs
#36
opened Feb 3, 2026 by
PaParaZz1
Loading…
4 of 9 tasks
feature(pu): add run_ppo_geo3k_qwen2.5_vl_7b.sh
enhancement
New feature or request
polish
Polish algorithms, tests or configs
#35
opened Feb 3, 2026 by
puyuan1996
Loading…
feature(luyd): add partial rollout in training process
enhancement
New feature or request
#29
opened Jan 22, 2026 by
AltmanD
Loading…
feature(sunjx): add GSPO and GMPO algorithms support
enhancement
New feature or request
#22
opened Jan 9, 2026 by
Jiaxuan-Sun
Loading…
refactor(sunjx): refactor loss-filter implementation
enhancement
New feature or request
refactor
Cleanup, formatting, or restructuring of existing code.
#17
opened Jan 1, 2026 by
Jiaxuan-Sun
Loading…
refactor(sunjx): refactor dataset and reward module
refactor
Cleanup, formatting, or restructuring of existing code.
#13
opened Dec 31, 2025 by
Jiaxuan-Sun
Loading…
feature(sunjx): add rejective sampling pipeline in t2i demo
enhancement
New feature or request
#3
opened Dec 25, 2025 by
Jiaxuan-Sun
Loading…
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.