generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Replaced
unittest.TestCase
with TrlTestCase
that handles tmp dir
#3863
opened Aug 7, 2025 by
qgallouedec
Loading…
[#3647] Fix: Assign default values in the GKDTrainer's constructor only when …
#3851
opened Aug 5, 2025 by
seungduk-yanolja
Loading…
2 of 5 tasks
Update profiling.py: fix scoping problems for wandb and mlflow
#3845
opened Aug 4, 2025 by
markshinyounglee
Loading…
5 tasks done
Optimize RLOO Trainer memory usage with string-level processing
#3837
opened Aug 2, 2025 by
luckyvickyricky
Loading…
2 of 5 tasks
👁️ From
AutoModelForVision2Seq
to AutoModelForImageTextToText
#3836
opened Aug 2, 2025 by
qgallouedec
Loading…
Fix SFTTrainer token accuracy computation with PromptEncoder
#3821
opened Jul 31, 2025 by
zk-quantum
Loading…
5 tasks done
GSPO docs - Sequence importance ratio and differences in relation to GRPO
#3816
opened Jul 31, 2025 by
almeidava93
Loading…
2 of 5 tasks
Adding support for different losses which are now supported by Liger
#3815
opened Jul 31, 2025 by
Manan17
Loading…
1 of 5 tasks
💇 Add soft overlong punishment reward function and update documentation
#3804
opened Jul 30, 2025 by
qgallouedec
Loading…
Add vLLM server mode support to OnlineDPOTrainer
#3783
opened Jul 27, 2025 by
vaelev
Loading…
6 tasks done
change doc for
num_iterations
and steps_per_generation
to hopefully make them more clear and differentiate between them more clearly
#3761
opened Jul 23, 2025 by
avishaiElmakies
Loading…
2 of 5 tasks
Dynamic sampling option in GRPO trainer based on DAPO paper
#3758
opened Jul 23, 2025 by
almeidava93
Loading…
2 of 5 tasks
Previous Next
ProTip!
Adding no:label will show everything without a label.