generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Issues: huggingface/trl
[Tracking issue] Integrate native liger-kernel losses
#2495
opened Dec 17, 2024 by
qgallouedec
Open
2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Tracking issue] Integrate native liger-kernel losses
✨ enhancement
New feature or request
🧒 good second issue
Good for contributors with basic project familiarity
#2495
opened Dec 17, 2024 by
qgallouedec
5 tasks
Probably a more reasonable method of New feature or request
🧒 good second issue
Good for contributors with basic project familiarity
🙋 help from community wanted
Open invitation for community members to contribute
🏋 SFT
Related to SFT
packing
✨ enhancement
#2466
opened Dec 12, 2024 by
AIR-hl
Let DPOTrainer Support padding_free
🏋 DPO
Related to DPO
✨ enhancement
New feature or request
🧒 good second issue
Good for contributors with basic project familiarity
🙋 help from community wanted
Open invitation for community members to contribute
#2422
opened Dec 1, 2024 by
fzyzcjy
[SFT VLM] Add support for Molmo models
🧒 good second issue
Good for contributors with basic project familiarity
🏋 SFT
Related to SFT
👁️ VLM
Related to Visual Language Models
#2136
opened Sep 27, 2024 by
lewtun
Always allow Related to DPO
✨ enhancement
New feature or request
🧒 good second issue
Good for contributors with basic project familiarity
🙋 help from community wanted
Open invitation for community members to contribute
ref_model=None
🏋 DPO
#2047
opened Sep 10, 2024 by
qgallouedec
PPOv2Trainer
reward_model
throws AttributeError: '<My Custom Class>' object has no attribute 'base_model_prefix'
📚 documentation
#1977
opened Aug 26, 2024 by
RylanSchaeffer
2 of 4 tasks
PPOv2Trainer throws New feature or request
🧒 good second issue
Good for contributors with basic project familiarity
🙋 help from community wanted
Open invitation for community members to contribute
🏋 PPO
Related to PPO
AttributeError: 'NoneType' object has no attribute 'modules'
because value_model
's default is None
✨ enhancement
#1976
opened Aug 26, 2024 by
RylanSchaeffer
2 of 4 tasks
Last layer of the Llava-1.5 visual tower is not training
🧒 good second issue
Good for contributors with basic project familiarity
👁️ VLM
Related to Visual Language Models
#1936
opened Aug 16, 2024 by
qgallouedec
Feature Request: Self-Improving Robust Preference Optimization (SRPO)
✨ enhancement
New feature or request
🧒 good second issue
Good for contributors with basic project familiarity
#1714
opened Jun 7, 2024 by
duyvuleo
Packing without cross contamination
✨ enhancement
New feature or request
🧒 good second issue
Good for contributors with basic project familiarity
🙋 help from community wanted
Open invitation for community members to contribute
#1230
opened Jan 15, 2024 by
nivibilla
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.