Skip to content

Issues: huggingface/trl

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Tracking issue] Integrate native liger-kernel losses ✨ enhancement New feature or request 🧒 good second issue Good for contributors with basic project familiarity
#2495 opened Dec 17, 2024 by qgallouedec
5 tasks
Probably a more reasonable method of packing ✨ enhancement New feature or request 🧒 good second issue Good for contributors with basic project familiarity 🙋 help from community wanted Open invitation for community members to contribute 🏋 SFT Related to SFT
#2466 opened Dec 12, 2024 by AIR-hl
Let DPOTrainer Support padding_free 🏋 DPO Related to DPO ✨ enhancement New feature or request 🧒 good second issue Good for contributors with basic project familiarity 🙋 help from community wanted Open invitation for community members to contribute
#2422 opened Dec 1, 2024 by fzyzcjy
[SFT VLM] Add support for Molmo models 🧒 good second issue Good for contributors with basic project familiarity 🏋 SFT Related to SFT 👁️ VLM Related to Visual Language Models
#2136 opened Sep 27, 2024 by lewtun
Always allow ref_model=None 🏋 DPO Related to DPO ✨ enhancement New feature or request 🧒 good second issue Good for contributors with basic project familiarity 🙋 help from community wanted Open invitation for community members to contribute
#2047 opened Sep 10, 2024 by qgallouedec
PPOv2Trainer reward_model throws AttributeError: '<My Custom Class>' object has no attribute 'base_model_prefix' 📚 documentation Improvements or additions to documentation 🧒 good second issue Good for contributors with basic project familiarity 🙋 help from community wanted Open invitation for community members to contribute 🏋 PPO Related to PPO
#1977 opened Aug 26, 2024 by RylanSchaeffer
2 of 4 tasks
PPOv2Trainer throws AttributeError: 'NoneType' object has no attribute 'modules' because value_model's default is None ✨ enhancement New feature or request 🧒 good second issue Good for contributors with basic project familiarity 🙋 help from community wanted Open invitation for community members to contribute 🏋 PPO Related to PPO
#1976 opened Aug 26, 2024 by RylanSchaeffer
2 of 4 tasks
Last layer of the Llava-1.5 visual tower is not training 🧒 good second issue Good for contributors with basic project familiarity 👁️ VLM Related to Visual Language Models
#1936 opened Aug 16, 2024 by qgallouedec
Feature Request: Self-Improving Robust Preference Optimization (SRPO) ✨ enhancement New feature or request 🧒 good second issue Good for contributors with basic project familiarity
#1714 opened Jun 7, 2024 by duyvuleo
Packing without cross contamination ✨ enhancement New feature or request 🧒 good second issue Good for contributors with basic project familiarity 🙋 help from community wanted Open invitation for community members to contribute
#1230 opened Jan 15, 2024 by nivibilla
ProTip! Type g i on any issue or pull request to go back to the issue listing page.