Issues · huggingface/trl

[Tracking issue] General dataset support

#2071 opened Sep 15, 2024 by qgallouedec

Open

[Tracking issue] Integrate native liger-kernel losses

#2495 opened Dec 17, 2024 by qgallouedec

Open 2

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clear current search query, filters, and sorts

20 Open 81 Closed

🐛 bug 🚀 deepspeed 🏋 DPO ⏳ needs more info

#2490 opened Dec 16, 2024 by sagie-dekel

7 of 9 tasks

🐛 bug 🏋 DPO

#2473 opened Dec 13, 2024 by qingjianbuyi

7 of 9 tasks

🏋 DPO ✨ enhancement

#2469 opened Dec 13, 2024 by zhc7

🐛 bug 🏋 DPO

#2468 opened Dec 13, 2024 by zhc7

Probaly mistake in DPOTrainer when compute/log grad_norm 🏋 DPO ❓ question

#2456 opened Dec 10, 2024 by AIR-hl

7 of 9 tasks

🏋 DPO ❓ question

#2452 opened Dec 9, 2024 by gp-1108

7 of 9 tasks

🐛 bug 🏋 DPO 🦥 unsloth 👁️ VLM

#2438 opened Dec 4, 2024 by davidszwjx

7 of 9 tasks

🐛 bug 🏋 DPO ⏳ needs more info

#2435 opened Dec 4, 2024 by ZengQQQ

7 of 9 tasks

🏋 DPO ✨ enhancement 🧒 good second issue 🙋 help from community wanted

#2422 opened Dec 1, 2024 by fzyzcjy

🏋 DPO ❓ question

#2382 opened Nov 22, 2024 by AML14

7 of 9 tasks

🐛 bug 🏋 DPO

#2355 opened Nov 14, 2024 by seTalent

8 of 9 tasks

🏋 DPO ✨ enhancement

#2337 opened Nov 7, 2024 by kaiwenw

4 tasks

🏋 DPO ❓ question 👁️ VLM

#2326 opened Nov 5, 2024 by DarioPTWR

🏋 DPO ❓ question

#2227 opened Oct 14, 2024 by ilyasoulk

🐛 bug 🚀 deepspeed 🏋 DPO 🙋 help from community wanted

#2154 opened Oct 2, 2024 by Ben-Schneider-code

2 of 4 tasks

Always allow ref_model=None 🏋 DPO ✨ enhancement 🧒 good second issue 🙋 help from community wanted

#2047 opened Sep 10, 2024 by qgallouedec

🏋 DPO 🙋 help from community wanted

#1025 opened Nov 22, 2023 by Devy99

ProTip! no:milestone will show everything without a milestone.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues: huggingface/trl

Issues list