Issues · huggingface/trl

[Tracking issue] General dataset support

#2071 opened Sep 15, 2024 by qgallouedec

Open

[Tracking issue] Integrate native liger-kernel losses

#2495 opened Dec 17, 2024 by qgallouedec

Open 2

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clear current search query, filters, and sorts

46 Open 37 Closed

✨ enhancement 🏋 KTO

#2554 opened Jan 10, 2025 by starmpcc

✨ enhancement 🏋 Online DPO 🏋 PPO 🏋 RLOO

#2529 opened Dec 28, 2024 by dawidm

✨ enhancement

#2525 opened Dec 28, 2024 by August-murr

3 tasks done

✨ enhancement

#2517 opened Dec 23, 2024 by AMindToThink

3 tasks

✨ enhancement 🏋 SFT

#2504 opened Dec 19, 2024 by ggbetz

✨ enhancement 🧒 good second issue

#2495 opened Dec 17, 2024 by qgallouedec

5 tasks

🏋 DPO ✨ enhancement

#2469 opened Dec 13, 2024 by zhc7

Probably a more reasonable method of packing ✨ enhancement 🧒 good second issue 🙋 help from community wanted 🏋 SFT

#2466 opened Dec 12, 2024 by AIR-hl

✨ enhancement ⏳ needs more info

#2415 opened Nov 29, 2024 by dame-cell

✨ enhancement 🙋 help from community wanted ⚡ PEFT 🏋 RLOO

#2404 opened Nov 28, 2024 by harvinyou

7 of 9 tasks

✨ enhancement 👶 good first issue 🙋 help from community wanted 🏋 PPO

#2387 opened Nov 23, 2024 by kechunFIVE

✨ enhancement 🙋 help from community wanted

#2383 opened Nov 22, 2024 by morLev

2 of 3 tasks

🏋 DPO ✨ enhancement

#2337 opened Nov 7, 2024 by kaiwenw

4 tasks

Using a different ref_model from model leads to incorrect results ✨ enhancement ❓ question

#2307 opened Nov 1, 2024 by DarshanDeshpande

2 of 4 tasks

✨ enhancement

#2195 opened Oct 7, 2024 by idanshen

Loading…

✨ enhancement

#2190 opened Oct 6, 2024 by gaetanlop • Draft

9 of 10 tasks

✨ enhancement 🙋 help from community wanted 🏋 Reward

#2110 opened Sep 24, 2024 by lewtun

ProTip! Find all open issues with in progress development work with linked:pr.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues: huggingface/trl

Issues list