Change the repository type filter
All
Repositories list
69 repositories
Meta-Unlearning
Public- 🌾 OAT: Online AlignmenT for LLMs
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
CPO
Public- [ATTRIB @ NeurIPS 2024] When Attention Sink Emerges in Language Models: An Empirical View
scaling-with-vocab
Public[NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623sailor-llm
Public- C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
optim4rl
PublicOptim4RL is a Jax framework of learning to optimize for reinforcement learning.I-FSJ
Publicdice
PublicOfficial implementation of Bootstrapping Language Models via DPO Implicit Rewardslorahub
Publicsailcraft
Public🚢 Data Toolkit for Sailor Language Modelszero-bubble-megatron-deepspeed
Public archivesailcompass
Publicmetaformer
PublicMetaFormer Baselines for Vision (TPAMI 2024)poolformer
PublicPoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)d4ft
Publicfinetune-fair-diffusion
PublicCode of the paper: Finetuning Text-to-Image Diffusion Models for FairnessMDT
PublicCLoT
PublicCVPR'24, Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation".AnyDoor
Public