Skip to content

[EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.

Notifications You must be signed in to change notification settings

luka-group/mDPO

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Multimodal Direct Preference Optimization (mDPO)

mDPO: Conditional Preference Optimization for Multimodal Large Language Models (published at EMNLP 2024).

🌐 Homepage | 📖 Paper | 💻 Code | 🤗 Dataset

alt text

Updates

  • 🔥 [2024-09-04] Initial release of the mDPO trainer. We are currently working on releasing the code for training and evaluating different models.

Installation

TBD

Training

Our training data is available at this link.

To train Bunny with mDPO, use the following command:

python bunny/run_mdpo_bunny.py

Evaluation

TBD

Citation

Please cite the following paper if you find the repo helpful:

@article{wang2024mdpo,
  title={mDPO: Conditional Preference Optimization for Multimodal Large Language Models},
  author={Wang, Fei and Zhou, Wenxuan and Huang, James Y and Xu, Nan and Zhang, Sheng and Poon, Hoifung and Chen, Muhao},
  journal={arXiv preprint arXiv:2406.11839},
  year={2024}
}

About

[EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages