MDP$^3$

Description

MDP$^3$, short for Markov Decision Determinantal Point Process with Dynamic Programming, is an implementation of the paper MDP$^3$: A Training-free Approach for List-wise Frame Selection in Video-LLMs. It introduces a novel, training-free methodology for effective frame selection in video large language models.

Reproduce Hardware

Operating System: Ubuntu 20.04.6 LTS (x86_64)
CPU: AMD EPYC 7H12 (255) @ 2.600GHz
GPU: NVIDIA A100-PCIE-40GB and NVIDIA A100-PCIE-80GB

Installation

To set up the environment and install the required dependencies, follow these steps:

Create a Conda environment:

conda create -n MDP3 python==3.10.14
conda activate MDP3

Install the MDP$^3$ package and additional dependencies:

pip install -e .
pip install torchvision
pip install pysubs2

Evaluation

To evaluate the MiniCPM-V2.6 model on the Video-MME dataset, use the following commands:

Single GPU

Run the evaluation with or without subtitle usage:

CUDA_VISIBLE_DEVICES=0 python run.py --data Video-MME --model MiniCPM-V-2_6 --nframe 128
CUDA_VISIBLE_DEVICES=0 python run.py --data Video-MME --model MiniCPM-V-2_6 --nframe 128 --use-subtitle

Multi-GPU

Run the evaluation using multiple GPUs with Torch distributed:

CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 torchrun --standalone --nproc-per-node 8 run.py --data Video-MME --model MiniCPM-V-2_6 --nframe 128
CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 torchrun --standalone --nproc-per-node 8 run.py --data Video-MME --model MiniCPM-V-2_6 --nframe 128 --use-subtitle

Citation

If you find MDP$^3$ useful, please cite the pepaer:

@article{sun2025mdp3,
  title={MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs},
  author={Sun, Hui and Lu, Shiyin and Wang, Huanyu and Chen, Qing-Guo and Xu, Zhao and Luo, Weihua and Zhang, Kaifu and Li, Ming},
  journal={arXiv preprint arXiv:2501.02885},
  year={2025}
}

Acknowledgement

This code is implemented based on the VLMEvalKit. We sincerely thank the authors for their contributions.

License

MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
docs		docs
requirements		requirements
scripts		scripts
vlmeval.egg-info		vlmeval.egg-info
vlmeval		vlmeval
.env		.env
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py
run.sh		run.sh
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MDP$^3$

Description

Reproduce Hardware

Installation

Evaluation

Single GPU

Multi-GPU

Citation

Acknowledgement

License

About

Uh oh!

Releases

Packages

Languages

License

sunh-23/MDP3

Folders and files

Latest commit

History

Repository files navigation

MDP$^3$

Description

Reproduce Hardware

Installation

Evaluation

Single GPU

Multi-GPU

Citation

Acknowledgement

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages