[ICML 2025] Efficient Multi-modal Long Context Learning for Training-free Adaptation

Zehong Ma¹, Shiliang Zhang^1,3^*, Longhui Wei², Qi Tian² ⁴,

¹Peking University, ²Huawei Inc., ³Peng Cheng Laboratory, Shenzhen, China, ⁴Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ).

(* CorresCorresponding author.)

🫖 Introduction

We introduce Efficient Multi-Modal Long Context Learning (EMLoC), a novel training-free method that embeds many demonstration examples directly into the model input. EMLoC offers a more efficient, flexible, and scalable solution for task adaptation. By adaptively pruning tokens at each layer under a Jensen-Shannon divergence constraint, our method achieves a dramatic reduction in inference complexity without sacrificing performance.

🔥 Latest News

If you like our project, please give us a star ⭐ on GitHub for the latest update.
[2025/5/4] 🎉 Release the paper and code of EMLoC.

🎉 Supported Models

Multi-modal Large Language Models

Qwen2-VL (The version of Monkey Patch is still in preparation)
Other models will be supported soon!

🤖 Instructions for EMLoC

Environments

This work is built based on Qwen2-VL, lmms-eval, and transformers. Thanks for their contributions! We modify the modeling_qwen2_vl.py and cache_utils.py in transformers. Besides, we modify the lmms-eval to support multi-modal in-context learning and Ascend 910B. More details can be seen in the code.

git clone 
cd EMLoC
conda create -y -n emloc python=3.10
conda activate emloc
pip install -r requirements

## install torch-npu to support Ascend 910B
# pip install torch-npu==2.4.0

Data Preparation

ImageNet1k: train and val should be in the root dir of imagenet1k.

ln -s /path/to/imagenet1k/ ./data/imagenet1k

Other datasets: lmms-eval will automatically download and spilt them into fewshot set and validation set.

Run EMLoC on different datasets.

# ImageNet
sh ./scripts/EMLoC_imagenet.sh

# illusionVQA
sh ./scripts/EMLoC_illusionVQA.sh

# mmerealworld
sh ./scripts/EMLoc_mmerealworld_lite.sh

# More datasets please see ./scripts/

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
lmms-eval		lmms-eval
qwen-vl-utils		qwen-vl-utils
scripts		scripts
transformers		transformers
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

[ICML 2025] Efficient Multi-modal Long Context Learning for Training-free Adaptation

🫖 Introduction

🔥 Latest News

🎉 Supported Models

🤖 Instructions for EMLoC

Environments

Data Preparation

Run EMLoC on different datasets.

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Zehong-Ma/EMLoC

Folders and files

Latest commit

History

Repository files navigation

[ICML 2025] Efficient Multi-modal Long Context Learning for Training-free Adaptation

🫖 Introduction

🔥 Latest News

🎉 Supported Models

🤖 Instructions for EMLoC

Environments

Data Preparation

Run EMLoC on different datasets.

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages