feat(tasks): add MMVP task with ground truth corrections #1028

Luodian · 2026-01-22T19:34:37Z

Summary

Add MMVP (Multimodal Visual Patterns) benchmark task
Apply verified ground truth corrections for indices 99 and 279 as documented in Issues about MMVP Dataset #1018

Description

MMVP is a benchmark that tests VLMs on "CLIP-blind pairs" - images that look similar to CLIP but have clear visual differences. The dataset contains 300 samples (150 pairs) testing 9 basic visual patterns.

Features

Dataset: Loads from MMVP/MMVP on HuggingFace
Metrics:
- mmvp_accuracy: Individual question accuracy
- mmvp_pair_accuracy: Both questions in a CLIP-blind pair must be correct (stricter metric)
Ground Truth Corrections: Applies verified corrections for:
- Index 99: Elephant tusks are long, not short (corrected from B to A)
- Index 279: Person is standing, not sitting (corrected from B to A)

Usage

python -m lmms_eval --model <model> --tasks mmvp --batch_size 1

References

Original MMVP: https://github.com/tsb0601/MMVP
HuggingFace Dataset: https://huggingface.co/datasets/MMVP/MMVP
VLMEvalKit Implementation: https://github.com/open-compass/VLMEvalKit

Add MMVP (Multimodal Visual Patterns) benchmark task that tests VLMs on CLIP-blind pairs - images perceived as similar by CLIP but with clear visual differences. Key features: - Loads dataset from MMVP/MMVP on HuggingFace - Reports both individual accuracy and pair accuracy metrics - Applies verified ground truth corrections for indices 99 and 279 as documented in issue #1018 The pair accuracy metric requires models to correctly answer BOTH questions in each CLIP-blind pair, providing a stricter evaluation of genuine visual understanding. Github-Issue: #1018

Luodian added 2 commits January 23, 2026 03:34

style: apply black formatting to mmvp utils

73dd2f3

kcz358 approved these changes Jan 23, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(tasks): add MMVP task with ground truth corrections #1028

feat(tasks): add MMVP task with ground truth corrections #1028

Luodian commented Jan 22, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(tasks): add MMVP task with ground truth corrections #1028

Are you sure you want to change the base?

feat(tasks): add MMVP task with ground truth corrections #1028

Conversation

Luodian commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Description

Features

Usage

References

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Luodian commented Jan 22, 2026 •

edited

Loading