Skip to content

NJUVISION/MG-VLQA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MG-VLQA: Multi-Granularity Quality Assessment for Image Compression via Visual Language Models

An image quality assessment framework leveraging visual-language models for multi-granularity semantic fidelity.

MG-VLQA Pipeline

Hanfei Li1  ·  Anle Ke1  ·  Jiawen Gu2  ·  Chao Zhou2  ·  Tong Chen1  ·  Zhan Ma1  · 

1 Nanjing University     2 Kuaishou Technology

🚀Quick Start

📁Prerequisites

Before installing the dependencies, please prepare the model weights.

Create the following directory structure in the project root:

weights/
├── Gemma/
└── SAM/

Download the required model weights from Hugging Face and place them in the corresponding folders:

🧪Environment Requirements

Make sure your environment meets the following requirements:

  • Python >= 3.9
  • PyTorch == 2.6.0
  • transformers == 4.53.0
  • timm == 1.0.6
  • six
  • accelerate

Then, navigate to the GroundingDINO directory and install it in editable mode:

cd GroundingDINO
pip install -e .

About

MG-VLQA: Multi-Granularity Quality Assessment for Image Compression via Visual Language Models

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages