ClipCrop: Conditioned Cropping Driven by Vision-Language Model

It was fortunate to find the missing annotated data for the GAIC dataset on my old laptop (see directory "GAIC-Text-Annotations").
Hopefully these annotations will be useful to the field of smart cropping.
I personally don't follow this field, so there will be no updates.

Download the GAIC dataset (journal version) from here, which includes 2636, 200, and 500 images for training, validation and testing.

If you find this repository useful, please consider citing:

@inproceedings{zhong2023clipcrop,
  title={ClipCrop: conditioned cropping driven by vision-language model},
  author={Zhong, Zhihang and Cheng, Mingxi and Wu, Zhirong and Yuan, Yuhui and Zheng, Yinqiang and Li, Ji and Hu, Han and Lin, Stephen and Sato, Yoichi and Sato, Imari},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  pages={294--304},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
GAIC-Text-Annotations		GAIC-Text-Annotations
README.md		README.md
info.py		info.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ClipCrop: Conditioned Cropping Driven by Vision-Language Model

About

Uh oh!

Releases

Packages

Languages

zzh-tech/ClipCrop

Folders and files

Latest commit

History

Repository files navigation

ClipCrop: Conditioned Cropping Driven by Vision-Language Model

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages