Skip to content

varyxi/compositional_knowledge_vlms

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This repository is inspired by the article https://openreview.net/pdf?id=KRLUvxh8uaX.
A clear example of how VLM works when changing compositional information is presented. Augmentations for NegCLIP training are realised.

Some useful links:
CLIP model: https://huggingface.co/openai/clip-vit-base-patch32
CTC loss realization: https://github.com/mlfoundations/open_clip/blob/main/src/open_clip/loss.py
Flickr Dataset: https://www.kaggle.com/datasets/hsankesara/flickr-image-dataset
NegCLIP training: https://github.com/vinid/neg_clip

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published