Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection

Requirement

Package

Our experiments are conducted with Python 3.8 and Pytorch 1.8.1.

All required packages are based on CoOp (for training) and MCM (for evaluation). This code is built on top of the awesome toolbox Dassl.pytorch so you need to install the dassl environment first. Simply follow the instructions described here to install dassl as well as PyTorch. After that, run pip install -r requirements.txt under LoCoOp/ to install a few more packages required by CLIP and MCM (this should be done when dassl is activated).

Datasets

Please create data folder and download the following ID and OOD datasets to data.

In-distribution Datasets

We use ImageNet-1K as the ID dataset.

Create a folder named imagenet/ under data folder.
Create images/ under imagenet/.
Download the dataset from the official website and extract the training and validation sets to $DATA/imagenet/images.

Out-of-distribution Datasets

We use the large-scale OOD datasets iNaturalist, SUN, Places, and Texture curated by Huang et al. 2021. We follow instructions from this repository to download the subsampled datasets.

The overall file structure is as follows:

LoCoOp
|-- data
    |-- imagenet
        |-- images/
            |--train/ # contains 1,000 folders like n01440764, n01443537, etc.
            |-- val/ # contains 1,000 folders like n01440764, n01443537, etc.
    |-- iNaturalist
    |-- SUN
    |-- Places
    |-- Texture
    ...

Quick Start

The training script is in LoCoOp/scripts/sct/train.sh.

e.g., 1-shot training with ViT-B/16

CUDA_VISIBLE_DEVICES=0 bash scripts/sct/train.sh data imagenet vit_b16_ep25 end 16 1 False 0.25 200

e.g., 16-shot training with ViT-B/16

CUDA_VISIBLE_DEVICES=0 bash scripts/sct/train.sh data imagenet vit_b16_ep25 end 16 16 False 0.25 200

Acknowledgement

We appreciate the following papers for their open-source code, which this repository is built upon.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
clip_w_local		clip_w_local
configs		configs
datasets		datasets
scripts		scripts
trainers		trainers
utils		utils
README.md		README.md
classnames.txt		classnames.txt
eval_ood_detection.py		eval_ood_detection.py
imagenet_class_index.json		imagenet_class_index.json
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection

Requirement

Package

Datasets

In-distribution Datasets

Out-of-distribution Datasets

Quick Start

Acknowledgement

About

Releases

Packages

Languages

tmlr-group/SCT

Folders and files

Latest commit

History

Repository files navigation

Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection

Requirement

Package

Datasets

In-distribution Datasets

Out-of-distribution Datasets

Quick Start

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages