Stars
[CVPR 2022 Oral] Detecting Deepfakes with Self-Blended Images https://arxiv.org/abs/2204.08376
[VISAPP2024] Towards the Detection of Diffusion Model Deepfakes
deepfake dataset collected on the web for deepfake detection
A list of tools, papers and code related to Deepfake Detection.
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Official code for Conformer: Local Features Coupling Global Representations for Visual Recognition
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
An open source implementation of CLIP.
(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"
EVA Series: Visual Representation Fantasies from BAAI
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
[CBMI2024 Best Paper] Official repository of the paper "Is CLIP the main roadblock for fine-grained open-world perception?".
Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.
This repository is the official implementation of our paper Robust Diffusion Model-Generated Image Detection with CLIP, accepted by MIPR 2024
[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
[NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification
The official code of "Rethinking Local Perception in Lightweight Vision Transformer"
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.