Stars
Flax is a neural network library for JAX that is designed for flexibility.
Scenic: A Jax Library for Computer Vision Research and Beyond
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
Object detection on multiple datasets with an automatically learned unified label space.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
PyTorch implementation of Pointnet2/Pointnet++
Deep Hough Voting for 3D Object Detection in Point Clouds
A list of papers and datasets about point cloud analysis (processing)
A resource repository for 3D machine learning
Open3D: A Modern Library for 3D Data Processing
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the came…
My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT p…
This is Pytorch re-implementation of our CVPR 2020 paper "Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation" (https://arxiv.org/abs/1911.10194)
End-to-End Object Detection with Transformers
An Official Repo of CVPR '20 "MSeg: A Composite Dataset for Multi-Domain Segmentation"
[CVPR 2020] CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement
[IJCV-2021] FairMOT: On the Fairness of Detection and Re-Identification in Multi-Object Tracking
Gated-Shape CNN for Semantic Segmentation (ICCV 2019)