Skip to content
View yash0307's full-sized avatar

Highlights

  • Pro

Block or report yash0307

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,222 179 Updated Dec 31, 2024

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Python 903 44 Updated Oct 16, 2024

LaVIT: Empower the Large Language Model to Understand and Generate Visual Content

Jupyter Notebook 548 29 Updated Oct 6, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,848 633 Updated Dec 31, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,930 2,302 Updated Aug 12, 2024

MLCD & UNICOM : Large-Scale Visual Representation Model

Python 476 21 Updated Dec 31, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,112 980 Updated Nov 18, 2024

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 4,931 654 Updated Aug 5, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 48,341 5,714 Updated Sep 18, 2024

This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts, and attributes prediction models, query evaluation scripts…

Python 274 12 Updated Feb 12, 2024

PyTorch Implementation of the ICCV 2023 paper: Generalized Differentiable RANSAC ($\nabla$-RANSAC).

Python 177 10 Updated Dec 21, 2023

SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)

Python 3,400 680 Updated Aug 30, 2024

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,821 346 Updated Aug 7, 2024

Easily compute clip embeddings and build a clip retrieval system with them

Jupyter Notebook 2,455 216 Updated Apr 15, 2024

An open source implementation of CLIP.

Python 10,692 1,005 Updated Dec 23, 2024

Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training

Python 132 4 Updated Mar 8, 2023

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)

Python 264 40 Updated Feb 13, 2023

DocILE: Document Information Localization and Extraction Benchmark

Python 119 9 Updated May 15, 2024

QuadTree Attention for Vision Transformers (ICLR2022)

Jupyter Notebook 342 34 Updated Apr 23, 2024

Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022

Jupyter Notebook 2,380 366 Updated May 31, 2024

Training and evaluating NBM and SPAM for interpretable machine learning.

Python 76 14 Updated Mar 22, 2023

🐍 Geometric Computer Vision Library for Spatial AI

Python 10,104 979 Updated Dec 31, 2024

Official repository for "Revisiting Weakly Supervised Pre-Training of Visual Perception Models". https://arxiv.org/abs/2201.08371.

Jupyter Notebook 177 9 Updated Apr 17, 2022

Code for Recall@k Surrogate Loss with Large Batches and Similarity Mixup, CVPR 2022.

Python 58 8 Updated Nov 4, 2024

code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022

Python 261 33 Updated Oct 2, 2024

Code for DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning

Python 96 10 Updated Feb 20, 2023

Implementation of [Understanding and Improving Kernel Local Descriptors](https://arxiv.org/abs/1811.11147) using PyTorch.

Python 17 Updated Jan 6, 2021

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,442 248 Updated Apr 24, 2024

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Python 8,434 2,636 Updated Aug 13, 2024

Fast and memory-efficient exact attention

Python 14,851 1,403 Updated Dec 31, 2024
Next