yash0307

Yash Patel yash0307

Computer Vision, Machine Learning.

64 followers · 16 following

Czech Technical University, Carnegie Mellon University, IIIT Hyderabad
Prague, Czech Republic
https://yash0307.github.io/

Achievements

Highlights

Stars

EvolvingLMMs-Lab / lmms-eval

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,222 179 Updated Dec 31, 2024

PKU-YuanGroup / Chat-UniVi

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Python 903 44 Updated Oct 16, 2024

jy0205 / LaVIT

LaVIT: Empower the Large Language Model to Understand and Generate Visual Content

Jupyter Notebook 548 29 Updated Oct 6, 2024

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,848 633 Updated Dec 31, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,930 2,302 Updated Aug 12, 2024

deepglint / unicom

MLCD & UNICOM : Large-Scale Visual Representation Model

Python 476 21 Updated Dec 31, 2024

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,112 980 Updated Nov 18, 2024

salesforce / BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 4,931 654 Updated Aug 5, 2024

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 48,341 5,714 Updated Sep 18, 2024

facebookresearch / paco

This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts, and attributes prediction models, query evaluation scripts…

Python 274 12 Updated Feb 12, 2024

weitong8591 / differentiable_ransac

PyTorch Implementation of the ICCV 2023 paper: Generalized Differentiable RANSAC ($\nabla$-RANSAC).

Python 177 10 Updated Dec 21, 2023

magicleap / SuperGluePretrainedNetwork

SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)

Python 3,400 680 Updated Aug 30, 2024

rom1504 / img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,821 346 Updated Aug 7, 2024

rom1504 / clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

Jupyter Notebook 2,455 216 Updated Apr 15, 2024

mlfoundations / open_clip

An open source implementation of CLIP.

Python 10,692 1,005 Updated Dec 23, 2024

facebookresearch / diht

Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training

Python 132 4 Updated Mar 8, 2023

shabie / docformer

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)

Python 264 40 Updated Feb 13, 2023

rossumai / docile

DocILE: Document Information Localization and Extraction Benchmark

Python 119 9 Updated May 15, 2024

Tangshitao / QuadTreeAttention

QuadTree Attention for Vision Transformers (ICLR2022)

Jupyter Notebook 342 34 Updated Apr 23, 2024

zju3dv / LoFTR

Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022

Jupyter Notebook 2,380 366 Updated May 31, 2024

facebookresearch / nbm-spam

Training and evaluating NBM and SPAM for interpretable machine learning.

Python 76 14 Updated Mar 22, 2023

kornia / kornia

🐍 Geometric Computer Vision Library for Spatial AI

Python 10,104 979 Updated Dec 31, 2024

facebookresearch / SWAG

Official repository for "Revisiting Weakly Supervised Pre-Training of Visual Perception Models". https://arxiv.org/abs/2201.08371.

Jupyter Notebook 177 9 Updated Apr 17, 2022

yash0307 / RecallatK_surrogate

Code for Recall@k Surrogate Loss with Large Batches and Similarity Mixup, CVPR 2022.

Python 58 8 Updated Nov 4, 2024

uta-smile / TCL

code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022

Python 261 33 Updated Oct 2, 2024

Yuting-Gao / DisCo-pytorch

Code for DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning

Python 96 10 Updated Feb 20, 2023

manyids2 / mkd_local_descriptor

Implementation of [Understanding and Improving Kernel Local Descriptors](https://arxiv.org/abs/1811.11147) using PyTorch.

Python 17 Updated Jan 6, 2021

OFA-Sys / OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,442 248 Updated Apr 24, 2024

open-mmlab / mmsegmentation

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Python 8,434 2,636 Updated Aug 13, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 14,851 1,403 Updated Dec 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yash Patel yash0307

Achievements

Achievements

Highlights

Block or report yash0307

Stars

EvolvingLMMs-Lab / lmms-eval

PKU-YuanGroup / Chat-UniVi

jy0205 / LaVIT

facebookresearch / xformers

haotian-liu / LLaVA

deepglint / unicom

salesforce / LAVIS

salesforce / BLIP

facebookresearch / segment-anything

facebookresearch / paco

weitong8591 / differentiable_ransac

magicleap / SuperGluePretrainedNetwork

rom1504 / img2dataset

rom1504 / clip-retrieval

mlfoundations / open_clip

facebookresearch / diht

shabie / docformer

rossumai / docile

Tangshitao / QuadTreeAttention

zju3dv / LoFTR

facebookresearch / nbm-spam

kornia / kornia

facebookresearch / SWAG

yash0307 / RecallatK_surrogate

uta-smile / TCL

Yuting-Gao / DisCo-pytorch

manyids2 / mkd_local_descriptor

OFA-Sys / OFA

open-mmlab / mmsegmentation

Dao-AILab / flash-attention