Skip to content
View hyqyoung's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report hyqyoung

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official PyTorch implementation of "Multi-modal Queried Object Detection in the Wild" (accepted by NeurIPS 2023)

Python 265 12 Updated Feb 23, 2024

OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)

Python 20 1 Updated Oct 10, 2024

Images to inference with no labeling (use foundation models to train supervised models).

Python 1,934 155 Updated Nov 1, 2024

collection of diffusion model papers categorized by their subareas

1,238 62 Updated Nov 1, 2024

This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.

Python 419 65 Updated Jun 25, 2024

LangGPT: Empowering everyone to become a prompt expert!🚀 Structured Prompt,Language of GPT, 结构化提示词,结构化Prompt

Jupyter Notebook 6,463 520 Updated Oct 20, 2024

A collection of awesome image inpainting studies.

TeX 162 12 Updated Oct 7, 2024

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Python 13,926 1,251 Updated Sep 5, 2024
Python 62 10 Updated Oct 23, 2021

[IEEE TPAMI-2024] Pair then Relation: Pair-Net for Panoptic Scene Graph Generation

Python 93 1 Updated Aug 9, 2024

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Python 2,506 154 Updated Oct 10, 2024

The official implementation of RAR

Python 70 Updated Mar 27, 2024

[WACV 2025] Python implementation of Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation

Python 181 19 Updated Sep 12, 2024
Python 5 Updated Sep 10, 2024

The Paper List of Large Multi-Modality Model, Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.

399 47 Updated Jul 11, 2024

Collection of AWESOME vision-language models for vision tasks

2,456 215 Updated Oct 19, 2024
Python 103 3 Updated Feb 19, 2024

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

Python 292 22 Updated Jul 17, 2024

[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications

602 33 Updated Oct 29, 2024

This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.

375 27 Updated Oct 18, 2024

A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull request…

199 15 Updated Aug 17, 2024

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Python 523 33 Updated Jan 7, 2024

An official codebase of Scene-Aware Label Graph Learning for Multi-Label Image Classification, ICCV 2023.

Python 12 Updated Apr 8, 2024

VMamba: Visual State Space Models,code is based on mamba

Python 2,161 134 Updated Oct 28, 2024

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 2,953 196 Updated Sep 19, 2024

Awesome Papers related to Mamba.

1,179 62 Updated Oct 17, 2024

Benchmarking Panoptic Scene Graph Generation (PSG), ECCV'22

Python 421 69 Updated Apr 10, 2023
Python 168 14 Updated May 10, 2023

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

4,623 489 Updated Jul 30, 2024

Awesome List of Attention Modules and Plug&Play Modules in Computer Vision

Python 1,084 164 Updated May 11, 2023
Next