hyqyoung

💭

I may be slow to respond.

linyu hyqyoung

💭

I may be slow to respond.

7 followers · 25 following

Hangzhou

Achievements

Stars

YifanXu74 / MQ-Det

Official PyTorch implementation of "Multi-modal Queried Object Detection in the Wild" (accepted by NeurIPS 2023)

Python 265 12 Updated Feb 23, 2024

Zehong-Ma / OVMR

OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)

Python 20 1 Updated Oct 10, 2024

autodistill / autodistill

Images to inference with no labeling (use foundation models to train supervised models).

Python 1,934 155 Updated Nov 1, 2024

wangkai930418 / awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

1,238 62 Updated Nov 1, 2024

longzw1997 / Open-GroundingDino

This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.

Python 419 65 Updated Jun 25, 2024

langgptai / LangGPT

LangGPT: Empowering everyone to become a prompt expert!🚀 Structured Prompt，Language of GPT, 结构化提示词，结构化Prompt

Jupyter Notebook 6,463 520 Updated Oct 20, 2024

AlonzoLeeeooo / awesome-image-inpainting-studies

A collection of awesome image inpainting studies.

TeX 162 12 Updated Oct 7, 2024

LlamaFamily / Llama-Chinese

Llama中文社区，Llama3在线体验和微调模型已开放，实时汇总最新Llama3学习资料，已将所有代码更新适配Llama3，构建最好的中文Llama大模型，完全开源可商用

Python 13,926 1,251 Updated Sep 5, 2024

BCV-Uniandes / PNG

Python 62 10 Updated Oct 23, 2021

king159 / Pair-Net

[IEEE TPAMI-2024] Pair then Relation: Pair-Net for Panoptic Scene Graph Generation

Python 93 1 Updated Aug 9, 2024

InternLM / InternLM-XComposer

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Python 2,506 154 Updated Oct 10, 2024

Liuziyu77 / RAR

The official implementation of RAR

Python 70 Updated Mar 27, 2024

zifuwan / Sigma

[WACV 2025] Python implementation of Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation

Python 181 19 Updated Sep 12, 2024

hutuo1213 / CLIPViC

Python 5 Updated Sep 10, 2024

Paranioar / Awesome_Matching_Pretraining_Transfering

The Paper List of Large Multi-Modality Model, Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.

399 47 Updated Jul 11, 2024

jingyi0000 / VLM_survey

Collection of AWESOME vision-language models for vision tasks

2,456 215 Updated Oct 19, 2024

wengzejia1 / Open-VCLIP

Python 103 3 Updated Feb 19, 2024

WisconsinAIVision / ViP-LLaVA

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

Python 292 22 Updated Jul 17, 2024

Event-AHU / Mamba_State_Space_Model_Paper_List

[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications

602 33 Updated Oct 29, 2024

JindongGu / Awesome-Prompting-on-Vision-Language-Model

This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.

375 27 Updated Oct 18, 2024

Charles-Xie / awesome-described-object-detection

A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull request…

199 15 Updated Aug 17, 2024