Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek3, ...) and 150+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…

Python 5,146 445 Updated Jan 24, 2025

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,829 124 Updated Oct 30, 2024

HVision-NKU / TAR3D

Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction'

53 Updated Dec 26, 2024

zhengli97 / ATPrompt

Official PyTorch Code for "ATPrompt: Textual Prompt Learning with Embedded Attributes"

Python 18 Updated Dec 23, 2024

penghao-wu / vstar

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Python 559 38 Updated Jan 7, 2024

swordlidev / Evaluation-Multimodal-LLMs-Survey

A Survey on Benchmarks of Multimodal Large Language Models

83 6 Updated Jan 2, 2025

HVision-NKU / DenseVLM

15 Updated Dec 11, 2024

HVision-NKU / MaskCLIPpp

Official repository of the paper "MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation"

Python 17 1 Updated Jan 17, 2025

HVision-NKU / Cascade-CLIP

Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation

Python 46 3 Updated Aug 15, 2024

HVision-NKU / SRFormer

Official code for "SRFormer: Permuted Self-Attention for Single Image Super-Resolution" (ICCV 2023) and SRFormerV2

Python 251 22 Updated Aug 18, 2024

HVision-NKU / Conv2Former

Python 171 13 Updated Jan 2, 2025

HVision-NKU / CamoFormer

Python 90 15 Updated Dec 21, 2024

CircleRadon / Osprey

[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"

Python 783 42 Updated Aug 5, 2024

Ucas-HaoranWei / Vary-toy

Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)

Python 613 46 Updated Dec 30, 2024

zhengyuan-xie / ECCV24_NeST

[ECCV 2024] Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation

Python 28 2 Updated Nov 1, 2024

baaivision / EVA

EVA Series: Visual Representation Fantasies from BAAI

Python 2,397 173 Updated Aug 1, 2024

votchallenge / toolkit

The official VOT Challenge evaluation and analysis toolkit

Python 173 48 Updated Dec 18, 2024

oxuva / long-term-tracking-benchmark

[ECCV'18] Long-term Tracking in the Wild: A Benchmark

Python 180 37 Updated Dec 26, 2019

lxtGH / OMG-Seg

OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]

Python 1,214 44 Updated Dec 11, 2024

dair-ai / ml-visuals

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

14,114 1,416 Updated Feb 13, 2023

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 38,473 5,047 Updated Jan 23, 2025

visionml / pytracking

Visual tracking library based on PyTorch.

Python 3,308 608 Updated Aug 8, 2024

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,187 991 Updated Nov 18, 2024

HVision-NKU / StoryDiffusion

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 6,139 616 Updated Sep 26, 2024

shenyunhang / APE

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Python 503 31 Updated May 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ashun989

Achievements