Skip to content
View zouhaoa's full-sized avatar
  • Zhejiang University
  • HangZhou

Block or report zouhaoa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLM

99 repositories

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,029 2,313 Updated Aug 12, 2024

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,390 177 Updated Nov 27, 2024

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,534 2,927 Updated Sep 2, 2024

Awesome-LLM: a curated list of Large Language Model

20,502 1,673 Updated Jan 10, 2025

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,536 94 Updated Dec 11, 2024

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editin…

Python 3,213 232 Updated Aug 20, 2024

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 2,869 265 Updated Jun 4, 2024

Supercharged BLIP-2 that can handle videos

Python 118 6 Updated Dec 1, 2023

VisionLLM Series

Python 974 32 Updated Jan 4, 2025

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python 3,137 256 Updated Nov 26, 2024

Recent LLM-based CV and related works. Welcome to comment/contribute!

849 36 Updated Jun 5, 2024

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,448 248 Updated Apr 24, 2024

Oscar and VinVL

Python 1,040 251 Updated Aug 28, 2023
Jupyter Notebook 221 28 Updated Dec 18, 2023

FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.

Python 3,848 416 Updated Dec 20, 2024

Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"

Python 787 109 Updated Jun 30, 2021

Code for ALBEF: a new vision-language pre-training method

Python 1,596 200 Updated Sep 20, 2022

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,143 987 Updated Nov 18, 2024

Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch

Python 1,225 59 Updated Oct 18, 2022

Inference code for Llama models

Python 57,154 9,650 Updated Aug 18, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,731 4,056 Updated Jul 17, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 37,656 4,641 Updated Jan 10, 2025

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,748 1,855 Updated Jun 27, 2024

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

Python 3,680 472 Updated Oct 12, 2023

Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration

Python 1,578 129 Updated Jan 1, 2025

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,323 827 Updated Jan 8, 2025

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,151 826 Updated Jun 10, 2024

总结Prompt&LLM论文,开源数据&模型,AIGC应用

2,786 283 Updated Jan 9, 2025