alice-cool

Follow

Donglearner alice-cool

Follow

12 followers · 144 following

Starred repositories

kwstat / agridat

Agricultural datasets

R 124 39 Updated Dec 16, 2024

gauravchopracg / analysis-of-tomato-leaf-disease-identification-techniques

analysis of tomato leaf disease identification techniques

Python 2 Updated Apr 21, 2021

Kki2Eve / Agri-LLaVA

Python 11 1 Updated Nov 27, 2024

BUAADreamer / Chinese-LLaVA-Med

中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine

Python 69 4 Updated May 22, 2024

yuwvandy / KG-LLM-MDQA

Python 289 30 Updated Jan 10, 2024

XiaoxinHe / G-Retriever

Official Implementation of NeurIPS 2024 paper "G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering""

Python 388 66 Updated Nov 15, 2024

Ablustrund / LoRAMoE

LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment

Python 280 23 Updated Apr 29, 2024

haodongze / Self-KSel-QAns

Python 3 Updated Nov 28, 2024

yuanze-lin / REVIVE

[NeurIPS 2022] Official Code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering

Python 99 10 Updated Sep 18, 2024

PaulLerner / ViQuAE

Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retrieval (Lerner et al., ECIR'24)

Python 31 2 Updated Dec 19, 2024

apple / ml-mobileclip

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Python 817 58 Updated Nov 22, 2024

dvlab-research / LLaMA-VID

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)

Python 766 45 Updated Jul 29, 2024

JimXiongGM / Interactive-KBQA

A KBQA solution framework based on the agent-environment paradigm in the era of LLMs.

Python 19 2 Updated Aug 22, 2024

THUNLP-MT / ActiView

Python 7 Updated Dec 20, 2024

om-ai-lab / ZoomEye

ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration

Python 22 3 Updated Jan 1, 2025

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 7,160 500 Updated Feb 10, 2025

vl2g / KaLMA

Official Implementation of our EMLNP 2024 Paper

Python 9 Updated Oct 24, 2024

PKU-YuanGroup / LLaVA-CoT

LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 1,812 65 Updated Jan 22, 2025

SivanDoveh / IPLoc

Repository for the paper: Teaching VLMs to Localize Specific Objects from In-context Examples

Python 20 1 Updated Nov 27, 2024

aimagelab / ReflectiVA

Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering

6 Updated Nov 29, 2024

microsoft / PICa

An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA, AAAI 2022 (Oral)

Python 85 6 Updated Apr 10, 2022

UMass-Foundation-Model / VisualCoT

Codebase for AAAI 2024 conference paper Visual Chain-of-Thought Prompting for Knowledge-based Visual Reasoning

Python 25 1 Updated Jul 21, 2024

JiuTian-VL / JiuTian-LION

[CVPR 2024] LION: Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge

Jupyter Notebook 130 6 Updated Jul 18, 2024

jamespark3922 / localized-skd

Localized Symbolic Knowledge Distillation for Visual Commonsense Models (Neurips 2023]

Jupyter Notebook 5 Updated Dec 12, 2023

Upper9527 / GeReA

Official Code of "GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering"

Python 10 Updated Oct 10, 2024

FatemehShiri / Spatial-MM

5 1 Updated Jan 10, 2025

WHB139426 / QA-Prompts

Q&A Prompts: Discovering Rich Visual Clues through Mining Question-Answer Prompts for VQA requiring Diverse World Knowledge [ECCV'24]

Python 6 Updated Nov 20, 2024

codezakh / SelTDA

[CVPR 23] Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!

Python 14 1 Updated May 14, 2024

thomaswei-cn / MC-CoT

MC-CoT implementation code

Python 11 1 Updated Oct 31, 2024

Lackel / DKA

[Arxiv 2024] Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models

Python 8 1 Updated Jul 23, 2024

Starred topics

Google