blip2

This repository is for profiling, extracting, visualizing and reusing generative AI weights to hopefully build more accurate AI models and audit/scan weights at rest to identify knowledge domains for risk(s).

ai deep-learning blender tiff transformers weights image-to-image blender-python llm stable-diffusion foundational-models generative-ai safetensors blip2 gptq

Updated Dec 18, 2023
Python

BUAADreamer / SPN4CIR

Star

[ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives

transformer data-generation llama clip image-retrieval blip multi-modal-retrieval multimodal-learning cross-modal-retrieval composed-image-retrieval llava blip2 memory-bank acmmm2024

Updated Oct 12, 2024
Python

ZhaoPeiduo / BLIP2-Japanese

Star

Modifying LAVIS' BLIP2 Q-former with models pretrained on Japanese datasets.

japanese pytorch captioning multimodal-deep-learning blip2

Updated Jan 16, 2024
Python

nngocson2002 / ViVQA

Star

The Multimodal Model for Vietnamese Visual Question Answering (ViVQA)

vqa multimodal-deep-learning efficientnet bartpho beit-3 blip2 vivqa

Updated Jul 29, 2024
Python

kyegomez / qformer

Sponsor

Star

Implementation of Qformer from BLIP2 in Zeta Lego blocks.

machine-learning ai machine artificial-intelligence multi-modal attention-mechanism multi-modality blip2

Updated Oct 7, 2024
Python

eric-ai-lab / ComCLIP

Star

Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"

causality clip svo slip vision-and-language compositionality flickr8k-dataset image-text-matching flickr30k image-text-retrieval winoground blip2

Updated Aug 18, 2024
Python

152334H / MiniGPT-4-discord-bot

Star

A true multimodal LLaMA derivative -- on Discord!

ai discord-bot llama multimodal vicuna llm blip2

Updated Apr 17, 2023
Python

mlpc-ucsd / BLIVA

Star

(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions

chatbot llama lora multimodal visual-language-learning llm instruction-tuning blip2 bliva

Updated Apr 14, 2024
Python

sled-group / chat-with-nerf

Star

Chat with NeRF enables users to interact with a NeRF model by typing in natural language.

nerf gpt-4 nerfstudio chatgpt blip2 lerf

Updated Apr 17, 2024
Python

DAMO-NLP-SG / Video-LLaMA

Star

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

llama large-language-models video-language-pretraining vision-language-pretraining cross-modal-pretraining blip2 minigpt4 multi-modal-chatgpt

Updated Jun 4, 2024
Python

Improve this page

Add a description, image, and links to the blip2 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the blip2 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

blip2

Here are 14 public repositories matching this topic...

otdavies / AIOrganizeMyDesktop

leeyunjai / image2text

craigsdennis / scairy

jacobmarks / fiftyone-image-captioning-plugin

matlok-ai / bampe-weights

BUAADreamer / SPN4CIR

ZhaoPeiduo / BLIP2-Japanese

nngocson2002 / ViVQA

kyegomez / qformer

eric-ai-lab / ComCLIP

152334H / MiniGPT-4-discord-bot

mlpc-ucsd / BLIVA

sled-group / chat-with-nerf

DAMO-NLP-SG / Video-LLaMA

Improve this page

Add this topic to your repo