Skip to content

Fine-Tuning a Generative VLM for Image Describing

Notifications You must be signed in to change notification settings

Holy-Morphism/VLM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Fine-Tuning VLM and Demo App

This repository contains a notebook for fine-tuning the BLIP-2 vision-language model with LoRA on the Flickr8k dataset for image captioning and a demo for interacting with the finetuned model.

Here is a blog explaining how I fine-tuned this VLM, you can also read about this on my website Fine-Tuning BLIP-2 with LoRA on the Flickr8k Dataset for Image Captioning

About

Fine-Tuning a Generative VLM for Image Describing

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published