Skip to content

generate a meaningful audio from uploaded photo using HuggingFace + Langchain+ Open AI

Notifications You must be signed in to change notification settings

uday160386/image-audio-hf-openai

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Gen AI: generate a meaningful audio from uploaded photo using HuggingFace + Langchain+ Open AI

Pre-requisites:

Install below libraries from requirements.txt file

pip install -r requirements.txt 

Design info:

  • used hugging face to consume ready made AI models.
  • for image-to-text with model as "(salesforce/blip-image-captioning-base)"
  • for text to audio with model as "kan-bayashi_ljspeech_vits".
  • used langchain+Chat GPT to geenrate a text
  • published image to audio using streamlit

Build and run?

streamlit run app.py

Image to Audio:

screenshot

Read more here

Releases

No releases published

Packages

No packages published

Languages