uday160386 / image-audio-hf-openai Public

Notifications You must be signed in to change notification settings
Fork 1
Star 0

generate a meaningful audio from uploaded photo using HuggingFace + Langchain+ Open AI

0 stars 1 fork Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
images		images
README.md		README.md
app.py		app.py
man_standing_with_camera.jpeg		man_standing_with_camera.jpeg
requirements.txt		requirements.txt

Repository files navigation

Gen AI: generate a meaningful audio from uploaded photo using HuggingFace + Langchain+ Open AI

Pre-requisites:

Install below libraries from requirements.txt file

pip install -r requirements.txt

Design info:

used hugging face to consume ready made AI models.
for image-to-text with model as "(salesforce/blip-image-captioning-base)"
for text to audio with model as "kan-bayashi_ljspeech_vits".
used langchain+Chat GPT to geenrate a text
published image to audio using streamlit

Build and run?

streamlit run app.py

Image to Audio:

Read more here

About

generate a meaningful audio from uploaded photo using HuggingFace + Langchain+ Open AI

python natural-language-processing openai text-audio streamlit ai-ml huggingface-transformers langchain

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%