Install below libraries from requirements.txt file
pip install -r requirements.txt
- used hugging face to consume ready made AI models.
- for image-to-text with model as "(salesforce/blip-image-captioning-base)"
- for text to audio with model as "kan-bayashi_ljspeech_vits".
- used langchain+Chat GPT to geenrate a text
- published image to audio using streamlit
streamlit run app.py
Read more here