Skip to content
#

imagecaptioning

Here are 65 public repositories matching this topic...

A versatile app that converts images into short stories and lifelike audio locally. It combines Hugging Face's image captioning, Groq's story generation, and Parler TTS for local text-to-speech synthesis. Ideal for AI-driven projects with fast, reliable on-device TTS.

  • Updated Sep 29, 2024
  • Python

Generative AI Models is a comprehensive repository dedicated to the implementation of cutting-edge generative AI models using Python. It features various models, including those for image captioning and text-to-image generation, leveraging advanced architectures like Vision Transformers (ViT), GPT-2, and Stable Diffusion.

  • Updated Aug 26, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the imagecaptioning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the imagecaptioning topic, visit your repo's landing page and select "manage topics."

Learn more