image2text

Here are 23 public repositories matching this topic...

lukas-blecher / LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

python machine-learning ocr latex deep-learning image-processing pytorch dataset transformer vit image2text im2text im2latex im2markup math-ocr vision-transformer latex-ocr

Updated Jan 18, 2025
Python

zai-org / GLM-V

Star

GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

video-understanding reasoning vlm image2text

Updated Oct 15, 2025
Python

OleehyO / TexTeller

Star

TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.

image2text latex-ocr

Updated Aug 22, 2025
Python

prabhakar267 / image2text

Star

📋 Python wrapper to grab text from images and save as text files using Tesseract Engine

ocr tesseract python-wrapper tesseract-ocr optical-character-recognition image2text tesseract-engine tesseract-installation

Updated Aug 14, 2025
Python

Hangover3832 / ComfyUI-Hangover-Nodes

Star

Various nodes for ComfyUI

image2text stable-diffusion comfyui kosmos-2

Updated May 18, 2025
Python

etosworld / etos-deepcut

Star

Deep Extreme Cut http://www.vision.ee.ethz.ch/~cvlsegmentation/dextr . a tool to do automatically object segmentation from extreme points.

deep-learning annotation pytorch segmentation semantic-segmentation deeplab grabcut pspnet object-segmentation image2text image-segmantation

Updated Dec 27, 2020
Python

TheLime1 / CheatoMate

Star

A collection of scripts to "help" you with your programming exams and assignments.

chat ai assignment cheating exam cheat codebase network-card image2text pdf2text

Updated Jan 21, 2024
Python

JulioPeixoto / softrag

Star

Minimal local-first multimodal RAG library powered by SQLite + sqlite-vec.

nlp agent open-source sql openai sqlite3 image2text rag vector-database text2text llm generative-ai chatgpt retrieval-augmented-generation

Updated Jul 26, 2025
Python

thefcraft / civitai-stable-diffusion-337k

Star

Civitai Stable Diffusion 337k Dataset; dataset of ai generated image

dataset image-classification image-generation image2text stable-diffusion civitai

Updated Jan 7, 2025
Python

sssingh / pic-to-story

Star

A Large Language Model (LLM) Based App to Generate Stories from Pictures

openapi generative-model gradio image2text huggingface gpt-3-text-generation large-language-models llm huggingface-spaces langchain

Updated Oct 10, 2023
Python

Jerey / image-to-pdf-and-txt

Sponsor

Star

Python tool, which takes 1..n images, tries to rotate them based on the text, extract the text and store 1..n images to a pdf.

ocr tesseract python3 hacktoberfest pyocr opencv-python image2text

Updated Feb 13, 2023
Python

TAO71-AI / I4.0

Star

TAO71 I4.0 is an AI created by TAO71 in Python.

python linux api client ai server chatbot transformers python3 artificial-intelligence chatbots text2image image2text python311 diffusers gpt4all llama-cpp-python

Updated Aug 21, 2025
Python

eddieir / Image_to_Text

Star

ocr tesseract tesseract-ocr image2text

Updated Nov 5, 2019
Python

VityaVitalich / IMAD

Star

[AINL 2023] IMAD: IMage Augmented multi-modal Dialogue

deep-learning dataset dialogue-systems image2text multimodal multimodal-deep-learning

Updated May 28, 2023
Python

yhwang / im2txt-inference

Star

Run im2txt trained model in inference mode

python flask tensorflow show-and-tell image2text inference-mode

Updated Dec 22, 2017
Python

ergonomech / BLIP-2-Image-Describer

Star

A web-based application that leverages the BLIP-2 model to generate detailed descriptions of uploaded images.

gradio image2text blip2

Updated Nov 7, 2024
Python

iohanngrig / gptassistant

Star

AI based apps

ai text2speech text2image image2text aiassistant

Updated Jan 29, 2024
Python

BinhQuocLy / Pdf2Quiz

Star

A Pdf2Quiz NLP model.

nlp image2text pdf2text pdf2question pdf2quiz

Updated Jun 2, 2024
Python

Emsley1d / Project03-NutriCO2

Star

A CRUD application; my third project for GA Software Engineering Immersive.

python html api image2text

Updated Mar 13, 2023
Python

davidserra9 / cross-modal-retrieval-with-triplet-network

Star

Text-to-Image and Image-to-Text model retrieval

computer-vision deep-learning text2image image2text

Updated Jul 8, 2022
Python

Improve this page

Add a description, image, and links to the image2text topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the image2text topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

image2text

Here are 23 public repositories matching this topic...

lukas-blecher / LaTeX-OCR

zai-org / GLM-V

OleehyO / TexTeller

prabhakar267 / image2text

Hangover3832 / ComfyUI-Hangover-Nodes

etosworld / etos-deepcut

TheLime1 / CheatoMate

JulioPeixoto / softrag

thefcraft / civitai-stable-diffusion-337k

sssingh / pic-to-story

Jerey / image-to-pdf-and-txt

TAO71-AI / I4.0

eddieir / Image_to_Text

VityaVitalich / IMAD

yhwang / im2txt-inference

ergonomech / BLIP-2-Image-Describer

iohanngrig / gptassistant

BinhQuocLy / Pdf2Quiz

Emsley1d / Project03-NutriCO2

davidserra9 / cross-modal-retrieval-with-triplet-network

Improve this page

Add this topic to your repo