pix2tex: Using a ViT to convert images of equations into LaTeX code.
-
Updated
Jan 18, 2025 - Python
pix2tex: Using a ViT to convert images of equations into LaTeX code.
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.
📋 Python wrapper to grab text from images and save as text files using Tesseract Engine
Deep Extreme Cut http://www.vision.ee.ethz.ch/~cvlsegmentation/dextr . a tool to do automatically object segmentation from extreme points.
A collection of scripts to "help" you with your programming exams and assignments.
Minimal local-first multimodal RAG library powered by SQLite + sqlite-vec.
Civitai Stable Diffusion 337k Dataset; dataset of ai generated image
A Large Language Model (LLM) Based App to Generate Stories from Pictures
Python tool, which takes 1..n images, tries to rotate them based on the text, extract the text and store 1..n images to a pdf.
TAO71 I4.0 is an AI created by TAO71 in Python.
[AINL 2023] IMAD: IMage Augmented multi-modal Dialogue
Run im2txt trained model in inference mode
A web-based application that leverages the BLIP-2 model to generate detailed descriptions of uploaded images.
A CRUD application; my third project for GA Software Engineering Immersive.
Text-to-Image and Image-to-Text model retrieval
Add a description, image, and links to the image2text topic page so that developers can more easily learn about it.
To associate your repository with the image2text topic, visit your repo's landing page and select "manage topics."