OCR for Math expressions

Main ideas

Generate training images of math expressions by Latex.
Train the model using those training samples.
When doing OCR, capture the image of a math expressions by a snipping GUI made by PyQt6.
Use the trained model to do OCR on the captured image.

Required software and packages

The following instructions are tested on Windows 11, but I believe they can be adapted to other operating systems with modest modifications.

Install Python3, Tex Live, Ghostscript, and ImageMagick. Don't forget to config the environment PATH if necessary. You may want to check the installation by typing the following command in the powershell.

python --version
pdflatex --version
gswin64c --version
convert --version

Install python packages NumPy, Pillow, PyQt6.

Make sure Windows allows executing PowerShell script. See How to enable execution of PowerShell scripts? for details about this issue.

Building instructions

Generating all sample images:

python generate.py

Run the snipping GUI

python snipping.py

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
README.md		README.md
all_symbols.txt		all_symbols.txt
engine.py		engine.py
generate.py		generate.py
main.tex		main.tex
my_run.ps1		my_run.ps1
seg.py		seg.py
snipping.py		snipping.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR for Math expressions

Main ideas

Required software and packages

Building instructions

About

Releases

Packages

Languages

functionadvanced/OCR

Folders and files

Latest commit

History

Repository files navigation

OCR for Math expressions

Main ideas

Required software and packages

Building instructions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages