You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: README.md
+2-2
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
1
# pix2tex - LaTeX OCR
2
2
3
-
[](https://github.com/lukas-blecher/LaTeX-OCR)[](https://pypi.org/project/pix2tex)[](https://pypi.org/project/pix2tex)[](https://github.com/lukas-blecher/LaTeX-OCR/releases)[](https://colab.research.google.com/drive/1ba_qCGJl29dFQqfBjdqMik3o_EqPE4fr)
3
+
[](https://github.com/lukas-blecher/LaTeX-OCR)[](https://pypi.org/project/pix2tex)[](https://pypi.org/project/pix2tex)[](https://github.com/lukas-blecher/LaTeX-OCR/releases)[](https://colab.research.google.com/github/lukas-blecher/LaTeX-OCR/blob/master/notebooks/LaTeX_OCR_test.ipynb)
4
4
5
5
The goal of this project is to create a learning based system that takes an image of a math formula and returns corresponding LaTeX code.
6
6
@@ -33,7 +33,7 @@ The model works best with images of smaller resolution. That's why I added a pre
33
33
34
34
Always double check the result carefully. You can try to redo the prediction with an other resolution if the answer was wrong.
35
35
36
-
## Training the model [](https://colab.research.google.com/drive/1MqZSKzSgEnJB9lU7LyPma4bo4J3dnj1E)
36
+
## Training the model [](https://colab.research.google.com/github/lukas-blecher/LaTeX-OCR/blob/master/notebooks/LaTeX_OCR_training.ipynb)
37
37
38
38
1. First we need to combine the images with their ground truth labels. I wrote a dataset class (which needs further improving) that saves the relative paths to the images with the LaTeX code they were rendered with. To generate the dataset pickle file run
"In this brief notebook I show how you can finetune/train an OCR model.\n",
25
+
"\n",
26
+
"I've opted to mix in handwritten data into the regular pdf LaTeX images. For that I started out with the released pretrained model and continued training on the slightly larger corpus."
"Now we generate the datasets. We can string multiple datasets together to get one large lookup table. The only thing saved in these pkl files are image sizes, image location and the ground truth latex code. That way we can serve batches of images with the same dimensionality."
0 commit comments