-
Notifications
You must be signed in to change notification settings - Fork 9.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Lstmbox #2216
Lstmbox #2216
Conversation
I have tested this for Since the character level bounding boxes are NOT accurate with LSTM engine, I changed the format to the one used by ocrd-train i.e. bounding info at TEXTLINE level for all characters on a line. |
Sorry if I miss some information, but what is user case for this? |
When tesseract is used with This PR allows creation of box files from images, in the format needed by LSTM training. The box files will still need to be edited for accuracy (similar to tesseract 3). This can be useful when someone wants to finetune for a particular typeface which is not available as a font. |
Since there are lots of requests by users who want to train using images, I wanted to add this option. |
Test results for eng, hin, ara, chi_sim and chi_tra Both chi_sim and chi_tra are including extra spaces when the word has both Chinese and Latin script. |
(cherry picked from commit 049db10)
text2image does not output space char at EOL. |
@amitdo Thanks. Will change. Have you checked the output for RTL eg. Hebrew? |
The ara lstmbox in testbox.zip looks fine. |
Create box files (using code similar to tsv renderer) in the format needed for LSTM training i.e. with a line for space after every word and a line with tab to mark end of line.