STR-food-menu-TH

Scene Text Recognition on Thai food menu image

Overview

This project is Scene Text Recognition(STR), subset of Optical Character Recognition(OCR), means we are trying to extract the text from the image and the domain of this project is Thai food menu images which is table-like data. As the Text Localization model and Text Recognition model are private model, so they are not provided in this repository. However, I've provided all pipeline code and the behavior of the missing model function, hence you can put your models into the blank space in the code.

Scope Of Work

Thai food menu image
Dishname should be in the form of Thai text
The content should be in the form of "Front + Dishname + Price + Back"
- Front: The text in front of Dishname e.g. 1 , 1., 1.
- Dishname: The name of dish
- Price: The price of dishname
- Back: The text behind the Price e.g. บาท, -, .-, .- บาทand บ.
Not only recognize the dish name and price but the model should also be able to pair them correctly (return as a list of list of each pair)

Pipeline

Preprocessing: ROI from user
Text Localization: Private model
Text Recognition: Private model
Post Processing: Matching Algorithm

Preprocessing

We need ROI(Region Of Interest) from the users to fix 2 problems

To deal with outlier in image

Without ROI

With ROI

To deal with the image that there are more than 1 region in the image

Text Localization

To localize the text in image

Text Recognition

To recognize the text in the output image from text from Text Localizer

Post Processing

There are 2 steps in Matching Algorithm:

Classify whether each text are dishname(+front) or price(+back) using the index of number in text
Map the dishname and its price using Perpendicular Distance between them

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
demo		demo
image		image
LICENSE		LICENSE
README.md		README.md
STR_food_menu.ipynb		STR_food_menu.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

STR-food-menu-TH

Overview

Scope Of Work

Pipeline

Preprocessing

Without ROI

With ROI

Text Localization

Text Recognition

Post Processing

Demonstration

About

Releases

Packages

Languages

License

NatthanonNon/STR-food-menu-TH

Folders and files

Latest commit

History

Repository files navigation

STR-food-menu-TH

Overview

Scope Of Work

Pipeline

Preprocessing

Without ROI

With ROI

Text Localization

Text Recognition

Post Processing

Demonstration

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages