This is a pytorch implementation of An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition.
This implementation is similar to https://github.com/meijieru/crnn.pytorch, we rewrite it and make it easy to understand.
A GRCNN implementaion is in https://github.com/jingjing-you/GRCNN_OCR.pytorch.
- Pytorch >= 0.4.0
- opencv
- warp_ctc_pytorch
- tqdm
The names of train and val data are constructed as bellow:
img_xxx_$$$.jpg
where 'xxx' represents the number of this image and '$$$' represents the label of this image. For example,‘img_0_WHLU.jpg’.
Other examples are in data_sample
directory.
You can run python train.py
to train your model.
You can run python eval.py
to eval your model.