Adding an example on handwriting recognition #594

sayakpaul · 2021-08-16T16:02:22Z

A Colab Notebook is available here.

`.py` file

haifeng-jin

Thank you for your contribution!

haifeng-jin · 2021-08-18T18:35:54Z

examples/vision/handwriting_recognition.py

+[IAM Dataset](https://fki.tic.heia-fr.ch/databases/iam-handwriting-database)
+that has variable length ground-truths. IAM Dataset is widely used across many OCR
+benchmarks so we hope this example serves as a good starting point. 
+"""


Can we add a little more description of the dataset, like each sample in the dataset is an image of hand-written sentences, the prediction target of a sample is a string?

haifeng-jin · 2021-08-18T18:39:48Z

examples/vision/handwriting_recognition.py

+"""
+## Introduction
+
+This example shows how the [Captcha OCR](https://keras.io/examples/vision/captcha_ocr/)


May add one sentence introduction to what is OCR.

I don't think this is required since it's a sequel example.

haifeng-jin · 2021-08-18T19:07:13Z

examples/vision/handwriting_recognition.py

+"""
+
+
+class CTCLayer(keras.layers.Layer):


Do we have to make the loss a Layer instead of a loss function or Loss subclass?

Our idea with this example is to keep it as close to the Captcha OCR example as possible while showing the bits that need to be changed. IMO, that helps to ensure a good reading experience. Ccing @AakashKumarNain if he has other points of view.

When defining a normal loss function, we generally assume that the shape of y_true and y_pred are same which isn't the case here. Defining it as an endpoint layer makes it easy to calculate the length of inputs and the labels on the fly during training, and then pass the same to the ctc_batch_cost(...)

haifeng-jin

Thank you for the updates! LGTM.

sayakpaul · 2021-08-20T07:21:19Z

@haifeng-jin,
@AakashKumarNain and I may have another component to add to this tutorial. We are working on that. So, please expect a bit of delay as we finalize things.

haifeng-jin · 2021-08-20T17:13:21Z

@sayakpaul Sure, just ping me when it's ready. Thank you!

sayakpaul · 2021-08-21T03:27:19Z

@haifeng-jin now, it's good to go for another round of review. We incorporated Edit Distance as an evaluation metric.

examples/vision/handwriting_recognition.py

haifeng-jin

LGTM, Thanks for the update!

fchollet

Awesome example! Very strong follow-up to the original OCR example.

I added a few minor copyedits. Please add the generated files. Thanks @haifeng-jin for the review.

sayakpaul · 2021-08-25T02:41:41Z

@fchollet added the generated files. The only major change is the reduced number of epochs because my system was getting stalled after 10 epochs. Tried on a commodity GCP Notebook instance too but didn't help much.

Cc: @AakashKumarNain

AakashKumarNain · 2021-08-25T04:07:23Z

Thanks for the review @haifeng-jin @fchollet .

@sayakpaul should we run this on a bigger VM (only if it is required)?

sayakpaul · 2021-08-25T04:12:57Z

I don't think that's required given that we have explicitly noted the number of epochs should be at least 50. However, if you want to do it go right ahead.

fchollet · 2021-08-25T10:15:05Z

Yes, I think the restriction should be fine. Thank you!

fchollet · 2021-08-25T10:19:13Z

@sayakpaul @AakashKumarNain there were only 2 image files included in the PR, but the example is intended to have 3 figures (it's missing the last one). Please add the missing figure (in a new PR)

sayakpaul · 2021-08-25T10:27:48Z

but the example is intended to have 3 figures (it's missing the last one). Please add the missing figure (in a new PR)

@fchollet it actually plots two figures and uses markdown for the other one. See here.

sayakpaul and others added 6 commits August 16, 2021 08:18

adding example

41bfa06

date fix

3d6e2cc

.ds_store deletion

76fbd64

feedback round I

f341d38

ds_store delete

33723d2

Merge pull request #1 from sayakpaul/handwriting-ocr

332445b

`.py` file

google-cla bot added the cla: yes label Aug 16, 2021

fchollet assigned haifeng-jin Aug 16, 2021

fchollet requested a review from haifeng-jin August 16, 2021 19:59

haifeng-jin requested changes Aug 18, 2021

View reviewed changes

feedback round I

fe33fc0

haifeng-jin approved these changes Aug 20, 2021

View reviewed changes

incorporated edit distance thanks to aakash

d771533

AakashKumarNain reviewed Aug 23, 2021

View reviewed changes

examples/vision/handwriting_recognition.py Outdated Show resolved Hide resolved

removed redundant prediction_model

7ea3066

haifeng-jin approved these changes Aug 24, 2021

View reviewed changes

copyedits

f6f0bae

fchollet approved these changes Aug 25, 2021

View reviewed changes

adding generated files

b06b4c8

fchollet merged commit 4891e16 into keras-team:master Aug 25, 2021

sayakpaul mentioned this pull request Aug 25, 2021

Command-level edits and image fixes to Handwriting OCR #606

Merged

sitamgithub-MSIT mentioned this pull request Apr 29, 2024

Re-updating OCR model for reading Captchas Keras 3 example (TF-Only) #1843

Open

		"""


		class CTCLayer(keras.layers.Layer):

Adding an example on handwriting recognition #594

Adding an example on handwriting recognition #594

Uh oh!

Conversation

sayakpaul commented Aug 16, 2021

Uh oh!

haifeng-jin left a comment

Choose a reason for hiding this comment

Uh oh!

haifeng-jin Aug 18, 2021

Choose a reason for hiding this comment

Uh oh!

sayakpaul Aug 19, 2021

Choose a reason for hiding this comment

Uh oh!

haifeng-jin Aug 18, 2021

Choose a reason for hiding this comment

Uh oh!

sayakpaul Aug 19, 2021

Choose a reason for hiding this comment

Uh oh!

haifeng-jin Aug 18, 2021

Choose a reason for hiding this comment

Uh oh!

sayakpaul Aug 19, 2021

Choose a reason for hiding this comment

Uh oh!

AakashKumarNain Aug 19, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

haifeng-jin left a comment

Choose a reason for hiding this comment

Uh oh!

sayakpaul commented Aug 20, 2021

Uh oh!

haifeng-jin commented Aug 20, 2021

Uh oh!

sayakpaul commented Aug 21, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

haifeng-jin left a comment

Choose a reason for hiding this comment

Uh oh!

fchollet left a comment

Choose a reason for hiding this comment

Uh oh!

sayakpaul commented Aug 25, 2021

Uh oh!

AakashKumarNain commented Aug 25, 2021

Uh oh!

sayakpaul commented Aug 25, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fchollet commented Aug 25, 2021

Uh oh!

fchollet commented Aug 25, 2021

Uh oh!

sayakpaul commented Aug 25, 2021

Uh oh!

Uh oh!

AakashKumarNain Aug 19, 2021 •

edited

Loading

sayakpaul commented Aug 21, 2021 •

edited

Loading

sayakpaul commented Aug 25, 2021 •

edited

Loading