WIP Keras 3 captcha_ocr #1609

mattdangerw · 2023-11-11T00:37:53Z

Not ready to land, but opening for some questions...

mattdangerw · 2023-11-11T00:41:10Z

@fchollet I could use your thoughts on two things here.

The guide uses keras.backend.ctc_batch_cost. I have just copied the functions into the example for now (it is a little hefty, but oh well). There is also keras._legacy.backend.ctc_batch_cost, but that does not seem to export. Should it? Would using it be better?
The second error seems like potentially a functional model bug! We have a model made like this

prediction_model = keras.models.Model(
    model.get_layer(name="image").input, model.get_layer(name="dense2").output
)

Which fails on predict like this...

File ~/miniconda3/envs/keras-nlp-tensorflow/lib/python3.10/site-packages/keras/src/ops/operation.py:47, in Operation.__call__(self, *args, **kwargs)
     45 if any_symbolic_tensors(args, kwargs):
     46     return self.symbolic_call(*args, **kwargs)
---> 47 return self.call(*args, **kwargs)

File ~/miniconda3/envs/keras-nlp-tensorflow/lib/python3.10/site-packages/keras/src/models/functional.py:188, in Functional.call(self, inputs, training, mask)
    186         if mask is not None:
    187             x._keras_mask = mask
--> 188 outputs = self._run_through_graph(
    189     inputs, operation_fn=lambda op: operation_fn(op, training=training)
    190 )
    191 return unpack_singleton(outputs)

File ~/miniconda3/envs/keras-nlp-tensorflow/lib/python3.10/site-packages/keras/src/ops/function.py:148, in Function._run_through_graph(self, inputs, operation_fn)
    146 output_tensors = []
    147 for x in self.outputs:
--> 148     output_tensors.append(tensor_dict[id(x)])
    150 return pack_sequence_as(self._outputs_struct, output_tensors)

KeyError: 140610153812160

Is this a valid use case? And valid bug?

Thanks!

fchollet · 2023-11-11T03:35:49Z

The second error seems like potentially a functional model bug! We have a model made like this

Yes, looks like a bug indeed. The use case looks valid as far as I can tell.

The guide uses keras.backend.ctc_batch_cost. I have just copied the functions into the example for now (it is a little hefty, but oh well). There is also keras._legacy.backend.ctc_batch_cost, but that does not seem to export. Should it? Would using it be better?

It's no longer meant to be public. The version copied from the old Keras backend also isn't too great since it relies heavily on tf.compat.v1 which we should definitely stay away from.

I think the long-term fix would be to introduce a cross-backend CTC op, maybe written from scratch. Right now none of the solutions are satisfying. Seems like a big gap tbh, CTC is still the reference for OCR problems today.

CTC was already completed left behind in TF 2, for what it's worth. Not sure why. Only TF 1 had some support (and even then it was barely usable, hence why we had these hefty backend functions to work around TF 1 APIs).

fchollet · 2023-11-22T04:58:19Z

I debugged the "functional model issue" and as it turns out the framework is fine. However there was a semantic change in Keras 3 (which is more consistent now).

The code tried to query .input on an Input layer. In Keras 2 this returns the same Input. In Keras 3 this is empty (which makes more sense to me: the entry node doesn't have itself as its own entry node...). To get the model's input, better get the model's .input (instead of the model's input's .input).

So you need to modify the prediction model creation as such:

prediction_model = keras.models.Model(
    model.input[0], model.get_layer(name="dense2").output
)

the [0] is because model.input is a list of 2 input tensors (image and label).

mattdangerw · 2023-11-22T23:14:11Z

Thanks! I will re-render this guide.

mattdangerw · 2023-11-23T02:58:49Z

Done! All rendered.

fchollet

LGTM

Copying the old code (compat.v1 and all) is still the least bad option.

github-actions bot assigned sachinprasadhs Nov 11, 2023

mattdangerw changed the title ~~WIP for captcha ocr~~ WIP Keras 3 captcha_ocr Nov 11, 2023

mattdangerw added 2 commits November 22, 2023 18:53

WIP for captcha ocr

69d3b41

Fix example

936dd23

mattdangerw force-pushed the keras-3-captcha-ocr branch from 12e18cb to 936dd23 Compare November 23, 2023 02:58

mattdangerw marked this pull request as ready for review November 23, 2023 02:58

fchollet approved these changes Nov 23, 2023

View reviewed changes

fchollet merged commit ada0e46 into keras-team:keras-3 Nov 23, 2023

sitamgithub-MSIT mentioned this pull request Apr 29, 2024

Re-updating OCR model for reading Captchas Keras 3 example (TF-Only) #1843

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WIP Keras 3 captcha_ocr #1609

WIP Keras 3 captcha_ocr #1609

Uh oh!

mattdangerw commented Nov 11, 2023

Uh oh!

mattdangerw commented Nov 11, 2023 •

edited

Loading

Uh oh!

fchollet commented Nov 11, 2023

Uh oh!

fchollet commented Nov 22, 2023

Uh oh!

mattdangerw commented Nov 22, 2023

Uh oh!

mattdangerw commented Nov 23, 2023

Uh oh!

fchollet left a comment

Uh oh!

Uh oh!

WIP Keras 3 captcha_ocr #1609

WIP Keras 3 captcha_ocr #1609

Uh oh!

Conversation

mattdangerw commented Nov 11, 2023

Uh oh!

mattdangerw commented Nov 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fchollet commented Nov 11, 2023

Uh oh!

fchollet commented Nov 22, 2023

Uh oh!

mattdangerw commented Nov 22, 2023

Uh oh!

mattdangerw commented Nov 23, 2023

Uh oh!

fchollet left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mattdangerw commented Nov 11, 2023 •

edited

Loading