Add DeiT Model #2203

Sohaib-Ahmed21 · 2025-04-03T21:45:56Z

Add VIT based DeiT (data-efficient image transformers) model to keras-hub along with its backbone, layers, tests and checkpoint conversion.
Paper: https://arxiv.org/pdf/2012.12877
Model card: https://huggingface.co/facebook/deit-base-distilled-patch16-384

…yers.

google-cla · 2025-04-03T21:46:00Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

Sohaib-Ahmed21 · 2025-04-18T09:34:04Z

@divyashreepathihalli @JyotinderSingh , kindly approve the workflows.

sachinprasadhs · 2025-04-18T23:13:08Z

@Sohaib-Ahmed21 , Thanks for the PR, could you please add the checkpoint_conversion , and link a colab notebook here verifying the numerics, parameters matching and demonstrating end to end usage example of the model.

Sohaib-Ahmed21 · 2025-04-19T07:31:22Z

@Sohaib-Ahmed21 , Thanks for the PR, could you please add the checkpoint_conversion , and link a colab notebook here verifying the numerics, parameters matching and demonstrating end to end usage example of the model.

Yeah sure, I'll do that and update the PR, thanks!

…ollow proper ordering of preprocessing functions with addition of center_crop utility

Sohaib-Ahmed21 · 2025-04-24T17:00:35Z

Added the checkpoint_conversion, will share the parameter/numerics verification and end to end demo notebooks soon.

Sohaib-Ahmed21 · 2025-04-26T17:57:15Z

I’m sharing the following resources related to this PR:

DeiT Checkpoint Conversion and Numerics Verification Demo (across multiple backends): Notebook Link
DeiT End-to-End Demo (zero-shot and finetuning): Notebook Link
Here are the converted DeiT presets from Hugging Face checkpoints for reference.

Sohaib-Ahmed21 · 2025-04-26T17:57:58Z

This PR is ready for review. Kindly approve the workflows and review the PR, thanks!

sachinprasadhs

Thanks for the demo notebook, added few more comments.

Also, in the colab can you assert the parameters matching between Keras and HF model.
Example, tiny variant has "params": 5524800, can you show the output where it matches with the HF params?

keras_hub/src/layers/preprocessing/image_converter.py

keras_hub/src/models/deit/deit_image_converter.py

keras_hub/src/models/deit/deit_backbone_test.py

keras_hub/src/models/deit/deit_backbone.py

keras_hub/src/layers/preprocessing/image_converter.py

keras_hub/src/models/deit/deit_image_converter.py

Sohaib-Ahmed21 · 2025-04-29T11:20:35Z

Thanks for the detailed review. I'll address the reviews soon.

can you show the output where it matches with the HF params?

Yes, I'll show that.

…and adjust checkpoint_conversion script accordingly

…ckpoint_conversion script

Sohaib-Ahmed21 · 2025-05-02T08:09:12Z

I've addressed all reviews, kindly review the updated PR. The notebook has also been updated to include parameter verification.

sachinprasadhs · 2025-05-02T18:50:14Z

Awesome, this looks great! Thank you.

Sohaib-Ahmed21 · 2025-05-03T07:05:05Z

Is the failing test related to the PR? Kindly confirm and re-run the tests if required.

mattdangerw · 2025-05-06T18:58:21Z

@Sohaib-Ahmed21 no the jax-gpu segfault popped up recently, it's probably related to our test environment (we haven't tracked it down yet). You can ignore.

mattdangerw

Few misc comments, but this is looking good!

mattdangerw · 2025-05-06T19:10:16Z

keras_hub/src/models/deit/deit_presets.py

+
+# Metadata for loading pretrained model weights.
+backbone_presets = {
+    "deit-base-distilled-patch16-384_imagenet": {


I think we use only underscore in our preset names, no mixing dashes and underscore like this.

This should be true for the preset names here and on Kaggle, we want consistency.

mattdangerw · 2025-05-06T19:10:41Z

keras_hub/src/utils/transformers/preset_loader.py

@@ -73,7 +76,10 @@ def load_task(self, cls, load_weights, load_task_weights, **kwargs):
                cls, load_weights, load_task_weights, **kwargs
            )
        # Support loading the classification head for classifier models.
-        if architecture == "ViTForImageClassification":
+        if (


Maybe we could just check if ForImageClassification is in the name here?

Sohaib-Ahmed21 added 3 commits March 28, 2025 02:48

Add deit model basis to keras hub, its basic utilitites, backbone, la…

da66882

…yers.

update conversion script

aa4affa

Add DeiT model with backbone, layers, tests.

5e16661

Sohaib-Ahmed21 and others added 2 commits April 4, 2025 11:55

Removed print statements.

ab0175a

Update deit_backbone_test.py to include exact shape

616acd6

divyashreepathihalli requested a review from JyotinderSingh April 10, 2025 01:46

Sohaib-Ahmed21 added 2 commits April 16, 2025 22:40

Resolved failing test cases.

7071e1e

Fix failing test cases

7b027a4

sachinprasadhs added the stat:awaiting response from contributor label Apr 18, 2025

Sohaib-Ahmed21 and others added 4 commits April 19, 2025 20:48

Merge branch 'keras-team:master' into deit

46f3b4a

Solve jax failing tests

b102d37

Merge branch 'master' into deit

5c138b0

Add checkpoint conversion, presets and customize image converter to f…

12d5fec

…ollow proper ordering of preprocessing functions with addition of center_crop utility

Clean model code for input consistency

a7bac11

Sohaib-Ahmed21 and others added 2 commits April 27, 2025 11:20

Merge branch 'keras-team:master' into deit

c3c03ce

Refactor api imports to fix pre-commit api_gen tests

73651fc

sachinprasadhs reviewed Apr 28, 2025

View reviewed changes

mattdangerw reviewed Apr 28, 2025

View reviewed changes

keras_hub/src/layers/preprocessing/image_converter.py Outdated Show resolved Hide resolved

keras_hub/src/models/deit/deit_image_converter.py Show resolved Hide resolved

Sohaib-Ahmed21 and others added 4 commits May 1, 2025 14:53

Merge branch 'keras-team:master' into deit

49b80af

Allow accepting non-square images

a835e8c

Make DeiTImageConverter empty to match other classifiers' converters …

2f1464c

…and adjust checkpoint_conversion script accordingly

Enable quantization check in backbone test and validate params in che…

c5b7454

…ckpoint_conversion script

Sohaib-Ahmed21 requested review from mattdangerw and sachinprasadhs May 2, 2025 08:09

sachinprasadhs added kokoro:force-run Runs Tests on GPU and removed stat:awaiting response from contributor labels May 2, 2025

kokoro-team removed the kokoro:force-run Runs Tests on GPU label May 2, 2025

divyashreepathihalli added the kokoro:force-run Runs Tests on GPU label May 5, 2025

kokoro-team removed the kokoro:force-run Runs Tests on GPU label May 5, 2025

mattdangerw reviewed May 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add DeiT Model #2203

Add DeiT Model #2203

Sohaib-Ahmed21 commented Apr 3, 2025 •

edited

Loading

google-cla bot commented Apr 3, 2025

Sohaib-Ahmed21 commented Apr 18, 2025 •

edited

Loading

sachinprasadhs commented Apr 18, 2025

Sohaib-Ahmed21 commented Apr 19, 2025

Sohaib-Ahmed21 commented Apr 24, 2025

Sohaib-Ahmed21 commented Apr 26, 2025

Sohaib-Ahmed21 commented Apr 26, 2025

sachinprasadhs left a comment

Sohaib-Ahmed21 commented Apr 29, 2025

Sohaib-Ahmed21 commented May 2, 2025

sachinprasadhs commented May 2, 2025

Sohaib-Ahmed21 commented May 3, 2025

mattdangerw commented May 6, 2025

mattdangerw left a comment

mattdangerw May 6, 2025

mattdangerw May 6, 2025

mattdangerw May 6, 2025

Add DeiT Model #2203

Are you sure you want to change the base?

Add DeiT Model #2203

Conversation

Sohaib-Ahmed21 commented Apr 3, 2025 • edited Loading

google-cla bot commented Apr 3, 2025

Sohaib-Ahmed21 commented Apr 18, 2025 • edited Loading

sachinprasadhs commented Apr 18, 2025

Sohaib-Ahmed21 commented Apr 19, 2025

Sohaib-Ahmed21 commented Apr 24, 2025

Sohaib-Ahmed21 commented Apr 26, 2025

Sohaib-Ahmed21 commented Apr 26, 2025

sachinprasadhs left a comment

Choose a reason for hiding this comment

Sohaib-Ahmed21 commented Apr 29, 2025

Sohaib-Ahmed21 commented May 2, 2025

sachinprasadhs commented May 2, 2025

Sohaib-Ahmed21 commented May 3, 2025

mattdangerw commented May 6, 2025

mattdangerw left a comment

Choose a reason for hiding this comment

mattdangerw May 6, 2025

Choose a reason for hiding this comment

mattdangerw May 6, 2025

Choose a reason for hiding this comment

mattdangerw May 6, 2025

Choose a reason for hiding this comment

Sohaib-Ahmed21 commented Apr 3, 2025 •

edited

Loading

Sohaib-Ahmed21 commented Apr 18, 2025 •

edited

Loading