Quantized ImageBind by ahmedsaed · Pull Request #1 · ahmedsaed/ImageBind

ahmedsaed · 2025-04-21T11:05:30Z

This pull request introduces quantization support for the ImageBind model, enabling efficient inference with reduced precision. The changes include adding new quantization-related classes, modifying existing components to support quantization, and providing a script for model quantization and evaluation.

Quantization Support Enhancements:

New Quantization Classes:
- Added QuantizableMultiheadAttention and QuantizedMultiheadAttention classes to support quantized attention mechanisms in the transformer module (imagebind/models/transformer.py). [1] [2]
- Introduced QuantizedDropPath to handle quantization-aware stochastic depth. This class includes quantization and dequantization stubs for compatibility with quantized models (imagebind/models/transformer.py).
Integration of Quantization in Model Components:
- Replaced DropPath with QuantizedDropPath and added nn.quantized.FloatFunctional for skip connections in the transformer blocks (imagebind/models/transformer.py). [1] [2]
- Substituted MultiheadAttention with QuantizableMultiheadAttention in the model's trunk instantiation (imagebind/models/imagebind_model.py).

Codebase Refactoring for Readability:

Reformatted import statements for better readability and consistency across files (imagebind/models/imagebind_model.py).
Simplified string formatting in extra_repr for the LearnableLogitScaling class (imagebind/models/helpers.py).

Quantization Workflow Script:

Added a new script (quantized.py) to demonstrate the quantization process, including:
- Preparing the ImageBind model for static quantization.
- Creating dummy data for calibration.
- Evaluating the similarity between original and quantized model outputs using cosine similarity.

Minor Improvements:

Added a check to dequantize inputs if they are quantized before normalization in the Normalize class (imagebind/models/helpers.py).
Adjusted model weight loading to improve code readability (imagebind/models/imagebind_model.py).

- Added dequantization step in Normalize class for quantized inputs. - Updated QuantizedImageBindModel to accept a quantization configuration. - Refactored quantization stubs to utilize the provided q_config. - Introduced quantization in MultiheadAttention and DropPath classes. - Modified quantized.py to include q_config setup and model quantization process.

…antization

ahmedsaed added 15 commits April 19, 2025 22:26

Add quantization methods for ImageBind model

3519167

attempt: quantizable MHA

76120fb

attempt: quantizable MHA

5b00667

Merge branch 'quantization' of github.com:Ahmedsaed/ImageBind into qu…

ab3f6b4

…antization

Quantize MultiHeadAttention modules

5b7a23c

fix: revert accidental modification of original model arch

07801d8

restore styling of the original model arch file

962c2e2

refactor: remove hardcoded qconfig from DropPath

0e5b89c

Skip quantization of atten_mask

3e903a6

Add random data for depth, thermal, imu

bf7e1da

refactor quantization code

cb6b3db

Fix: test on same data

8ec481f

refactor: quantization.py for static quantization

04ea7fb

refactor: model loaders

08c6979

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Quantized ImageBind#1

Quantized ImageBind#1
ahmedsaed wants to merge 15 commits intomainfrom
quantization

ahmedsaed commented Apr 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

ahmedsaed commented Apr 21, 2025

Quantization Support Enhancements:

Codebase Refactoring for Readability:

Quantization Workflow Script:

Minor Improvements:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant