Ensure dtype consistency in Pooling forward method #2492

EliasKassapis · 2024-02-20T10:53:15Z

Adjusted Pooling module's forward function to initialize new tensors with the same dtype as input tensors, fixing dtype mismatch errors in mixed precision settings (e.g., model.half()). This change prevents errors arising from hard-coded .float() usage, enabling seamless operation across different dtype environments.

tomaarsen · 2024-02-20T12:04:56Z

Hello!

I think this makes sense at a glance! And you mention that model.half() causes problems currently, is that during inference? Please let me know or provide a simple reproduction snippet, so I can effectively verify if this PR solves the problem.

Tom Aarsen

EliasKassapis · 2024-02-20T12:23:42Z

Hey Tom, thanks for replying so quickly. Before the fix, this part of my code led to a runtime error:

model = SentenceTransformer("distiluse-base-multilingual-cased-v2").to(device) 
model.half()
model.encode(input_text, show_progress_bar=False)

Error: RuntimeError: mat1 and mat2 must have the same dtype, but got Float and Half

This error originates from the model.encode(...) line, from the models' Dense layer (with param dtype float16 after invoking model.half()) which follows the Pooling module. The latter returned a dict where key sentence_embedding contained a tensor of dtype float32 due to hardcoded .float() tensor operations in the Pooling module.

This commit resolves this mismatch by replacing the hardcoded .float() with .to(token_embeddings.dtype), thereby maintaining dtype consistency within the Pooling module's forward function

tomaarsen · 2024-02-21T10:18:04Z

Very clear! I can reproduce this, too. I've ran make style to satisfy the code quality check & I've added a very simple test case. Thanks for this work! I'll merge it when the CI is green :)

Tom Aarsen

tomaarsen added 2 commits February 21, 2024 11:09

Run 'make style' to satisfy quality failures

1eb3ea6

Add simple test case

5a6a60d

Only test fp16 on CUDA

838f75b

tomaarsen merged commit 20056c6 into UKPLab:master Feb 21, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure dtype consistency in Pooling forward method #2492

Ensure dtype consistency in Pooling forward method #2492

EliasKassapis commented Feb 20, 2024

tomaarsen commented Feb 20, 2024

EliasKassapis commented Feb 20, 2024

tomaarsen commented Feb 21, 2024

Ensure dtype consistency in Pooling forward method #2492

Ensure dtype consistency in Pooling forward method #2492

Conversation

EliasKassapis commented Feb 20, 2024

tomaarsen commented Feb 20, 2024

EliasKassapis commented Feb 20, 2024

tomaarsen commented Feb 21, 2024