[`Tests`] Fix failing 8bit test #25564

younesbelkada · 2023-08-17T11:10:35Z

What does this PR do?

Fixes two failing tests in https://github.com/huggingface/transformers/actions/runs/5873964880/job/15928072870

tests/quantization/bnb/test_mixed_int8.py::MixedInt8Test::test_get_keys_to_not_convert & tests/quantization/bnb/test_mixed_int8.py::MixedInt8GPT2Test::test_get_keys_to_not_convert

Context: #25105 added stronger checks to enable the correct quantization of models on the Hub. Therefore it added a test that checks if mpt-7b is correctly quantized. Since that model requires einops to be added as a dependency I propose to simply add einops in the docker image

cc @ydshieh

HuggingFaceDocBuilderDev · 2023-08-17T11:32:40Z

The documentation is not available anymore as the PR was closed or merged.

ydshieh · 2023-08-17T11:43:58Z

Hi @younesbelkada ! Could you tell a bit more why MPT needs einops?

ydshieh · 2023-08-17T11:45:42Z

Also you probably need refresh your CircleCI token?

younesbelkada · 2023-08-17T11:52:50Z

for MPT as you can see from the remote model, einops is used in files that are imported in modeling_mpt.py such as attention.py here: https://huggingface.co/mosaicml/mpt-7b/blob/main/attention.py#L7 it seems to be a strong dependency (hence the error on the slow test attached)
It is also listed as a dependency in the requirements file together with their custom fork of triton: https://huggingface.co/mosaicml/mpt-7b/blob/main/requirements.txt#L2 but the triton dependency is optional

ydshieh · 2023-08-17T12:16:49Z

OK, I think I got confused by the fact we have a modeling file and we also use the same checkpoint name in our model tests. Now I get it, but a question: why we want to use the remote code rather than the code in transformers for this quantization test ..? Is it really necessary to use the remote code the quantization?

younesbelkada · 2023-08-17T12:31:59Z

Thanks!
Regarding your question, yes I think so, #25105 added the support for correct quantization for most remote code models, therefore it is crucial to test the remote code model + non remote model in the test

ydshieh

Thanks for explaining. Let's try it!

I am not sure what's the original purpose of testing the modeling code on the Hub - although we pin a revision, the code might change on the Hub, and our tests won't detect any failure if any for new code.

But as it is already there, let's keep it - just not to add any more remote model into the relevant tests 🙏

ArthurZucker

~~Quite aligned with @ydshieh here. Don't think we should be testing a code on the hub with trust_remote_code. Why not use the model now that it is on transformers?~~
EDIT: read the original PR, guess it's okay, no a big fan of testing remote codes 😅

* fix failing 8bit test * trigger CI

fix failing 8bit test

f7503b4

younesbelkada requested a review from ydshieh August 17, 2023 11:10

ydshieh approved these changes Aug 17, 2023

View reviewed changes

trigger CI

6d9ad3c

younesbelkada requested a review from ArthurZucker August 17, 2023 14:23

ArthurZucker reviewed Aug 17, 2023

View reviewed changes

younesbelkada merged commit d4c0aa1 into huggingface:main Aug 17, 2023
3 checks passed

younesbelkada deleted the fix-bnb-test-einops branch August 17, 2023 15:34

blbadger pushed a commit to blbadger/transformers that referenced this pull request Nov 8, 2023

[Tests] Fix failing 8bit test (huggingface#25564)

eb2e114

* fix failing 8bit test * trigger CI

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`Tests`] Fix failing 8bit test #25564

[`Tests`] Fix failing 8bit test #25564

younesbelkada commented Aug 17, 2023

HuggingFaceDocBuilderDev commented Aug 17, 2023 •

edited

Loading

ydshieh commented Aug 17, 2023

ydshieh commented Aug 17, 2023

younesbelkada commented Aug 17, 2023 •

edited

Loading

ydshieh commented Aug 17, 2023

younesbelkada commented Aug 17, 2023

ydshieh left a comment •

edited

Loading

ArthurZucker left a comment •

edited

Loading

[Tests] Fix failing 8bit test #25564

[Tests] Fix failing 8bit test #25564

Conversation

younesbelkada commented Aug 17, 2023

What does this PR do?

HuggingFaceDocBuilderDev commented Aug 17, 2023 • edited Loading

ydshieh commented Aug 17, 2023

ydshieh commented Aug 17, 2023

younesbelkada commented Aug 17, 2023 • edited Loading

ydshieh commented Aug 17, 2023

younesbelkada commented Aug 17, 2023

ydshieh left a comment • edited Loading

Choose a reason for hiding this comment

ArthurZucker left a comment • edited Loading

Choose a reason for hiding this comment

[`Tests`] Fix failing 8bit test #25564

[`Tests`] Fix failing 8bit test #25564

HuggingFaceDocBuilderDev commented Aug 17, 2023 •

edited

Loading

younesbelkada commented Aug 17, 2023 •

edited

Loading

ydshieh left a comment •

edited

Loading

ArthurZucker left a comment •

edited

Loading