[Major] Add support for Mixtral8x7b #16

cylinbao · 2024-04-11T17:10:34Z

Add simulated quantization for Mixtral8x7b.
One major difference to Llama is that we move the activation quantization to after the gate operation of the SpareMoeBlock.
I also update the transformers library version to 3.39.0 for better support on the Mixtral model.
Currently, we have 4.41 perplexity on wikitext2 for W4A4 quantization.

happierpig · 2024-04-12T13:11:24Z

model/modelutils_mixtral.py

Note that the current implementation is shared reordering indices for all sparse experts. Ideally, different experts should have separate sampled and generated indices. However, due to the efficiency considerations (we want to fuse the reorder operator into the previous layer norm operator) and real evaluation (no large accuracy difference between these two ways), we choose shared indices.

happierpig

Note that the current implementation is shared reordering indices for all sparse experts. Ideally, different experts should have separate sampled and generated indices. However, due to the efficiency considerations (we want to fuse the reorder operator into the previous layer norm operator) and real evaluation (no large accuracy difference between these two ways), we choose shared indices.

cylinbao added 7 commits April 8, 2024 07:10

mixtral wip, support regular w-act quant

81aeda7

make compatible with newer transformers lib

11bae09

mixtral wip, support regular w-act quant

391a0af

Merge branch 'mixtral-test' into mixtral

06afa2a

Merge branch 'yilong' into mixtral

97647be

[Major] add mixtral8x7b support

7dafad0

[upd] fix zero shot

0a4da67

cylinbao requested a review from happierpig April 11, 2024 17:10

happierpig reviewed Apr 12, 2024

View reviewed changes

happierpig approved these changes Apr 12, 2024

View reviewed changes

happierpig merged commit d509398 into efeslab:main Apr 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Major] Add support for Mixtral8x7b #16

[Major] Add support for Mixtral8x7b #16

cylinbao commented Apr 11, 2024

happierpig Apr 12, 2024

happierpig left a comment

[Major] Add support for Mixtral8x7b #16

[Major] Add support for Mixtral8x7b #16

Conversation

cylinbao commented Apr 11, 2024

happierpig Apr 12, 2024

Choose a reason for hiding this comment

happierpig left a comment

Choose a reason for hiding this comment