Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CORE] [QUANT] Support for GPTQModel's
dynamic
quantization per module override/control #7086[CORE] [QUANT] Support for GPTQModel's
dynamic
quantization per module override/control #7086Changes from 1 commit
f470b26
c56e3de
502edb3
18064cd
1b132c3
4b63754
90258d2
a5d3c8b
c84793f
5682124
d651668
9a36694
fbc594f
e9ae8f5
19d7772
7057dbb
8565328
84ada54
e81a7da
68291ce
7867405
c63ba51
5f9b712
3692578
f902b2d
a570509
9b9d7e3
0559137
b29a094
3a2bb94
3c0d45a
74b1d42
b0672ae
066f489
6dc56a6
98a198e
b2861d8
c4a29eb
25703e3
1fd690e
4f48d1b
070ae3c
13b2b7b
6850e6d
40562d1
7b774bb
c72125a
c298195
bbc049d
78f8818
2cfec63
93ee576
6ebf85c
59bdf54
9de0382
67d0882
5623936
e41bdd7
965d7da
1a34027
0b249a1
4de04ae
4c0608b
8f21375
e3084e3
25dbd5a
874076c
17704df
c7f10be
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
Check failure on line 13 in vllm/model_executor/layers/quantization/gptq_marlin.py
Ruff (E501)
Check failure on line 76 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 76 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 76 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 76 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 77 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 77 in vllm/model_executor/layers/quantization/gptq_marlin.py
Ruff (E501)
Check failure on line 77 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 77 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 77 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 78 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 78 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 78 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 78 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 79 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 79 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 79 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 79 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 144 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 144 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 144 in vllm/model_executor/layers/quantization/gptq_marlin.py
Ruff (E501)
Check failure on line 144 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 144 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 144 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 144 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 144 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 144 in vllm/model_executor/layers/quantization/gptq_marlin.py
Check failure on line 159 in vllm/model_executor/layers/quantization/gptq_marlin.py
Ruff (E501)