Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ CI/Build ] Added E2E Test For Compressed Tensors #5839

Merged
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
fixed classmethod
  • Loading branch information
robertgshaw2-neuralmagic committed Jun 26, 2024
commit 0bbcab5955a4c8a3099df1e8884ed3a97c8d3dfb
4 changes: 4 additions & 0 deletions tests/models/test_compressed_tensors.py
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,8 @@

import pytest

from tests.quantization.utils import is_quant_method_supported

from .utils import check_logprobs_close

MODELS = [
Expand All @@ -17,6 +19,8 @@
NUM_LOGPROBS = 5


@pytest.mark.skipif(not is_quant_method_supported("compressed-tensors"),
reason="compressed-tensors is not supported on this machine type.")
@pytest.mark.parametrize("model_name", MODELS)
def test_models(
vllm_runner,
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -33,7 +33,8 @@ def get_supported_act_dtypes(cls) -> List[torch.dtype]:
return [torch.float16, torch.bfloat16]

# Need to figure it out
def get_min_capability(self) -> int:
@classmethod
def get_min_capability(cls) -> int:
return 60

def get_name(self) -> str:
Expand Down
Loading