Description
📚 The doc issue
The quantization example at https://docs.vllm.ai/en/latest/quantization/auto_awq.html can't be run - it looks like AWQ is looking for safetensors files and https://huggingface.co/lmsys/vicuna-7b-v1.5/tree/main doesn't have them.
return model_class.from_pretrained(
File "/env/lib/conda/stas-inference/lib/python3.10/site-packages/transformers/modeling_utils.py", line 3477, in from_pretrained
raise EnvironmentError(
OSError: Error no file named model.safetensors found in directory /data/huggingface/hub/models--lmsys--vicuna-7b-v1.5/snapshots/3321f76e3f527bd14065daf69dad9344000a201d.
autoawq=0.2.6
Suggest a potential alternative/fix
I tried another model that has .safetensors files but then it fails with:
File "/env/lib/conda/stas-inference/lib/python3.10/site-packages/datasets/data_files.py", line 332, in resolve_pattern
fs, _, _ = get_fs_token_paths(pattern, storage_options=storage_options)
File "/env/lib/conda/stas-inference/lib/python3.10/site-packages/fsspec/core.py", line 681, in get_fs_token_paths
paths = [f for f in sorted(fs.glob(paths)) if not fs.isdir(f)]
File "/env/lib/conda/stas-inference/lib/python3.10/site-packages/huggingface_hub/hf_file_system.py", line 417, in glob
return super().glob(path, **kwargs)
File "/env/lib/conda/stas-inference/lib/python3.10/site-packages/fsspec/spec.py", line 613, in glob
pattern = glob_translate(path + ("/" if ends_with_sep else ""))
File "/env/lib/conda/stas-inference/lib/python3.10/site-packages/fsspec/utils.py", line 732, in glob_translate
raise ValueError(
ValueError: Invalid pattern: '**' can only be an entire path component
I see that this example has been copied from https://github.com/casper-hansen/AutoAWQ?tab=readme-ov-file#examples and it's identical and broken at the source.
edit: I think the issue is the datasets
version - I'm able to run this version https://github.com/casper-hansen/AutoAWQ/blob/6f14fc7436d9a3fb5fc69299e4eb37db4ee9c891/examples/quantize.py with datasets==2.21.0
the version from https://docs.vllm.ai/en/latest/quantization/auto_awq.html still fails as explained above.