Skip to content

Commit 4a71563

Browse files
committed
fix qwen mxfp4
Signed-off-by: yiliu30 <yi4.liu@intel.com>
1 parent a2257bc commit 4a71563

File tree

1 file changed

+3
-1
lines changed
  • examples/pytorch/nlp/huggingface_models/language-modeling/quantization/auto_round/qwen

1 file changed

+3
-1
lines changed

examples/pytorch/nlp/huggingface_models/language-modeling/quantization/auto_round/qwen/quantize.py

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -62,10 +62,12 @@ def quant_model(args):
6262
quant_config = AutoRoundConfig(
6363
tokenizer=tokenizer,
6464
scheme=config["scheme"],
65-
enable_torch_compile=args.enable_torch_compile,
65+
enable_torch_compile=True,
6666
iters=config["iters"],
6767
fp_layers=config["fp_layers"],
6868
export_format=export_format,
69+
disable_opt_rtn=True,
70+
low_gpu_mem_usage=True,
6971
output_dir=output_dir,
7072
)
7173

0 commit comments

Comments
 (0)