add support for scheme FP8_STATIC to export llm_compressor format #816

n1ck-guo · 2025-09-11T02:32:50Z

export llm_compressor format
export auto_round:llm_compressor format
extract save func from all export files, save in export/utils.py, rename to save_model

Signed-off-by: n1ck-guo <heng.guo@intel.com>

Copilot

Pull Request Overview

This PR adds support for the FP8_STATIC quantization scheme to export models in the llm_compressor format. The change enables static FP8 weight and activation quantization with specific configurations for compressed-tensors compatibility.

Key Changes

Adds FP8_STATIC scheme detection and format conversion to llm_compressor
Implements static FP8 quantization export with compressed-tensors configuration
Consolidates common save functionality across export modules

Reviewed Changes

Copilot reviewed 10 out of 10 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
auto_round/utils.py	Modified `is_static_wfp8afp8` to accept string parameters for format detection
auto_round/export/utils.py	Added shared `save` function to reduce code duplication across export modules
auto_round/export/export_to_llmcompressor/export_to_static_fp.py	New module implementing FP8_STATIC export with compressed-tensors configuration
auto_round/export/export_to_llmcompressor/export.py	Added FP8_STATIC support to the main export dispatcher
auto_round/autoround.py	Added FP8_STATIC format detection and validation logic
auto_round/export/export_to_awq/export.py	Refactored to use shared save function
auto_round/export/export_to_autoround/export_to_fp8.py	Renamed class and refactored to use shared save function
auto_round/export/export_to_autoround/export.py	Refactored to use shared save function
auto_round/export/export_to_autogptq/export.py	Refactored to use shared save function
test/test_cpu/test_llmcompressor.py	Added test case for FP8_STATIC export validation

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

auto_round/export/export_to_llmcompressor/export_to_static_fp.py

auto_round/export/export_to_autoround/export_to_fp8.py

for more information, see https://pre-commit.ci

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

wenhuach21 · 2025-09-11T02:37:45Z

Support it in the AutoRound format as well, and add nvfp4/fp8_static support on the vLLM side later.

auto_round/export/export_to_autogptq/export.py

auto_round/export/export_to_autoround/export.py

Signed-off-by: n1ck-guo <heng.guo@intel.com>

for more information, see https://pre-commit.ci

Signed-off-by: n1ck-guo <heng.guo@intel.com>

auto_round/export/export_to_llmcompressor/export_to_static_fp.py

auto_round/export/export_to_autoround/export.py

for more information, see https://pre-commit.ci

Signed-off-by: n1ck-guo <heng.guo@intel.com>

into hengguo/static_fp8

auto_round/autoround.py

auto_round/export/export_to_llmcompressor/export.py

auto_round/export/export_to_llmcompressor/export_to_static_fp.py

Signed-off-by: n1ck-guo <heng.guo@intel.com>

auto_round/export/export_to_llmcompressor/export_to_static_fp.py

Signed-off-by: n1ck-guo <heng.guo@intel.com>

test/test_cpu/test_llmcompressor.py

Signed-off-by: yiliu30 <yi4.liu@intel.com>

This reverts commit 038ff1d.

Signed-off-by: n1ck-guo <heng.guo@intel.com>

n1ck-guo requested review from WeiweiZhang1, Copilot and wenhuach21 September 11, 2025 02:32

add support for scheme FP8_STATIC to export llm_compressor format

1d6aa4e

Signed-off-by: n1ck-guo <heng.guo@intel.com>

Copilot AI reviewed Sep 11, 2025

View reviewed changes

n1ck-guo and others added 3 commits September 11, 2025 10:36

Merge branch 'main' into hengguo/static_fp8

720919c

[pre-commit.ci] auto fixes from pre-commit.com hooks

9369676

for more information, see https://pre-commit.ci

Update auto_round/export/export_to_llmcompressor/export_to_static_fp.py

cfca962

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

wenhuach21 reviewed Sep 11, 2025

View reviewed changes

auto_round/export/export_to_autogptq/export.py Show resolved Hide resolved

wenhuach21 reviewed Sep 11, 2025

View reviewed changes

auto_round/export/export_to_autoround/export.py Outdated Show resolved Hide resolved

n1ck-guo and others added 2 commits September 10, 2025 22:58

update

d7bbba3

Signed-off-by: n1ck-guo <heng.guo@intel.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

8f08394

for more information, see https://pre-commit.ci

wenhuach21 requested a review from yiliu30 September 11, 2025 05:56

support auto_round:llm_compressor

d57decc

Signed-off-by: n1ck-guo <heng.guo@intel.com>

yiliu30 reviewed Sep 11, 2025

View reviewed changes

auto_round/export/export_to_llmcompressor/export_to_static_fp.py Show resolved Hide resolved

auto_round/export/export_to_autoround/export.py Outdated Show resolved Hide resolved

n1ck-guo and others added 4 commits September 12, 2025 13:08

Merge branch 'main' into hengguo/static_fp8

fd57425

[pre-commit.ci] auto fixes from pre-commit.com hooks

6fb2b92

for more information, see https://pre-commit.ci

edit by comment

3d57606

Signed-off-by: n1ck-guo <heng.guo@intel.com>

Merge branch 'hengguo/static_fp8' of https://github.com/intel/auto-round

94ba26c

into hengguo/static_fp8

wenhuach21 reviewed Sep 16, 2025

View reviewed changes

auto_round/autoround.py Outdated Show resolved Hide resolved

wenhuach21 reviewed Sep 16, 2025

View reviewed changes

auto_round/autoround.py Outdated Show resolved Hide resolved

wenhuach21 reviewed Sep 16, 2025

View reviewed changes

auto_round/export/export_to_llmcompressor/export.py Show resolved Hide resolved

wenhuach21 reviewed Sep 16, 2025

View reviewed changes

auto_round/export/export_to_llmcompressor/export.py Outdated Show resolved Hide resolved

wenhuach21 reviewed Sep 16, 2025

View reviewed changes

auto_round/export/export_to_llmcompressor/export_to_static_fp.py Outdated Show resolved Hide resolved

wenhuach21 reviewed Sep 16, 2025

View reviewed changes

auto_round/export/export_to_llmcompressor/export_to_static_fp.py Outdated Show resolved Hide resolved

wenhuach21 reviewed Sep 16, 2025

View reviewed changes

auto_round/export/export_to_llmcompressor/export_to_static_fp.py Outdated Show resolved Hide resolved

n1ck-guo added 3 commits September 16, 2025 20:48

upate

1bc90e1

Signed-off-by: n1ck-guo <heng.guo@intel.com>

modify ut model path

ba66f98

Signed-off-by: n1ck-guo <heng.guo@intel.com>

fix

3afd4a6

Signed-off-by: n1ck-guo <heng.guo@intel.com>

yiliu30 reviewed Sep 17, 2025

View reviewed changes

auto_round/export/export_to_llmcompressor/export_to_static_fp.py Show resolved Hide resolved

n1ck-guo added 3 commits September 18, 2025 04:37

fix ut

f6b8ed0

Signed-off-by: n1ck-guo <heng.guo@intel.com>

fix ut fail

9970c32

Signed-off-by: n1ck-guo <heng.guo@intel.com>

fix

df5365c

Signed-off-by: n1ck-guo <heng.guo@intel.com>

yiliu30 approved these changes Sep 19, 2025

View reviewed changes

test/test_cpu/test_llmcompressor.py Show resolved Hide resolved

yiliu30 and others added 3 commits September 19, 2025 01:14

add init for experimental

038ff1d

Signed-off-by: yiliu30 <yi4.liu@intel.com>

Revert "add init for experimental"

dc773b1

This reverts commit 038ff1d.

merge

946e398

Signed-off-by: n1ck-guo <heng.guo@intel.com>

n1ck-guo merged commit 1089004 into main Sep 19, 2025
8 checks passed

n1ck-guo deleted the hengguo/static_fp8 branch September 19, 2025 08:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add support for scheme FP8_STATIC to export llm_compressor format #816

add support for scheme FP8_STATIC to export llm_compressor format #816

Uh oh!

n1ck-guo commented Sep 11, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wenhuach21 commented Sep 11, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

add support for scheme FP8_STATIC to export llm_compressor format #816

add support for scheme FP8_STATIC to export llm_compressor format #816

Uh oh!

Conversation

n1ck-guo commented Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Key Changes

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wenhuach21 commented Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

n1ck-guo commented Sep 11, 2025 •

edited

Loading

wenhuach21 commented Sep 11, 2025 •

edited

Loading