[`model_free_ptq`] Add pathway for day-zero weight quantization support #1971

kylesayrs · 2025-10-28T21:41:17Z

Purpose

Create a pathway which can quantize model weights without needing a model definition or the use of a calibration pipeline. Such a pathway provides fast and reliable support for models which:
- Do not have a HF model definition yet
- Have complications with sequential pipelines (very large vision towers, tracing failure, long calibration runtime)

Usage

model_free_ptq(
    model_stub="meta-llama/Llama-3.2-1B-Instruct",
    save_directory="Llama-3.2-1B-Instruct-FP8_block",
    scheme="FP8_BLOCK",
    ignore=["model.embed_tokens", "lm_head"],
    max_workers=15,
    device="cuda:0",
):

Testing

Added test_model_free_ptq_matches_oneshot which tests that saved tensors and configs exactly match between model_free_ptq and oneshot entrypoints for the same arguments. This test takes about 10 seconds to run.

Future Extensions

Mixed-precision quantization (multiple recipes/targets)
Multi-GPU support (work is already parallelized by threads, but if GPU is the bottleneck we can split the work across GPUs)
Multi-process support (is python processing is the bottleneck, we can replace multithreading with multiprocessing)

github-actions · 2025-10-28T21:41:25Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs force-pushed the kylesayrs/weights-only branch from f4423c1 to 294a78a Compare October 30, 2025 20:21

kylesayrs changed the base branch from main to 03_untie_fix October 31, 2025 02:41

Base automatically changed from 03_untie_fix to main October 31, 2025 16:22

kylesayrs changed the title ~~[PTQ] weights_ptq pathway for day-zero weight quantization support~~ [Weights-only] weights_ptq pathway for day-zero weight quantization support Nov 3, 2025

kylesayrs changed the title ~~[Weights-only] weights_ptq pathway for day-zero weight quantization support~~ [Weights-only] ptq_weights pathway for day-zero weight quantization support Nov 3, 2025

kylesayrs changed the title ~~[Weights-only] ptq_weights pathway for day-zero weight quantization support~~ [Weights-only] ptq_weights pathway for day-zero weight quantization support Nov 3, 2025

ptq_weights

6fe9db9

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs force-pushed the kylesayrs/weights-only branch from 1c56a75 to 6fe9db9 Compare November 3, 2025 16:32

fix rebase

4f366d7

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs mentioned this pull request Nov 3, 2025

Switch backend to use llm-compressor neuralmagic/AutoFP8#33

Open

kylesayrs added 2 commits November 3, 2025 16:46

fix tests

fd88038

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

docstrings

221de59

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs marked this pull request as ready for review November 3, 2025 18:02

kylesayrs added 2 commits November 3, 2025 18:15

model_free_ptq

f1f0e29

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

rename tests

81cfe50

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs changed the title ~~[Weights-only] ptq_weights pathway for day-zero weight quantization support~~ [model_free_ptq] Add pathway for day-zero weight quantization support Nov 3, 2025

change naem

9790ac9

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[`model_free_ptq`] Add pathway for day-zero weight quantization support #1971

[`model_free_ptq`] Add pathway for day-zero weight quantization support #1971

Uh oh!

kylesayrs commented Oct 28, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[model_free_ptq] Add pathway for day-zero weight quantization support #1971

Are you sure you want to change the base?

[model_free_ptq] Add pathway for day-zero weight quantization support #1971

Uh oh!

Conversation

kylesayrs commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Usage

Testing

Future Extensions

Uh oh!

github-actions bot commented Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[`model_free_ptq`] Add pathway for day-zero weight quantization support #1971

[`model_free_ptq`] Add pathway for day-zero weight quantization support #1971

kylesayrs commented Oct 28, 2025 •

edited

Loading