Feature/calibration data device by avtc · Pull Request #2421 · ModelCloud/GPTQModel

avtc · 2026-02-19T16:16:37Z

@Qubitium Hi, this feature allows specify in config the device where calibration data inputs/outputs will be stored, allowing to use more calibration data samples for quantization, because calibration data can be placed on device different to cuda:0 which already stores all layer modules.

Before the feature initial calibration data was stored on CPU and after first pass it was stored on DEVICE_0 (cuda:0 usually).
After the feature if calibration_data_device is not set initial behavior preserved.
calibration_data_device can be set to "cpu", "cuda:1" (or any other torch device), and to "balanced" - in "balanced" mode calibration data distributed between compute devices available: DEVICE_0 .. DEVICE_N

P.S. I have used this feature previously several times but on another old branch. This PR is based on latest master.
Also I have fixed examples in config file for using moe parameter, and fixed sys.abiflags typo which failed build.

Note: the handling of layer with all modules excluded from quantization was also fixed, as current main code did not do forward replay it seems.

I have run several small tests (few first layers) ensuring nothing fail with auto_forward_data_parallel enabled and disabled, on qwen3-30b-a3b with calibration_data_device set to cpu, cuda:1, balanced and removed from config.

…anced to distribute between compute devices

Qubitium · 2026-02-21T18:36:47Z

@avtc Thanks again for amother gem! Can you whip up some unit tests so there is good test coverage on the diffs so I can run it ok our gpus and check for regressions.

…ss" - filter only by compute_device_filter

avtc · 2026-02-23T18:39:08Z

@Qubitium I have added tests with help of GLM-5, please review if it is OK

Qubitium · 2026-02-24T05:36:18Z

@avtc Will be checking and merging in the next 48 hours.

avtc added 3 commits February 19, 2026 16:05

feat: calibration_data_device - specific device cpu/cuda:1 etc or bal…

b210aa7

…anced to distribute between compute devices

update config examples for "moe" parameter with batch_size specified

90720fd

fix typo sys.abiflags

0339f2a

avtc marked this pull request as draft February 23, 2026 15:08

avtc added 4 commits February 23, 2026 17:09

tests added. removed "Exclude calibration data device from forward pa…

7f4e0bd

…ss" - filter only by compute_device_filter

fixing tests

ad9dcfc

fixing tests - 2

6a546bf

fixing tests - 3

bb8b12f

avtc marked this pull request as ready for review February 23, 2026 18:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/calibration data device#2421

Feature/calibration data device#2421
avtc wants to merge 7 commits intoModelCloud:mainfrom
avtc:feature/calibration-data-device

avtc commented Feb 19, 2026 •

edited

Loading

Uh oh!

Qubitium commented Feb 21, 2026

Uh oh!

avtc commented Feb 23, 2026

Uh oh!

Qubitium commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

avtc commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Qubitium commented Feb 21, 2026

Uh oh!

avtc commented Feb 23, 2026

Uh oh!

Qubitium commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

avtc commented Feb 19, 2026 •

edited

Loading