Prototyping load weight scale for qwen3. #741

inho9606 · 2025-09-25T04:08:07Z

Description

This PR is a prototype to load weight_scale values from the Qwen3 checkpoint.

first commit
- Reads .safetensors file as a PT framework first since the "flax" framework Numpy does not support float8. And then it converts the tensor to jnp. As this commit modifies utility functions, it may cause unexpected errors by other models using the same utility functions.
Second function
- Reads the weight_scale from weight files and save them in Qwen3ForCausalLM instance with a new attribute named 'quant_scales'.
- I think it may better have them in nnx.Module with other layers, but it is quite complex.. As it is a prototyping, I implemented it with the easier way first.

FIXES: b/446023123

Tests

the following command runs the model on JAX path loading weight_scales:

python3 examples/offline_inference.py --model=RedHatAI/Qwen3-32B-FP8-dynamic --tensor_parallel_size=8 --task=generate --max_model_len=1024 --download_dir=/mnt/disks/persist

Signed-off-by: inho9606 <inhoseo@google.com>

inho9606 closed this Sep 25, 2025

inho9606 reopened this Sep 25, 2025

inho9606 requested review from BirdsOfAFthr and kyuyeunk and removed request for kyuyeunk September 25, 2025 05:19

inho9606 added 2 commits September 26, 2025 00:46

convert otrch tensor to jnp for float8 dtype support

b9e8e85

Signed-off-by: inho9606 <inhoseo@google.com>

Prototyping weight_scale load from weight files for Qwen3

ae345c3

Signed-off-by: inho9606 <inhoseo@google.com>

inho9606 force-pushed the prototyping_load_weight_scale_for_qwen3 branch from a4f5732 to ae345c3 Compare September 26, 2025 01:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Prototyping load weight scale for qwen3. #741

Prototyping load weight scale for qwen3. #741

Uh oh!

inho9606 commented Sep 25, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Prototyping load weight scale for qwen3. #741

Are you sure you want to change the base?

Prototyping load weight scale for qwen3. #741

Uh oh!

Conversation

inho9606 commented Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

inho9606 commented Sep 25, 2025 •

edited

Loading