Skip to content

Conversation

@yiliu30
Copy link
Contributor

@yiliu30 yiliu30 commented Sep 22, 2025

No description provided.

Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
@yiliu30 yiliu30 marked this pull request as ready for review September 22, 2025 10:25
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
@yiliu30 yiliu30 changed the title [WIP]NVFP4 Loading support NVFP4 Loading support Sep 24, 2025
Signed-off-by: yiliu30 <yi4.liu@intel.com>
@yiliu30
Copy link
Contributor Author

yiliu30 commented Sep 24, 2025

Local tests passed:

==================================== PASSES ====================================
=========================== short test summary info ============================
SKIPPED [1] test/test_cuda/test_transformers.py:166: test requires Intel Extension for PyTorch to be installed and match current PyTorch version, see https://github.com/intel/intel-extension-for-pytorch
SKIPPED [1] test/test_cuda/test_transformers.py:136: test requires multiple CUDA GPUs
SKIPPED [1] test/test_cuda/test_transformers.py:101: test requires Intel Extension for PyTorch to be installed and match current PyTorch version, see https://github.com/intel/intel-extension-for-pytorch
============ 6 passed, 3 skipped, 17 warnings in 180.40s (0:03:00) =============


==================================== PASSES ====================================
_______________________ test_e2e_quant_and_infer[mxfp8] ________________________
------------------------------ Captured log call -------------------------------
WARNING  bitsandbytes.cextension:cextension.py:94 Could not find the bitsandbytes CUDA binary at PosixPath('/home/yliu7/miniforge3/envs/ao/lib/python3.11/site-packages/bitsandbytes/libbitsandbytes_cuda128.so')
WARNING  bitsandbytes.cextension:cextension.py:101 The installed version of bitsandbytes was compiled without GPU support. 8-bit optimizers, 8-bit multiplication, and GPU quantization are unavailable.
=================== 3 passed, 1 warning in 240.91s (0:04:00) ===================

Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
@yiliu30 yiliu30 merged commit 0751337 into main Sep 24, 2025
8 checks passed
@yiliu30 yiliu30 deleted the nvfp4 branch September 24, 2025 06:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants