Skip to content

Conversation

@tianleiwu
Copy link
Contributor

@tianleiwu tianleiwu commented Jul 16, 2025

Description

Fix cuda build error when DEBUG_GENERATION is defined.

Motivation and Context

In #24821, a dumping API was removed:
void Print(const char* name, int index, bool end_line)
But related code is not updated.

In MatMulNBits, there is a recent change to add bfloat16 support, but the tensor dumper only support BFloat16 but not __nv_bfloat16. This PR adds functions to support __nv_bfloat16 in cuda tensor dumper.

@tianleiwu tianleiwu merged commit 2d6a525 into main Jul 16, 2025
93 checks passed
@tianleiwu tianleiwu deleted the tlwu/fix_tensor_dump branch July 16, 2025 18:06
qti-yuduo pushed a commit to CodeLinaro/onnxruntime that referenced this pull request Aug 8, 2025
### Description

Fix cuda build error when DEBUG_GENERATION is defined.

### Motivation and Context

In microsoft#24821, a dumping API
was removed:
`void Print(const char* name, int index, bool end_line)`
But related code is not updated.

In MatMulNBits, there is a recent change to add bfloat16 support, but
the tensor dumper only support BFloat16 but not __nv_bfloat16. This PR
adds functions to support __nv_bfloat16 in cuda tensor dumper.
sanketkaleoss pushed a commit to sanketkaleoss/onnxruntime that referenced this pull request Aug 11, 2025
### Description

Fix cuda build error when DEBUG_GENERATION is defined.

### Motivation and Context

In microsoft#24821, a dumping API
was removed:
`void Print(const char* name, int index, bool end_line)`
But related code is not updated.

In MatMulNBits, there is a recent change to add bfloat16 support, but
the tensor dumper only support BFloat16 but not __nv_bfloat16. This PR
adds functions to support __nv_bfloat16 in cuda tensor dumper.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants