-
Couldn't load subscription status.
- Fork 285
[BugFix] Correct direct copy from bf16 to fp8 #1090
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
Cunxiao2002
wants to merge
20
commits into
tile-ai:main
Choose a base branch
from
Cunxiao2002:fix_1046
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+61
−18
Open
Changes from all commits
Commits
Show all changes
20 commits
Select commit
Hold shift + click to select a range
9923cd8
[BugFix] Correct direct copy from bf16 to fp8
d2e495f
Merge branch 'main' into fix_1046
cb907dd
fix lint
8281b05
implement overloaded cast codegen for type conversion
3051cf4
fix lint
37804b3
remove test
999e74e
fix lint
900ae67
trigger CI
5c25147
Overload fp8 for implicit conversion
c49edee
format
cda16d5
Merge branch 'main' into fix_1046
0aad651
new format
e448754
fix: Reinterpret types to cute types in GEMM
dad541c
new format
76c0b1a
fix lint
4741fd6
Merge branch 'main' into fix_1046
6d885a4
new format
7f1a507
fix lint
19e8a7d
Merge branch 'main' into fix_1046
Cunxiao2002 521c0bb
format
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Some comments aren't visible on the classic Files Changed page.
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Explicitly include cuda_bf16.h to guarantee __nv_bfloat16 availability.
Avoid relying on transitive includes; add the CUDA header so host/RTC builds consistently see __nv_bfloat16.
Apply this diff near the existing cuda_runtime include:
#ifndef __CUDACC_RTC__ #include <cuda_runtime.h> +#include <cuda_bf16.h> #endif🤖 Prompt for AI Agents