Commit 0841cf1
committed
mx roofline: adjust mxfp8 formulas
Summary:
It's not clear whether we can write a fast dim0 + dim1 cast kernel, so
adjusting the roofline estimation formulas to use separate dim0 and dim1 kernels
Test Plan:
```
python benchmarks/float8/float8_roofline.py ~/local/tmp/20250325_b200_mxfp8_v2_triton.csv --mx_recipe_name mxfp8_cublas --shape_gen_name pow2_extended
```
Reviewers:
Subscribers:
Tasks:
Tags:
ghstack-source-id: 66a95b3
ghstack-comment-id: 2752441017
Pull Request resolved: #19531 parent 36b6545 commit 0841cf1
1 file changed
+9
-20
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
183 | 183 | | |
184 | 184 | | |
185 | 185 | | |
186 | | - | |
187 | | - | |
188 | | - | |
189 | | - | |
190 | | - | |
191 | | - | |
192 | | - | |
193 | | - | |
194 | | - | |
195 | | - | |
196 | | - | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
197 | 192 | | |
198 | | - | |
199 | | - | |
200 | | - | |
201 | | - | |
202 | | - | |
203 | | - | |
204 | | - | |
205 | | - | |
206 | | - | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
207 | 196 | | |
208 | 197 | | |
209 | 198 | | |
| |||
0 commit comments