We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 9fcebdc commit f199105Copy full SHA for f199105
benchmark/matmul_fp8/README.md
@@ -27,10 +27,10 @@ PY
27
28
| K | Latency (s) | Throughput (TFLOPs) |
29
|-------|-------------|---------------------|
30
-| 256 | 0.091488 | 0.376 |
31
-| 512 | 0.110496 | 0.622 |
32
-| 1024 | 0.148256 | 0.927 |
33
-| 2048 | 0.234080 | 1.174 |
34
-| 4096 | 0.398944 | 1.378 |
35
-| 8192 | 0.752416 | 1.461 |
36
-| 16384 | 1.443808 | 1.523 |
+| 256 | 0.091488 | 376 |
+| 512 | 0.110496 | 622 |
+| 1024 | 0.148256 | 927 |
+| 2048 | 0.234080 | 1174 |
+| 4096 | 0.398944 | 1378 |
+| 8192 | 0.752416 | 1461 |
+| 16384 | 1.443808 | 1523 |
0 commit comments