You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -82,6 +84,9 @@ In this section, you’ll learn how to write and execute a straightforward GEMM
82
84
Below is an example that demonstrates more advanced features: layout annotation, parallelized copy, and swizzle for improved L2 cache locality. This snippet shows how to adapt your kernel to maximize performance on complex hardware.
83
85
84
86
```python
87
+
# Copyright (c) Microsoft Corporation.
88
+
# Licensed under the MIT License.
89
+
import tilelang
85
90
import tilelang.language as T
86
91
# `make_mma_swizzle_layout` is a python defined layout function
87
92
# specifically designed for for MMA operations
@@ -91,6 +96,7 @@ from tilelang.intrinsics import (
91
96
make_mma_swizzle_layout as make_swizzle_layout,)
92
97
93
98
defmatmul(M, N, K, block_M, block_N, block_K, dtype="float16", accum_dtype="float"):
99
+
# add decorator @tilelang.jit if you want to return a torch function
# 4. Retrieve and inspect the generated CUDA source (optional)
172
+
cuda_source = jit_kernel.get_kernel_source()
173
+
print("Generated CUDA kernel:\n", cuda_source)
174
+
175
+
# 5.Pofile latency with kernel
176
+
profiler = jit_kernel.get_profiler()
177
+
178
+
latency = profiler.do_bench()
179
+
180
+
print(f"Latency: {latency} ms")
136
181
```
137
182
138
183
### Dive Deep into TileLang Beyond GEMM
@@ -152,4 +197,4 @@ TileLang has now been used in project [BitBLAS](https://github.com/microsoft/Bit
152
197
153
198
## Acknowledgements
154
199
155
-
We learned a lot from the [TVM](https://github.com/apache/tvm) community and would like to thank them for their contributions.
200
+
We learned a lot from the [TVM](https://github.com/apache/tvm) community and would like to thank them for their contributions. The initial version of this project is mainly contributed by [LeiWang1999](https://github.com/LeiWang1999), [chengyupku](https://github.com/chengyupku) and [nox-410](https://github.com/nox-410). Part of this work was done during the internship at Microsoft Research, under the supervision of Dr. Lingxiao Ma, Dr. Yuqing Xia, Dr. Jilong Xue, and Dr. Fan Yang.
Copy file name to clipboardExpand all lines: docker/README.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
To ease the process of installing all the dependencies, we provide a Dockerfile and a simple guideline to build a Docker image with all of above installed. The Docker image is built on top of Ubuntu 20.04, and it contains all the dependencies required to run the experiments. We only provide the Dockerfile for NVIDIA GPU, and the Dockerfile for AMD GPU will be provided upon request.
Copy file name to clipboardExpand all lines: docs/Installation.md
+6-6Lines changed: 6 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,7 +9,7 @@
9
9
10
10
The easiest way to install TileLang is directly from the PyPi using pip. To install the latest version, run the following command in your terminal.
11
11
12
-
**Note**: Currently, TileLang whl is only supported on Ubuntu 20.04 or later version as we build the whl files on this platform. Currently we only provide whl files for CUDA>=11.0 and with Python>=3.8. **If you are using a different platform or environment, you may need to [build TileLang from source](https://github.com/TileLang/tile-lang/blob/main/docs/Installation.md#building-from-source).**
12
+
**Note**: Currently, TileLang whl is only supported on Ubuntu 20.04 or later version as we build the whl files on this platform. Currently we only provide whl files for CUDA>=11.0 and with Python>=3.8. **If you are using a different platform or environment, you may need to [build TileLang from source](https://github.com/tile-ai/tilelang/blob/main/docs/Installation.md#building-from-source).**
0 commit comments