Skip to content

Commit 5118085

Browse files
committed
Update blog
1 parent 76a629a commit 5118085

File tree

1 file changed

+3
-1
lines changed

1 file changed

+3
-1
lines changed

content/blog/2025-11-05-1762335811.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,9 @@ Some final takeaways:
1818

1919
## ML compilers might break CUDA's moat (and fix AMD's ROCm support)
2020

21-
It's pretty clear that ML compilers are going to be a big deal. NVIDIA's TensorRT is also an ML compiler, but it only targets their GPUs. Once the generated machine code (from cross-vendor ML compilers) is comparable in performance to hand-tuned kernels, these compilers are going to break the (in)famous moat of CUDA. And thankfully, this will also finally make AMD's consumer GPUs more accessible to developers (by making AMD's terrible support for ROCm on consumer GPUs unnecessary). Yes, cheap shot, but I've lost a lot of hair trying to support AMD's consumer GPUs over the years.
21+
It's pretty clear that ML compilers are going to be a big deal. NVIDIA's TensorRT is also an ML compiler, but it only targets their GPUs. Once the generated machine code (from cross-vendor ML compilers) is comparable in performance to hand-tuned kernels, these compilers are going to break the (in)famous moat of CUDA.
22+
23+
And thankfully, this will also finally make AMD's consumer GPUs more accessible to developers (by making AMD's terrible support for ROCm on consumer GPUs unnecessary). Yes, cheap shot, but I've lost a lot of hair trying to support AMD's consumer GPUs over the years.
2224

2325
Hand-written kernels could go the way of hand-written assembly code. This was always going to happen eventually, but I think it's pretty close now.
2426

0 commit comments

Comments
 (0)