Skip to content

Commit a9de2f9

Browse files
committed
Update blog
1 parent a22628d commit a9de2f9

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

content/blog/2025-10-27-1761560082.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ For e.g. reading constantly from the Global Memory is like driving between the f
2121
Therefore the job of running a computation graph (like ONNX) efficiently on GPU(s) is like planning the logistics of a manufacturing company. You've got raw materials in the main warehouse that you need to transfer between cities, and store/process/transfer artifacts across different factories and machines. You need to make sure that:
2222
- the production process follows the chart laid out in the computation graph.
2323
- every machine in each factory is being utilized optimally
24-
- account for the time it takes to move things between cities/factories/machines.
24+
- account for the time it takes to move things between cities/factories/machines
2525

2626
And most importantly, you need to focus on your overall goal, i.e. either the time it takes to produce the finished product (i.e. latency) or maximum utilisation of all your machines (i.e. throughput).
2727

0 commit comments

Comments
 (0)