2x speedup with IBM foundation stack #1615

casper-hansen · 2023-11-09T22:07:58Z

They rewrite Llama model definition to be compatible with torch.compile. Completely open source.

WoosukKwon · 2023-11-09T23:51:50Z

Hi @casper-hansen, thanks for bringing it up. Indeed, we are actively investigating torch.compile support.

hmellor · 2024-03-13T12:58:40Z

Closing because torch.compile is on the roadmap #2681

WoosukKwon added the performance Performance-related issues label Nov 9, 2023

hmellor closed this as completed Mar 13, 2024

Provide feedback