Closed as not planned
Description
Microsoft have claimed that ”Splitwise“ is supported in vLLM, see
https://www.microsoft.com/en-us/research/blog/splitwise-improves-gpu-usage-by-splitting-llm-inference-phases/
So how to use it in vLLM? I could not find keyword about ”Splitwise“.