Closed
Description
Hi vLLM genius @zhuohan123 @WoosukKwon
I find a new project https://github.com/ModelTC/lightllm
After reading their blog, the performance advantage on the 7b model is not very obvious, but the gap is larger on the 65b. We will also do some verification and comparison later. The reason for bringing up this issue is to hope that we may see what the LightLLM does well, so that we can refer to and port similar optimizations to vLLM. Cheers.