I think it would be better if we could provide dockerfile in vLLM, for distributed serving, and single-gpu serving. Then we could use it on Kubernetes or other container-based environments.