Closed
Description
@infwinston Feel free to use FastChat's completion template to implement a chat completion endpoint in our demo server. You can use the completion API as a reference:
vllm/vllm/entrypoints/openai/api_server.py
Lines 88 to 101 in 9d27b09