Skip to content

Commit

Permalink
Fix #3501
Browse files Browse the repository at this point in the history
VLLM needs top_j to be int, not float
  • Loading branch information
surak authored Sep 23, 2024
1 parent e208d56 commit bbce136
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion fastchat/serve/vllm_worker.py
Original file line number Diff line number Diff line change
Expand Up @@ -71,7 +71,7 @@ async def generate_stream(self, params):
request_id = params.pop("request_id")
temperature = float(params.get("temperature", 1.0))
top_p = float(params.get("top_p", 1.0))
top_k = params.get("top_k", -1.0)
top_k = params.get("top_k", -1)
presence_penalty = float(params.get("presence_penalty", 0.0))
frequency_penalty = float(params.get("frequency_penalty", 0.0))
max_new_tokens = params.get("max_new_tokens", 256)
Expand Down

0 comments on commit bbce136

Please sign in to comment.