You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
SUMMARY:
Adds end to end model tests
TEST PLAN:
Compares logprobs of results from hf model vs vllm model at fp16 and
bfloat16
---------
Co-authored-by: Michael Goin <michael@neuralmagic.com>
I noticed that we use conditions like this to check whether it is greedy sampling
https://github.com/WoosukKwon/cacheflow/blob/189ae231336857bcc4c6f6157bf7868cdf56fb5f/cacheflow/sampling_params.py#L45
However, I guess this will result in several problems
==
for floating point numbersI typically use something like this https://github.com/lm-sys/FastChat/blob/a94fd259a97128f7f4483ddb760690f467888d84/fastchat/serve/inference.py#L227
@WoosukKwon, @zhuohan123 What do you think? If you are happy, I can change all "==" to "<=".
The text was updated successfully, but these errors were encountered: