Using flashinfer in sglang with google/gemma-7b-it
File "/home/ubuntu/sglang-venv/lib/python3.11/site-packages/flashinfer/prefill.py", line 462, in forward
return self._wrapper.forward(
^^^^^^^^^^^^^^^^^^^^^^
RuntimeError: BatchPrefillWithPagedKVCache failed to dispatch with dtype Half
I don't know if this is caused by Gemma's bfloat16 dtype or my inappropriate usage.