[SpecDecode][Kernel] Use Flashinfer for Rejection Sampling in Speculative Decoding#7244
Merged
youkaichao merged 20 commits intovllm-project:mainfrom LiuXiaoxuanPKU:flashinfer-rejection-samplerSep 2, 2024
+306-109
Commits
Commits on Aug 6, 2024
Commits on Aug 7, 2024
- committed
- committed
Commits on Aug 12, 2024
Commits on Aug 13, 2024
- committed
Commits on Aug 18, 2024
Commits on Aug 20, 2024
Commits on Aug 21, 2024
- committed
Commits on Aug 28, 2024
Commits on Aug 29, 2024
Commits on Aug 30, 2024
- committed