Skip to content

[SpecDecode][Kernel] Use Flashinfer for Rejection Sampling in Speculative Decoding#7244

Merged
youkaichao merged 20 commits intovllm-project:mainfrom LiuXiaoxuanPKU:flashinfer-rejection-samplerSep 2, 2024

Commits

Commits on Aug 6, 2024

Commits on Aug 7, 2024

Commits on Aug 13, 2024

Commits on Aug 20, 2024

Commits on Aug 21, 2024

Commits on Aug 28, 2024

Commits on Aug 29, 2024

Commits on Aug 30, 2024

Commits on Aug 31, 2024