It seems that assisted generation can further reduce sampling latency. Is there scope for adding support for that in vllm? Assisted generation [docs](https://huggingface.co/blog/assisted-generation)