Closed
Description
🚀 The feature, motivation and pitch
Recently outlines updated their interface from FSM to Guide to support "acceleration"/"fast-forward" which will output next sets of tokens if they are directly available. For JSON schema, the cases are the keys, the "
, and }
etc.
This is non-trivial but very useful to improve vLLM for. It should also help other framework like AICI #3714.
Alternatives
No response
Additional context
No response