-
Notifications
You must be signed in to change notification settings - Fork 84
Labels
enhancementNew feature or requestNew feature or request
Description
Is your feature request related to a problem?
- Performance Gain
- Remove all the mess with preparing hidden states / mask... in the executor.
prepare_batch. With external paged kernel, we could simplify ourexecutorworkflow.
Describe the Solution you'd like
- Update KV Cache Manager to include Paged KV;
- Check how to leverage open-sourced paged KV kernel on MAC;
Alternatives Considered (Optional)
No response
Additional Context (Optional)
No response
Metadata
Metadata
Labels
enhancementNew feature or requestNew feature or request