Skip to content

an override flag for the size of per layer KV caches? #13568

Thellton started this conversation in Ideas
Discussion options

You must be logged in to vote

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@Thellton
Comment options

@ggerganov
Comment options

@Thellton
Comment options

@Thellton
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Ideas
Labels
None yet
3 participants