You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
To ensure consistent hashing from prompt tokens to block hashes that align with the underlying KVCache index, the block hasher logic must be factored out into a single, shared implementation.
Right now, if each prefix store implements its own block hashing, there’s a risk of divergence. Instead, we should rely on one canonical block hasher (compatible with vLLM’s prefix caching design) so that block boundaries and hash values are uniform across all stores.