It says "(WIP) prefix-aware routing" in the README here:
https://github.com/vllm-project/production-stack/tree/main/src/router
A very quick scan through the Python files there, I can't see anything.
Can the results in the blog be reproduced using the code that is currently in this repo?