Frictionless Computing: Entropy-Based Operation Filtering for 10x-1000x Effective Speedup
-
Updated
Dec 28, 2025
Frictionless Computing: Entropy-Based Operation Filtering for 10x-1000x Effective Speedup
Backend-agnostic benchmarking suite for evaluating LLM inference systems across local runtimes and hosted APIs, with a focus on latency, throughput, token efficiency, and runtime stability.
Add a description, image, and links to the systems-research topic page so that developers can more easily learn about it.
To associate your repository with the systems-research topic, visit your repo's landing page and select "manage topics."