CacheRoute is an innovative LLM scheduling scheme dedicated to enabling flexible KV cache reuse across LLM systems, improving task performance and system efficiency.
network routing knowledge-injection llm vllm llm-inference kvcache lmcache llm-task-scheduling kvcache-reuse
-
Updated
Jul 2, 2026 - Python