[2025?7?23?]() [Benchmark](https://identia.digital/lmcache/en/category/benchmark/), [decoding](https://identia.digital/lmcache/en/tag/decoding-en/), [spec decode](https://identia.digital/lmcache/en/tag/spec-decode-en/), [speculative](https://identia.digital/lmcache/en/tag/speculative-en/)
???Kuntai Du
??????LMCache Lab ????????????/???????????????60%??
—
?????? KV cache?????? LMCache Lab——??LLM?prefilling?????????????????????????decoding??????LLM?????????????????????????????????????????????? LLM ???????:money_with_wings:
???decoding?????????
???????????????????????token??????????token?????? 60%?????????/?????????????????????????????????????????——??????????????????????????
Benchmarks:bar_chart:
?????????? vLLM ? Python ???docstrings????????????????

????????????????????VLLM?????60%
??:wrench:
????????????????????????????????????

??????????????????
????????????early access?????????????????????????????
??????:raised_hands:
????????????????????????LMIgnite????????LMCache Lab ?????——????????????????????[????](https://lmignite.tensormesh.ai/)???????????????????????????????????