Explore practical recipes for deploying LMCache across different model architectures, serving engines, storage backends, and environments, along with roadmap updates and contribution guidelines for the open-source community.
Recipes
Recipes are practical deployment guides that show how to launch LMCache in a specific setup, including supported serving engines, compatible LMCache functionalities, and any known limitations or configuration notes.
Qwen3 MoE
A mixture-of-experts architecture designed to improve scaling efficiency by activating only a subset of model parameters per token.
Whether you’re fixing a bug, improving docs, adding model support, writing tests, or helping other users, there are many ways to contribute to LMCache.
Contribution Guide
Learn how to open issues, submit pull requests, follow the review process, and contribute code, documentation, tests, or new model support.