Skip to content
  • Docs
  • Resources
  • Community
  • Blog
  • Demo
  • Contacts
  • 中文

Author: slshen

LMCache Multi-node P2P CPU Memory Sharing & Control: From Experimental Feature to Production

Baolong Mao (Tencent), Chunxiao Zheng (Tencent), Weishu Deng (Tensormesh), Darren Peng (Tensormesh), Samuel Shen (Tensormesh) What is P2P and what does it promise? In this blog post, we will go over: Most production vLLM deployments run multiple identical instances behind a load balancer. Each instance builds its own KV cache only from the traffic it […]

  • EXPLORE
  • GitHub
  • Documentation
  • Blog
  • Community
  • Slack
  • Meetings
  • Contributing
  • Code of Conduct
  • Tools
  • KV Cache Calculator
  • KV Cache Memory Explosion

Join us via:

LMCache is a community-driven open-source project. Contributions are welcome.
© 2026 LMCache Contributors.

No results found.
Try a different search term.

Start typing to search…

CtrlK to open  ·  Esc to close
Enter Enter to open  ·  ↑↓ to navigate
  • Docs
  • Resources
  • Community
  • Blog
  • Demo
  • Contacts
  • Docs
  • Resources
  • Community
  • Blog
  • Demo
  • Contacts
GitHub Repo
  • 中文
Join us via: