docs: update readme document.

yq33victor · web-flow · commit a8669af13ad1 · 2025-12-05T23:50:45.000+08:00
diff --git a/README.md b/README.md
@@ -29,8 +29,10 @@ limitations under the License. -->
 
 
 ### 📢 News
-- 🎉 We recently have released our [xLLM Technical Report](https://arxiv.org/abs/2510.14686) on arXiv, providing comprehensive technical blueprints and implementation insights.
-
+- 2025-12-05: 🎉 We now support high-performance inference for the [GLM-4.5/GLM-4.6](https://github.com/zai-org/GLM-4.5/blob/main/README_zh.md) series models.
+- 2025-12-05: 🎉 We now support high-performance inference for the [VLM-R1](https://github.com/om-ai-lab/VLM-R1) model.
+- 2025-12-05: 🎉 We build hybrid KV cache management based on [Mooncake](https://github.com/kvcache-ai/Mooncake), supporting global KV cache management with intelligent offloading and prefetching.
+- 2025-10-16: 🎉 We recently have released our [xLLM Technical Report](https://arxiv.org/abs/2510.14686) on arXiv, providing comprehensive technical blueprints and implementation insights.
 
 ## 1. Project Overview
 
@@ -112,6 +114,8 @@ Supported models list:
 - Qwen2.5-VL
 - Qwen3 / Qwen3-MoE
 - Qwen3-VL / Qwen3-VL-MoE
+- GLM4.5 / GLM4.6
+- VLM-R1
 
 ---
 
@@ -244,4 +248,4 @@ If you think this repository is helpful to you, welcome to cite us:
   journal={arXiv preprint arXiv:2510.14686},
   year={2025}
 }
-```
+```
diff --git a/README_zh.md b/README_zh.md
@@ -28,7 +28,10 @@ limitations under the License. -->
 
 ### 📢 新闻
 
-- 我们最近在 arXiv 上发布了我们的 [xLLM 技术报告](https://arxiv.org/abs/2510.14686)，提供了全面的技术蓝图和实施见解。
+- 2025-12-05: 🎉 我们支持了[GLM-4.5/GLM-4.6](https://github.com/zai-org/GLM-4.5/blob/main/README_zh.md)系列模型.
+- 2025-12-05: 🎉 我们支持了[VLM-R1](https://github.com/om-ai-lab/VLM-R1) 模型.
+- 2025-12-05: 🎉 我们基于[Mooncake](https://github.com/kvcache-ai/Mooncake)构建了混合 KV 缓存管理机制，支持具备智能卸载与预取能力的全局 KV 缓存管理。
+- 2025-10-16: 🎉 我们最近在 arXiv 上发布了我们的 [xLLM 技术报告](https://arxiv.org/abs/2510.14686)，提供了全面的技术蓝图和实施见解。
 
 ## 1. 简介
 
@@ -107,6 +110,8 @@ xLLM 提供了强大的智能计算能力，通过硬件系统的算力优化与
 - Qwen2.5-VL
 - Qwen3 / Qwen3-MoE
 - Qwen3-VL / Qwen3-VL-MoE
+- GLM-4.5 / GLM-4.6
+- VLM-R1
 
 ---
 
@@ -249,4 +254,4 @@ python setup.py bdist_wheel
   journal={arXiv preprint arXiv:2510.14686},
   year={2025}
 }
-```
+```