docs: add GLM-4.6V model support in README. (#499)

yq33victor · web-flow · commit 03c4676a8bd3 · 2025-12-08T21:05:28.000+08:00
Signed-off-by: pengtao.156 &lt;pengtao.156@jd.com&gt;
diff --git a/README.md b/README.md
@@ -29,6 +29,7 @@ limitations under the License. -->
 
 
 ### 📢 News
+- 2025-12-08: 🎉 We day-0 support high-performance inference for the [GLM-4.6V](https://github.com/zai-org/GLM-V) model.
 - 2025-12-05: 🎉 We now support high-performance inference for the [GLM-4.5/GLM-4.6](https://github.com/zai-org/GLM-4.5/blob/main/README_zh.md) series models.
 - 2025-12-05: 🎉 We now support high-performance inference for the [VLM-R1](https://github.com/om-ai-lab/VLM-R1) model.
 - 2025-12-05: 🎉 We build hybrid KV cache management based on [Mooncake](https://github.com/kvcache-ai/Mooncake), supporting global KV cache management with intelligent offloading and prefetching.
@@ -114,7 +115,7 @@ Supported models list:
 - Qwen2.5-VL
 - Qwen3 / Qwen3-MoE
 - Qwen3-VL / Qwen3-VL-MoE
-- GLM4.5 / GLM4.6
+- GLM4.5 / GLM4.6 / GLM-4.6V
 - VLM-R1
 
 ---
diff --git a/README_zh.md b/README_zh.md
@@ -28,6 +28,7 @@ limitations under the License. -->
 
 ### 📢 新闻
 
+- 2025-12-06: 🎉 我们在第一时间内支持了[GLM-4.6V](https://github.com/zai-org/GLM-V)模型的高效推理。
 - 2025-12-05: 🎉 我们支持了[GLM-4.5/GLM-4.6](https://github.com/zai-org/GLM-4.5/blob/main/README_zh.md)系列模型.
 - 2025-12-05: 🎉 我们支持了[VLM-R1](https://github.com/om-ai-lab/VLM-R1) 模型.
 - 2025-12-05: 🎉 我们基于[Mooncake](https://github.com/kvcache-ai/Mooncake)构建了混合 KV 缓存管理机制，支持具备智能卸载与预取能力的全局 KV 缓存管理。
@@ -110,7 +111,7 @@ xLLM 提供了强大的智能计算能力，通过硬件系统的算力优化与
 - Qwen2.5-VL
 - Qwen3 / Qwen3-MoE
 - Qwen3-VL / Qwen3-VL-MoE
-- GLM-4.5 / GLM-4.6
+- GLM-4.5 / GLM-4.6 / GLM-4.6V
 - VLM-R1
 
 ---