Skip to content

Commit 9b666aa

Browse files
Deploying to gh-pages from @ dstackai/dstack@117195d 🚀
1 parent 52adcb3 commit 9b666aa

5 files changed

Lines changed: 12 additions & 12 deletions

File tree

7.83 KB
Loading

docs/examples/inference/dynamo.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
2-
title: Dynamo
3-
description: Deploying zai-org/GLM-4.5-Air-FP8 using NVIDIA Dynamo
2+
title: NVIDIA Dynamo
3+
description: Deploying zai-org/GLM-4.5-Air-FP8 using NVIDIA Dynamo with Prefill-Decode disaggregation.
44
---
55

66
# Dynamo

docs/examples/inference/dynamo/index.html

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@
66
<meta charset="utf-8">
77
<meta name="viewport" content="width=device-width,initial-scale=1">
88

9-
<meta name="description" content="Deploying zai-org/GLM-4.5-Air-FP8 using NVIDIA Dynamo">
9+
<meta name="description" content="Deploying zai-org/GLM-4.5-Air-FP8 using NVIDIA Dynamo with Prefill-Decode disaggregation.">
1010

1111

1212
<meta name="author" content="dstack GmbH">
@@ -29,7 +29,7 @@
2929

3030

3131

32-
<title>Dynamo - dstack</title>
32+
<title>NVIDIA Dynamo - dstack</title>
3333

3434

3535

@@ -134,16 +134,16 @@
134134

135135

136136
<meta property="og:type" content="website" />
137-
<meta property="og:title" content="Dynamo - dstack" />
138-
<meta property="og:description" content="Deploying zai-org/GLM-4.5-Air-FP8 using NVIDIA Dynamo" />
137+
<meta property="og:title" content="NVIDIA Dynamo - dstack" />
138+
<meta property="og:description" content="Deploying zai-org/GLM-4.5-Air-FP8 using NVIDIA Dynamo with Prefill-Decode disaggregation." />
139139
<meta property="og:image" content="https://dstack.ai/assets/images/social/docs/examples/inference/dynamo.png" />
140140
<meta property="og:image:type" content="image/png" />
141141
<meta property="og:image:width" content="1200" />
142142
<meta property="og:image:height" content="630" />
143143
<meta property="og:url" content="https://dstack.ai/docs/examples/inference/dynamo/" />
144144
<meta property="twitter:card" content="summary_large_image" />
145-
<meta property="twitter.title" content="Dynamo - dstack" />
146-
<meta property="twitter:description" content="Deploying zai-org/GLM-4.5-Air-FP8 using NVIDIA Dynamo" />
145+
<meta property="twitter.title" content="NVIDIA Dynamo - dstack" />
146+
<meta property="twitter:description" content="Deploying zai-org/GLM-4.5-Air-FP8 using NVIDIA Dynamo with Prefill-Decode disaggregation." />
147147
<meta property="twitter:image" content="https://dstack.ai/assets/images/social/docs/examples/inference/dynamo.png" />
148148
</head>
149149

@@ -214,7 +214,7 @@
214214
<div class="md-header__topic" data-md-component="header-topic">
215215
<span class="md-ellipsis">
216216

217-
Dynamo
217+
NVIDIA Dynamo
218218

219219
</span>
220220
</div>

llms-full.txt

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -12868,8 +12868,8 @@ Currently, auto-scaling only supports `rps` as the metric. TTFT and ITL metrics
1286812868
# docs/examples/inference/dynamo.md
1286912869

1287012870
---
12871-
title: Dynamo
12872-
description: Deploying zai-org/GLM-4.5-Air-FP8 using NVIDIA Dynamo
12871+
title: NVIDIA Dynamo
12872+
description: Deploying zai-org/GLM-4.5-Air-FP8 using NVIDIA Dynamo with Prefill-Decode disaggregation.
1287312873
---
1287412874

1287512875
# Dynamo

llms.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -62,7 +62,7 @@
6262
### Inference
6363

6464
- [SGLang](https://dstack.ai/docs/examples/inference/sglang.md): Deploying Qwen3.6-27B using SGLang on NVIDIA and AMD GPUs
65-
- [Dynamo](https://dstack.ai/docs/examples/inference/dynamo.md): Deploying zai-org/GLM-4.5-Air-FP8 using NVIDIA Dynamo
65+
- [Dynamo](https://dstack.ai/docs/examples/inference/dynamo.md): Deploying zai-org/GLM-4.5-Air-FP8 using NVIDIA Dynamo with Prefill-Decode disaggregation.
6666
- [vLLM](https://dstack.ai/docs/examples/inference/vllm.md): Deploying Qwen3.6-27B using vLLM on NVIDIA and AMD GPUs
6767
- [NIM](https://dstack.ai/docs/examples/inference/nim.md): Deploying Nemotron-3-Super-120B-A12B using NVIDIA NIM
6868
- [TensorRT-LLM](https://dstack.ai/docs/examples/inference/trtllm.md): Deploying Qwen3-235B-A22B-FP8 using NVIDIA TensorRT-LLM on NVIDIA GPUs

0 commit comments

Comments
 (0)