docs: describe local_vllm_model and extend docs for vllm_model by marta-sd · Pull Request #1430 · NVIDIA-NeMo/Gym

marta-sd · 2026-05-27T09:43:39Z

This PR solves issue #1106

copy-pr-bot · 2026-05-27T09:43:43Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

marta-sd · 2026-05-27T09:43:53Z

/ok to test 86b5e63

cwing-nvidia · 2026-05-27T19:59:37Z

could we also include local_vllm_model_proxy in this PR or plan how to structure in the docs if it will be a follow-on PR? could you describe rationale for separate pages for each vllm variant vs a combined docs page for all vllm variants?

marta-sd · 2026-05-29T08:13:41Z

Thanks for feedback @cwing-nvidia !

could we also include local_vllm_model_proxy in this PR or plan how to structure in the docs if it will be a follow-on PR?

I have added it to this PR, please take a look 🙂

could you describe rationale for separate pages for each vllm variant vs a combined docs page for all vllm variants?

My only motivation was that the existing vllm page was already quite long and I felt that it would be easier to navigate if they were separated. I don't see any blockers to have them all on one page, whichever you think works best for the users.

I think another options to consider is having 3 pages:

overview (how Gym connects to vLLM, how model servers differ and when to choose which)
self-managed vllm server (the existing page)
deploy vllm server with Gym (shared page for local and local proxy)

Let me know what's your preferred structure 🙂

Signed-off-by: Marta Stepniewska-Dziubinska <martas@nvidia.com>

… local_vllm_model) Signed-off-by: Marta Stepniewska-Dziubinska <martas@nvidia.com>

…h no browser) Signed-off-by: Marta Stepniewska-Dziubinska <martas@nvidia.com>

Signed-off-by: Marta Stepniewska-Dziubinska <martas@nvidia.com>

marta-sd · 2026-05-29T08:26:13Z

/ok to test 1bd2b63

Style-guide-only cleanup stacked on top of #1430. No content or meaning changes — this is purely an NVIDIA style-guide pass over the docs that #1430 introduces. ## Fixes | File | Fix | |------|-----| | `local-vllm.mdx` | `e.g.` → "for example" (×2); `on/off` slash → "on or off"; `you've` → "you have" | | `local-vllm-proxy.mdx` | typo `extising` → "existing"; `disabled in through` → "disabled through"; `etc.` → "and so on" | | `vllm.mdx` | `you'd` → "you would" (in the new `<Note>`) | | `responses_api_models/vllm_model/README.md` | removed exclamation mark | ## Scope notes - Scoped to content **#1430 adds/changes**. Pre-existing violations in `vllm.mdx` (the `Please…!` line, `HuggingFace` spelling, bare `[here]` links) are out of scope for a style check of this PR. - Left heading-case and `see → refer to` untouched — both are inconsistent repo-wide, so changing only these pages would worsen consistency. Targets `martas/1106` so it merges into #1430 before that PR lands on `main`. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Signed-off-by: Lawrence Lane <llane@nvidia.com> Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>

cmunley1

can we give an example or link to an example of how to use this with training frameworks like nemo rl and verl?

copy-pr-bot Bot temporarily deployed to public May 27, 2026 09:44 Inactive

copy-pr-bot Bot temporarily deployed to public May 27, 2026 09:45 Inactive

marta-sd requested a review from lbliii May 27, 2026 09:46

marta-sd self-assigned this May 27, 2026

copy-pr-bot Bot temporarily deployed to public May 27, 2026 09:46 Inactive

marta-sd force-pushed the martas/1106 branch from 6ae85fa to fcd398b Compare May 29, 2026 08:01

marta-sd added 4 commits May 29, 2026 10:14

add docs for local vllm model server

4da26bc

Signed-off-by: Marta Stepniewska-Dziubinska <martas@nvidia.com>

add responses_api_models/vllm_model/README.md (similar to the one for…

b8309eb

… local_vllm_model) Signed-off-by: Marta Stepniewska-Dziubinska <martas@nvidia.com>

add option to login to fern with device code (needed for machines wit…

2cb553c

…h no browser) Signed-off-by: Marta Stepniewska-Dziubinska <martas@nvidia.com>

add docs for the local vllm proxy

1bd2b63

Signed-off-by: Marta Stepniewska-Dziubinska <martas@nvidia.com>

marta-sd force-pushed the martas/1106 branch from fcd398b to 1bd2b63 Compare May 29, 2026 08:20

marta-sd marked this pull request as ready for review May 29, 2026 08:26

lbliii mentioned this pull request May 29, 2026

docs: NVIDIA style guide pass on vLLM model-server docs #1455

Merged

lbliii approved these changes May 29, 2026

View reviewed changes

cmunley1 reviewed May 29, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: describe local_vllm_model and extend docs for vllm_model#1430

docs: describe local_vllm_model and extend docs for vllm_model#1430
marta-sd wants to merge 5 commits into
mainfrom
martas/1106

marta-sd commented May 27, 2026

Uh oh!

copy-pr-bot Bot commented May 27, 2026

Uh oh!

marta-sd commented May 27, 2026

Uh oh!

cwing-nvidia commented May 27, 2026

Uh oh!

marta-sd commented May 29, 2026

Uh oh!

marta-sd commented May 29, 2026

Uh oh!

cmunley1 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

marta-sd commented May 27, 2026

Uh oh!

copy-pr-bot Bot commented May 27, 2026

Uh oh!

marta-sd commented May 27, 2026

Uh oh!

cwing-nvidia commented May 27, 2026

Uh oh!

marta-sd commented May 29, 2026

Uh oh!

marta-sd commented May 29, 2026

Uh oh!

cmunley1 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants