Skip to content

chore: add-benchmark skill: NVSkills integration#1445

Open
ananthsub wants to merge 3 commits into
mainfrom
ci/nvskills-add-benchmark
Open

chore: add-benchmark skill: NVSkills integration#1445
ananthsub wants to merge 3 commits into
mainfrom
ci/nvskills-add-benchmark

Conversation

@ananthsub
Copy link
Copy Markdown
Contributor

Summary

  • Add license: Apache-2.0 to the frontmatter of .claude/skills/add-benchmark/SKILL.md.
  • Add .claude/skills/add-benchmark/evals/evals.json with 3 positive trigger cases (scaffold native benchmark, wrap external library, add training env with GitLab dataset) and 2 negative cases that delegate to sibling skills.

Motivation

Prepares the add-benchmark skill for NVSkills CI signing.

Test plan

  • After dependencies merge, comment /nvskills-ci on this PR. Expect a signature commit attaching skill-card.md and skill.oms.sig under .claude/skills/add-benchmark/.

@ananthsub ananthsub requested review from a team and cwing-nvidia May 28, 2026 18:38
@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented May 28, 2026

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@ananthsub ananthsub changed the title ci: add-benchmark skill: NVSkills integration chore: add-benchmark skill: NVSkills integration May 28, 2026
@ananthsub ananthsub force-pushed the ci/nvskills-add-benchmark branch from b84a4b4 to d8c6353 Compare May 28, 2026 22:20
@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented May 28, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@ananthsub
Copy link
Copy Markdown
Contributor Author

/nvskills-ci

@ananthsub ananthsub marked this pull request as ready for review May 28, 2026 22:21
@ananthsub
Copy link
Copy Markdown
Contributor Author

/nvskills-ci

1 similar comment
@ananthsub
Copy link
Copy Markdown
Contributor Author

/nvskills-ci

@ananthsub ananthsub force-pushed the ci/nvskills-add-benchmark branch from d8c6353 to bf2e7e3 Compare May 28, 2026 23:06
@ananthsub
Copy link
Copy Markdown
Contributor Author

/nvskills-ci

## Summary

- Move `.claude/skills/add-benchmark/` to top-level
  `skills/add-benchmark/` so this PR touches files under the central
  `team-request.yml` trigger allowlist (`skills/`, `team-skills/`,
  `rules/team-rules/`, `plugins/`).
- Replace `.claude/skills/add-benchmark/` with a symlink to
  `../../skills/add-benchmark` so Claude Code and Cursor continue to
  discover the skill via the conventional
  `.claude/skills/<name>/SKILL.md` path with no tool-side change.
- Add `license: Apache-2.0` to `skills/add-benchmark/SKILL.md`
  frontmatter (required by NVSkills CI).
- Add `skills/add-benchmark/evals/evals.json` with 3 positive trigger
  cases (scaffold native benchmark, wrap external library, add
  training env with GitLab dataset) and 2 negative cases that
  delegate to sibling skills.

## Motivation

Prepares the `add-benchmark` skill for NVSkills CI signing. The
other four Gym skills remain at `.claude/skills/<name>/` until each
one is similarly migrated in its own follow-up PR — keeping per-PR
diffs small and letting NVSkills CI evaluate one skill at a time
rather than all five simultaneously.

A separate follow-up PR against `NVIDIA/skills` will update
`components.d/nemo-gym.yml` `path:` from `.claude/skills/` to
`skills/` once all five skills have migrated, to keep the daily
sync pipeline aligned with the new source layout.

## Test plan

- [ ] Comment `/nvskills-ci` on this PR. Expect the request workflow
      to dispatch (not skip) and `svc-nvskills-signing` to attach
      `skill-card.md` and `skill.oms.sig` under `skills/add-benchmark/`.
- [ ] Claude Code discovers `add-benchmark` via
      `.claude/skills/add-benchmark/SKILL.md` (follows symlink).
- [ ] Cursor discovers `add-benchmark` via
      `.claude/skills/add-benchmark/SKILL.md` (follows symlink).

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
@ananthsub ananthsub force-pushed the ci/nvskills-add-benchmark branch from bf2e7e3 to 644a1b0 Compare May 28, 2026 23:55
@ananthsub
Copy link
Copy Markdown
Contributor Author

/nvskills-ci

1 similar comment
@ananthsub
Copy link
Copy Markdown
Contributor Author

/nvskills-ci

Resolve the NV-BASE validation and context-deduplication failures from
the first /nvskills-ci run on add-benchmark. The trigger now fires; the
remaining failures were all in skill content.

Frontmatter and schema:
- Shorten the description and add negative triggers (not for debugging
  or docs) to reduce over-triggering.
- Add metadata.author and metadata.tags.

Sections:
- Add Purpose, When not to use this skill, and Examples sections.
- Rename Workflow to Instructions.

Security notes (documentation only, no behavior change):
- SKILL.md: note credential handling for env.yaml, and that NeMo-Gym
  does not currently sandbox executed code, so code-execution benchmarks
  should run in an isolated, disposable environment the operator controls.
- references/patterns.md: equivalent notes on the external-tool
  auto-install pattern and the Ray subprocess pattern.

Context deduplication:
- Collapse the two ng_collect_rollouts blocks into one canonical block in
  Step 6, with Step 7 referencing it for the baseline run.
- Replace the duplicated ng_prepare_data block in SKILL.md with a
  cross-link to the canonical version in references/patterns.md.

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
@ananthsub
Copy link
Copy Markdown
Contributor Author

/nvskills-ci

- Shorten the description to under 150 characters.
- Add Prerequisites, Limitations, and Troubleshooting sections.

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant