chore: add flash agent skill co-located with source code by TimPietruskyRunPod · Pull Request #254 · runpod/flash

TimPietruskyRunPod · 2026-03-05T15:48:38Z

Summary

Adds flash/SKILL.md (588 lines) rewritten around the unified Endpoint class API
Replaces the old skill in runpod/skills which documented the deprecated 8-class resource hierarchy (LiveServerless, CpuLiveServerless, etc.)
Co-locating the skill with source code ensures it stays in sync with the codebase
Discoverable via npx skills add runpod/flash

Skill contents

Quick reference and getting started (flash login, flash init, flash run, flash deploy)
Four Endpoint modes: QB decorator, LB decorator, external image client, existing endpoint client
Full constructor parameter table (verified against endpoint.py)
EndpointJob API
GPU groups & types, CPU instance types (verified against enums)
Cloudpickle scoping rules
All CLI commands (verified against cli/main.py)
Common patterns matching skeleton templates
Architecture overview and common gotchas

Test plan

Verify npx skills add runpod/flash discovers the skill
Verify code examples match skeleton templates (gpu_worker.py, cpu_worker.py, lb_worker.py)
Verify all constructor parameters match Endpoint.__init__ in endpoint.py
Verify CLI commands match registrations in cli/main.py
Verify GPU/CPU enums match current definitions

Adds flash/SKILL.md rewritten around the unified Endpoint class API. Replaces the old skill in runpod/skills which documented the deprecated 8-class resource hierarchy. Co-locating the skill ensures it stays in sync with the codebase. Discoverable via `npx skills add runpod/flash`.

Remove content an agent doesn't need: architecture internals, full enum listings, verbose CLI option tables, redundant code patterns. Keep: constructor params, four modes, cloudpickle rules, gotchas. Point agents to source files for enum details they can read themselves.

- replace v1.6.0 content with eval-tested v1.7.0 skill - remove non-existent flash login command - fix GpuType.ANY to GpuGroup (GpuType has no ANY member) - consolidate from four modes to three (QB, LB, client) - all examples use Endpoint class exclusively - scored 18/18 on eval assertions across 3 test prompts

…o skill

…th sections

…auto-switch gotcha

runpod-Henrik

PR #254 Review: `chore: add flash agent skill co-located with source code`

1. Bug: Fabricated CLI subcommands (high confidence — verified against cli/main.py and flash deploy --help)

The CLI section documents subcommands that don't exist:

flash deploy new staging      # ❌ not a real command
flash deploy send staging     # ❌ not a real command
flash deploy list staging     # ❌ not a real command
flash deploy info staging     # ❌ not a real command
flash deploy delete staging   # ❌ not a real command

The actual flash deploy is a single command with --env:

flash deploy --env staging           # build + deploy to staging
flash deploy --exclude torch,pkg2    # exclude packages

Environment management is via flash env:

flash env list
flash env create
flash env get
flash env delete

2. Issue: execution_timeout_ms missing from the constructor table

Endpoint.__init__ has execution_timeout_ms: int = 0 but it's absent from the skill's parameter table. This is user-facing (needed for long-running jobs) and was a known fix (executionTimeoutMs → execution_timeout_ms snake_case rename in 1.7.0).

3. Question: "Auto GPU switching requires workers >= 5"

Gotcha #8 states this as a rule. Is this a documented platform policy? If not verified, it could mislead users into setting unnecessarily high worker counts.

4. Nit: ADA_80_PRO VRAM labeled "80GB" but includes H100 NVL

The source docstring says: "NVIDIA H100 PCIe, NVIDIA H100 80GB HBM3, NVIDIA H100 NVL". H100 NVL is 94GB, not 80GB. Worth noting the label is approximate.

5. Nit: Missing less-common constructor params

accelerate_downloads, datacenter, scaler_type, scaler_value are in __init__ but absent from the table. Fine to omit as advanced, but accelerate_downloads=False is a useful workaround for slow dep installs.

Verdict: NEEDS WORK — The CLI section needs to be corrected before merge. An agent using this skill will generate flash deploy new/send/list commands that fail immediately.

🤖 Reviewed by Henrik's AI-Powered Bug Finder

- Replace made-up `flash deploy new/send/list/info/delete` with actual `flash deploy --env`, `flash env list/create/get/delete` commands - Add `flash deploy --preview` for local Docker preview - Add `execution_timeout_ms` to Endpoint constructor

TimPietruskyRunPod · 2026-03-05T22:56:24Z

we continue in runpod/skills#7

TimPietruskyRunPod marked this pull request as draft March 5, 2026 17:26

TimPietruskyRunPod added 5 commits March 5, 2026 21:33

fix: use correct "Runpod" casing in skill

9f6bfb2

chore: remove deprecated class mention from skill

c96a25d

TimPietruskyRunPod force-pushed the chore/add-flash-skill branch from c951deb to 16a0a2d Compare March 5, 2026 21:39

TimPietruskyRunPod added 18 commits March 5, 2026 22:43

chore: add auth section, restore flash login, remove architecture noise

76483f2

chore: remove unnecessary allowed-tools from skill frontmatter

2fb0952

chore: shorten skill description, move version out of title

fcefce0

fix: use correct "Runpod" casing in skill

b82da18

chore: remove redundant intro, lead with install + imports

0a40f3e

chore: remove repo-specific source path from skill

f2f9cd5

chore: add NetworkVolume, PodTemplate, flashboot, gpu_count details t…

efda28f

…o skill

chore: add full CpuInstanceType enum table to skill

5dd2196

chore: trim redundant cloudpickle wrong/correct example

ed44e42

chore: remove redundant cloudpickle section, keep in gotchas

db721ed

chore: remove redundant import from intro line

a7e5d76

chore: remove version from skill title

7da03d4

chore: add local dev workflow context to skill intro

b71449a

chore: move CLI to top as code block with examples, remove old CLI/au…

ee6de57

…th sections

chore: add setup section with install and auth before CLI

a43d1c8

chore: separate flash login and RUNPOD_API_KEY as distinct auth options

8d27d27

chore: simplify endpoint intro line

f81c031

chore: add multi-GPU list support, update examples to workers=5, add …

ee8a866

…auto-switch gotcha

TimPietruskyRunPod marked this pull request as ready for review March 5, 2026 22:21

runpod-Henrik reviewed Mar 5, 2026

View reviewed changes

TimPietruskyRunPod closed this Mar 5, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: add flash agent skill co-located with source code#254

chore: add flash agent skill co-located with source code#254
TimPietruskyRunPod wants to merge 24 commits intomainfrom
chore/add-flash-skill

TimPietruskyRunPod commented Mar 5, 2026

Uh oh!

runpod-Henrik left a comment

Uh oh!

TimPietruskyRunPod commented Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

TimPietruskyRunPod commented Mar 5, 2026

Summary

Skill contents

Test plan

Uh oh!

runpod-Henrik left a comment

Choose a reason for hiding this comment

PR #254 Review: chore: add flash agent skill co-located with source code

Uh oh!

TimPietruskyRunPod commented Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

PR #254 Review: `chore: add flash agent skill co-located with source code`