feat(cli): add --keep-sandbox flag to fullsend run by rh-hemartin · Pull Request #1595 · fullsend-ai/fullsend

rh-hemartin · 2026-05-27T13:26:50Z

Summary

Adds --keep-sandbox flag to fullsend run
When set, skips sandbox deletion at end of run regardless of success or failure
Prints the sandbox name and the exact openshell sandbox exec command to drop into a shell

Why

Post-failure sandbox inspection currently requires re-running with extra instrumentation. This flag lets developers exec directly into the sandbox after a failure to check uploaded files, env contents, or reproduce commands manually.

Closes #1594

Test plan

Run fullsend run triage --keep-sandbox and confirm sandbox is not deleted
Confirm printed openshell sandbox exec command drops into a shell in the sandbox
Run without flag and confirm sandbox is still deleted normally

🤖 Generated with Claude Code

github-actions · 2026-05-27T13:28:26Z

Site preview

Preview: https://47e3d241-site.fullsend-ai.workers.dev

Commit: 4138c8e085bad3f9628775741c37ffe85a26e742

fullsend-ai-review · 2026-05-27T13:32:02Z

Review

Findings

Medium

[intent-alignment] internal/cli/run.go — PR includes undocumented scope beyond cli: add --keep-sandbox flag to fullsend run for post-failure inspection #1594: the claude-output.jsonl stream capture via io.TeeReader in runAgentWithProgress and the removal of the --verbose TODO comment in buildClaudeCommand are not mentioned in the PR body or authorized by the linked issue. The JSONL capture introduces a file write into the agent execution hot path and changes the I/O plumbing (replacing direct stdout reads with a TeeReader). While the implementation is correct and degrades gracefully on file-creation failure, it should be described in the PR summary so reviewers can evaluate it independently.
Remediation: Either (a) update the PR description to document the output-capture feature and explain why it belongs in this PR, or (b) split it into a separate PR.

Low

[docs-currency] docs/guides/user/running-agents-locally.md — The user guide has a "Testing without side effects" section (line 365) documenting --no-post-script for debugging workflows. The new --keep-sandbox flag serves a similar debugging purpose (post-failure sandbox inspection) and would naturally belong in that section or a nearby "Debugging sandbox failures" subsection.
Remediation: Add a short paragraph documenting --keep-sandbox near the --no-post-script section.
[test-coverage] internal/cli/run.go — No unit tests for the --keep-sandbox defer-path behavior or the TeeReader wiring in runAgentWithProgress. The TeeReader integration (file creation failure degradation, drain-on-error path) is particularly testable and worth covering.

Previous run

Review

Findings

Medium

[intent-alignment] internal/cli/run.go — PR includes undocumented scope beyond cli: add --keep-sandbox flag to fullsend run for post-failure inspection #1594: the claude-output.jsonl stream capture via io.TeeReader in runAgentWithProgress and the removal of the --verbose TODO comment in buildClaudeCommand are not mentioned in the PR body or authorized by the linked issue. The JSONL capture is a useful companion feature for post-failure debugging, but it changes the I/O path of the agent runner (introducing a file write into the hot path) and should be described in the PR summary so reviewers can evaluate it independently. Consider either (a) adding a section to the PR body explaining the JSONL capture rationale, or (b) splitting it into a separate PR.

Low

[docs-currency] docs/guides/user/running-agents-locally.md — The user guide documents fullsend run flags inline (e.g., --no-post-script, --env-file, --fullsend-binary) but will not mention the new --keep-sandbox flag after this PR merges. Consider adding a brief mention alongside the --no-post-script documentation.
[test-coverage] internal/cli/run.go — No unit tests for the --keep-sandbox defer-path behavior or the TeeReader wiring in runAgentWithProgress. The existing test file (run_test.go) tests flag registration but not runtime behavior. The TeeReader integration (file creation failure degradation, drain-on-error path) is particularly testable and worth covering.

Previous run

Review

Findings

Medium

[intent-alignment] internal/cli/run.go — PR includes undocumented scope beyond cli: add --keep-sandbox flag to fullsend run for post-failure inspection #1594: the claude-output.jsonl stream capture via io.TeeReader in runAgentWithProgress and the removal of the --verbose TODO comment in buildClaudeCommand are not mentioned in the PR body or authorized by the linked issue. The JSONL capture is a useful companion feature for post-failure debugging, but it changes the I/O path of the agent runner (introducing a file write into the hot path) and should be described in the PR summary so reviewers can evaluate it independently. Consider either (a) adding a section to the PR body explaining the JSONL capture rationale, or (b) splitting it into a separate PR.

Low

[docs-currency] docs/guides/user/running-agents-locally.md — The user guide documents fullsend run flags inline (e.g., --no-post-script, --env-file, --fullsend-binary) but will not mention the new --keep-sandbox flag after this PR merges. Consider adding a brief mention alongside the --no-post-script documentation.
[test-coverage] internal/cli/run.go — No unit tests for the --keep-sandbox defer-path behavior or the TeeReader wiring in runAgentWithProgress. The existing test file (run_test.go) tests flag registration but not runtime behavior. The TeeReader integration (file creation failure degradation, drain-on-error path) is particularly testable and worth covering.

Previous run (2)

Review

Findings

Medium

[intent-alignment] internal/cli/run.go — PR bundles undocumented changes beyond the --keep-sandbox feature described in the PR body and issue cli: add --keep-sandbox flag to fullsend run for post-failure inspection #1594. The claude-output.jsonl TeeReader capture (new outputPath parameter in runAgentWithProgress, file creation at iterDir/claude-output.jsonl) is a distinct feature that tees agent stdout to disk. The --verbose comment removal in buildClaudeCommand is also undocumented. These changes are individually reasonable but should be described in the PR body or split into separate commits so reviewers can evaluate each change on its own merits.
Remediation: Either (a) update the PR description to document the output-capture feature and explain why it belongs in this PR, or (b) split it into a separate commit/PR.

Low

[docs-currency] docs/guides/user/running-agents-locally.md — The user guide has a "Testing without side effects" section (line 365) that documents --no-post-script for debugging workflows. The new --keep-sandbox flag serves a similar debugging purpose (post-failure sandbox inspection) and would naturally belong in that section or a nearby "Debugging sandbox failures" section.
Remediation: Add a short paragraph documenting --keep-sandbox near the --no-post-script section.

Info

[test-coverage] internal/cli/run.go — No test coverage for the keepSandbox early-return path or the new outputPath parameter in runAgentWithProgress. These are CLI integration paths that are difficult to unit test, but the TeeReader fallback behavior (graceful degradation when os.Create fails) would benefit from a targeted test.

Skips sandbox deletion at end of run when --keep-sandbox is set. Prints sandbox name and openshell exec command for direct inspection. Useful for post-failure debugging without re-running the full agent. Closes #1594 Signed-off-by: Hector Martinez <hemartin@redhat.com>

github-actions Bot deployed to site-preview May 27, 2026 13:28 View deployment

fullsend-ai-review Bot added the requires-manual-review Review requires human judgment label May 27, 2026

rh-hemartin force-pushed the feat/keep-sandbox branch from 48fd0c5 to b0a4fc0 Compare May 27, 2026 15:19

github-actions Bot deployed to site-preview May 27, 2026 15:21 View deployment

fullsend-ai-review Bot added requires-manual-review Review requires human judgment and removed requires-manual-review Review requires human judgment labels May 27, 2026

rh-hemartin self-assigned this May 28, 2026

rh-hemartin force-pushed the feat/keep-sandbox branch from b0a4fc0 to 4138c8e Compare May 28, 2026 07:54

github-actions Bot deployed to site-preview May 28, 2026 07:59 View deployment

fullsend-ai-review Bot added requires-manual-review Review requires human judgment and removed requires-manual-review Review requires human judgment labels May 28, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(cli): add --keep-sandbox flag to fullsend run#1595

feat(cli): add --keep-sandbox flag to fullsend run#1595
rh-hemartin wants to merge 1 commit into
mainfrom
feat/keep-sandbox

rh-hemartin commented May 27, 2026

Uh oh!

github-actions Bot commented May 27, 2026 •

edited

Loading

Uh oh!

fullsend-ai-review Bot commented May 27, 2026 •

edited

Loading

Review

Findings

Medium

Low

Review

Findings

Medium

Low

Review

Findings

Medium

Low

Info

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rh-hemartin commented May 27, 2026

Summary

Why

Test plan

Uh oh!

github-actions Bot commented May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Site preview

Uh oh!

fullsend-ai-review Bot commented May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review

Findings

Medium

Low

Review

Findings

Medium

Low

Review

Findings

Medium

Low

Review

Findings

Medium

Low

Info

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

github-actions Bot commented May 27, 2026 •

edited

Loading

fullsend-ai-review Bot commented May 27, 2026 •

edited

Loading