running_harness

A reusable long-running agent harness skill: turn a long task into a recoverable loop of

init -> run (small step) -> verify (gates) -> retry/rollback -> escalate -> handoff.

This repo contains a single skill folder (originally developed under .claude/skills/).

Layout

SKILL.md: Skill entrypoint (trigger + workflow)
agents/openai.yaml: UI metadata (optional but included)
script/: Executable harness tools
reference/: Playbooks / guidance
assets/: Icons

Quick Start

Initialize a task directory:

bash script/init_harness.sh \
  --task "Fix XXX and keep regression green" \
  --verify-cmd "./gradlew test --tests com.foo.BarTest"

This prints a task dir like .agent-harness/<task-id> and creates durable artifacts:

task.json, feature_list.json, claude-progress.txt, init.sh

(Recommended) Get bearings at the start of each session:

bash script/get_bearings.sh --task-dir .agent-harness/<task-id>

Run attempts with verify gates:

bash script/run_attempt.sh \
  --task-dir .agent-harness/<task-id> \
  --run-cmd "<your incremental command>" \
  --verify-cmd "./gradlew test --tests com.foo.BarTest" \
  --durability sync \
  --heartbeat-seconds 15 \
  --max-attempts 12

Verify Plan (Multiple Stages)

cat >/tmp/verify.plan <<'EOF_PLAN'
unit::./gradlew test --tests com.foo.UnitTest
integration::./gradlew integrationTest
EOF_PLAN

bash script/run_attempt.sh \
  --task-dir .agent-harness/<task-id> \
  --run-cmd "<your incremental command>" \
  --verify-plan /tmp/verify.plan \
  --escalate-after 2 \
  --stop-on-escalation \
  --max-attempts 12

Handoff

Generate a handoff file (also generated automatically on success/escalation/failure):

bash script/write_handoff.sh --task-dir .agent-harness/<task-id>

Maintenance

Run the full maintenance gate (validate + self-test + lint):

bash script/release_check.sh

Safety Notes

--rollback git is destructive and requires --allow-hard-reset.
By default run_attempt.sh locks the task dir to prevent concurrent writers.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

running_harness

Layout

Quick Start

Verify Plan (Multiple Stages)

Handoff

Maintenance

Safety Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
agents		agents
assets		assets
reference		reference
script		script
README.md		README.md
SKILL.md		SKILL.md

Folders and files

Latest commit

History

Repository files navigation

running_harness

Layout

Quick Start

Verify Plan (Multiple Stages)

Handoff

Maintenance

Safety Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages