PRD.md - K8s Agent Execution via OpenCode Subagents

Project Name: Agentic SDLC Runner
Version: 1.1.0
Date: 2026-02-15

1. Problem Statement

The Challenge

Developers need to run AI coding agents on remote Kubernetes infrastructure while maintaining visibility and control from their local development environment.

Current State

OpenCode runs locally only
No way to leverage remote compute for async tasks
No visibility into remote agent progress

Desired State

Run [ASYNC] tasks in K8s pods
See all agent activity from one CLI
Autonomous execution with human oversight
Secure, production-ready deployment

2. User Stories

Story 1: Run Async Task in K8s

As a developer
I want to run an [ASYNC] task in a Kubernetes pod
So that I can leverage remote compute for well-defined tasks

Acceptance Criteria:

Task from tasks.md spawns a K8s pod
Pod runs opencode with task context
Changes are committed and pushed

Story 2: Visibility from Main Session

As a developer
I want to see what all async agents are doing from my main OpenCode session
So that I have full visibility without switching tools

Acceptance Criteria:

Pod logs stream back to main session
Human sees all agent activity in one place
Can monitor multiple agents simultaneously

Story 3: Parallel Execution

As a developer
I want multiple [ASYNC] tasks to run in parallel
So that I can speed up feature implementation

Acceptance Criteria:

Multiple pods spawn for parallel [ASYNC] tasks
All pods visible from main session
Resources don't block each other

Story 4: Git-Based Workflow

As a developer
I want changes to be committed and pushed automatically
So that my work is tracked and reviewable

Acceptance Criteria:

Agent commits on completion
Changes pushed to remote branch
Human can review before merging

Story 5: Worktree Isolation for Same-Branch Parallel Execution

As a developer
I want multiple [ASYNC] tasks from the same feature branch to run in parallel without conflicts
So that I can speed up feature implementation

Acceptance Criteria:

Same-branch parallel tasks each get their own Git worktree
Worktrees created in .agentic-sdlc/worktrees/ directory
Each worktree has isolated working directory
Branch names include task identifier to prevent conflicts
Worktree automatically cleaned up after task completion
Branch preserved for review/merge

Story 6: Secure Secret Management

As a security engineer
I want secrets managed via External Secrets Operator
So that there are no hardcoded credentials

Acceptance Criteria:

No secrets in ConfigMaps or code
Integration with External Secrets Operator
Support for cloud secret managers (GCP, AWS, Azure)

3. Success Criteria

Functional Requirements

ID	Requirement	Priority
F1	Subagent spawns K8s pod for [ASYNC] task	Must
F2	Pod logs stream back to main session	Must
F3	Agent autonomously implements task	Must
F4	Changes committed and pushed automatically	Must
F5	Integration with spec-kit /implement command	Must
F6	Parallel execution of multiple [ASYNC] tasks	Must
F7	Helm-based deployment	Must
F8	External Secrets Operator integration	Must
F9	Multi-environment support (dev/stg/prod)	Must
F10	Worktree isolation for same-branch parallel tasks	Must

Non-Functional Requirements

ID	Requirement	Priority
NF1	Security: No hardcoded secrets, ESO integration	Must
NF2	Visibility: All agent activity visible from main CLI	Must
NF3	Integrability: Works with existing spec-kit workflow	Must
NF4	Autonomy: Agents work without human intervention	Must
NF5	GitOps: ArgoCD deployment support	Should
NF6	Workload Identity: GKE/EKS support	Should

4. Requirements

Must Have

Helm Chart - Standard Kubernetes deployment
- Chart.yaml, values.yaml, templates/
- _helpers.tpl for template functions
- External Secrets integration
- RBAC configuration
Multi-Environment Releases - Dev, staging, production
- Separate values files per environment
- ArgoCD Application manifests
- Namespace isolation
Helper Scripts - Simplified pod management
- spawn-pod.sh - Creates pods via Helm templates
- tail-logs.sh - Streams pod logs
- Environment-aware (dev/stg/prod)
- SSH secret support
OpenCode Docker image - Container with OpenCode + git
Integration with spec-kit - Works with /implement command
Worktree Support - Git worktree for same-branch parallel tasks
- Worktree creation per parallel task
- Branch naming with task identifier
- Worktree cleanup after completion

Should Have

Auto-cleanup of completed pods
Timeout handling
Error reporting to main session
GKE Workload Identity support

Out of Scope

External controller
K-Agent framework integration
Beads issue tracking
Web dashboard
Mayor pattern (Gastown)

5. Architecture

Pattern: Subagent-Based Orchestration with Helm

Main Session → Subagent → scripts/spawn-pod.sh → K8s Pod
                    ↓
              scripts/tail-logs.sh → streams to main
                    ↓
              git commit/push → completion

Deployment Architecture

┌─────────────────────────────────────────────────────────────────┐
│                        Helm Chart                                │
│                                                                  │
│   ┌──────────────┐  ┌──────────────┐  ┌──────────────┐          │
│   │ Namespace    │  │ ServiceAccount│  │ RBAC         │          │
│   │              │  │ + Workload ID │  │ Role/Binding │          │
│   └──────────────┘  └──────────────┘  └──────────────┘          │
│                                                                  │
│   ┌──────────────┐  ┌──────────────┐                            │
│   │ ExternalSecret│  │ ConfigMap   │                            │
│   │ (password)    │  │ (config)    │                            │
│   └──────────────┘  └──────────────┘                            │
│                                                                  │
└─────────────────────────────────────────────────────────────────┘
                              │
                              ▼
┌─────────────────────────────────────────────────────────────────┐
│                     Agent Pods (dynamic)                         │
│                                                                  │
│   ┌──────────────┐  ┌──────────────┐  ┌──────────────┐          │
│   │ Pod: Task 1  │  │ Pod: Task 2  │  │ Pod: Task N  │          │
│   │ worktree-1   │  │ worktree-2   │  │ worktree-N   │          │
│   │ opencode run │  │ opencode run │  │ opencode run │          │
│   │ git push     │  │ git push     │  │ git push     │          │
│   └──────────────┘  └──────────────┘  └──────────────┘          │
│                                                                  │
│   Worktrees: .agentic-sdlc/worktrees/                           │
│   - feature-login-task1 (isolated working directory)            │
│   - feature-login-task2 (isolated working directory)            │
│                                                                  │
└─────────────────────────────────────────────────────────────────┘

Why This Pattern?

Simple: No complex controller needed, just Helm + kubectl
Native: Uses OpenCode's built-in subagent capability
Visible: All activity streams to main session
Git-based: No separate state management
Secure: External Secrets Operator manages credentials
Standard: Follows ai-directives K8s standards

Alternative Approaches Considered

Approach	Why Not Chosen
External K8s controller	Overkill for our use case
K-Agent framework	Too complex, not needed
Gastown Mayor	We have structured workflow via spec-kit
Beads issue tracking	Git-branch pattern solves same problems
Raw YAML manifests	Not scalable, replaced with Helm

6. Timeline

Phase 1: Foundation (Week 1)

Task	Description
T1.1	Create Helm chart structure
T1.2	Create templates (namespace, RBAC, serviceaccount)
T1.3	Create External Secret template
T1.4	Test Helm deployment manually

Milestone: Helm chart deploys successfully

Phase 2: Integration (Week 2)

Task	Description
T2.1	Build OpenCode Docker image
T2.2	Create pod template
T2.3	Create multi-environment releases (dev/stg/prod)
T2.4	Create helper scripts (spawn-pod.sh, tail-logs.sh)
T2.5	Test pod creation via scripts and kubectl

Milestone: Can create pod via scripts and see logs

Phase 3: E2E & GitOps (Week 3)

Task	Description
T3.1	Test git clone + opencode run in pod
T3.2	Test git commit + push from pod
T3.3	Create ArgoCD Application manifests
T3.4	Integrate with spec-kit /implement

Milestone: End-to-end autonomous execution works

Phase 4: Polish (Week 4)

Task	Description
T4.1	Error handling and reporting
T4.2	Cleanup logic for completed pods
T4.3	Documentation (README, SPEC, PRD)
T4.4	User testing

Milestone: Production-ready

7. Open Questions

Q1: Git Authentication

Options:

SSH key (mounted as secret)
Git token (via External Secrets Operator)
Workload identity (GKE/EKS specific)

Decision: Support all three via Helm values configuration

Q2: Completion Detection

Options:

Log parsing (detect "git push" in logs)
Git webhook (external service)
Polling (check pod status)

Decision: Log parsing for simplicity

Q3: Pod Timeout

Options:

No timeout (run until done)
Fixed timeout (e.g., 2 hours)
Configurable timeout

Decision: Configurable via Helm values, default 2 hours

Q4: Cleanup Policy

Options:

Always delete after completion
Keep for debugging
Configurable

Decision: Configurable via Helm values

Q5: Secret Store Provider

Options:

GCP Secret Manager
AWS Secrets Manager
Azure Key Vault
HashiCorp Vault

Decision: Generic External Secrets Operator, user configures provider

8. Risks

Risk	Impact	Mitigation
External Secrets Operator not installed	High	Document prerequisite
Pod fails to clone repo	High	Test git credentials early
OpenCode run fails in pod	Medium	Proper error logging
Pod hangs indefinitely	Medium	Implement timeout
Git push conflicts	Low	Each task has own branch

9. Success Metrics

Metric	Target
Pod creation success rate	>95%
Task completion rate	>90%
Time from spawn to completion	<2 hours average
Human visibility	100% of agent activity visible
Security audit pass	No hardcoded secrets

10. Appendix

Terminology

Term	Definition
[ASYNC] task	Task that can be delegated to autonomous agent
[SYNC] task	Task requiring human interaction
Subagent	OpenCode's built-in agent spawning capability
Worktree	Git feature for multiple working directories; used for parallel task isolation
ESO	External Secrets Operator
Workload Identity	Cloud-native identity for K8s workloads
GitOps	Declarative continuous deployment using git

Deployment Commands

# Development
helm upgrade --install agentic-sdlc-agent-runner-dev \
  releases/agentic-sdlc-agent-runner-dev \
  -n agent-runner-dev --create-namespace

# Staging
helm upgrade --install agentic-sdlc-agent-runner-stg \
  releases/agentic-sdlc-agent-runner-stg \
  -n agent-runner-stg --create-namespace

# Production
helm upgrade --install agentic-sdlc-agent-runner-prod \
  releases/agentic-sdlc-agent-runner-prod \
  -n agent-runner-prod --create-namespace

FilesExpand file tree

PRD.md

Latest commit

History

PRD.md

File metadata and controls

PRD.md - K8s Agent Execution via OpenCode Subagents

1. Problem Statement

The Challenge

Current State

Desired State

2. User Stories

Story 1: Run Async Task in K8s

Story 2: Visibility from Main Session

Story 3: Parallel Execution

Story 4: Git-Based Workflow

Story 5: Worktree Isolation for Same-Branch Parallel Execution

Story 6: Secure Secret Management

3. Success Criteria

Functional Requirements

Non-Functional Requirements

4. Requirements

Must Have

Should Have

Out of Scope

5. Architecture

Pattern: Subagent-Based Orchestration with Helm

Deployment Architecture

Why This Pattern?

Alternative Approaches Considered

6. Timeline

Phase 1: Foundation (Week 1)

Phase 2: Integration (Week 2)

Phase 3: E2E & GitOps (Week 3)

Phase 4: Polish (Week 4)

7. Open Questions

Q1: Git Authentication

Q2: Completion Detection

Q3: Pod Timeout

Q4: Cleanup Policy

Q5: Secret Store Provider

8. Risks

9. Success Metrics

10. Appendix

Related Documents

Terminology

Deployment Commands