Batch Feature Processing

Last Updated: 2025-12-23 Version: Enhanced in v3.24.0, Simplified in v3.32.0 (Issue #88), Automatic retry added v3.33.0 (Issue #89), Consent bypass added v3.35.0 (Issue #96), Git automation added v3.36.0 (Issue #93), Dependency analysis added v3.44.0 (Issue #157) Command: /batch-implement

This document describes the batch feature processing system for sequential multi-feature development with intelligent state management, automatic context management, and per-feature git automation.

Overview

Process multiple features sequentially with intelligent state management and automatic context management. Supports 50+ features without manual intervention.

Workflow: Parse input → Create batch state → For each: /auto-implement → Continue (Claude Code handles context automatically)

Usage Options

1. File-Based Input

Create a plain text file with one feature per line:

# Authentication
Add user login with JWT
Add password reset flow

Then run:

/batch-implement <features-file>

2. GitHub Issues Input (NEW in v3.24.0)

Fetch feature titles directly from GitHub issues:

/batch-implement --issues 72 73 74
# Fetches: "Issue #72: [title]", "Issue #73: [title]", "Issue #74: [title]"

Requirements:

gh CLI v2.0+ installed
One-time authentication: gh auth login

Prerequisites for Unattended Batch Processing (NEW in v3.35.0 - Issue #96)

For fully unattended batch processing (4-5 features, ~2 hours), configure git automation to bypass interactive prompts.

Why This Matters: By default, /auto-implement prompts for consent on first run. During batch processing, this prompt blocks the entire batch from continuing, defeating the purpose of unattended processing.

Configure for Unattended Batches

Option 1: Environment Variable (Recommended)

Create or update .env in your project root:

# Enable automatic git operations (no prompts during batch)
AUTO_GIT_ENABLED=true

# Optional: Control specific git operations
AUTO_GIT_PUSH=true   # Default: auto-push to remote
AUTO_GIT_PR=true     # Default: auto-create pull requests

Then run your batch:

/batch-implement features.txt
# No prompts - runs fully unattended

Option 2: Environment Variables (Shell)

Set environment variables before running batch:

export AUTO_GIT_ENABLED=true
export AUTO_GIT_PUSH=true
export AUTO_GIT_PR=true

/batch-implement features.txt

Option 3: Minimal (Commit Only, No Push)

If you prefer committing locally without pushing during batch:

# .env file
AUTO_GIT_ENABLED=true
AUTO_GIT_PUSH=false    # Don't push during batch

Then:

/batch-implement features.txt
# Features committed locally, not pushed
# Manually push when batch completes: git push

How It Works

Issue #96 (v3.35.0): /auto-implement STEP 5 now checks AUTO_GIT_ENABLED environment variable BEFORE showing interactive consent prompt.

Behavior:

AUTO_GIT_ENABLED=true (or not set): Auto-proceed with git operations, skip prompt
AUTO_GIT_ENABLED=false: Skip git operations entirely, skip prompt
First run without env var: Shows interactive consent prompt (stored for future runs)

In Batches: When processing multiple features, the environment variable is checked for each feature:

Feature 1: Checks env var → auto-proceeds (no prompt)
Feature 2: Checks env var → auto-proceeds (no prompt)
Feature 3-5: Checks env var → auto-proceeds (no prompt)

Result: Fully unattended processing with zero blocking prompts.

Verification

Before starting your batch, verify configuration:

# Check environment variable
echo $AUTO_GIT_ENABLED

# Or check .env file
cat .env | grep AUTO_GIT

Expected output:

AUTO_GIT_ENABLED=true

State Management (Enhanced in v3.24.0)

Persistent State

State tracked in .claude/batch_state.json:

{
  "batch_id": "batch-20251116-123456",
  "current_index": 3,
  "completed": ["feature1", "feature2", "feature3"],
  "failed": [],
  "status": "in_progress",
  "context_token_estimate": 85000,
  "issue_numbers": [72, 73, 74],
  "source_type": "github_issues"
}

Dependency Analysis (NEW in v3.44.0 - Issue #157)

Smart dependency ordering for intelligent feature sequencing

Overview

When processing multiple features with /batch-implement, features may have implicit dependencies (e.g., implementing auth before testing it, or modifying a shared file). The dependency analyzer automatically detects these relationships and reorders features to prevent conflicts.

How It Works:

Analyze Phase: Parse feature descriptions for dependency keywords
Detect Phase: Build dependency graph from keyword analysis
Order Phase: Use topological sort to find optimal execution order
Validate Phase: Detect circular dependencies (prevent impossible orderings)
Execute Phase: Process features in dependency-optimized order

Dependency Keywords

The analyzer detects these keywords in feature descriptions:

Dependency Keywords:

requires - Feature X requires Feature Y to be implemented first
depends - Feature X depends on Feature Y
after - Feature X should run after Feature Y
before - Feature X should run before Feature Y
uses - Feature X uses/modifies code from Feature Y
needs - Feature X needs Feature Y as a prerequisite

File References:

.py, .md, .json, .yaml, .yml, .sh, .ts, .js, .tsx, .jsx

Example: Dependency Detection

Given these features:

Add JWT authentication module
Add tests for JWT validation (requires JWT authentication)
Add password reset endpoint (requires auth, uses email service)
Add email service module

The analyzer detects:

Feature 1 (tests) depends on Feature 0 (auth)
Feature 2 (password reset) depends on Feature 0 (auth)
Feature 2 (password reset) depends on Feature 3 (email)

Optimal Ordering

Using topological sort (Kahn's algorithm), features are reordered:

Original Order:        Optimized Order:
1. Add JWT auth        1. Add JWT auth (no deps)
2. Add tests (dep 1)   2. Add email service (no deps)
3. Add password reset  3. Add tests (depends on JWT)
4. Add email service   4. Add password reset (depends on JWT, email)

Benefits:

Tests run after implementation (can pass)
Features with dependencies run after prerequisites (can access needed code)
Files modified in correct order (avoid conflicts)

Circular Dependency Detection

If the analyzer detects circular dependencies, it:

Reports the cycle - Shows which features form the loop
Gracefully degrades - Falls back to original order
Continues processing - Batch doesn't fail, just uses original order

Example Circular:

Feature A depends on Feature B
Feature B depends on Feature A

Result: Uses original order, logs warning

ASCII Graph Visualization

When dependency analysis completes, users see:

Dependency Analysis Complete:
  Total dependencies detected: 3
  Independent features: 1
  Dependent features: 3

Feature Dependency Graph
========================

Feature 0: Add JWT authentication
  └─> [no dependencies]

Feature 1: Add tests for JWT (requires JWT)
  └─> [depends on] Feature 0: Add JWT authentication

Feature 2: Add password reset (requires auth, uses email)
  └─> [depends on] Feature 0: Add JWT authentication
  └─> [depends on] Feature 3: Add email service

Feature 3: Add email service
  └─> [no dependencies]

State Storage

Dependency information is stored in batch state:

{
  "batch_id": "batch-20251223-features",
  "feature_order": [0, 3, 1, 2],
  "feature_dependencies": {
    "0": [],
    "1": [0],
    "2": [0, 3],
    "3": []
  },
  "analysis_metadata": {
    "stats": {
      "total_dependencies": 3,
      "independent_features": 1,
      "dependent_features": 3,
      "max_depth": 2,
      "total_features": 4
    },
    "analyzed_at": "2025-12-23T10:00:00Z"
  }
}

Performance

Analysis Time:

Typical (50 features): <100ms
Large (500 features): <500ms
Max (1000 features): <1000ms (timeout: 5 seconds)

Memory: O(V + E) where V = features, E = dependencies

Linear in feature count, not exponential
Safe for 100+ feature batches

Algorithm: Kahn's algorithm for topological sort

Time complexity: O(V + E)
Space complexity: O(V + E)

Security

Input Validation:

Text sanitization (max 10,000 chars per feature)
No shell execution
Path traversal protection (CWE-22)
Command injection prevention (CWE-78)

Resource Limits:

MAX_FEATURES: 1000
TIMEOUT_SECONDS: 5
Memory limits enforced

Graceful Degradation

If dependency analysis fails:

try:
    deps = analyze_dependencies(features)
    order = topological_sort(features, deps)
except Exception as e:
    print(f"Dependency analysis failed: {e}")
    order = list(range(len(features)))  # Use original order
    print("Continuing with original order...")

Result: Batch processing continues with original order, no data loss

Implementation Details

File: plugins/autonomous-dev/lib/feature_dependency_analyzer.py (509 lines)

Key Functions:

analyze_dependencies(features) - Main entry point
topological_sort(features, deps) - Reorder using Kahn's algorithm
visualize_graph(features, deps) - Generate ASCII visualization
get_execution_order_stats(features, deps, order) - Statistics

See docs/LIBRARIES.md section 33 for complete API reference.

Integration with /batch-implement

STEP 1.5 of /batch-implement now analyzes dependencies:

# STEP 1.5: Analyze Dependencies and Optimize Order (Issue #157)

from plugins.autonomous_dev.lib.feature_dependency_analyzer import (
    analyze_dependencies,
    topological_sort,
    visualize_graph,
    get_execution_order_stats
)

try:
    deps = analyze_dependencies(features)
    feature_order = topological_sort(features, deps)
    stats = get_execution_order_stats(features, deps, feature_order)
    graph = visualize_graph(features, deps)

    state.feature_dependencies = deps
    state.feature_order = feature_order
    state.analysis_metadata = {"stats": stats}

    print(f"Dependencies detected: {stats['total_dependencies']}")
    print(graph)

except Exception as e:
    print(f"Dependency analysis failed: {e}")
    feature_order = list(range(len(features)))
    state.feature_order = feature_order
    state.feature_dependencies = {i: [] for i in range(len(features))}

Then STEP 2+ uses state.feature_order for processing order.

Examples

Example 1: Simple Linear Dependency

Implement database schema
Add migrations for schema
Run migrations in test

Detected dependencies:

Feature 1 depends on Feature 0
Feature 2 depends on Feature 1

Optimized order: [0, 1, 2] (same as original - already correct)

Example 2: Multiple Independent Trees

Add JWT authentication
Add tests for JWT
Add password hashing utility
Add hashing tests
Add login endpoint

Detected dependencies:

Feature 1 (JWT tests) depends on Feature 0 (JWT)
Feature 3 (hashing tests) depends on Feature 2 (hashing)
Feature 4 (login) depends on Feature 0 (JWT) and Feature 2 (hashing)

Optimized order: [0, 2, 1, 3, 4]

Example 3: Circular Dependencies (Graceful Degradation)

Feature A (requires B)
Feature B (requires C)
Feature C (requires A)

Detected: Circular dependency detected among [A, B, C]

Result: Uses original order [0, 1, 2], continues processing

Git Automation (NEW in v3.36.0 - Issue #93)

Per-feature git commits during batch processing - Each feature in /batch-implement workflow now automatically creates a git commit with conventional commit messages, optional push, and optional PR creation.

Overview

When processing multiple features with /batch-implement, the workflow now includes automatic git operations for each completed feature:

Feature completes: All tests pass, docs updated, quality checks done
Git automation triggers: execute_git_workflow() called with in_batch_mode=True
Commit created: Conventional commit message generated and applied
State recorded: Git operation details saved in batch_state.json for audit trail
Continue: Batch processing moves to next feature

Configuration

Git automation in batch mode uses the same environment variables as /auto-implement:

# .env file (project root)
AUTO_GIT_ENABLED=true      # Master switch (default: true)
AUTO_GIT_PUSH=false        # Disable push during batch (default: true)
AUTO_GIT_PR=false          # Disable PR creation during batch (default: true)

Batch Mode Differences

Batch mode differs from /auto-implement in three ways:

Skips first-run consent prompt - Uses environment variables silently
No interactive prompts - All decisions made via .env configuration
Audit trail in state - Git operations recorded in batch_state.json for debugging

Git State Tracking

Each git operation is recorded in batch_state.json with complete metadata:

{
  "batch_id": "batch-20251206-feature-1",
  "git_operations": {
    "0": {
      "commit": {
        "success": true,
        "timestamp": "2025-12-06T10:00:00Z",
        "sha": "abc123def456",
        "branch": "feature/auth"
      },
      "push": {
        "success": true,
        "timestamp": "2025-12-06T10:00:15Z",
        "branch": "feature/auth",
        "remote": "origin"
      }
    },
    "1": {
      "commit": {
        "success": true,
        "timestamp": "2025-12-06T10:15:00Z",
        "sha": "def456abc123",
        "branch": "feature/jwt"
      }
    }
  }
}

Per-Feature Commit Messages

Each feature gets its own commit with a conventional commit message:

feat(auth): add JWT token validation

- Implement token validation middleware
- Add refresh token support
- Update authentication docs

Co-Authored-By: Claude <noreply@anthropic.com>

Generated by the commit-message-generator agent based on changed files and feature context.

Error Handling in Batch

If a git operation fails during batch processing:

Commit failure: Feature marked as completed (git operation failed)
Push failure: Commit succeeds, push marked as failed, batch continues
PR failure: Commit and push succeed, PR marked as failed, batch continues

All failures are non-blocking - batch continues to next feature with detailed error recorded.

Audit Trail

View git operation history for a batch:

# Check what git operations succeeded
cat .claude/batch_state.json | jq '.git_operations'

# Example output
{
  "0": {
    "commit": {"success": true, "sha": "abc123..."},
    "push": {"success": false, "error": "Permission denied"}
  },
  "1": {
    "commit": {"success": true, "sha": "def456..."},
    "push": {"success": true}
  }
}

Implementation API

The git automation for batch mode is exposed via:

from auto_implement_git_integration import execute_git_workflow

# Batch mode usage
result = execute_git_workflow(
    workflow_id='batch-20251206-feature-1',
    request='Add JWT validation',
    in_batch_mode=True  # Skip first-run prompts
)

# Returns git operation results (commit sha, push success, PR URL, etc.)

The in_batch_mode=True parameter signals that:

First-run consent prompt should be skipped
Environment variable consent is still checked
This is part of a larger batch workflow

Context Management (Compaction-Resilient)

The batch system uses a compaction-resilient design that survives Claude Code's automatic context summarization, enabling truly unattended operation for large batches.

How It Works:

Externalized state: All progress tracked in batch_state.json, not conversation memory
Self-contained features: Each /auto-implement bootstraps fresh from external sources
Auto-compaction safe: When Claude Code summarizes context, processing continues seamlessly
Git preserves work: Every completed feature is committed before moving on
Resume for crashes only: --resume only needed if Claude Code actually exits/crashes

Why This Works:

Each feature implementation reads from external state:

Requirements: Fetched from GitHub issue (not memory)
Codebase state: Read from filesystem (not memory)
Progress: Tracked in batch_state.json (not memory)
Completed work: Committed to git (permanent)

Benefits:

Fully unattended: No manual /clear cycles needed
Unlimited batch sizes: 50+ features run continuously
Auto-compaction safe: Claude Code's summarization doesn't break workflow
Zero data loss: State externalized, not dependent on conversation context
Crash recovery: --resume available for actual crashes

Crash Recovery

Resume from last completed feature:

/batch-implement --resume batch-20251116-123456

Recovery Process:

Loads state from .claude/batch_state.json
Validates status ("in_progress" for normal resume)
Skips completed features
Continues from current_index

Automatic Failure Recovery (NEW in v3.33.0 - Issue #89)

Automatic retry with intelligent failure classification for transient errors and safety limits.

Overview

When a feature fails during /batch-implement, the system automatically classifies the error and retries transient failures while skipping permanent errors.

Key Features:

Transient Retry: Network errors, timeouts, API rate limits (automatically retried)
Permanent Skip: Syntax errors, import errors, type errors (not retried)
Safety Limits: Max 3 retries per feature, circuit breaker after 5 consecutive failures
User Consent: First-run prompt (opt-in), can be overridden via .env
Audit Logging: All retry attempts logged for debugging

Transient vs Permanent Errors

Transient (Retriable):

Network errors (ConnectionError, NetworkError)
Timeout errors (TimeoutError)
API rate limits (RateLimitError, 429, 503)
Temporary service failures (502, 504, TemporaryFailure)

Permanent (Not Retriable):

Syntax errors (SyntaxError, IndentationError)
Import errors (ImportError, ModuleNotFoundError)
Type errors (TypeError, AttributeError, NameError)
Value errors (ValueError, KeyError, IndexError)
Logic errors (AssertionError)

Retry Decision Logic

When a feature fails, the system checks in order:

Global Retry Limit: Max 50 total retries across all features (hard limit)
Circuit Breaker: Blocks retries after 5 consecutive failures (safety mechanism)
Failure Type: Permanent errors never retried
Per-Feature Limit: Max 3 retries per individual feature

If all checks pass, the feature is automatically retried.

First-Run Consent

On first use, you'll see:

╔══════════════════════════════════════════════════════════════╗
║                                                              ║
║  🔄 Automatic Retry for /batch-implement (NEW)              ║
║                                                              ║
║  Automatic retry enabled for transient failures:            ║
║    ✓ Network errors                                         ║
║    ✓ API rate limits                                        ║
║    ✓ Temporary service failures                             ║
║                                                              ║
║  Max 3 retries per feature (prevents infinite loops)        ║
║  Circuit breaker after 5 consecutive failures (safety)      ║
║                                                              ║
║  HOW TO DISABLE:                                            ║
║    Add to .env: BATCH_RETRY_ENABLED=false                   ║
║                                                              ║
╚══════════════════════════════════════════════════════════════╝

Your response is saved to ~/.autonomous-dev/user_state.json and reused for future runs.

Environment Variable Override

To control retry behavior via environment variable:

# Enable automatic retry
export BATCH_RETRY_ENABLED=true

# Disable automatic retry
export BATCH_RETRY_ENABLED=false

# Or in .env file
echo "BATCH_RETRY_ENABLED=true" >> .env

Monitoring Retries

Retry attempts are logged to .claude/audit/ directory with audit trails:

# View retry audit log for specific batch
cat .claude/audit/batch-20251118-123456_retry_audit.jsonl

Each audit entry includes:

Timestamp
Feature index
Retry attempt number
Error message (sanitized)
Global retry count
Decision reason

Circuit Breaker

When a batch experiences 5 consecutive failures:

Circuit Breaker Opens: Retries blocked to prevent resource exhaustion
Continue Processing: Failed features are marked as failed (not skipped)

Manual Reset: Use command to reset breaker after investigation:

python .claude/batch_retry_manager.py reset-breaker batch-20251118-123456

State Persistence

Retry state persists in .claude/batch_*_retry_state.json:

{
  "batch_id": "batch-20251118-123456",
  "retry_counts": {
    "0": 2,  // Feature 0 retried 2 times
    "5": 1   // Feature 5 retried 1 time
  },
  "global_retry_count": 5,
  "consecutive_failures": 0,
  "circuit_breaker_open": false,
  "created_at": "2025-11-18T10:00:00Z",
  "updated_at": "2025-11-18T10:15:00Z"
}

This allows resuming with retry state intact across crashes.

Security

Automatic retry implements defensive security:

CWE-117: Log injection prevention via error message sanitization
CWE-22: Path validation for state files
CWE-59: Symlink rejection for user state file
CWE-400: Resource exhaustion prevention via circuit breaker
CWE-732: File permissions secured (0o600 for user state file)

State Tracking

Tracked Metrics

Completed features: Successfully processed features
Failed features: Features that encountered errors
Processing history: Timestamps and token estimates for debugging
Current index: Position in feature list
Context tokens: Estimated token count (informational only)
Issue numbers: Original GitHub issue numbers (for --issues flag)
Source type: Input method (file or github_issues)

Progress Maintenance

State persists across crashes
Automatic resume on restart
No duplicate processing
Full audit trail of completed work

Use Cases

Sprint Backlogs: Process 10-50 features from sprint planning
Overnight Processing: Queue large feature sets for batch processing
Technical Debt: Clean up 50+ small improvements sequentially
Large Migrations: Handle 50+ feature migrations with state-based tracking

Performance

Per Feature: ~20-30 minutes (same as /auto-implement)
Context Management: Automatic (Claude Code manages 200K token budget)
State Save/Load: <10 seconds per feature (persistent tracking)
Scalability: Tested with 50+ features without manual intervention
Recovery: Resume from exact failure point

Implementation Files

Command: plugins/autonomous-dev/commands/batch-implement.md
State Manager: plugins/autonomous-dev/lib/batch_state_manager.py (enhanced v3.33.0 with retry tracking, v3.36.0 with git operations)
GitHub Fetcher: plugins/autonomous-dev/lib/github_issue_fetcher.py (v3.24.0)
Failure Classifier: plugins/autonomous-dev/lib/failure_classifier.py (v3.33.0 - Issue #89)
Retry Manager: plugins/autonomous-dev/lib/batch_retry_manager.py (v3.33.0 - Issue #89)
Consent Handler: plugins/autonomous-dev/lib/batch_retry_consent.py (v3.33.0 - Issue #89)
Git Integration: plugins/autonomous-dev/lib/auto_implement_git_integration.py (v3.36.0 with execute_git_workflow() batch mode support - Issue #93)
State File: .claude/batch_state.json (created automatically, includes git_operations field v3.36.0 - Issue #93)
Retry State File: .claude/batch_*_retry_state.json (created per batch for retry tracking)

FilesExpand file tree

BATCH-PROCESSING.md

Latest commit

History

BATCH-PROCESSING.md

File metadata and controls

Batch Feature Processing

Overview

Usage Options

1. File-Based Input

2. GitHub Issues Input (NEW in v3.24.0)

Prerequisites for Unattended Batch Processing (NEW in v3.35.0 - Issue #96)

Configure for Unattended Batches

How It Works

Verification

State Management (Enhanced in v3.24.0)

Persistent State

Dependency Analysis (NEW in v3.44.0 - Issue #157)

Overview

Dependency Keywords

Example: Dependency Detection

Optimal Ordering

Circular Dependency Detection

ASCII Graph Visualization

State Storage

Performance

Security

Graceful Degradation

Implementation Details

Integration with /batch-implement

Related Documentation

Examples

Git Automation (NEW in v3.36.0 - Issue #93)

Overview

Configuration

Batch Mode Differences

Git State Tracking

Per-Feature Commit Messages

Error Handling in Batch

Audit Trail

Implementation API

Context Management (Compaction-Resilient)

Crash Recovery

Automatic Failure Recovery (NEW in v3.33.0 - Issue #89)

Overview

Transient vs Permanent Errors

Retry Decision Logic

First-Run Consent

Environment Variable Override

Monitoring Retries

Circuit Breaker

State Persistence

Security

State Tracking

Tracked Metrics

Progress Maintenance

Use Cases

Performance

Implementation Files

See Also