promptdriven
diff --git a/‎README.md‎
Lines changed: 62 additions & 0 deletions b/‎README.md‎
Lines changed: 62 additions & 0 deletions
diff --git a/‎context/agentic_test_orchestrator_example.py‎
Lines changed: 167 additions & 0 deletions b/‎context/agentic_test_orchestrator_example.py‎
Lines changed: 167 additions & 0 deletions
diff --git a/‎docs/TUTORIALS.md‎
Lines changed: 28 additions & 0 deletions b/‎docs/TUTORIALS.md‎
Lines changed: 28 additions & 0 deletions
@@ -28,6 +28,7 @@ For CLI users, PDD also offers powerful **agentic commands** that implement GitH
 - `pdd change <issue-url>` - Implement feature requests (12-step workflow)
 - `pdd bug <issue-url>` - Create failing tests for bugs
 - `pdd fix <issue-url>` - Fix the failing tests
+- `pdd test <issue-url>` - Generate UI tests from issue descriptions (9-step workflow)
 
 For prompt-based workflows, the **`sync`** command automates the complete development cycle with intelligent decision-making, real-time visual feedback, and sophisticated state management.
 
@@ -540,6 +541,7 @@ flowchart TB
         change["pdd change &lt;url&gt;"]
         bug["pdd bug &lt;url&gt;"]
         fix_url["pdd fix &lt;url&gt;"]
+        test_url["pdd test &lt;url&gt;"]
     end
 
     sync["pdd sync"]
@@ -570,6 +572,7 @@ flowchart TB
 - **[`change`](#8-change)**: Implement feature requests from GitHub issues (12-step workflow)
 - **[`bug`](#14-bug)**: Analyze bugs and create failing tests from GitHub issues
 - **[`fix`](#6-fix)**: Fix failing tests (supports issue-driven and manual modes)
+- **[`test`](#4-test)**: Generate UI tests from GitHub issues (9-step workflow in agentic mode)
 
 ### Core Commands (Prompt-Based)
 - **[`sync`](#1-sync)**: **[PRIMARY FOR PROMPT WORKFLOWS]** Automated prompt-to-code cycle
@@ -1433,6 +1436,64 @@ pdd [GLOBAL OPTIONS] example --output examples/factorial_calculator_example.py f
 
 ### 4. test
 
+Generate or enhance unit tests for a given code file and its corresponding prompt file. Also supports **agentic mode** for generating UI tests from GitHub issues.
+
+#### Agentic Mode (UI Test Generation)
+
+Generate UI tests from a GitHub issue. The issue describes what needs to be tested (a webpage, CLI, or desktop app), and an agentic workflow analyzes the target, creates a test plan, and generates comprehensive UI tests.
+
+```
+pdd [GLOBAL OPTIONS] test <github-issue-url>
+```
+
+**How it works (9-step workflow with GitHub comments):**
+
+1. **Duplicate check** - Search for existing issues describing the same test requirements. If found, merge content and close the duplicate. Posts comment with findings.
+
+2. **Documentation check** - Review repo documentation and codebase to understand what needs to be tested. Posts comment with findings.
+
+3. **Analyze & clarify** - Determine if enough information exists in the issue to create tests. Posts comment requesting clarification if needed.
+
+4. **Detect frontend** - Identify the frontend type: web UI (Next.js, React, etc.), CLI, or desktop app. Determines the appropriate testing framework (e.g., Playwright for web). Posts comment with frontend analysis.
+
+5. **Create test plan** - Design a comprehensive test plan and verify it's achievable. Posts comment requesting information (e.g., credential access) if plan is blocked.
+
+6. **Generate tests** - Create UI tests in a new worktree following the test plan. Posts comment with generated test code.
+
+7. **Run tests** - Execute the generated tests against the target. Posts comment with test results.
+
+8. **Fix & iterate** - Fix any failing tests and re-run until they pass. Posts comment with fix attempts and final status.
+
+9. **Submit PR** - Create a draft pull request with the UI tests linked to the issue. Posts comment with PR link.
+
+**Agentic Options:**
+- `--timeout-adder FLOAT`: Add additional seconds to each step's timeout (default: 0.0)
+- `--no-github-state`: Disable GitHub issue comment-based state persistence, use local-only
+- `--manual`: Use legacy prompt-based mode instead of agentic mode
+
+**Cross-Machine Resume**: By default, workflow state is stored in a hidden comment on the GitHub issue, enabling resume from any machine. Use `--no-github-state` to disable this feature. You can also set `PDD_NO_GITHUB_STATE=1` environment variable.
+
+**Example (Agentic Mode):**
+```bash
+# Generate UI tests from a GitHub issue
+pdd test https://github.com/myorg/myrepo/issues/789
+
+# Resume after answering clarifying questions
+pdd test https://github.com/myorg/myrepo/issues/789
+```
+
+**Next Step - Fixing Test Issues:**
+
+If the generated tests reveal issues that need code fixes, use `pdd fix` with the same issue URL:
+
+```bash
+pdd fix https://github.com/myorg/myrepo/issues/789
+```
+
+---
+
+#### Manual Mode (Prompt-Based)
+
 Generate or enhance unit tests for a given code file and its corresponding prompt file.
 
 Test organization:
@@ -1441,6 +1502,7 @@ Test organization:
 
 ```
 pdd [GLOBAL OPTIONS] test [OPTIONS] PROMPT_FILE CODE_OR_EXAMPLE_FILE
+pdd [GLOBAL OPTIONS] test --manual [OPTIONS] PROMPT_FILE CODE_OR_EXAMPLE_FILE
 ```
 
 Arguments:
 
@@ -0,0 +1,167 @@
+"""
+Example usage of the agentic_test_orchestrator module.
+
+This script demonstrates how to invoke the `run_agentic_test_orchestrator` function.
+Since the orchestrator relies on internal modules like `run_agentic_task` and `load_prompt_template`,
+this example mocks those dependencies to simulate a successful UI test generation workflow
+without making actual LLM calls or requiring a real GitHub issue.
+
+Scenario:
+    We simulate an issue where a user requests UI tests for a login page.
+    The orchestrator will step through the 9-step process:
+    1. Check for duplicate test requests
+    2. Review codebase documentation
+    3. Analyze and ask clarifying questions if needed
+    4. Detect frontend type (web/CLI/desktop)
+    5. Create test plan
+    6. Generate UI tests
+    7. Run tests
+    8. Fix and iterate on failing tests
+    9. Submit PR
+"""
+
+import sys
+from pathlib import Path
+from unittest.mock import patch, MagicMock
+
+# Ensure the project root is in sys.path so we can import the module
+project_root = Path(__file__).resolve().parent.parent
+sys.path.append(str(project_root))
+
+try:
+    from pdd.agentic_test_orchestrator import run_agentic_test_orchestrator
+except ImportError:
+    print("Error: Could not import 'pdd.agentic_test_orchestrator'.")
+    print("Ensure your PYTHONPATH is set correctly or the file structure matches.")
+    sys.exit(1)
+
+
+def mock_load_prompt_template(template_name: str) -> str:
+    """
+    Mock implementation of load_prompt_template.
+    Returns a dummy prompt string based on the requested template name.
+    """
+    return f"MOCK PROMPT FOR: {template_name}\nContext: {{issue_content}}"
+
+
+def mock_run_agentic_task(instruction: str, cwd: Path, verbose: bool, quiet: bool, label: str, timeout: float = None, max_retries: int = 3):
+    """
+    Mock implementation of run_agentic_task.
+    Simulates the output of an LLM agent for each step of the 9-step UI test workflow.
+    """
+    step_num = label.replace("step", "")
+
+    # Default return values
+    success = True
+    cost = 0.15  # Simulated cost per step
+    provider = "anthropic"
+    output = ""
+
+    if step_num == "1":
+        output = "No duplicate test requests found. Proceeding with UI test generation."
+    elif step_num == "2":
+        output = """Codebase review complete:
+        - Frontend: Next.js application in /frontend
+        - Auth pages: /frontend/pages/auth/login.tsx, /frontend/pages/auth/register.tsx
+        - Components: LoginForm, RegisterForm in /frontend/components/auth/
+        - API routes: /api/auth/login, /api/auth/register"""
+    elif step_num == "3":
+        output = """Requirements are clear:
+        - Test login page functionality
+        - Verify form validation (email format, password requirements)
+        - Test successful and failed login scenarios
+        - No clarification needed from author."""
+    elif step_num == "4":
+        output = """Frontend detected: Next.js (React)
+        Test framework: Playwright
+        Base URL: http://localhost:3000
+        Authentication: Session-based via NextAuth.js"""
+    elif step_num == "5":
+        output = """Test Plan:
+        1. Login page renders correctly
+        2. Form validation - invalid email shows error
+        3. Form validation - password too short shows error
+        4. Successful login redirects to dashboard
+        5. Failed login shows error message
+        6. Remember me checkbox persists session
+
+        Estimated tests: 6 test cases
+        Files to create: tests/e2e/login.spec.ts"""
+    elif step_num == "6":
+        output = """FILES_CREATED: tests/e2e/login.spec.ts
+
+        Generated Playwright test file with 6 test cases:
+        - test('login page renders correctly')
+        - test('shows error for invalid email')
+        - test('shows error for short password')
+        - test('successful login redirects to dashboard')
+        - test('failed login shows error message')
+        - test('remember me persists session')"""
+    elif step_num == "7":
+        output = """Test execution results:
+        6 tests total
+        5 passed
+        1 failed: 'remember me persists session' - localStorage not mocked
+
+        Overall: 83% pass rate"""
+    elif step_num == "8":
+        output = """Fixed failing test:
+        - Added localStorage mock in beforeEach hook
+        - Re-ran tests: 6/6 passed
+
+        FILES_MODIFIED: tests/e2e/login.spec.ts
+        All tests now passing."""
+    elif step_num == "9":
+        output = """PR Created: https://github.com/example/myapp/pull/456
+
+        Title: Add UI tests for login page (#123)
+        Branch: test/issue-123
+        Files: tests/e2e/login.spec.ts"""
+    else:
+        output = f"Unknown step executed: {step_num}"
+
+    return success, output, cost, provider
+
+
+def main():
+    """Main function to run the agentic test orchestrator simulation."""
+    # Define dummy issue data
+    issue_data = {
+        "issue_url": "https://github.com/example/myapp/issues/123",
+        "issue_content": "Create UI tests for the login page. Should test form validation, successful login, and error handling.",
+        "repo_owner": "example",
+        "repo_name": "myapp",
+        "issue_number": 123,
+        "issue_author": "test_requester",
+        "issue_title": "Add UI tests for login page",
+        "cwd": Path("./temp_workspace"),
+        "verbose": True,
+        "quiet": False,
+        "timeout_adder": 0.0,
+        "use_github_state": False  # Disable for simulation
+    }
+
+    print("Starting Agentic UI Test Orchestrator Simulation...")
+    print("-" * 60)
+
+    # Patch the internal dependencies
+    with patch("pdd.agentic_test_orchestrator.load_prompt_template", side_effect=mock_load_prompt_template), \
+         patch("pdd.agentic_test_orchestrator.run_agentic_task", side_effect=mock_run_agentic_task):
+
+        # Run the orchestrator
+        success, final_msg, total_cost, model, changed_files = run_agentic_test_orchestrator(
+            **issue_data
+        )
+
+    print("-" * 60)
+    print("Simulation Complete.")
+    print(f"Success: {success}")
+    print(f"Final Message: {final_msg}")
+    print(f"Total Cost: ${total_cost:.2f}")
+    print(f"Model Used: {model}")
+    print(f"Changed Files: {changed_files}")
+    print("\nNext step: Review the generated tests and merge the PR.")
+
+
+if __name__ == "__main__":
+    main()
@@ -75,6 +75,34 @@ This tutorial walks through implementing a GitHub issue using PDD.
    - The PR is updated with the fix
    - Review and merge when ready
 
+### Method 4: Generating UI Tests
+
+1. **Create a GitHub Issue**
+   - Describe what needs to be tested (webpage URL, CLI command, or desktop app)
+   - Include screenshots or text descriptions of expected behavior
+   - Specify what elements/interactions should be verified
+
+2. **Generate UI Tests**
+   ```bash
+   pdd test https://github.com/myorg/myrepo/issues/789
+   ```
+   This analyzes the target and creates comprehensive UI tests.
+
+3. **Handle Clarifying Questions**
+   - If PDD needs more information (e.g., credentials, test environment setup), it posts questions to the issue
+   - Answer them in the GitHub issue comments
+   - Run `pdd test` again to resume
+
+4. **Review the Generated Tests**
+   - The PR contains tests for the specified UI (Playwright for web, pytest for CLI, etc.)
+   - Review and adjust tests as needed
+
+5. **Fix Any Issues Found**
+   ```bash
+   pdd fix https://github.com/myorg/myrepo/issues/789
+   ```
+   Use this if tests reveal bugs that need fixing.
+
 ### Tips
 
 - **Resume from anywhere**: Workflow state is saved to GitHub, so you can continue on any machine