KHAEntertainment · KHAEntertainment · Mar 31, 2026 · Mar 31, 2026 · Mar 31, 2026 · Mar 31, 2026
diff --git a/.claude/skills/grok-multi-agent-api/SKILL.md b/.claude/skills/grok-multi-agent-api/SKILL.md
@@ -0,0 +1,153 @@
+---
+name: grok-multi-agent-api
+description: xAI Grok Multi-Agent API reference for developing and maintaining this plugin. Triggers: "multi-agent api", "grok api", "agent_count", "reasoning effort", "openai sdk usage", "grok-4.20-multi-agent", "api configuration"
+version: 1.0.0
+---
+
+# xAI Grok 4.20 Multi-Agent API Reference
+
+Reference for the Realtime Multi-agent Research API that this plugin wraps. Use this when modifying `src/bridge/grok_bridge.py`, `src/agent/grok_agent.py`, or any bridge code that communicates with xAI/OpenRouter.
+
+## Model ID
+
+```
+grok-4.20-multi-agent
+```
+
+> **Note:** This plugin currently uses `x-ai/grok-4.20-multi-agent-beta` via OpenRouter. The direct xAI API uses `grok-4.20-multi-agent`. Both refer to the same underlying model.
+
+## API Endpoints
+
+| Provider | Base URL | Endpoint |
+|----------|----------|----------|
+| xAI Direct | `https://api.x.ai/v1` | `/responses` |
+| OpenRouter | `https://openrouter.ai/api/v1` | `/chat/completions` |
+
+**This plugin uses OpenRouter** as the gateway. The bridge sends requests to OpenRouter which proxies to xAI.
+
+## Agent Count Configuration
+
+| SDK / API | Parameter | 4 Agents | 16 Agents |
+|-----------|-----------|----------|-----------|
+| xAI SDK | `agent_count` | `4` | `16` |
+| OpenAI SDK | `reasoning.effort` | `"low"` or `"medium"` | `"high"` or `"xhigh"` |
+| Vercel AI SDK | `reasoningEffort` | `"low"` or `"medium"` | `"high"` or `"xhigh"` |
+| REST API | `reasoning.effort` | `"low"` or `"medium"` | `"high"` or `"xhigh"` |
+
+- **4 agents**: Quick research, focused queries, lower cost
+- **16 agents**: Deep research, complex multi-faceted topics, higher token usage
+
+In this plugin's bridge code (`grok_bridge.py`), agent count is sent as `extra_body={"agent_count": N}` via the OpenAI SDK.
+
+## Built-in Tools
+
+xAI provides server-side tools that can be enabled per request:
+
+| Tool | Description |
+|------|-------------|
+| `web_search` | Web search |
+| `x_search` | X/Twitter search |
+| `code_execution` | Code execution |
+| `collections_search` | Collections search |
+
+When enabled, the server runs the agent loop automatically, invoking tools until the final answer is generated. These incur additional cost.
+
+**Important for this plugin:** The bridge currently does NOT pass through built-in tools — it uses the agents for pure reasoning over provided file context. If adding tool support, pass them in the `tools` parameter.
+
+## Output Behavior
+
+- Only the **leader agent's** final response and tool calls are returned to the caller
+- Sub-agent state (intermediate reasoning, tool calls, outputs) is encrypted
+- Encrypted sub-agent state is included only when `use_encrypted_content=True` (xAI SDK)
+- This keeps default responses clean while preserving context for multi-turn
+
+## Multi-turn Conversations
+
+Use `previous_response_id` to chain turns. The agents use prior context for more targeted follow-up answers.
+
+## API Limitations
+
+- **No Chat Completions API** — must use Responses API (`/responses`) or xAI SDK
+- **No `max_tokens`** — parameter is not supported
+- **No client-side/custom tools** — only built-in tools and remote MCP tools supported
+- **Only leader output exposed** — sub-agent details are encrypted unless explicitly requested
+
+## Example: Direct xAI API (Python OpenAI SDK)
+
+```python
+import os
+from openai import OpenAI
+
+client = OpenAI(
+    api_key=os.getenv("XAI_API_KEY"),
+    base_url="https://api.x.ai/v1",
+)
+
+# 4-agent setup
+response = client.responses.create(
+    model="grok-4.20-multi-agent",
+    reasoning={"effort": "low"},
+    input=[
+        {"role": "user", "content": "Analyze this code..."},
+    ],
+)
+
+# 16-agent setup
+response = client.responses.create(
+    model="grok-4.20-multi-agent",
+    reasoning={"effort": "high"},
+    input=[
+        {"role": "user", "content": "Deep analysis..."},
+    ],
+)
+```
+
+## Example: Via OpenRouter (This Plugin's Path)
+
+```python
+from openai import OpenAI
+
+client = OpenAI(
+    api_key=os.getenv("OPENROUTER_API_KEY"),
+    base_url="https://openrouter.ai/api/v1",
+)
+
+response = client.chat.completions.create(
+    model="x-ai/grok-4.20-multi-agent-beta",
+    extra_body={"agent_count": 4},  # or 16
+    messages=[
+        {"role": "system", "content": "You are..."},
+        {"role": "user", "content": "Analyze..."},
+    ],
+)
+```
+
+## Prompting Best Practices
+
+When constructing system prompts for the multi-agent model:
+
+1. **Set scope and depth explicitly** — "Compare X across dimensions A, B, C" not "Tell me about X"
+2. **Request structured output** — "Present as a comparison table with categories..."
+3. **Specify sources/perspectives** — "Cite academic papers from 2024-2025"
+4. **Break complex research into turns** — Start broad, narrow with follow-ups
+5. **Provide context** — Include relevant constraints and prior knowledge
+
+## Pricing Considerations
+
+All tokens from **both leader and sub-agents** are billed (input, output, reasoning). Server-side tool calls by any agent also count. A single multi-agent request may use significantly more tokens than a standard request. Monitor via `usage` and `server_side_tool_usage` fields.
+
+## Streaming
+
+The xAI SDK supports streaming with `include=["verbose_streaming"]`:
+
+```python
+chat = client.chat.create(
+    model="grok-4.20-multi-agent",
+    include=["verbose_streaming"],
+)
+for response, chunk in chat.stream():
+    if chunk.content:
+        print(chunk.content, end="", flush=True)
+```
+
+This plugin's bridge does not currently stream — it waits for the full response. Streaming support would require changes to `grok_bridge.py:call_grok()` and `src/bridge/index.js`.
diff --git a/CLAUDE.md b/CLAUDE.md
@@ -0,0 +1,97 @@
+# CLAUDE.md
+
+This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
+
+## Project Overview
+
+A dual-platform plugin (Claude Code + OpenClaw) that bridges to xAI's **Grok 4.20 Multi-Agent Beta** via the **OpenRouter** API. It gives AI coding agents access to Grok's multi-agent swarm (4 or 16 agents) with ~2M token context for code analysis, refactoring, and generation.
+
+## Build & Development Commands
+
+```bash
+# Build (copies Python bridge + Node wrapper to dist/)
+npm run build
+
+# Test (only checks CLI --help flag)
+npm test
+
+# Lint
+npm run lint
+
+# Clean
+npm run clean
+
+# Install to local platforms
+./install.sh openclaw    # copies to ~/.openclaw/
+./install.sh claude      # copies to ~/.claude/plugins/grok-swarm/
+./install.sh both        # both platforms
+
+# Python deps
+pip3 install -r requirements.txt
+```
+
+Requires Node.js >= 18 and Python 3.8+.
+
+## Architecture
+
+Layered bridge pattern — each layer has a single responsibility:
+
+```
+Plugin Layer (TypeScript/manifests)
+    ↓ registers tools and skills
+CLI Wrapper (Node.js — src/bridge/index.js)
+    ↓ timeout enforcement, process spawning
+Python Bridge (src/bridge/grok_bridge.py)
+    ↓ OpenAI SDK → OpenRouter API
+xAI Grok 4.20 Multi-Agent Beta
+```
+
+**Key modules:**
+
+- `src/bridge/grok_bridge.py` — Core API logic: key resolution, mode-based system prompts, file context assembly, code block parsing. The `call_grok()` function is the central entry point.
+- `src/bridge/cli.py` — Unified CLI that dispatches to grok_bridge with argparse.
+- `src/bridge/apply.py` — Parses annotated code blocks and writes files to disk. Supports three annotation formats: `lang:path`, `FILE:` marker, and `# filename.py` comments.
+- `src/bridge/index.js` — Node.js wrapper that enforces timeouts on Python subprocess.
+- `src/bridge/oauth_setup.py` — PKCE OAuth flow for OpenRouter (keeps keys out of LLM context).
+- `src/bridge/usage_tracker.py` — Persistent token/cost tracking.
+- `src/agent/grok_agent.py` — Autonomous loop: discover files → call Grok → apply changes → verify → iterate.
+- `src/shared/patterns.py` — Centralized regex patterns for filename detection, shared between bridge and agent.
+- `src/plugin/index.ts` — OpenClaw plugin: registers `grok_swarm` (single call) and `grok_swarm_agent` (autonomous loop) tools.
+
+## API Key Resolution Priority
+
+`grok_bridge.py:get_api_key()` checks in order:
+1. `OPENROUTER_API_KEY` environment variable
+2. `~/.config/grok-swarm/config.json`
+3. `~/.claude/grok-swarm.local.md`
+4. OpenClaw auth profiles
+
+## Thinking Levels
+
+- **Low** (default): 4-agent swarm — faster, cheaper
+- **High**: 16-agent swarm — triggered by phrases like "16 agent swarm", "high thinking mode", or `--thinking high`
+
+## File Annotation Formats
+
+Code blocks can be annotated three ways for `apply.py` to write them:
+1. Fenced block with language:path — ` ```python:src/main.py `
+2. `FILE: path/to/file.py` marker inside the block
+3. Comment header — `# filename.py` (uses `shared/patterns.py` regex)
+
+## Task Tracking
+
+Uses **bd (beads)** — not TodoWrite or markdown lists:
+```bash
+bd ready              # Find available work
+bd show <id>          # View issue details
+bd update <id> --claim
+bd close <id>
+```
+
+## Code Duplication Note
+
+`skills/grok-refactor/bridge/` and `skills/grok-refactor/shared/` are copies of `src/bridge/` and `src/shared/` respectively (not symlinks). Changes to bridge/shared code must be applied in both locations.
+
+## Version Locations
+
+Version is defined in multiple places and must be kept in sync: `package.json`, `VERSION`, `pyproject.toml`, `CLAWHUB.md`, `.claude-plugin/marketplace.json`, and `platforms/claude/.claude-plugin/plugin.json`. Use `<VERSION>` as the canonical placeholder when referencing version numbers.