Claude Proxy Architecture Overview v1.0.20

Purpose

The Claude Proxy provides a low‑cost gateway for Claude Code tools by forwarding requests from Claude (via the ANTHROPIC_BASE_URL environment variable) to either xAI or GroqCloud back‑ends. It translates between Anthropic‑style messages and the OpenAI‑style payloads expected by the back‑ends, injects ultra‑simple tool schemas with OS‑aware descriptions, and selects the most appropriate model for the request.

Key Benefits:

15-20x cost reduction vs direct Anthropic API
All 15+ Claude Code tools working with proper schemas
OS-aware command tool selection (Windows/Unix/macOS)
Non-verbose tool execution (no announcement chatter)
Automatic environment detection and command wrapping

High‑Level Data Flow

+-----------------+       HTTP POST /v1/messages       +-------------------+
|   Claude Code   | ----------------------------------> |   Proxy Server    |
|   (Claude CLI) |                                      | (Flask app)       |
+-----------------+                                      +-------------------+
                                                             |
               +----------------------+   +--------------------------+
               | BaseClaudeProxy      |   |   Console Thread         |
               | (generic server)     |   |  (R/Q/H commands)        |
               +----------------------+   +--------------------------+
                             |
           +-----------------+-----------------+
           |                                   |
+-------------------+                +--------------------+
|   XAIAdapter      |                |   GroqAdapter      |
| (xAI specific)   |                | (Groq specific)    |
+-------------------+                +--------------------+
           |                                   |
+-------------------+                +--------------------+
|   XAIApiClient    |                |   GroqApiClient    |
+-------------------+                +--------------------+
           |                                   |
+-------------------+                +--------------------+
|   XAIModelSelector|               |   GroqModelSelector|
+-------------------+                +--------------------+
           |                                   |
   External xAI API                  External GroqCloud API

Core Components

Component	Responsibility
BaseClaudeProxy (`proxy_core.py`)	Sets up the Flask app, generic `/v1/<path>` route, port conflict detection, and a console thread for runtime commands (R‑restart, Q‑quit, H‑help). Delegates authentication and request handling to an adapter implementation.
Adapter Interface (required methods)	`authenticate() -> bool` – validates the API key. `handle_proxy_request(data: dict) -> Flask response` – processes the incoming payload and returns a response. `name` – human‑readable identifier used in logs.
XAIAdapter (`xai_adapter.py`)	Implements the adapter for the xAI back‑end. Uses `XAIApiClient` for HTTP calls, `XAIModelSelector` for model choice, and the shared `ClaudeToolMapper` / `MessageTransformer` utilities.
GroqAdapter (`groq_claude_proxy_enhanced.py`)	Implements the adapter for GroqCloud. Mirrors the XAI flow but includes special handling for web‑search tools (switches to the `groq/compound` model).
Model Selectors (`XAIModelSelector`, `GroqModelSelector`)	Inspect the user messages to decide which model and reasoning effort to use. Heuristics look for keywords related to reasoning complexity, coding, or explicit web‑search requests.
API Clients (`XAIApiClient`, `GroqApiClient`)	Retrieve API keys from environment variables (`XAI_API_KEY`, `GROQ_API_KEY`) or the Windows registry. Provide a thin wrapper around `requests.post` with retry/back‑off and optional streaming for xAI.
ClaudeToolMapper (`proxy_common.py`)	Maps Groq tool names to Claude‑Code tool names, generates ultra‑simple JSON schemas that Groq/xAI accept, and performs parameter renaming (e.g., `path` → `file_path`).
MessageTransformer (`proxy_common.py`)	Converts Anthropic‑style messages ↔ OpenAI‑style messages, and translates tool calls/results back into the Claude tool‑use format.

Request Flow (Step‑by‑Step)

Client Request – Claude sends a POST to http://localhost:<port>/v1/messages with a JSON payload containing model, messages, optional tools, etc.
BaseClaudeProxy receives the request on the /v1/<path> route, checks the API key via the adapter, and forwards the JSON body to adapter.handle_proxy_request.
Adapter extracts the original model name and messages, then:
- Calls its Model Selector to decide which back‑end model to use and the appropriate reasoning_effort (if applicable).
- Uses MessageTransformer to convert Anthropic messages into the OpenAI‑style format required by the back‑end.
- If the client supplied tools, the adapter injects ultra‑simple tool schemas generated by ClaudeToolMapper and sets tool_choice: "auto".
API Client sends the assembled request to the external service (https://api.x.ai/v1/chat/completions or https://api.groq.com/openai/v1/chat/completions).
Response Handling –
- For text‑only replies, MessageTransformer converts the OpenAI response back to Anthropic format.
- For tool calls, MessageTransformer maps each Groq tool call to the corresponding Claude tool name and argument shape, returning a tool_use block.
- For web‑search tool calls (Groq only), the proxy intercepts the call, re‑issues the request using the groq/compound model, and returns the search results as a plain‑text message.
Flask Response – The transformed payload is returned to Claude, completing the round‑trip.

Model Selection Logic

Groq – GroqModelSelector.select_model_and_config chooses:
- groq/compound when a web‑search tool is present.
- openai/gpt-oss-120b with a reasoning_effort (high/medium) derived from keyword analysis of the message content.
xAI – XAIModelSelector.select_model evaluates two keyword lists (reasoning vs. coding). It prefers:
- grok-4-0709 with high reasoning for complex analytical requests or any Opus model name.
- grok-code-fast-1 for coding‑centric tasks.

Both selectors default to the higher‑capacity model when the heuristics indicate a need for deeper reasoning.

Web‑Search Interception (Groq Only)

When a tool call named web_search or browser_search is detected, the proxy:

Extracts the query.
Switches to the groq/compound model, which has built‑in web‑search capability.
Sends a simple user‑message request ("Search the web for: <query>").
Returns the raw search results as a plain‑text Claude message.
If the secondary request fails, a fallback message informs the user that the search is unavailable.

Configuration & Secrets

Environment Variables –
- GROQ_API_KEY – GroqCloud authentication token.
- XAI_API_KEY – xAI authentication token.
The proxy also reads these variables from the Windows registry under HKCU\Environment for convenience.
No secrets are logged; only high‑level status messages are printed.

Console Commands (Runtime)

Command	Action
`R`	Restart the proxy (re‑executes the Python process).
`Q` / `QUIT`	Gracefully shut down the server and exit.
`H` / `HELP`	Show the help menu with the above commands.

These commands are processed in a background thread (_console_input_handler).

v1.0.26 Architecture Improvements

Bash Command Standardization:

Standard Unix/bash Syntax: Updated tool descriptions to use standard bash commands (rm, ls, cd, cat)
Eliminated PowerShell/cmd Confusion: Removed Windows-specific command instructions that caused execution errors
Claude Code Compatibility: Aligned with Claude Code's native bash execution environment
Command Pass-through: Simplified to pure command forwarding without shell detection or wrapping

v1.0.25 Architecture Improvements

Command Processing Simplification:

Native Claude Code Handling: Removed all proxy command wrapping and translation
Direct Pass-through: Commands sent unchanged to Claude Code for native processing
OS Detection Delegation: Let Claude Code handle shell detection and execution

v1.0.24 Architecture Improvements

PowerShell Integration Attempt:

PowerShell Wrapping: Attempted Windows PowerShell command wrapping
Tool Description Updates: Updated examples to show PowerShell syntax
Translation Removal: Eliminated command translation in favor of native generation

v1.0.23 Architecture Improvements

Windows Command Reliability:

PowerShell Adoption: Switched to PowerShell for Windows command execution
Smart Detection: Added PowerShell vs cmd detection to prevent double-wrapping
Cross-platform Handling: Maintained Unix/Linux command pass-through

v1.0.22 Architecture Improvements

Command Wrapping Removal:

No Command Modification: Removed all Windows command wrapping logic
Direct Execution: Commands passed through unchanged to Claude Code
Simplified Processing: Eliminated complex Windows command detection

v1.0.21 Architecture Improvements

Windows Command Chain Fixes:

Pushd/Popd Implementation: Converted cd /d && command chains to pushd/popd for reliability
Directory Change Handling: Improved Windows directory navigation in command chains
Command Chain Reliability: Fixed issues with && command execution on Windows

v1.0.20 Architecture Improvements

Command Execution Fixes:

Double CMD Wrapping Fix: Prevented duplicate cmd /c wrapping causing command hangs
Parameter Schema Validation: Fixed null parameter filtering for read_file operations
Edit File Parameter Mapping: Corrected path→file_path parameter translation for edit operations
Unicode Encoding Fix: Removed Unicode characters causing Windows startup failures
Windows Command Detection: Improved logic for detecting Windows native commands

v1.0.19 Architecture Improvements

Path Correction & Command Wrapping:

Intelligent Path Correction: Automatic path correction for Windows/Unix compatibility
Command Double-Wrapping Prevention: Fixed infinite cmd /c wrapping loops
Schema Validation Enhancement: Improved tool parameter validation and error handling

v1.0.18 Architecture Improvements

Baseten Adapter & Cost Optimization:

Baseten Integration: Added Baseten adapter for 20% additional cost savings
Model Routing Optimization: Enhanced model selection for cost-effective processing
Dual-Parameter Schema Validation: Fixed schema validation for complex parameter structures

v1.0.17 Architecture Improvements

Plan Mode Removal & Tool Improvements:

Plan Mode Elimination: Removed redundant plan mode causing double announcements
ExitPlanMode Formatting: Improved formatting for plan presentation to users
Schema Validation Fixes: Resolved dual-parameter validation errors

v1.0.16 Architecture Improvements

Version Synchronization:

Unified Version Display: Synchronized all proxy component versions to v1.0.15
Documentation Updates: Updated installer and documentation version references
Release Process: Streamlined version management across all components

v1.0.15 Architecture Improvements

Anti-Verbose Tool Execution:

Direct Tool Execution: Added "Execute directly without announcements" to tool descriptions
Eliminated Verbose Announcements: Prevents "I need to use..." messages before tool calls
Streamlined UX: Tools now execute immediately without explanation overhead
Applied to Critical Tools: ExitPlanMode, TodoWrite, EditFile, MultiEditFile

v1.0.14 Architecture Improvements

Enhanced Parameter Mapping Debugging:

Critical Debug Logging: Added INFO level logging for parameter mapping issues
Tool Validation Tracing: Detailed logging of parameter transformations
Diagnostic Improvements: Better visibility into tool call processing

v1.0.13 Architecture Improvements

Complete Tool Validation Fix:

TodoWrite Parameter Mapping: Fixed tasks→todos, description→content parameter translation
Null Parameter Filtering: Removes null values causing GroqCloud schema validation failures
Tool Schema Compliance: All Claude Code 2025 tools now work correctly
Parameter Auto-Fix: Automatically corrects malformed tool calls from AI models

v1.0.12 Architecture Improvements

TodoWrite Schema Compliance:

Parameter Mapping: Fixed tasks→todos, description→content field mapping
Structure Validation: Auto-generates missing activeForm fields
Format Handling: Supports both string arrays and object arrays
Null Filtering: Removes null parameters preventing validation errors

v1.0.11 Architecture Improvements

Parameter Validation Enhancement:

Null Value Filtering: Removes null limit/offset parameters from file operations
Schema Compliance: Resolves GroqCloud 400 errors for optional parameters
File Operation Stability: Read, Write, Edit tools work without validation failures

v1.0.10 Architecture Improvements

Command Execution Stability:

Double-Wrap Prevention: Prevents cmd /c "cmd /c "command"" nesting
Loop Prevention: Resolves infinite command execution loops
Bash Tool Reliability: Commands execute correctly without hanging

v1.0.9 Architecture Improvements

Windows Compatibility & Debugging:

Unicode Character Removal: Eliminated all Unicode checkmarks preventing Windows encoding errors
Enhanced Debugging: Added comprehensive logging for tool schema generation and parameter mapping
Startup Reliability: Adapters now initialize successfully on all Windows systems
Debug Visibility: Tool validation and parameter mapping operations now logged for troubleshooting

v1.0.8 Architecture Improvements

Critical Tool Parameter Fix:

Fixed Edit Tool Parameters: Resolved parameter mapping for edit_file and multi_edit_file tools
Path-to-FilePath Mapping: Models calling with 'path' parameter now correctly mapped to 'file_path'
Tool Validation Success: Eliminates GroqCloud 400 errors about missing file_path properties
File Operations Working: Edit, MultiEdit, Read, and Write tools now work correctly

v1.0.7 Architecture Improvements

Critical Bug Fix:

Fixed Command Execution: Resolved double cmd /c wrapping that broke git, python, npm and other external commands
Smart Command Detection: Now only wraps Windows internal commands (dir, type, echo) while leaving external programs unwrapped
Proper Git Integration: Git commands now execute correctly without Windows command prompt interference

v1.0.6 Architecture Improvements

Tool Schema Enhancements:

OS-Aware Descriptions: Tools automatically adapt descriptions based on detected OS (Windows/Unix/macOS)
Non-Verbose Execution: Removed "MANDATORY" language that caused announcement chatter
Smart Command Wrapping: run_cmd only wraps Windows internal commands (dir, type, echo) with cmd /c, leaves external programs (git, python, npm) unwrapped
Proper Field Validation: TodoWrite uses exact 3-field schema (content, status, activeForm)

Startup & Debugging:

Version Logging: Displays proxy version on startup for debugging
OS Detection Logging: Shows detected OS and recommended command tools
Centralized Versioning: Single version constant in proxy_common.py

Schema Compliance:

2025 Claude Code Compatible: All tools match latest official specifications
JSON Schema Standards: Proper type, properties, required, enum validation
Cross-Provider Tested: Validated with both xAI Grok and GroqCloud

Error Elimination:

Fixed TodoWrite Corruption: No more invalid id/priority fields
Fixed ExitPlanMode Verbosity: Clean plan execution without announcements
Fixed Command Confusion: Clear OS-specific examples prevent syntax errors

Extending the Proxy

Add a New Back‑End – Implement a new Adapter class that satisfies the three required methods and plug it into BaseClaudeProxy.
Custom Tool Schemas – Extend ClaudeToolMapper.generate_ultra_simple_tools with additional tool definitions. Ensure the schema stays ultra‑simple (no additionalProperties).
Enhanced Model Selection – Refine the keyword lists or add a machine‑learning based classifier inside the selector classes.
Metrics & Logging – Hook into the existing logging configuration to emit structured JSON logs for observability.

This document provides a concise developer‑oriented overview of the Claude Proxy architecture, its components, and the request lifecycle.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Claude Proxy Architecture Overview v1.0.20

Purpose

High‑Level Data Flow

Core Components

Request Flow (Step‑by‑Step)

Model Selection Logic

Web‑Search Interception (Groq Only)

Configuration & Secrets

Console Commands (Runtime)

v1.0.26 Architecture Improvements

v1.0.25 Architecture Improvements

v1.0.24 Architecture Improvements

v1.0.23 Architecture Improvements

v1.0.22 Architecture Improvements

v1.0.21 Architecture Improvements

v1.0.20 Architecture Improvements

v1.0.19 Architecture Improvements

v1.0.18 Architecture Improvements

v1.0.17 Architecture Improvements

v1.0.16 Architecture Improvements

v1.0.15 Architecture Improvements

v1.0.14 Architecture Improvements

v1.0.13 Architecture Improvements

v1.0.12 Architecture Improvements

v1.0.11 Architecture Improvements

v1.0.10 Architecture Improvements

v1.0.9 Architecture Improvements

v1.0.8 Architecture Improvements

v1.0.7 Architecture Improvements

v1.0.6 Architecture Improvements

Extending the Proxy

FilesExpand file tree

architecture.md

Latest commit

History

architecture.md

File metadata and controls

Claude Proxy Architecture Overview v1.0.20

Purpose

High‑Level Data Flow

Core Components

Request Flow (Step‑by‑Step)

Model Selection Logic

Web‑Search Interception (Groq Only)

Configuration & Secrets

Console Commands (Runtime)

v1.0.26 Architecture Improvements

v1.0.25 Architecture Improvements

v1.0.24 Architecture Improvements

v1.0.23 Architecture Improvements

v1.0.22 Architecture Improvements

v1.0.21 Architecture Improvements

v1.0.20 Architecture Improvements

v1.0.19 Architecture Improvements

v1.0.18 Architecture Improvements

v1.0.17 Architecture Improvements

v1.0.16 Architecture Improvements

v1.0.15 Architecture Improvements

v1.0.14 Architecture Improvements

v1.0.13 Architecture Improvements

v1.0.12 Architecture Improvements

v1.0.11 Architecture Improvements

v1.0.10 Architecture Improvements

v1.0.9 Architecture Improvements

v1.0.8 Architecture Improvements

v1.0.7 Architecture Improvements

v1.0.6 Architecture Improvements

Extending the Proxy