[codex] Add on-demand fusion model routing#63
Conversation
|
Codex review: needs real behavior proof before merge. Reviewed July 2, 2026, 6:18 AM ET / 10:18 UTC. Summary Reproducibility: yes. for the review blockers: source inspection shows the catalog/UI advertisement and dynamic local-model pricing paths without needing a live Worker. This PR is a feature, so issue-style reproduction is otherwise not applicable. Review metrics: 2 noteworthy metrics.
Merge readiness Overall follows the weaker of proof and patch quality, so missing proof can cap an otherwise strong patch. Rank-up moves:
Proof guidance:
Mantis proof suggestion Risk before merge
Maintainer options:
Next step before merge
Security Review findings
Review detailsBest possible solution: Repair readiness-gated advertisement and dynamic local-model budget semantics, add full route/admin real behavior proof, then let maintainers decide whether this default-off fusion/provider surface belongs in core. Do we have a high-confidence way to reproduce the issue? Yes for the review blockers: source inspection shows the catalog/UI advertisement and dynamic local-model pricing paths without needing a live Worker. This PR is a feature, so issue-style reproduction is otherwise not applicable. Is this the best way to solve the issue? No; the current patch is not the best landing shape because advertised availability and budget behavior do not match the documented/proxy behavior. The safer path is to fix those semantics and add focused regression coverage before product acceptance. Full review comments:
Overall correctness: patch is incorrect AGENTS.md: found and applied where relevant. Codex review notes: model internal, reasoning high; reviewed against d39ef9cfa07f. Label changesLabel justifications:
Evidence reviewedWhat I checked:
Likely related people:
What the crustacean ranks mean
Shiny media proof means a screenshot, video, or linked artifact directly shows the changed behavior. Runtime, network, CSP, and security claims still need visible diagnostics. How this review workflow works
|
Summary
clawrouter/fusion, an explicitly selected virtual chat modellocal-openaiprovider for Ollama, LM Studio, vLLM, and other OpenAI-compatible serversDesign
Fusion is one sparse adviser layer plus one final synthesizer. Adviser failures fail open. Adviser requests are text-only, exclude tool schemas and images, use bounded context/output, and are inserted into the final request as delimited untrusted evidence. The final request preserves the caller's original tools, multimodal content, streaming mode, and output contract.
The default cost-oriented profile pairs
local/qwen3:8bwithopenai/gpt-4.1-mini. Operators can select any registered chat models. A hosted Cloudflare Worker cannot reach laptop loopback, so production local-model use requires a secured network-reachable endpoint or a locally run ClawRouter.Validation
node --test worker/test/*.test.mjs— 27 passednode --test admin/test/*.test.mjs— 19 passednode --test test/*.test.mjs— 43 passedqwen3:0.6bOpenAI-compatible adviser request returnedLOCAL_FUSION_OKBrowser visual QA was unavailable locally because the existing-Chrome bridge is not installed; the production UI build and admin CI job passed.
Operational notes
clawrouter/fusionis advertised only when enabled and the caller can execute the configured synthesizer