[codex] Add on-demand fusion model routing#63
Conversation
|
Codex review: needs real behavior proof before merge. Reviewed July 5, 2026, 12:06 AM ET / 04:06 UTC. Summary Reproducibility: yes. for the review blockers from source inspection: the catalog/admin code gates fusion model insertion on Review metrics: 3 noteworthy metrics.
Merge readiness Overall follows the weaker of proof and patch quality, so missing proof can cap an otherwise strong patch. Rank-up moves:
Proof guidance:
Mantis proof suggestion Risk before merge
Maintainer options:
Next step before merge
Security Review findings
Review detailsBest possible solution: Land this only after fusion is advertised strictly when executable, dynamic local routes preserve intended pricing metadata, the contributor adds redacted live proof for the route and UI, and a maintainer accepts the new core routing/config surface. Do we have a high-confidence way to reproduce the issue? Yes for the review blockers from source inspection: the catalog/admin code gates fusion model insertion on Is this the best way to solve the issue? No: the proposed direction is plausible, but the current PR is not yet the best mergeable solution because it advertises unavailable fusion routes, misprices dynamic local models, and lacks inspectable real-behavior proof. Full review comments:
Overall correctness: patch is incorrect AGENTS.md: found and applied where relevant. Codex review notes: model internal, reasoning high; reviewed against f3b733f801ae. Label changesLabel changes:
Label justifications:
Evidence reviewedWhat I checked:
Likely related people:
What the crustacean ranks mean
Shiny media proof means a screenshot, video, or linked artifact directly shows the changed behavior. Runtime, network, CSP, and security claims still need visible diagnostics. How this review workflow works
Review history (1 earlier review cycle)
|
Summary
clawrouter/fusion, an explicitly selected virtual chat modellocal-openaiprovider for Ollama, LM Studio, vLLM, and other OpenAI-compatible serversDesign
Fusion is one sparse adviser layer plus one final synthesizer. Adviser failures fail open. Adviser requests are text-only, exclude tool schemas and images, use bounded context/output, and are inserted into the final request as delimited untrusted evidence. The final request preserves the caller's original tools, multimodal content, streaming mode, and output contract.
The default cost-oriented profile pairs
local/qwen3:8bwithopenai/gpt-4.1-mini. Operators can select any registered chat models. A hosted Cloudflare Worker cannot reach laptop loopback, so production local-model use requires a secured network-reachable endpoint or a locally run ClawRouter.Validation
node --test worker/test/*.test.mjs— 27 passednode --test admin/test/*.test.mjs— 19 passednode --test test/*.test.mjs— 43 passedqwen3:0.6bOpenAI-compatible adviser request returnedLOCAL_FUSION_OKBrowser visual QA was unavailable locally because the existing-Chrome bridge is not installed; the production UI build and admin CI job passed.
Operational notes
clawrouter/fusionis advertised only when enabled and the caller can execute the configured synthesizer