fix: use per candidate provider for model_fallbacks by corevibe555 · Pull Request #2143 · sipeed/picoclaw

corevibe555 · 2026-03-29T03:11:12Z

📝 Description

When using model_fallbacks with models from different providers, all fallback requests
were sent to the primary model's api_base with the primary model's api_key instead
of each fallback's own configuration. This made cross-provider fallback chains
non-functional (e.g. an OpenRouter primary with a Gemini fallback would send the Gemini
request to OpenRouter's API, resulting in a 404).

Root cause: a single LLMProvider was constructed from the primary model's
ModelConfig at startup and reused for every candidate in the fallback chain. The chain
only swapped the model ID string — the underlying HTTP client (with its baked-in
api_base and api_key) never changed.

Fix: at agent initialization, a dedicated LLMProvider is pre-created for each
candidate found in model_list and stored in a new CandidateProviders map on
AgentInstance (keyed by provider/model). The fallback run closure now selects the
correct provider for the active candidate from this map, falling back to
agent.Provider when no override is found. This covers both primary fallback candidates
and light-model routing candidates.

🗣️ Type of Change

🐞 Bug fix (non-breaking change which fixes an issue)
✨ New feature (non-breaking change which adds functionality)
📖 Documentation update
⚡ Code refactoring (no functional changes, no api changes)

🤖 AI Code Generation

🤖 Fully AI-generated (100% AI, 0% Human)
🛠️ Mostly AI-generated (AI draft, Human verified/modified)
👨‍💻 Mostly Human-written (Human lead, AI assisted or none)

🔗 Related Issue

Fixes #2140

📚 Technical Context (Skip for Docs)

Reference URL: [BUG] model_fallbacks inherits primary model's api_base/api_key instead of using each fallback's own config #2140
Reasoning: The FallbackChain.Execute callback captured agent.Provider (the
primary model's provider) in its closure and passed only the model ID string per
candidate. Creating providers eagerly at agent creation time (rather than lazily per
request) avoids runtime overhead while ensuring each fallback uses its own credentials.
The CandidateProviders map is keyed by providers.ModelKey(provider, model) to
match the same key used inside the fallback chain's run closure.

🧪 Test Environment

Hardware: PC
OS: Linux
Model/Provider: OpenRouter (primary) + Google Gemini (fallback)
Channels: —

📸 Evidence (Optional)

Click to view Logs/Screenshots

Before fix — fallback routed to OpenRouter with wrong key:

CLAassistant · 2026-03-29T03:11:18Z

All committers have signed the CLA.

yinwm

Thanks for the fix! The root cause analysis is spot-on and the pre-creation approach is clean. I prefer this over #1637 (see my reasoning there).

Two blocking issues before we can merge:

1. Rebase needed — conflicts with merged #2038

This PR branches from before #2038 was merged. In the current main, the fallback closure uses activeProvider (which may be LightProvider when routing selects the light tier):

// current main (after #2038)
return activeProvider.Chat(ctx, messagesForCall, toolDefsForCall, model, llmOpts)

But your change defaults to agent.Provider:

p := agent.Provider  // should be activeProvider

Please rebase onto latest main and ensure the default fallback respects light model routing. The fix should be:

p := activeProvider
if cp, ok := agent.CandidateProviders[providers.ModelKey(provider, model)]; ok {
    p = cp
}
return p.Chat(...)

2. Model matching in registerCandidateProviders is fragile

The direct string comparison fullModel == cfg.ModelList[i].Model assumes model_list entries always use "provider/model" format. Consider reusing resolvedModelConfig() from model_resolution.go instead, which already handles alias resolution and model config lookup.

Non-blocking suggestions:

Replace log.Printf with logger.WarnCF for consistency with the rest of the codebase
Add unit tests for cross-provider fallback resolution (this is important for a core routing fix)

Once these are addressed, I'm happy to approve.

corevibe555 · 2026-03-29T18:12:42Z

Thanks for the feedback.
I am working on this.

corevibe555 · 2026-03-29T22:19:22Z

Done
Done as requested(Updated)

corevibe555 · 2026-03-31T02:06:26Z

@yinwm Would you give feedback so I can improve as per your view? Thank you!

corevibe555 · 2026-03-31T05:09:51Z

Squashed commit history, so I made the commits into one meaningful commit.

corevibe555 · 2026-04-01T23:35:58Z

@yinwm Another update here.
I've gone through the PR and updated again as per your original request.

Rebase needed — conflicts with merged fix(agent): use light provider for routed model calls #2038
Rebase done, code updated properly as per your comment.
Model matching in registerCandidateProviders is fragile
Utilized resolvedModelConfig(), fixed tests.

Each fallback model now uses its own api_base and api_key from model_list instead of inheriting the primary model's provider config. Previously, a single LLMProvider was created from the primary model's ModelConfig and reused for all fallback candidates — only the model ID string was swapped. This caused all fallback requests to be routed to the primary provider's endpoint, making cross-provider fallback chains non-functional (e.g., OpenRouter primary with Gemini fallback would send the Gemini request to OpenRouter's API). Fix: pre-create a per-candidate LLMProvider at agent initialization time by looking up each candidate's ModelConfig from model_list. The fallback run closure now selects the correct provider per candidate via CandidateProviders map, falling back to agent.Provider when no override is found. Fixes sipeed#2140 Made-with: Cursor test: add test for instance.go fix: fix test refactor: optimize fix: fix Golang lint issues chore: comment cleanup

yinwm

Great work! Both blocking issues from the first review have been properly addressed. The rebase onto latest main is clean, activeProvider is correctly used as the default, and resolvedModelConfig() is now the canonical resolution path. The test coverage is thorough — 352 lines covering the exact #2140 scenario, edge cases, and the graceful fallback to activeProvider for unregistered candidates.

One non-blocking suggestion for a follow-up: populateCandidateProvidersFromNames uses resolvedModelConfig() (which only matches by model_name), but buildModelListResolver (used by resolveModelCandidates) has an additional fallback path that also matches by Model and modelID. This means if a fallback is referenced by model ID instead of alias, the candidate will be created in the fallback chain but won't have a corresponding CandidateProviders entry. Consider aligning these two resolution paths in a follow-up PR.

sipeed-bot · 2026-04-08T05:35:33Z

@corevibe555 Great debugging on the model_fallbacks cross-provider issue. Pinning it down to a single LLMProvider getting reused across every candidate is exactly the kind of root cause that's easy to miss, so the per candidate rebuild is a clean fix.

We're setting up the PicoClaw Dev Group on Discord for contributors to connect and collaborate. If you'd like to join, drop a note to support@sipeed.com with the subject [Join PicoClaw Dev Group] + corevibe555 and we'll send the invite link your way.

* fix: use per-candidate provider for model_fallbacks Each fallback model now uses its own api_base and api_key from model_list instead of inheriting the primary model's provider config. Previously, a single LLMProvider was created from the primary model's ModelConfig and reused for all fallback candidates — only the model ID string was swapped. This caused all fallback requests to be routed to the primary provider's endpoint, making cross-provider fallback chains non-functional (e.g., OpenRouter primary with Gemini fallback would send the Gemini request to OpenRouter's API). Fix: pre-create a per-candidate LLMProvider at agent initialization time by looking up each candidate's ModelConfig from model_list. The fallback run closure now selects the correct provider per candidate via CandidateProviders map, falling back to agent.Provider when no override is found. Fixes sipeed#2140 Made-with: Cursor test: add test for instance.go fix: fix test refactor: optimize fix: fix Golang lint issues chore: comment cleanup * refactor: use resolvedModelConfig() instead of buildModelIndex() * fix

corevibe555 force-pushed the fix/model-fallbacks-per-candidate-provider branch from 73762da to 5635190 Compare March 29, 2026 03:15

sipeed-bot bot added type: bug Something isn't working domain: agent domain: provider go Pull requests that update go code labels Mar 29, 2026

yinwm requested changes Mar 29, 2026

View reviewed changes

yinwm mentioned this pull request Mar 29, 2026

fix(agent): dispatch per-candidate provider in fallback chain #1637

Closed

3 tasks

corevibe555 force-pushed the fix/model-fallbacks-per-candidate-provider branch 3 times, most recently from 6bf2ce7 to e766036 Compare March 29, 2026 20:33

sipeed deleted a comment Mar 31, 2026

corevibe555 force-pushed the fix/model-fallbacks-per-candidate-provider branch from f318ea1 to 3e8b1cf Compare March 31, 2026 05:02

corevibe555 requested a review from yinwm March 31, 2026 22:08

github-actions bot mentioned this pull request Apr 2, 2026

🦞 OpenClaw 生态日报 2026-04-02 gsscsd/big_model_radar#121

Open

corevibe555 added 2 commits April 2, 2026 13:05

refactor: use resolvedModelConfig() instead of buildModelIndex()

15181e5

corevibe555 force-pushed the fix/model-fallbacks-per-candidate-provider branch from 9362c6e to 15181e5 Compare April 2, 2026 11:07

corevibe555 added 2 commits April 2, 2026 13:15

fix: resolve merge conflicts

e4345ce

fix

1ccc40d

github-actions bot mentioned this pull request Apr 4, 2026

🦞 OpenClaw 生态日报 2026-04-04 gsscsd/big_model_radar#131

Open

yinwm approved these changes Apr 7, 2026

View reviewed changes

yinwm merged commit 6ce0306 into sipeed:main Apr 7, 2026
4 checks passed

github-actions bot mentioned this pull request Apr 8, 2026

🦞 OpenClaw 生态日报 2026-04-08 gsscsd/big_model_radar#153

Open

github-actions bot mentioned this pull request Apr 9, 2026

🦞 OpenClaw 生态日报 2026-04-09 gsscsd/big_model_radar#158

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: use per candidate provider for model_fallbacks#2143

fix: use per candidate provider for model_fallbacks#2143
yinwm merged 4 commits intosipeed:mainfrom
corevibe555:fix/model-fallbacks-per-candidate-provider

corevibe555 commented Mar 29, 2026

Uh oh!

CLAassistant commented Mar 29, 2026 •

edited

Loading

Uh oh!

yinwm left a comment

Uh oh!

corevibe555 commented Mar 29, 2026

Uh oh!

corevibe555 commented Mar 29, 2026 •

edited

Loading

Uh oh!

corevibe555 commented Mar 31, 2026

Uh oh!

corevibe555 commented Mar 31, 2026

Uh oh!

corevibe555 commented Apr 1, 2026

Uh oh!

yinwm left a comment

Uh oh!

Uh oh!

sipeed-bot bot commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

corevibe555 commented Mar 29, 2026

📝 Description

🗣️ Type of Change

🤖 AI Code Generation

🔗 Related Issue

📚 Technical Context (Skip for Docs)

🧪 Test Environment

📸 Evidence (Optional)

Uh oh!

CLAassistant commented Mar 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yinwm left a comment

Choose a reason for hiding this comment

Uh oh!

corevibe555 commented Mar 29, 2026

Uh oh!

corevibe555 commented Mar 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

corevibe555 commented Mar 31, 2026

Uh oh!

corevibe555 commented Mar 31, 2026

Uh oh!

corevibe555 commented Apr 1, 2026

Uh oh!

yinwm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sipeed-bot bot commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CLAassistant commented Mar 29, 2026 •

edited

Loading

corevibe555 commented Mar 29, 2026 •

edited

Loading