From 013252f5d708b24eb43e144d98904ac0bc05be8d Mon Sep 17 00:00:00 2001 From: Sam Xu Date: Sun, 31 May 2026 15:23:42 +0800 Subject: [PATCH] chore(litellm): bump dev image to v1.88.0-rc.1 (fixes chatgpt Responses output bug) MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Codex calls (openai-codex/gpt-5.4*) were silently falling through to the OpenRouter Nemotron fallback because LiteLLM's chatgpt/ provider threw `'ResponsesAPIResponse' object has no attribute 'output'` on every real ChatGPT response. Agents posted (via Nemotron) so the failure was invisible — but the dev tier was never running on Codex output despite paying for two ChatGPT accounts. Verified live 2026-05-31: - v1.83.7-stable (prior): fails with the ChatgptException above. - v1.86.2 (latest stable that drops -stable suffix): SAME bug — fix not in v1.86.x. - v1.88.0-rc.1: openai-codex/gpt-5.4-mini returns real Codex output (model field "openai-codex/gpt-5.4-mini", not Nemotron). Pinning to the RC because v1.87.0 stable isn't published on ghcr yet (only RCs available). Revisit when v1.87.0 or v1.88.0 stable lands. No prod values change — values.yaml still on v1.82.3-stable; only dev gets the RC. Co-Authored-By: Claude Opus 4.7 (1M context) --- k8s/helm/commonly/values-dev.yaml | 20 +++++++++++++++----- 1 file changed, 15 insertions(+), 5 deletions(-) diff --git a/k8s/helm/commonly/values-dev.yaml b/k8s/helm/commonly/values-dev.yaml index 30336c75..796d813f 100644 --- a/k8s/helm/commonly/values-dev.yaml +++ b/k8s/helm/commonly/values-dev.yaml @@ -229,13 +229,23 @@ cloudflared: litellm: enabled: true - # Pinned override on top of values.yaml's v1.82.3-stable default. Testing - # whether v1.83.7-stable resolves the chatgpt/gpt-5.4 device-code-on-startup - # crashloop (distinct from but likely related to BerriAI/litellm#25429, which - # is still open upstream as of 2026-04-21). Roll back here if it regresses. + # Pinned override on top of values.yaml's v1.82.3-stable default. + # + # 2026-05-31: bumped from v1.83.7-stable to v1.88.0-rc.1 to fix the + # ChatgptException `'ResponsesAPIResponse' object has no attribute + # 'output'` bug. Symptom: every openai-codex/* call silently fell + # through to the OpenRouter fallback chain (Nemotron) even with valid + # auth.json — agents posted but never on Codex output. The fix lands + # in v1.87.x; first ghcr stable that includes it would be v1.87.0 + # (not yet published as plain v1.87.x — only RCs available). v1.88.0- + # rc.1 is the earliest tag that ghcr serves which has the patch. + # + # Pinning to the RC is the price; revisit when v1.87.0 stable or + # v1.88.0 stable lands on ghcr.io/berriai/litellm. v1.86.2 confirmed + # still broken via direct kubectl set image test (2026-05-31). image: repository: ghcr.io/berriai/litellm - tag: v1.83.7-stable + tag: v1.88.0-rc.1 # BerriAI/litellm#25429 workaround used to live here. Now off — acpx is # being deprecated in commonly (see ADR-005 Stage 3) and we'd rather route # everything through LiteLLM than chase an upstream bug for a code path