Fix Responses API cache retrieval error #9131

Olocool17 · 2025-12-15T17:31:32Z

Previous PR: #9123

Cache retrieval fix

While implementing the fix outlined in #9130 I additionally stumbled on an error pertaining to responses models when using caching.
When an item is successfully retrieved from the cache, an ad-hoc attribute .cache_hit is created and set to True on the response object.

Unfortunately, this is only possible for litellm.ModelResponse (the response for chat models) and not for litellm.ResponsesAPIResponse (the response for response models), because the latter is a Pydantic model without extra="allow" set in its config.

My proposed fix for this is to simply remove this ad-hoc attribute alltogether, since it is actually superflueous: the response.usage attribute is cleared on cache hit anyways, which makes settings.usage_tracker.add_usage(self.model, dict(results.usage) a null-op.

fix: cache retrieval error when using responses model

6906010

Olocool17 mentioned this pull request Dec 15, 2025

Fix responses structured outputs + cache retrieval error #9123

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix Responses API cache retrieval error #9131

Fix Responses API cache retrieval error #9131

Uh oh!

Olocool17 commented Dec 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Fix Responses API cache retrieval error #9131

Are you sure you want to change the base?

Fix Responses API cache retrieval error #9131

Uh oh!

Conversation

Olocool17 commented Dec 15, 2025

Cache retrieval fix

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant