fix(langfuse): avoid double-counting cache and reasoning tokens in usage by flore2003 · Pull Request #108 · agdevhq/core-ai

flore2003 · 2026-04-14T20:08:42Z

Summary

Langfuse sums all usageDetails keys containing input for "Input usage" and output for "Output usage". Since input already included cache tokens and output already included reasoning tokens, they were being double-counted in the UI breakdown.
input now reports inputTokens - cacheReadTokens - cacheWriteTokens (non-cached portion only)
output now reports outputTokens - reasoningTokens (non-reasoning portion only)
Dropped the explicit total key — Langfuse derives it correctly by summing all keys

Test plan

All 9 existing tests updated and passing

Langfuse sums all keys containing "input" for Input usage and "output" for Output usage. Since inputTokens/outputTokens already included cache/reasoning tokens, they were double-counted. Now input/output report only their non-overlapping portion, and total is omitted so Langfuse derives it correctly by summing all keys.

flore2003 added 2 commits April 14, 2026 13:06

add changeset

90cf021

flore2003 merged commit fb9162a into main Apr 14, 2026
3 checks passed

flore2003 deleted the fix/langfuse-usage-double-counting branch April 14, 2026 20:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(langfuse): avoid double-counting cache and reasoning tokens in usage#108

fix(langfuse): avoid double-counting cache and reasoning tokens in usage#108
flore2003 merged 2 commits into
mainfrom
fix/langfuse-usage-double-counting

flore2003 commented Apr 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

flore2003 commented Apr 14, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant