How the 120x token reduction works — benchmark methodology explained #12

DeusData · 2026-03-02T22:09:55Z

DeusData
Mar 2, 2026
Maintainer

The "120x fewer tokens" claim comes from a controlled benchmark. Here's the methodology so you can verify it yourself.

Setup: 5 structural questions about a real codebase (function lookup, call tracing, dead code, route listing, architecture overview). Each question asked twice — once via codebase-memory-mcp graph queries, once via a Claude Code Explorer agent that uses grep/Glob/Read tools.

Measurement: Total input + output tokens consumed by all tool calls to answer each question.

Results:

Question Type	Graph (tokens)	Explorer (tokens)	Ratio
Find function by pattern	~200	~45,000	225x
Trace call chain (depth 3)	~800	~120,000	150x
Dead code detection	~500	~85,000	170x
List all routes	~400	~62,000	155x
Architecture overview	~1,500	~100,000	67x
Total	~3,400	~412,000	121x

The Explorer agent has to: read file listings → grep for patterns → read matching files → parse the output → grep again for related files → read those. Each step is a tool call with full file contents in the response.

The graph query returns exactly the structural information in one call. No file contents, no noise, no irrelevant matches.

Why it matters beyond fitting in the context window: Cost ($3-15/M tokens adds up), latency (seconds of file reading vs <1ms graph query), and accuracy (LLMs lose track of details in large contexts).

Full benchmark data: See BENCHMARK_REPORT.md and the Performance section in the README.

arita37 · 2026-05-14T07:06:05Z

arita37
May 14, 2026

@ahundt
the markdown is emtpy... which codebase did you use ?

1 reply

cheidbrink May 23, 2026

I found it here: https://github.com/DeusData/codebase-memory-mcp/blob/main/docs/BENCHMARK.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How the 120x token reduction works — benchmark methodology explained #12

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How the 120x token reduction works — benchmark methodology explained #12

Uh oh!

DeusData Mar 2, 2026 Maintainer

Replies: 1 comment · 1 reply

Uh oh!

arita37 May 14, 2026

Uh oh!

cheidbrink May 23, 2026

DeusData
Mar 2, 2026
Maintainer

Replies: 1 comment 1 reply

arita37
May 14, 2026