storage_tokens_processed undercounts by tensor_parallel factor by FileSystemGuy · Pull Request #402 · mlcommons/storage

FileSystemGuy · 2026-06-02T15:53:42Z

num_tokens = entry_size / self.model_config.kv_cache_size_per_token — entry_size is the TP-sharded per-rank size (1/tensor_parallel of the full entry), but kv_cache_size_per_token is the full unsharded value. Dividing a sharded byte count by an unsharded per-token size undercounts storage_tokens_processed by a factor of tensor_parallel for every NVMe read when tensor_parallel > 1.

github-actions · 2026-06-02T15:53:59Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

FileSystemGuy · 2026-06-02T21:03:36Z

Please review: @russfellows

storage_tokens_processed undercounts by tensor_parallel factor

e6b4eee

FileSystemGuy requested a review from a team June 2, 2026 15:53

FileSystemGuy requested a review from hazemawadalla June 2, 2026 15:54

idevasena approved these changes Jun 2, 2026

View reviewed changes

dslik added the KVCache TF label Jun 2, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

storage_tokens_processed undercounts by tensor_parallel factor#402

storage_tokens_processed undercounts by tensor_parallel factor#402
FileSystemGuy wants to merge 1 commit into
mainfrom
FileSystemGuy-kvcache-storage_tokens_processed

FileSystemGuy commented Jun 2, 2026

Uh oh!

github-actions Bot commented Jun 2, 2026

Uh oh!

FileSystemGuy commented Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

FileSystemGuy commented Jun 2, 2026

Uh oh!

github-actions Bot commented Jun 2, 2026

Uh oh!

FileSystemGuy commented Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants