feat(http_server): expose tokenizer SHA256 on /get_model_info for parity verification by DavidBellamy · Pull Request #15 · LLM360/sglang

DavidBellamy · 2026-05-04T00:57:18Z

Summary

Add a `tokenizer_sha256` field to the `/get_model_info` endpoint that returns a deterministic hash of the active tokenizer's canonical JSON form. Lets clients verify two SGLang instances (or an SGLang instance and a separate trainer/embedding service) are using bit-identical tokenizers before relying on cross-process token-id assumptions.

Why

When token IDs cross process boundaries (e.g. an inference worker emits token IDs that another component uses as input to a separate process), the two sides must use bit-identical tokenizers — including merges, special tokens, and byte fallbacks. A subtle mismatch silently corrupts downstream logic in ways that are hard to diagnose because the IDs still look plausible.

Exposing a tokenizer hash on the existing model-info endpoint gives clients a one-call way to do this consistency check at startup.

Changes (`python/sglang/srt/entrypoints/http_server.py`)

New `_compute_tokenizer_sha256()` helper that returns a SHA256 of `tokenizer.backend_tokenizer.to_str()` for HF fast tokenizers, or `None` if the active tokenizer doesn't expose that interface. Cached after first call.
`/get_model_info` payload includes `tokenizer_sha256` next to the existing `tokenizer_path` field.

Behavior

Field is optional (`None` when the tokenizer doesn't support `to_str()`); existing clients ignore unknown fields.
One-time hash, cached.
No new dependencies (stdlib `hashlib`).

Provenance

One of five focused PRs that supersede #3.

…alistic perf and auto-discover ut (sgl-project#22086) Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>

…gl-project#21649) Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>

…roject#18639)

…rt (sgl-project#18642)

…a for qwen-vl (sgl-project#18781)

…ject#19731)

…-project#20066)

…ity verification Add a tokenizer_sha256 field to the /get_model_info endpoint that returns a deterministic hash of the active tokenizer's canonical JSON form. Lets clients verify two SGLang instances (or an SGLang instance and a separate trainer/embedding service) are using bit-identical tokenizers before relying on cross-process token-id assumptions. When token IDs cross process boundaries, the two sides must use bit-identical tokenizers including merges, special tokens, and byte fallbacks. A subtle mismatch silently corrupts downstream logic in ways that are hard to diagnose because the IDs still look plausible. Exposing a tokenizer hash on the existing model-info endpoint gives clients a one-call way to do this consistency check at startup. Field is optional (None when the tokenizer doesn't support .to_str()); existing clients ignore unknown fields. One-time hash, cached. No new dependencies.

mickqian and others added 19 commits April 4, 2026 23:37

[diffusion] CI: improve diffusion comparison benchmark setting for re…

43654ef

…alistic perf and auto-discover ut (sgl-project#22086) Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

[Fix] Fix nightly tests (sgl-project#22140)

164bc0a

Enable IndexCache for DeepSeek V3.2 (sgl-project#21405)

07f57fc

Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>

fix: TRT-LLM MHA CUDA illegal address with EAGLE v2 + DP attention (s…

c1927e1

…gl-project#21649) Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>

[Hotfix] Fix router gemm on sm103 (sgl-project#22134)

1519acf

[1/8] [sglang-miles] True on-policy training support for FSDP2 (sgl-p…

74c9eab

…roject#18639)

[2/8] [sglang-miles] R3 (Rollout Routing Replay) DeepEP and MTP suppo…

453eb15

…rt (sgl-project#18642)

[3/8] [sglang-miles] Support INT4 QAT for RL (sgl-project#18565)

2bc3f68

[4/8] [sglang-miles] PD disaggregation for RL (sgl-project#18646)

277147c

[5/8] [sglang-miles] MTP related fix (sgl-project#18647)

7af1f15

[6/8] [sglang-miles] tmp fix for vlm training: use legacy_load_mm_dat…

24def86

…a for qwen-vl (sgl-project#18781)

[7/8] [sglang-miles] support better token id return for TITO (sgl-pro…

69018ba

…ject#19731)

[8/8] [feat] Support cross turn token in after last user message (sgl…

3606aec

…-project#20066)

[sglang-miles] fix weight checker (sgl-project#21494)

98b5440

P2P Weight Update features for miles (sgl-project#21278)

629aa25

Fix is_multimodal_gen attr not in v0.5.10 ModelConfig

d335128

Fix flashinfer fused_moe with topk>8 (sgl-project#22201)

0e3e23e

Add maintain-deploy workflow for auto-merging PRs into deploy branch

58fc036

github-actions Bot added diffusion lora blackwell deepseek labels May 4, 2026

DavidBellamy mentioned this pull request May 4, 2026

W-TITO bullet 1 + W2 Miles shim + smg response-flatten Cargo patch #3

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(http_server): expose tokenizer SHA256 on /get_model_info for parity verification#15

feat(http_server): expose tokenizer SHA256 on /get_model_info for parity verification#15
DavidBellamy wants to merge 19 commits intomainfrom
feat/tokenizer-sha256-on-model-info-llm360-fork

DavidBellamy commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

Conversation

DavidBellamy commented May 4, 2026

Summary

Why

Changes (`python/sglang/srt/entrypoints/http_server.py`)

Behavior

Provenance

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants