core: Implement load balancing policy delay plumbing by AgraVator · Pull Request #12807 · grpc/grpc-java

AgraVator · 2026-05-13T16:47:39Z

This PR implements the plumbing required to propagate delay reason tokens from load balancing policies up to the transport layer and tracers, as specified in the LB policy delay design.

What changed

api: Added delayReasonToken to PickResult and factory method withNoResult(String).
api: Added delayStarted(String) and delayEnded() hooks to ClientStreamTracer to track delay segments.
core: Updated DelayedClientTransport to track delay tokens in PendingStream and notify tracers when delays start, change, or end.
core/util: Updated PickFirst and RoundRobin policies to emit cached tokens when connecting.
xds: Updated RingHash, RLS, and CDS policies to emit specific delay tokens when buffering picks.
xds: Updated PriorityLoadBalancer to wrap child pickers and prepend priority_X: to child tokens to track failovers.

Notes

This change focuses strictly on the plumbing of the delay reasons. Implementation of actual OpenTelemetry metrics and spans is deferred to a later phase.

This commit implements the plumbing required to propagate delay reason tokens from load balancing policies up to the transport layer and tracers, as specified in the LB policy delay design.

AgraVator · 2026-05-19T08:00:22Z

        connectivityState = newState;
-        picker = newPicker;
+        if (newState == CONNECTING || newState == IDLE) {
+          picker = new PriorityPicker(newPicker, priority);


appends "priority_p0:" to the child delay token

shivaspeaks

Quick review, LGTM overall. I'll take a deeper look with implementation doc when I'll be back in office.

This change looks to be something that should be consistent in all the languages. Is there a gRFC baking for this? If so link that PR in description?

shivaspeaks · 2026-06-09T01:47:06Z

+      PickResult childResult = delegate.pickSubchannel(args);
+      if (!childResult.hasResult() && childResult.getDelayReasonToken() != null) {
+        return PickResult.withNoResult(
+            "priority_" + priority + ":" + childResult.getDelayReasonToken());


A question to understand this better from performance perspective.
This string concatenation happens on the hot path for every buffered RPC. If the priority tree is deep or policies are nested, this may lead to,
Allocation Overhead- repeated string and PickResult allocations on every pickSubchannel call.
Metric Cardinality- These nested tokens (e.g., priority_p0:priority_p1:ring_hash:connecting) are used as metric labels. Highly nested tokens can cause a cardinality explosion.

Is there a way we can cache the concatenated PickResult in the PriorityPicker (if the child's result is also cached/static) to avoid per-pick allocations? I assume we need the new childResult's DelayReasonToken, I'm not sure if that stays static or is dynamic. If it stays static then we can move out or else we should at least create "priority_" + priority + ":" + statically?

AgraVator added 2 commits May 13, 2026 22:15

core,api,xds: Implement load balancing policy delay plumbing

b50ca84

This commit implements the plumbing required to propagate delay reason tokens from load balancing policies up to the transport layer and tracers, as specified in the LB policy delay design.

fix: tests

c38ce1d

AgraVator commented May 19, 2026

View reviewed changes

AgraVator added 2 commits May 19, 2026 13:40

fix: minor changes

a992bdf

add missing endDelay()

6a55ff2

AgraVator marked this pull request as ready for review June 8, 2026 11:13

AgraVator requested review from ejona86 and shivaspeaks June 8, 2026 11:14

shivaspeaks reviewed Jun 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core: Implement load balancing policy delay plumbing#12807

core: Implement load balancing policy delay plumbing#12807
AgraVator wants to merge 4 commits into
grpc:masterfrom
AgraVator:lb-policy-delay

AgraVator commented May 13, 2026 •

edited

Loading

Uh oh!

AgraVator May 19, 2026

Uh oh!

shivaspeaks left a comment

Uh oh!

shivaspeaks Jun 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

AgraVator commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changed

Notes

Uh oh!

AgraVator May 19, 2026

Choose a reason for hiding this comment

Uh oh!

shivaspeaks left a comment

Choose a reason for hiding this comment

Uh oh!

shivaspeaks Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

AgraVator commented May 13, 2026 •

edited

Loading