feat: skip work applier status updates if possible #375

michaelawyu · 2025-12-11T14:05:50Z

Description of your changes

As a performance improvement, set the work applier to skip status updates if possible.

I have:

Run make reviewable to ensure this PR is ready for review.

How has this code been tested

Integration tests

Special notes for your reviewer

Signed-off-by: michaelawyu <chenyu1@microsoft.com>

michaelawyu · 2025-12-11T14:06:49Z

Note: as anticipated after this PR the status update logic in work applier has reached the linter code complexity limit; to unblock progress that specific method is now exempted from the linter, will submit separate PRs to refactor the method.

michaelawyu · 2025-12-11T14:08:03Z

This PR precedes #118.

ryanzhang-oss

how can we make sure that a work's status in cache is actually the same in etcd? If we just compare with the cache and stop updating, we run the risk of the status never gets updated to the correct state (although the chance is small).

michaelawyu · 2025-12-17T21:17:26Z

how can we make sure that a work's status in cache is actually the same in etcd? If we just compare with the cache and stop updating, we run the risk of the status never gets updated to the correct state (although the chance is small).

Hi Ryan! For the work applier we already requeue periodically (with back-off) so unless the client-side cache becomes significantly out-of-sync with the API server (in this case all writes will be rejected with optimistic locks), any inconsistent status updates will be overwritten within at max. 15 minutes.

…plier

ryanzhang-oss · 2025-12-18T20:49:41Z

pkg/controllers/workapplier/status.go

-		return controller.NewAPIServerError(false, err)
+
+	// Skip the status update if no change found.
+	if equality.Semantic.DeepEqual(originalStatus, &appliedWork.Status) {


do we have an idea of how many calls this generates?

Hi Ryan! Do you mean the # of status updates this setup will skip? Or do you mean the time complexity about the DeepEqual calls?

For the former, in the target perf test environment each member agent was generating roughly 950K status updates per 24h (though this number might be a bit biased due to the agents being restarted occasionally).

For the latter, deep-equaling is usu. expensive, esp. when the object is complex. I don't have specifics on deep-equaling Work object status data now, but I could do a mini benchmarking if you are interested.

ryanzhang-oss · 2025-12-18T20:49:54Z

pkg/controllers/workapplier/status.go

 }
+
+func shouldSkipStatusUpdate(isDriftedOrDiffed, isStatusBackReportingOn bool, originalStatus, currentStatus *fleetv1beta1.WorkStatus) bool {
+	if isDriftedOrDiffed || isStatusBackReportingOn {


this will reduce the effectiveness of this PR by a lot, is there anyway to soften the blow a bit further?

Hi Ryan! We might need a flag to omit observation timestamps if needed? Though with exponential backoff, as long as the number of drifted/diffed placements + status back-reporting placements is low, and the system does not restart often, the # of calls should be relatively limited (~96 writes per placement) when everything stablizes.

Set the work applier to skip status updates if possible

40dc51c

Signed-off-by: michaelawyu <chenyu1@microsoft.com>

ryanzhang-oss reviewed Dec 16, 2025

View reviewed changes

Merge branch 'main' into feat/deep-equal-before-status-update-work-ap…

4649f16

…plier

ryanzhang-oss reviewed Dec 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: skip work applier status updates if possible #375

feat: skip work applier status updates if possible #375

Uh oh!

michaelawyu commented Dec 11, 2025

Uh oh!

michaelawyu commented Dec 11, 2025

Uh oh!

michaelawyu commented Dec 11, 2025

Uh oh!

ryanzhang-oss left a comment

Uh oh!

michaelawyu commented Dec 17, 2025

Uh oh!

ryanzhang-oss Dec 18, 2025

Uh oh!

michaelawyu Dec 19, 2025

Uh oh!

ryanzhang-oss Dec 18, 2025

Uh oh!

michaelawyu Dec 19, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: skip work applier status updates if possible #375

Are you sure you want to change the base?

feat: skip work applier status updates if possible #375

Uh oh!

Conversation

michaelawyu commented Dec 11, 2025

Description of your changes

How has this code been tested

Special notes for your reviewer

Uh oh!

michaelawyu commented Dec 11, 2025

Uh oh!

michaelawyu commented Dec 11, 2025

Uh oh!

ryanzhang-oss left a comment

Choose a reason for hiding this comment

Uh oh!

michaelawyu commented Dec 17, 2025

Uh oh!

ryanzhang-oss Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

michaelawyu Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

ryanzhang-oss Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

michaelawyu Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

michaelawyu Dec 19, 2025 •

edited

Loading