perf(monitor): avoid querying the same thing twice, avoid seq scans #82

smudge · 2025-12-10T19:03:22Z

This does two things:

Avoids querying both locked_count and working_count separately (since they are synonymous)
Splits the (total) count metric into two: failed_count and live_count

Worth noting that when a scan covers the entire table, a sequential scan might actually be better than an index scan. However, since the monitor already produces a count of failed jobs, we can memoize that result to avoid doing that work twice. (Furthermore, in postgres at least, these don't just become index scans -- they become INDEX ONLY scans, which should be cheaper than table scans since the index should be significantly more compact!)

/no-platform

effron · 2025-12-10T20:32:13Z

lib/delayed/monitor.rb

    end

    def run!
+      @memo = {}


maybe a bit of a stretch, but this makes me think that a separate class that can calc the metrics might be helpful, and this class is more in charge of the looping/sleeping mechanics. then we don't need this overwritable internal state, we just new up a new metric-calc-class during each call to run!. I see that Runnable handles some of this, but i guess organizing it that way means we have this notion of resettable state which is a bit smelly

Yeah, agreed -- I was also starting to think about how we could reimagine Runnable to take on less of the whole lifecycle of Delayed::Monitor and Delayed::Worker kinds of classes.

If I keep pulling on that thread, I imagine that this smelliness will go away, but I didn't want to refactor too much within this one PR.

effron

just a non blocking thoughts, otherwise changes look good!

domainLGTM

Worth noting that when a scan covers the entire table, **a sequential scan might actually be better than an index scan.** However, since the monitor already produces a count of failed jobs, we can memoize that result to avoid doing that work twice. Furthermore, in postgres at least, these don't just become index scans -- they become _INDEX ONLY_ scans, which should be cheaper than table scans since the index should be significantly more compact!

These concepts are now synonymous. I went as far as removing the `working` scope because it's not even truthful. (We don't know whether it's actively working, we only know that it was claimed by a worker and that the claim hasn't yet expired.)

smudge · 2025-12-11T17:14:01Z

spec/delayed/__snapshots__/monitor_spec.rb.snap

+  ->  Sort  (cost=...)
+        Output: (CASE WHEN ((priority >= 0) AND (priority < 10)) THEN 0 WHEN ((priority >= 10) AND (priority < 20)) THEN 10 WHEN ((priority >= 20) AND (priority < 30)) THEN 20 WHEN (priority >= 30) THEN 30 ELSE NULL::integer END), queue
+        Sort Key: (CASE WHEN ((delayed_jobs.priority >= 0) AND (delayed_jobs.priority < 10)) THEN 0 WHEN ((delayed_jobs.priority >= 10) AND (delayed_jobs.priority < 20)) THEN 10 WHEN ((delayed_jobs.priority >= 20) AND (delayed_jobs.priority < 30)) THEN 20 WHEN (delayed_jobs.priority >= 30) THEN 30 ELSE NULL::integer END), delayed_jobs.queue
+        ->  Seq Scan on public.delayed_jobs  (cost=...)


The [legacy index] case gets two Seq Scans instead of one. However, in practice, the @memo means that the monitor is still doing less work, as one of these Seq Scans is identical to a seq scan it already had to do to count failed rows.

effron

domainLGTM

smudge requested a review from effron December 10, 2025 19:03

effron reviewed Dec 10, 2025

View reviewed changes

effron previously approved these changes Dec 10, 2025

View reviewed changes

smudge added 4 commits December 11, 2025 12:10

Appease the linter

ac55b22

Adjust after rebase (new 'legacy' index explains)

f9b904a

smudge dismissed effron’s stale review via f9b904a December 11, 2025 17:12

smudge force-pushed the query-optimization/13 branch from c0ce18d to f9b904a Compare December 11, 2025 17:12

smudge requested a review from effron December 11, 2025 17:12

smudge commented Dec 11, 2025

View reviewed changes

effron approved these changes Dec 11, 2025

View reviewed changes

smudge merged commit f7c8775 into Betterment:main Dec 11, 2025
25 checks passed

smudge deleted the query-optimization/13 branch December 11, 2025 18:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf(monitor): avoid querying the same thing twice, avoid seq scans #82

perf(monitor): avoid querying the same thing twice, avoid seq scans #82

Uh oh!

smudge commented Dec 10, 2025

Uh oh!

effron Dec 10, 2025

Uh oh!

smudge Dec 10, 2025

Uh oh!

effron left a comment

Uh oh!

smudge Dec 11, 2025

Uh oh!

effron left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

perf(monitor): avoid querying the same thing twice, avoid seq scans #82

perf(monitor): avoid querying the same thing twice, avoid seq scans #82

Uh oh!

Conversation

smudge commented Dec 10, 2025

Uh oh!

effron Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

smudge Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

effron left a comment

Choose a reason for hiding this comment

Uh oh!

smudge Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

effron left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants