perf: welford's algorithm for mean-var aggregation by ilan-gold · Pull Request #4147 · scverse/scanpy

ilan-gold · 2026-06-08T12:14:02Z

From discussions with @zboldyga.

This should then in theory be reused with #4143 instead of its custom moments calculation

Closes #
Tests included or not required because:

Release notes not necessary because:

ilan-gold · 2026-06-08T12:16:20Z

                out[cat, col] += data.data[j]


+@njit


We should make these nogil or provide an option for fau to provide nogil njit

codecov · 2026-06-08T12:22:48Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 79.60%. Comparing base (0ac3337) to head (48230af).
✅ All tests successful. No failed tests found.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #4147      +/-   ##
==========================================
- Coverage   79.61%   79.60%   -0.02%     
==========================================
  Files         120      120              
  Lines       12786    12780       -6     
==========================================
- Hits        10180    10173       -7     
- Misses       2606     2607       +1

Flag	Coverage Δ
hatch-test.low-vers	`78.84% <100.00%> (-0.01%)`	⬇️
hatch-test.pre	`79.43% <100.00%> (-0.05%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
src/scanpy/get/_aggregated.py	`92.66% <100.00%> (-0.65%)`	⬇️
src/scanpy/get/_kernels.py	`100.00% <ø> (ø)`

zboldyga

@ilan-gold I reviewed since this plays into our other work

I don't see any issues with the welfords implementation, lgtm!

and overall I'm new with njit, OMP, TBB. But I see your point about nogil, and was able to create some scenarios that trigger that path. So agreed on the suggestion... I also found that it does also slightly improve the chan njit work when fau falls back on the serial build, and I guess it would generally fix any of these types of things across scanpy, so maybe that does belong in fau?

There's probably some threading stuff I'm not fully grasping with scanpy, fau, those libraries yet though; I will be more confident in that in a few weeks. I figure I will revisit threading and numba as a whole as part of building a good understanding of these libraries, and if I find any thread issues throughout scanpy/fau I will raise them at that point.

ilan-gold · 2026-06-12T09:07:39Z

so maybe that does belong in fau?

Yes the issue with nogil is that it means your code is no longer threadsafe with respect to its inputs (in a certain mental model of things, I guess). So introducing this change needs some though - I don't think anything in FAU actually alters its inputs though so we should be good.

There's probably some threading stuff I'm not fully grasping with scanpy, fau, those libraries yet though; I will be more confident in that in a few weeks. I figure I will revisit threading and numba as a whole as part of building a good understanding of these libraries, and if I find any thread issues throughout scanpy/fau I will raise them at that point.

That would be amazing because it is something we have (clearly) struggled with

perf: welford's algorithm for mean-var

21f5ddc

ilan-gold added this to the 1.12.2 milestone Jun 8, 2026

ilan-gold changed the title ~~perf: welford's algorithm for mean-var~~ perf: welford's algorithm for mean-var aggregation Jun 8, 2026

chore: relnote

514bd17

ilan-gold commented Jun 8, 2026

View reviewed changes

ilan-gold requested a review from flying-sheep June 8, 2026 12:24

Merge branch 'main' into ig/welford

48230af

zboldyga reviewed Jun 12, 2026

View reviewed changes

Merge branch 'main' into ig/welford

afc24a1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: welford's algorithm for mean-var aggregation#4147

perf: welford's algorithm for mean-var aggregation#4147
ilan-gold wants to merge 4 commits into
mainfrom
ig/welford

ilan-gold commented Jun 8, 2026

Uh oh!

ilan-gold Jun 8, 2026

Uh oh!

codecov Bot commented Jun 8, 2026 •

edited

Loading

Uh oh!

zboldyga left a comment •

edited

Loading

Uh oh!

ilan-gold commented Jun 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ilan-gold commented Jun 8, 2026

Uh oh!

ilan-gold Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

codecov Bot commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

zboldyga left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ilan-gold commented Jun 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov Bot commented Jun 8, 2026 •

edited

Loading

zboldyga left a comment •

edited

Loading