more efficient slide function #154

pavelkomarov · 2025-08-24T07:19:52Z

To address #146, I asked Chat how to make the function more efficient, and it had some good ideas. Got some strange data reference issue when I copied in some of the code, though, because it was modifying the kernel in place, which is a no no. Some ablation tests and more conversation with Chat eventually got to the bottom of it. It's way better now, not only more efficient but easier to read and interpret.

pavelkomarov · 2025-08-24T07:20:30Z

pynumdiff/utils/utility.py

-    weights = np.zeros((int(np.ceil(len(x)/stride)), len(x))) # Could be more space efficient
-    x_hats = np.zeros(weights.shape)
-    dxdt_hats = np.zeros(weights.shape)
+    x_hat = np.zeros(x.shape)


These are all just $O(N)$ now.

pavelkomarov · 2025-08-24T07:21:09Z

pynumdiff/utils/utility.py

-                        min(len(x), midpoint + half_window_size + 1)) # +1 because slicing is exclusive of end
-        kslice = slice(max(0, half_window_size - midpoint),
-                        min(len(kernel), len(kernel) - (midpoint + half_window_size + 1 - len(x))))
+        start = max(0, midpoint - half_window_size)


Making start and end explicit variables turns out to be really helpful later.

pavelkomarov · 2025-08-24T07:22:13Z

pynumdiff/utils/utility.py

-        weights[i, window] = kernel if kslice.stop - kslice.stop == len(kernel) else kernel[kslice]/np.sum(kernel[kslice])
-        if pass_weights: kwargs['weights'] = weights[i, window]
+        kstart = max(0, half_window_size - midpoint)
+        kend = kstart + (end - start)


This is a way simpler formula than what I had before, though it does the same thing. It's nice to be able to use start and end without having to get the properties from the window.

pavelkomarov · 2025-08-24T07:22:45Z

pynumdiff/utils/utility.py

-        window = slice(max(0, midpoint - half_window_size),
-                        min(len(x), midpoint + half_window_size + 1)) # +1 because slicing is exclusive of end
-        kslice = slice(max(0, half_window_size - midpoint),
-                        min(len(kernel), len(kernel) - (midpoint + half_window_size + 1 - len(x))))


This endpoint formula was a little crazy. Can be done in a simpler expression.

pavelkomarov · 2025-08-24T07:24:12Z

pynumdiff/utils/utility.py

+        window = slice(start, end)

-        # weights need to be renormalized if running off an edge
-        weights[i, window] = kernel if kslice.stop - kslice.stop == len(kernel) else kernel[kslice]/np.sum(kernel[kslice])


I was subtracting .stop from .stop here, rather than .start from .stop, so the else condition was always getting executed. That's fine, because normalization isn't bad, but I wasn't saving the compute I was hoping to save with this conditional.

pavelkomarov · 2025-08-24T07:25:37Z

pynumdiff/utils/utility.py

-    dxdt_hat = np.sum(weights*dxdt_hats, axis=0)
+        # run the function on the window and add weighted results to cumulative answers
+        x_window_hat, dxdt_window_hat = func(x[window], dt, *args, **kwargs)
+        x_hat[window] += w * x_window_hat


This is the real insight. No need to add to a big list or array like we were before. Just take the weighted running sum and keep track of cumulative weights for normalization at the end.

pavelkomarov · 2025-08-24T07:25:56Z

pynumdiff/utils/utility.py

+        weight_sum[window] += w # save sum of weights for normalization at the end

-    return x_hat, dxdt_hat
+    return x_hat/weight_sum, dxdt_hat/weight_sum


Bam, normalization is much easier.

more efficient slide function

cb7fb5b

pavelkomarov commented Aug 24, 2025

View reviewed changes

bounds

1354dd3

pavelkomarov merged commit 18e78d2 into master Aug 24, 2025
1 check passed

pavelkomarov deleted the efficient-slide-function branch August 24, 2025 07:46

pavelkomarov mentioned this pull request Sep 2, 2025

New basis-function-based methods module #155

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

more efficient slide function #154

more efficient slide function #154

Uh oh!

pavelkomarov commented Aug 24, 2025 •

edited

Loading

Uh oh!

pavelkomarov Aug 24, 2025

Uh oh!

pavelkomarov Aug 24, 2025

Uh oh!

pavelkomarov Aug 24, 2025

Uh oh!

pavelkomarov Aug 24, 2025

Uh oh!

pavelkomarov Aug 24, 2025

Uh oh!

pavelkomarov Aug 24, 2025

Uh oh!

pavelkomarov Aug 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

more efficient slide function #154

more efficient slide function #154

Uh oh!

Conversation

pavelkomarov commented Aug 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pavelkomarov Aug 24, 2025

Choose a reason for hiding this comment

Uh oh!

pavelkomarov Aug 24, 2025

Choose a reason for hiding this comment

Uh oh!

pavelkomarov Aug 24, 2025

Choose a reason for hiding this comment

Uh oh!

pavelkomarov Aug 24, 2025

Choose a reason for hiding this comment

Uh oh!

pavelkomarov Aug 24, 2025

Choose a reason for hiding this comment

Uh oh!

pavelkomarov Aug 24, 2025

Choose a reason for hiding this comment

Uh oh!

pavelkomarov Aug 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pavelkomarov commented Aug 24, 2025 •

edited

Loading