feat(core): add skiplist to CheckPoint for faster traversal by evanlinjin · Pull Request #2048 · bitcoindevkit/bdk

evanlinjin · 2025-09-25T08:08:33Z

Description

Adds a skip pointer and index field to CheckPoint to accelerate traversal on dense chains. A single skip pointer is set every 1000 checkpoints (by index), giving a several-hundred-× constant-factor speedup for get(), floor_at(), and range() on chains near the current Bitcoin tip height.

This is a constant-factor speedup, not O(√n) — a fixed interval k gives O(n/k + k), still asymptotically linear. For BDK's realistic size regime, though, the constant factor is ample: on a ~1M-checkpoint dense chain, positional lookups drop from ~1M hops to roughly 1–2k (tens of microseconds), well under any sensible latency budget.

Why it's useful

@martinsaposnic reported in this comment that Wallet::transactions() exceeded a 2-second poll budget on LDK-on-mutinynet with bitcoind as chain source, due to O(n) checkpoint traversal over a dense chain. The skiplist pulls this scenario far under budget.

Notes to the reviewers

Skip pointers are placed every 1000 checkpoints by index (not height), so the distribution stays consistent regardless of whether the chain is dense or sparse.
k = 1000 was chosen because the fixed-interval skiplist is minimized at k ≈ √n, and the motivating workload sits around n ≈ 1M (√n ≈ 1000). Smaller values bias the win toward chains of n ≈ 10k, where linear is already fast enough.
insert() rebuilds the affected portion of the chain via the existing extend/push paths, so index/skip invariants are maintained automatically.
All existing APIs remain unchanged.
A multi-level skiplist would give O(log n) but adds meaningful complexity; for BDK's realistic workload, the constant-factor win is sufficient and not worth the extra machinery.

Changelog notice

Added

skip pointer and index fields on CheckPoint.
Constant-factor traversal speedup for get(), floor_at(), and range() on dense chains.
Benchmarks covering small, medium, and large chains.

Checklists

All Submissions:

I followed the contribution guidelines

New Features:

I've added tests for the new feature
I've added docs for the new feature

🤖 Generated with Claude Code

evanlinjin · 2025-09-25T08:09:28Z

Guys, this is purely done by Claude. I haven't reviewed it yet.

evanlinjin · 2025-09-25T14:02:45Z

Performance Benchmark Comparison

Benchmarks comparing the old O(n) implementation vs new skiplist O(√n) implementation for a 10,000 checkpoint chain:

🎯 Key Results

Operation	Old Implementation	Skiplist Implementation	Speedup
get(100) - near start	98.270 μs	421 ns	233x faster
get(9000) - near end	9.668 μs	44 ns	220x faster
linear_traversal(100)	56.965 μs	110.66 μs	0.5x (expected*)

📊 Detailed Benchmarks

Finding checkpoint at position 100 (from 10k chain):

Old: 98.270 μs - Linear search from tip
New: 421 ns - Skip pointers jump directly to target
Improvement: 233x faster 🚀

Finding checkpoint at position 9000 (from 10k chain):

Old: 9.668 μs - Linear search through 1000 nodes
New: 44 ns - Skip pointers minimize traversal
Improvement: 220x faster 🚀

* Note: The linear_traversal benchmark shows the new implementation is slightly slower because it's doing the same linear traversal but with additional overhead from the skip/index fields. The real performance gains come from using the skiplist-aware methods like get(), floor_at(), and range().

Summary

The skiplist implementation provides massive performance improvements for checkpoint lookups, especially for deep searches in long chains. The O(√n) complexity is clearly demonstrated with 200x+ speedups in real-world scenarios.

evanlinjin · 2025-10-17T12:15:21Z

Guys, this is purely done by Claude. I haven't reviewed it yet.

It's now fully reviewed by myself! Made many simplifications.

Let's merge #2055 and rebase this on top of that!

evanlinjin · 2025-10-17T12:38:18Z

Skiplist Performance Update

After the optimizations, here are the updated benchmark results:

`get()` Performance

Benchmark	Time	Notes
`get_100_near_start`	475.89 ns	Get checkpoint near start of 100-item chain
`get_1000_middle`	31.07 ns	Get checkpoint in middle of 1000-item chain
`get_10000_near_end`	57.12 ns	Get checkpoint near end of 10000-item chain
`get_10000_near_start`	535.37 ns	Get checkpoint near start of 10000-item chain

`floor_at()` Performance

Benchmark	Time	Notes
`floor_at_1000`	286.33 ns	Floor at height 750 in 1000-item chain
`floor_at_10000`	673.27 ns	Floor at height 7500 in 10000-item chain

`range()` Performance

Benchmark	Time	Notes
`range_1000_middle_10pct`	1.67 µs	Range 450..=550 in 1000-item chain
`range_10000_large_50pct`	97.59 µs	Range 2500..=7500 in 10000-item chain
`range_10000_from_start`	3.11 µs	Range ..=100 in 10000-item chain
`range_10000_near_tip`	1.21 µs	Range 9900.. in 10000-item chain
`range_single_element`	942.21 ns	Range 5000..=5000 in 10000-item chain

Traversal Comparison

Benchmark	Time	Notes
`linear_traversal_10000`	140.90 µs	Linear search to height 100 in 10000-item chain
`skiplist_get_10000`	539.80 ns	Skip-enhanced search to height 100 in 10000-item chain

Speedup: 261x faster with skip pointers!

Summary

The skip list implementation successfully achieves O(√n) time complexity for search operations. Key improvements from our optimizations:

Cleaner two-phase traversal in get() and range()
Simplified floor_at() from 33 lines to 1 line
Restored elegant insert() implementation (removed 60+ lines)
Refactored push() with clearer skip pointer logic

All tests pass and the implementation is now both performant and maintainable.

ValuedMammal · 2025-10-28T22:03:15Z

I think it should have more than a single level to achieve the optimal performance, but I'm not sure if that's possible without implementing a new type.

nymius

This will be an improvement over linked list for sure, but I would like to combine efforts with block-graph/skiplist to avoid duplicated work.
I find some components of the skip list too tied up to the underlying data, making difficult to resonate about the functionality of the skip list by itself. I would like to detach the skiplist logic from the underlying data.
There is a lot of room for improvement for skiplist applied to our particular use case, so I'm positive there will be a lot of changes to a structure like it. I would try to make smaller PRs to check the improvements, but get results early.
In that sense, the benchmark is going to be very handy to ensure we are making progress.
A one level, fixed skip interval is a great starting point for that.

evanlinjin · 2025-11-10T00:57:22Z

I think it should have more than a single level to achieve the optimal performance, but I'm not sure if that's possible without implementing a new type.

@ValuedMammal I think the current state of the PR is a good balance between performance and simplicity (for now).

We are already achieving queries in the single-digit microsecond range for typical wallet sizes (1k-100k checkpoints). In other words, we can already easily do hundreds of thousands of block-queries in under a second.
Doing a multi-level skiplist will increase code complexity significantly and increase memory usage by ~20% (due to Vec overhead for multiple skip pointers - admittedly, that's not a lot).

Here is a performance summary by Claude:

Chain Size	Checkpoints	Current	Multi-Level	Speedup
Small wallet	1,000	23.5 ns	~4 ns	6x
Medium wallet	10,000	414 ns	~60 ns	7x
Large wallet	100,000	2.3 μs	~70 ns	33x
Full chain	870,000	18.2 μs	~83 ns	219x

The performance gain for wallets with 10,000 or less checkpoints is insignificant (probably the majority of wallets right now).
The performance gain for wallets with 100,000 checkpoints is significant - but probably not noticeable for the user.
Worst-case scenario: Let's say we have a large wallet with 10,000 transactions and we canonicalize against a full checkpoint chain: That's 182 milliseconds to query for all relevant blocks which is perfectly manageable for a local wallet.

evanlinjin · 2025-11-23T08:59:23Z

@nymius

I find some components of the skip list too tied up to the underlying data, making difficult to resonate about the functionality of the skip list by itself. I would like to detach the skiplist logic from the underlying data.

The current state of the PR is self-contained and reviewable - the changes are all internal changes to CheckPoint + tests + benchmarks. We modified two read methods (get, range) and a single write method (push). We added two fields in CheckPoint. The review surface is extremely minimal. A total of 126 additional lines in checkpoint.rs (including comments).

Are you wanting to detach because you see this skiplist logic being used elsewhere? The current skiplist implementation has domain-specific simplifications which won't exist in a full skiplist implementation (i.e. append-only in our push, immutable - no need to modify node data). Furthermore, a more generic skiplist implementation will be 1000+ lines of code.

I would try to make smaller PRs to check the improvements, but get results early.

I don't think splitting this PR into multiple PRs makes sense here. This is a single atomic change to CheckPoint. The majority of this PR is tests and benchmarks. The modification surface is tiny.

We may need a new internal type to make multi-level skiplist possible, and thus achieve optimal performance. I'm not opposed to this idea. However, I think this PR is a simple change that achieves good enough performance.

nymius · 2025-11-23T19:45:52Z

You've exposed good points and I agree with all of them. Will review again ~~soon~~.

martinsaposnic · 2026-04-11T12:28:28Z

I was about to open a PR for this. I'm hitting this problem running an LDK Node on mutinynet with bitcoind as chain source where there's a 2-second poll that calls Wallet::transactions(), and once the checkpoint chain is long enough the operation takes longer than 2 seconds, so the CPU never idles.

I took a different approach adding a BTreeMap<u32, CheckPoint> index on LocalChain for O(log n) lookups. Happy to share it, although I believe this change would suffice too

nymius · 2026-04-13T12:53:48Z

@martinsaposnic would you be open to benchmark your case against this branch?

nymius

Needs rebase.

@martinsaposnic

Adds a skip pointer (every 100 checkpoints by index) and an index field to CheckPoint to accelerate get(), floor_at(), and range(). push() and insert() maintain the index/skip invariants on the rebuilt chain. This is a ~100x constant-factor speedup on dense chains, not a true O(sqrt(n)) bound: a fixed interval k gives O(n/k + k), which is asymptotically linear. For BDK's realistic size regime (up to ~1M dense checkpoints on a server), that constant factor is ample -- a full-chain get() drops from ~1M pointer chases to ~10k (tens of microseconds). Motivated by the server-side scenario reported by @martinsaposnic in bitcoindevkit#2048 where Wallet::transactions() exceeded a 2-second poll budget on LDK-on-mutinynet with bitcoind as chain source, due to O(n) checkpoint traversal over a dense chain. See: bitcoindevkit#2048 (comment) Benchmarks show ~265x speedup for deep searches in 10k checkpoint chains (linear traversal ~108us vs skiplist get ~407ns). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

evanlinjin · 2026-04-22T07:26:22Z

Pushed a follow-up commit bumping CHECKPOINT_SKIP_INTERVAL from 100 to 1000.

Rationale

Traversal cost with a fixed skiplist interval k is O(n/k + k), minimized at k ≈ √n. The original k=100 was tuned for n ≈ 10,000, which isn't the workload that motivates this PR. The reported issue (comment) is a dense server chain near current tip height (~946k blocks), where √n ≈ 970. Rounding to 1000 aligns the constant with that regime.

Concretely

n	k=100 hops	k=1000 hops
946,000	~9,560	~1,950 (~5× faster)
464,000 (segwit-era)	~4,740	~1,460
10,000	~200	~910 (slower, but still sub-µs)

Memory

No meaningful difference — Option<Arc<CPInner>> is niche-optimized to 8 bytes regardless of Some/None, so every node carries the same skip field. Skip pointers also reference existing nodes in the chain, so populated pointers don't trigger new heap allocations (just refcount increments on already-allocated ArcInners). k doesn't move the memory needle here.

Tradeoff

Chains shorter than ~1000 nodes no longer create any skip pointers. That's fine — linear traversal at n ≤ 1000 is already microseconds and wasn't the motivating problem. Small wallets weren't the target.

Left k as a fixed single-level constant rather than going multi-level / adaptive — for BDK's realistic size ceiling (~1M), the constant-factor win is sufficient and not worth the extra machinery. Happy to revisit if that changes.

(Edited: corrected an inaccurate claim about memory overhead — Option<Arc<_>> is niche-optimized so k doesn't meaningfully change memory usage.)

@martinsaposnic

Adds a skip pointer (every 100 checkpoints by index) and an index field to CheckPoint to accelerate get(), floor_at(), and range(). push() and insert() maintain the index/skip invariants on the rebuilt chain. This is a ~100x constant-factor speedup on dense chains, not a true O(sqrt(n)) bound: a fixed interval k gives O(n/k + k), which is asymptotically linear. For BDK's realistic size regime (up to ~1M dense checkpoints on a server), that constant factor is ample -- a full-chain get() drops from ~1M pointer chases to ~10k (tens of microseconds). Motivated by the server-side scenario reported by @martinsaposnic in bitcoindevkit#2048 where Wallet::transactions() exceeded a 2-second poll budget on LDK-on-mutinynet with bitcoind as chain source, due to O(n) checkpoint traversal over a dense chain. See: bitcoindevkit#2048 (comment) Benchmarks show ~265x speedup for deep searches in 10k checkpoint chains (linear traversal ~108us vs skiplist get ~407ns). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

@martinsaposnic

Adds a skip pointer (every 100 checkpoints by index) and an index field to CheckPoint to accelerate get(), floor_at(), and range(). push() and insert() maintain the index/skip invariants on the rebuilt chain. This is a ~100x constant-factor speedup on dense chains, not a true O(sqrt(n)) bound: a fixed interval k gives O(n/k + k), which is asymptotically linear. For BDK's realistic size regime (up to ~1M dense checkpoints on a server), that constant factor is ample -- a full-chain get() drops from ~1M pointer chases to ~10k (tens of microseconds). Motivated by the server-side scenario reported by @martinsaposnic in bitcoindevkit#2048 where Wallet::transactions() exceeded a 2-second poll budget on LDK-on-mutinynet with bitcoind as chain source, due to O(n) checkpoint traversal over a dense chain. See: bitcoindevkit#2048 (comment) Benchmarks show ~265x speedup for deep searches in 10k checkpoint chains (linear traversal ~108us vs skiplist get ~407ns). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

The skiplist's fixed interval k trades off as O(n/k + k), minimized when k ~ sqrt(n). The motivating workload (dense server chains near the current Bitcoin tip, n ~ 1M) sits far above the k=100 sweet spot of n ~ 10k. Bumping to k=1000 brings the interval closer to sqrt(1M) and yields ~5x better worst-case traversal for that case (roughly 2k hops instead of 10k). Memory is unchanged: Option<Arc<CPInner>> is niche-optimized to 8 bytes regardless of Some/None, so every node carries the same skip field, and skip pointers reference existing chain nodes (no new heap allocations -- just refcount bumps on already-allocated ArcInners). k only affects traversal performance, not footprint. Smaller chains (n < 1000) no longer gain anything from the skiplist, but linear traversal at that scale is already microseconds -- not a workload we need to optimize. Update test_skiplist_indices to verify skip pointer placement at the new interval using a 5000-node chain. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

@nymius

Addresses review feedback from @nymius: existing benches use fixed targets, which can land favorably or unfavorably relative to skip pointer positions and don't reflect real query patterns. The new bench draws 256 targets from a deterministic xorshift sequence and runs both a skiplist-enhanced get() and a plain linear walk over a 100k-node chain, so the same query stream exercises both paths. 100k is large enough to show the skiplist win clearly (100× fewer hops at k=1000) without slowing harness setup. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

evanlinjin · 2026-04-23T16:38:18Z

@nymius For the random access benchmark introduced in d7410c6;

random_access_linear_100k got 1.07 ms
random_access_skiplist_100k got 11.86 µs

~90x speedup

ValuedMammal · 2026-04-27T16:32:07Z

I recommend looking at how it's done in block-graph and read up on the blog post for a conceptual overview.

github-project-automation Bot added this to BDK Chain Sep 25, 2025

evanlinjin marked this pull request as draft September 25, 2025 08:08

evanlinjin force-pushed the feature/skiplist branch 2 times, most recently from 153c401 to 098c076 Compare September 25, 2025 09:25

evanlinjin moved this to In Progress in BDK Chain Sep 25, 2025

evanlinjin self-assigned this Sep 25, 2025

evanlinjin force-pushed the feature/skiplist branch 2 times, most recently from 5544fee to 4b9ccd1 Compare October 17, 2025 10:36

evanlinjin marked this pull request as ready for review October 17, 2025 12:14

evanlinjin requested review from ValuedMammal, nymius and oleonardolima October 17, 2025 12:19

evanlinjin force-pushed the feature/skiplist branch 2 times, most recently from 1455ce9 to 8986465 Compare October 19, 2025 11:41

nymius reviewed Nov 5, 2025

View reviewed changes

evanlinjin force-pushed the feature/skiplist branch from 86b0169 to 625b581 Compare November 9, 2025 23:05

ValuedMammal mentioned this pull request Nov 24, 2025

CheckPoint design issue #2076

Open

4 tasks

evanlinjin mentioned this pull request Jan 8, 2026

@evanlinjin's Initial Review ValuedMammal/block-graph#2

Open

evanlinjin requested a review from nymius February 6, 2026 06:56

evanlinjin force-pushed the feature/skiplist branch from 625b581 to 379981e Compare February 6, 2026 06:59

evanlinjin added this to the Chain 0.24.0 milestone Feb 21, 2026

nymius reviewed Apr 20, 2026

View reviewed changes

Comment thread crates/core/tests/test_checkpoint_skiplist.rs Outdated

Comment thread crates/core/benches/checkpoint_skiplist.rs

evanlinjin force-pushed the feature/skiplist branch from 379981e to da63591 Compare April 22, 2026 06:51

evanlinjin changed the title ~~Add skiplist to CheckPoint for O(√n) traversal~~ feat(core): add skiplist to CheckPoint for faster traversal Apr 22, 2026

evanlinjin force-pushed the feature/skiplist branch from da63591 to 77801bf Compare April 22, 2026 07:19

evanlinjin force-pushed the feature/skiplist branch from bfb6a64 to 09ef0b7 Compare April 22, 2026 07:33

evanlinjin force-pushed the feature/skiplist branch from 09ef0b7 to 75e6fdd Compare April 23, 2026 15:44

evanlinjin and others added 2 commits April 23, 2026 15:48

evanlinjin force-pushed the feature/skiplist branch from 75e6fdd to aeb4d00 Compare April 23, 2026 15:48

evanlinjin requested a review from nymius April 23, 2026 16:42

Conversation

evanlinjin commented Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Why it's useful

Notes to the reviewers

Changelog notice

Added

Checklists

All Submissions:

New Features:

Uh oh!

evanlinjin commented Sep 25, 2025

Uh oh!

evanlinjin commented Sep 25, 2025

Performance Benchmark Comparison

🎯 Key Results

📊 Detailed Benchmarks

Finding checkpoint at position 100 (from 10k chain):

Finding checkpoint at position 9000 (from 10k chain):

Summary

Uh oh!

evanlinjin commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

evanlinjin commented Oct 17, 2025

Skiplist Performance Update

get() Performance

floor_at() Performance

range() Performance

Traversal Comparison

Summary

Uh oh!

ValuedMammal commented Oct 28, 2025

Uh oh!

nymius left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

evanlinjin commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

evanlinjin commented Nov 23, 2025

Uh oh!

nymius commented Nov 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

martinsaposnic commented Apr 11, 2026

Uh oh!

nymius commented Apr 13, 2026

Uh oh!

nymius left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

evanlinjin commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rationale

Concretely

Memory

Tradeoff

Uh oh!

evanlinjin commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ValuedMammal commented Apr 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

evanlinjin commented Sep 25, 2025 •

edited

Loading

evanlinjin commented Oct 17, 2025 •

edited

Loading

`get()` Performance

`floor_at()` Performance

`range()` Performance

nymius left a comment •

edited

Loading

evanlinjin commented Nov 10, 2025 •

edited

Loading

nymius commented Nov 23, 2025 •

edited

Loading

evanlinjin commented Apr 22, 2026 •

edited

Loading

evanlinjin commented Apr 23, 2026 •

edited

Loading

ValuedMammal commented Apr 27, 2026 •

edited

Loading