Experiment: Use a bit in RefRandstrobe to indicate whether it is filtered by marcelm · Pull Request #466 · ksahlin/strobealign

marcelm · 2024-12-04T20:53:55Z

Profiling suggested that the Index::is_filtered() call is a bit slow. It checks whether a randstrobe occurs more often than filter_cutoff by accessing randstrobes[i] and randstrobes[i + filter_cutoff] and comparing the hashes.

The slowness could come from two cache misses because two quite far apart memory locations are read. To get rid of the second access, the idea is to use one bit within RefRandstrobe to store whether the item is filtered.

Somewhat unexpectedly, this does not improve speed. It does reduce cache misses according to perf stat -d, but this does not translate to a shorter runtime.

marcelm · 2024-12-06T14:05:45Z

After a couple of measurements on a different (10 years younger) machine, I can measure a difference - this PR makes mapping-only mode about 2% faster. (This comes at the expense of one less bit available for the hash, but this has very little impact.)

ksahlin · 2024-12-11T09:40:51Z

Great! Don't we anyway have B top bits available to store other things because of our prefix vector? This depends of course on that the bit is added after the sorted vector has been produced.

marcelm · 2024-12-17T14:04:10Z

Great! Don't we anyway have B top bits available to store other things because of our prefix vector? This depends of course on that the bit is added after the sorted vector has been produced.

Right, good point! I have the impression the filter bit would better fit in those upper bits anyway. Let me update the PR later.

…ere)

marcelm force-pushed the filterbit branch from dd77e1f to 2787285 Compare December 5, 2024 08:03

Base automatically changed from auxlen to main December 10, 2024 08:27

marcelm added 3 commits January 25, 2025 19:09

Use one less bit for randstrobe hash (TODO: adjust hardcoded 9 somewh…

2fdeafb

…ere)

filterbit

4306810

Update baseline commit

4f492d9

marcelm force-pushed the filterbit branch from 2787285 to 4f492d9 Compare January 26, 2025 20:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experiment: Use a bit in RefRandstrobe to indicate whether it is filtered#466

Experiment: Use a bit in RefRandstrobe to indicate whether it is filtered#466
marcelm wants to merge 3 commits intomainfrom
filterbit

marcelm commented Dec 4, 2024 •

edited

Loading

Uh oh!

marcelm commented Dec 6, 2024

Uh oh!

ksahlin commented Dec 11, 2024

Uh oh!

marcelm commented Dec 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

marcelm commented Dec 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

marcelm commented Dec 6, 2024

Uh oh!

ksahlin commented Dec 11, 2024

Uh oh!

marcelm commented Dec 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

marcelm commented Dec 4, 2024 •

edited

Loading