Fix threading #108

peter-reinholdt · 2025-08-18T10:46:25Z

Some fixes for threading:

Previously, the chunk boundary was computed as:

ceil(sqrt(thread_idx / num_threads) * sideLength);

Since thread_idx / num_threads performed integer division, this always evaluated to 0 for thread_idx < num_threads.
I also removed the sqrt, since (as far as I can see), where the end_chunk_idx is used, it covers "linear" work, not triangular/quadratically growing work.

Additionally, I encountered segfaults on large problem sizes (many determinants), which I eventually tracked down to a heap-use-after-free from pyci/src/hci.cpp:277.
Not entirely sure why this would happen, but maybe the order of threads joining is not guaranteed to correspond to the order of the v_wfns? The proposed change first joins all threads, then adds the determinants:

    for (auto &thread : v_threads) thread.join();
    for (auto &wf : v_wfns) wfn.add_dets_from_wfn(wf);

msricher · 2025-08-18T13:06:14Z

Thank you!! I'll merge this soon.

peter-reinholdt added 2 commits August 18, 2025 12:31

Fix integer division in end_chunk_idx

d483b99

Fix data race

8bff23b

msricher merged commit bb546bf into theochem:master Aug 18, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix threading #108

Fix threading #108

Uh oh!

peter-reinholdt commented Aug 18, 2025

Uh oh!

msricher commented Aug 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix threading #108

Fix threading #108

Uh oh!

Conversation

peter-reinholdt commented Aug 18, 2025

Uh oh!

msricher commented Aug 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants