Claude/optimize packmol turbo wl4 eu by MicheleBonus · Pull Request #122 · m3g/packmol

MicheleBonus · 2026-03-12T07:20:02Z

No description provided.

…-packmol-memgen Optimize hot cell-neighbor loops in computef/computeg

Optimize pairwise collision kernels for faster Packmol iterations

…s-using-offset-helper Refactor computef/computeg to use shared forward cell offsets

…toms

…parc-and-gparc Split pair-interaction kernels into fast/short/fixed paths and prefilter active neighbor heads

…h-arrays-in-compute_data.f90 Add hot-path scalar buffers and use them in pairwise kernels

…-with-per-cell-maxima Add per-cell radius bounds and PBC-aware cell-pair pruning

…e-calculations Optimize squared terms in hot atom-pair kernels

…ions-and-documentation Add build profiles (baseline, perf-native, devel, sanitize, static) and numerics check

…provements-for-computing docs: add Michele Bonus as contributor and update README build/profile and performance notes

…nd-ambiguous-references Fix pgencan build failure caused by ambiguous `x` symbol

…ents-and-errors Import `init1` from `compute_data` in `pgencan`

…ys-in-compute_data.f90 Rename compute_data hot buffers to x_hot/y_hot/z_hot and restrict imports

…gh-risk-fortran-files Narrow compute_data imports in collision-prone routines

…ation-and-warnings-in-gencan.f gencan: initialize CG/trust-region scalars and guard conditional reads

…-arguments-in-subroutines Restore full `evalhd` ABI and mark dummy API arguments as intentionally unused in GENCAN stubs

…-resetcells-subroutine Wrap long short-radius assignment in resetcells

…zation-in-packmol.f90 Defensively handle missing atom-level restriction mapping

…alidate-residue-bounds Validate fixed-molecule residue bounds before residue counting

…n-computef.f90 Reformat long continued assignments in src/computef.f90

…mance-and-refactor-expressions Optimize cell-neighbor evaluation and fix overlong Fortran lines

Three key changes that together yield ~25% speedup: 1. Remove hot buffer abstraction (x_hot, y_hot, z_hot, ibtype_hot, ibmol_hot) - xcart(:,1) is already contiguous in Fortran column-major layout, so separate x_hot(:) arrays provide zero cache benefit for the random linked-list access pattern in fparc/gparc - Eliminates refresh_hot_buffers_full (5 full-array copies per call) and refresh_hot_buffers_atom (per-atom copy overhead) - Use xcart/ibtype/ibmol directly in inner loops 2. Remove dead cell-level radius pruning infrastructure - cell_pair_min_dist2 always returns 0 for the 14 forward neighbor offsets because cells are sized >= interaction radius, making all neighbor pairs adjacent (zero gap distance) - The pruning check (min_cell_dist2 > max_reach2) was never true - Removes cell_max_radius/cell_max_short_radius tracking from both the cell-placement loop and resetcells, plus the reach computation and cell_pair_min_dist2 function from the neighbor-offset loop 3. Incremental cell reset - Walk the previous iteration's occupied-cell linked list to clear only those cells, instead of zeroing all ncells^3 entries - Also skip clearing latomnext since all placed atoms overwrite it 4. Pre-compute fixed_short_marker once at init instead of per-atom per-iteration (fixedatom and use_short_radius are static) Benchmarks (baseline -O2): water_box_pbc: 0.41s -> 0.32s (22% faster) solvprotein_pbc: 6.12s -> 4.56s (26% faster) https://claude.ai/code/session_01SgmhQ2p78sPjFPxZPmLkk7

MicheleBonus added 30 commits March 11, 2026 10:29

Optimize cell neighbor lookup in pairwise loops

d01d2c0

Merge pull request #1 from MicheleBonus/codex/optimize-performance-of…

ec58ec0

…-packmol-memgen Optimize hot cell-neighbor loops in computef/computeg

Optimize pairwise kernels in fparc/gparc

9a03d75

Merge pull request #2 from MicheleBonus/codex/optimize-packmol-for-speed

988029a

Optimize pairwise collision kernels for faster Packmol iterations

Refactor forward cell-neighbor traversal with shared offsets

208f05a

Merge pull request #3 from MicheleBonus/codex/refactor-neighbor-acces…

b10f2c8

…s-using-offset-helper Refactor computef/computeg to use shared forward cell offsets

Split pair kernels into fast/short/fixed paths and prefilter active a…

dcd9691

…toms

Merge pull request #4 from MicheleBonus/codex/refactor-hot-logic-in-f…

eba4d12

…parc-and-gparc Split pair-interaction kernels into fast/short/fixed paths and prefilter active neighbor heads

Add hot-path scalar streams for pairwise kernels

5775ac5

Merge pull request #5 from MicheleBonus/codex/define-explicit-hot-pat…

a579b0e

…h-arrays-in-compute_data.f90 Add hot-path scalar buffers and use them in pairwise kernels

Add per-cell radius bounds and pair-distance pruning

7a0703f

Merge pull request #6 from MicheleBonus/codex/extend-cell-bookkeeping…

0753de9

…-with-per-cell-maxima Add per-cell radius bounds and PBC-aware cell-pair pruning

Optimize hot-loop square operations in pair kernels

6b84f71

Merge pull request #7 from MicheleBonus/codex/refactor-hot-loop-squar…

672f6bf

…e-calculations Optimize squared terms in hot atom-pair kernels

Add explicit build profiles and numerics profile check

99a44a9

Merge pull request #8 from MicheleBonus/codex/update-build-configurat…

f187d25

…ions-and-documentation Add build profiles (baseline, perf-native, devel, sanitize, static) and numerics check

docs: update build/profile docs and add Michele Bonus credentials

8c0615b

Merge pull request #9 from MicheleBonus/codex/plan-adjustments-and-im…

408bfb3

…provements-for-computing docs: add Michele Bonus as contributor and update README build/profile and performance notes

Fix pgencan module import ambiguity for x

c6909bb

Merge pull request #10 from MicheleBonus/codex/fix-unused-arguments-a…

c77f328

…nd-ambiguous-references Fix pgencan build failure caused by ambiguous `x` symbol

Fix missing init1 import in pgencan

dee125a

Merge pull request #11 from MicheleBonus/codex/fix-unused-dummy-argum…

7e43663

…ents-and-errors Import `init1` from `compute_data` in `pgencan`

Rename hot coordinate buffers and tighten compute_data imports

04c7888

Merge pull request #12 from MicheleBonus/codex/rename-hot-buffer-arra…

eb5b78e

…ys-in-compute_data.f90 Rename compute_data hot buffers to x_hot/y_hot/z_hot and restrict imports

Narrow compute_data imports in collision-prone routines

0be9155

Merge pull request #13 from MicheleBonus/codex/refactor-imports-in-hi…

8848245

…gh-risk-fortran-files Narrow compute_data imports in collision-prone routines

Initialize gencan scalars before conditional use

943852a

Merge pull request #14 from MicheleBonus/codex/fix-variable-initializ…

d884d8d

…ation-and-warnings-in-gencan.f gencan: initialize CG/trust-region scalars and guard conditional reads

Fix GENCAN evalhd interface and mark API dummy args used

b164bfa

Merge pull request #15 from MicheleBonus/codex/verify-and-clean-dummy…

0264b8f

…-arguments-in-subroutines Restore full `evalhd` ABI and mark dummy API arguments as intentionally unused in GENCAN stubs

MicheleBonus and others added 11 commits March 11, 2026 16:09

Wrap long short-radius assignment in resetcells

9d85481

Merge pull request #16 from MicheleBonus/codex/refactor-assignment-in…

2689c37

…-resetcells-subroutine Wrap long short-radius assignment in resetcells

Handle missing restriction mapping defensively

a01001b

Merge pull request #17 from MicheleBonus/codex/add-defensive-initiali…

fbe2fe3

…zation-in-packmol.f90 Defensively handle missing atom-level restriction mapping

Validate fixed-molecule residue bounds before nres

41e2f38

Merge pull request #18 from MicheleBonus/codex/initialize-ilres-and-v…

ebbca56

…alidate-residue-bounds Validate fixed-molecule residue bounds before residue counting

Reformat long free-form expressions in computef

d32a71a

Merge pull request #19 from MicheleBonus/codex/reformat-assignments-i…

da192e6

…n-computef.f90 Reformat long continued assignments in src/computef.f90

Optimize neighbor reach checks and split long Fortran lines

a383777

Merge pull request #20 from MicheleBonus/codex/improve-program-perfor…

ac0a56a

…mance-and-refactor-expressions Optimize cell-neighbor evaluation and fix overlong Fortran lines

MicheleBonus closed this by deleting the head repository Mar 12, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Claude/optimize packmol turbo wl4 eu#122

Claude/optimize packmol turbo wl4 eu#122
MicheleBonus wants to merge 41 commits intom3g:masterfrom
MicheleBonus:claude/optimize-packmol-turbo-Wl4EU

MicheleBonus commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

MicheleBonus commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants