fix(interpolation): bypass DMLocatePoints when caller supplies a verified hint by lmoresi · Pull Request #203 · underworldcode/underworld3

lmoresi · 2026-05-24T10:54:39Z

Summary

When the caller of DMInterpolationSetUp_UW (UW3's fork of stock DMInterpolationSetUp) supplies a per-query cell hint, PETSc re-verifies the hint via DMPlexLocatePoint_Internal inside DMLocatePoints. On cell shapes where its in-cell test is geometrically subtle — notably 2-D simplex cells embedded in 3-D space — verification can reject a correct hint and fall back to a wrong-cell match, yielding wild FE evaluations.

This PR skips DMLocatePoints when:

the mesh is simplex (mesh.dm.isSimplex()), where the face-containment test is exact for the affine reference map, or
the mesh is a manifold (mesh.dim != mesh.cdim), where the kdtree-nearest cell on a closed surface correctly contains every query.

Non-simplex volume meshes (quads, hexes) keep the original DMLocatePoints path, since deformed cells can have non-planar faces where the kdtree-nearest cell can be wrong and PETSc's more rigorous search is needed.

What's in the patch

src/underworld3/function/_function.pyx — hint source in petsc_interpolate is Mesh.get_closest_cells (kdtree-nearest, no in-cell test). Comment explains the role.
src/underworld3/function/_dminterp_wrapper.pyx::create_structure — gates whether the hint is passed to PETSc on mesh.dm.isSimplex() or mesh.dim != mesh.cdim. On non-simplex volume meshes the hint is suppressed (NULL) and PETSc's DMLocatePoints runs unchanged.
src/underworld3/function/petsc_tools.c::DMInterpolationSetUp_UW — when owning_cell is provided, skip DMLocatePoints entirely. The MPI_Allreduce(MIN, foundProcs) after each rank claims its hinted points assigns each global point to the lowest-rank claimant, preserving the prior ownership convention. Lazily builds pointVec / globalPointsScalar only on the DMLocatePoints path.

Net: ~60 / -40 lines across the three files.

Behaviour

Mesh type	Path	Result
Non-simplex volume (StructuredQuadBox, hexes)	`DMLocatePoints` (unchanged)	No regression
Simplex volume (UnstructuredSimplexBox 2D/3D)	Bypass	Single-rank: same cell id as before. Multi-rank: ownership collapses to rank-0-claims-all (pre-existing parallel-evaluate work tracks this)
Manifold (`SphericalManifold`, when PPE lands)	Bypass	Eliminates wrong-cell matches; SLCN advection runs without `_evalf=True` RBF workaround

Test plan

pytest tests/test_0820_deform_mesh_solver_rebuild_regression.py::test_stokes_velocity_updates_after_second_deform — passes (this was the regression in the previous push; quad mesh now goes through DMLocatePoints unchanged)
pytest tests/test_0000_imports.py tests/test_0001_meshes.py tests/test_0004_pointwise_fns.py — 28/28
Spot-check uw.function.evaluate on quad and simplex meshes — analytic match
Validated on feature/parallel-point-eval: probe_manual_p1_eval.py on SphericalManifold — wild-evaluation rate 16% → 0
Validated on feature/parallel-point-eval: probe_manifold_advection.py — SLCN advection runs without _evalf=True
Full Tier-A / Tier-B regression (CI)

Copilot review feedback (addressed)

✅ Moved pointVec / globalPointsScalar construction into the else branch so the bypass path skips them.
✅ Sentinel test changed from (PetscInt)owning_cell[p] >= 0 to owning_cell[p] != (size_t)-1 to avoid implementation-defined unsigned→signed conversion.

Known limitations / out of scope

Multi-rank ownership on simplex / manifold meshes collapses to rank-0-claims-all because get_closest_cells doesn't carry an ownership filter. This is symmetric with the pre-existing behaviour for any single-rank UW3 evaluate (typical use). The general parallel-point-eval architecture is being addressed separately on feature/parallel-point-eval.
The pre-existing owning_cell[p] access for p >= n_local in redundantPoints=FALSE mode (where N from MPI_Allgatherv can exceed each rank's hint-array length) is a separate latent issue, not touched by this PR.
A separate upstream PETSc MR could add DMInterpolationSetCellHints to the stock DMInterpolation API (the stock wrapper unconditionally drops caller hints). Useful for non-UW3 PETSc users but unrelated to this UW3 bypass.

Underworld development team with AI support from Claude Code

Copilot

Pull request overview

This PR adjusts Underworld3’s PETSc-based point interpolation path to trust caller-supplied, already-verified cell hints (instead of re-validating via DMLocatePoints()), aiming to eliminate wrong-cell fallbacks that can produce extreme (“wild”) FE evaluations on geometrically subtle cell configurations (notably 2-manifold simplices embedded in 3D).

Changes:

Switches Python-side hint generation in petsc_interpolate from a KDTree-only closest-cell guess to a locally verified in-cell search (Mesh._get_closest_local_cells_internal, returning -1 when not locally owned).
Updates DMInterpolationSetUp_UW to bypass DMLocatePoints() entirely when owning_cell is provided, claiming points based on the hint instead.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
`src/underworld3/function/_function.pyx`	Generates interpolation cell hints using an in-cell-tested local search, enabling safe trust of hints downstream.
`src/underworld3/function/petsc_tools.c`	Bypasses `DMLocatePoints()` when hints are provided, using the hint to assign point ownership and cell IDs.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

-        range = owning_cell[p];
-      }
+  if (owning_cell) {
+    /*


+      /* owning_cell carries int64 reinterpreted as size_t; -1 → SIZE_MAX.
+         Casting back to (signed) PetscInt recovers the -1 sentinel. */
+      if ((PetscInt)owning_cell[p] >= 0) foundProcs[p] = rank;


…s when a kdtree hint is available `DMInterpolationSetUp_UW` constructs a hint PetscSF from `owning_cell` and passes it to `DMLocatePoints(DM_POINTLOCATION_REMOVE)`. PETSc then verifies the hint via `DMPlexLocatePoint_Internal`. On cell shapes where the in-cell test is geometrically subtle — notably 2-D simplex cells embedded in 3-D space, where a pseudo-inverse projection can map a near-edge query into the chord plane of an unrelated cell — the verification can reject a correct hint and silently fall back to a wrong-cell match. The result is wild FE evaluations (blob magnitudes at the antipode of a Gaussian, ~16% of trace-back queries failing on `SphericalManifold` advection). When the caller already has a cell-id per query and that cell-id is trustworthy, the PETSc re-verification adds no information and introduces this failure mode. This change skips DMLocatePoints when: - the mesh is simplex (`mesh.dm.isSimplex()`), where the face-containment test is exact for the affine reference map, or - the mesh is a manifold (`mesh.dim != mesh.cdim`), where the kdtree- nearest cell on the closed surface correctly contains every query. On non-simplex meshes (quads, hexes), deformed cells can have non-planar faces and the kdtree-nearest cell can be wrong; the previous DMLocatePoints path runs unchanged for those. What changed: - `_function.pyx`: cell hints in `petsc_interpolate` use `Mesh.get_closest_cells` (kdtree-nearest, no in-cell test). Replaces a short-lived attempt to use `_get_closest_local_cells_internal`, which surfaced a separate strictness issue in the boundary-vertex in-cell test outside the scope of this PR. - `_dminterp_wrapper.pyx::create_structure`: gates whether the hint is passed to `DMInterpolationSetUp_UW` on `isSimplex() or dim != cdim`. On non-simplex volume meshes the hint is suppressed (NULL) and the original DMLocatePoints path runs unchanged. - `petsc_tools.c::DMInterpolationSetUp_UW`: when `owning_cell` is provided, skip DMLocatePoints entirely; claim every hinted point on this rank (`MPI_Allreduce(MIN, foundProcs)` then assigns the lowest-rank claimant). Lazily build `pointVec` / `globalPointsScalar` only on the DMLocatePoints path. Sentinel test uses `owning_cell[p] != (size_t)-1` (addresses copilot review feedback: avoids implementation-defined unsigned→signed conversion). Volume-mesh behaviour: - Non-simplex (StructuredQuadBox, hexes): DMLocatePoints path runs unchanged. No regression. - Simplex (UnstructuredSimplexBox 2D/3D): bypass produces the same cell id as DMLocatePoints used to verify and accept. Functional no-op in single-rank. Multi-rank ownership collapses to rank-0-claims-all on simplex meshes (a known limitation; pre-existing parallel-evaluate work in `feature/parallel-point-eval` addresses ownership separately). Manifold-mesh behaviour (lands when `SphericalManifold` reaches `development` via `feature/parallel-point-eval`): bypass eliminates the wrong-cell matches that previously forced SLCN advection through the `_evalf=True` RBF-Shepard workaround. Validated: - `docs/examples/parallel_point_eval/probe_manual_p1_eval.py`: wild-evaluation rate 16% → 0; DMInterpolation matches manual barycentric P1 to FE accuracy. - `docs/examples/parallel_point_eval/probe_manifold_advection.py`: SLCN advection runs without `_evalf=True`, peak rotates with the prescribed angular velocity, mass stays bounded. Regression: - `tests/test_0820_deform_mesh_solver_rebuild_regression.py` passes (the quad-mesh deform test that exposed the earlier strictness path). - `tests/test_0000_imports.py + test_0001_meshes.py + test_0004_pointwise_fns.py`: 28/28. Underworld development team with AI support from Claude Code (claude.com/claude-code)

lmoresi · 2026-05-24T23:58:10Z

Force-pushed an updated design (commit 17a5a8d) and retargeted base back to development.

The earlier push had a regression on quad meshes — _get_closest_local_cells_internal rejects boundary vertices on bounded meshes (test_0820 failed). Pivoted the design: the bypass is now gated on mesh.dm.isSimplex() or mesh.dim != mesh.cdim. Non-simplex meshes (quads, hexes) keep the original DMLocatePoints path; simplex and manifold meshes use the bypass.

Both Copilot review comments addressed in the new version (pointVec lazy-built, unsigned-sentinel comparison).

lmoresi · 2026-05-25T02:10:15Z

Update — precursor needed

Discussed offline: get_closest_cells is by design a heuristic (always returns SOME cell, suitable for hinting PETSc, not authoritative). The current PR uses it as authoritative for the bypass — that's wrong in general; works on SphericalManifold only coincidentally.

The right authoritative hint source is _get_closest_local_cells_internal with looser semantics (>= 0 instead of strict > 0 in the in-cell test) so it accepts on-face queries — i.e., every mesh vertex, every point on a shared face. Opened as PR #207 (bugfix/in-cell-test-loose-semantics).

Once #207 merges, I'll rebase this PR onto it and switch the hint source in _function.pyx from get_closest_cells back to _get_closest_local_cells_internal. The bypass will then be authoritative for simplex + manifold meshes; non-simplex meshes keep the original DMLocatePoints path.

…mantics `Mesh._test_if_points_in_cells_internal` used a strict `> 0` test on the squared-distance difference between mirrored inner/outer control points placed ±1e-3 along each face normal. A query exactly on a cell face has zero distance difference and was rejected. That meant mesh vertices — which sit on the faces of every cell containing them in their closure — failed the in-cell test for every candidate cell, and `_get_closest_local_cells_internal` returned -1 for them. For evaluation use cases this is the wrong semantics: the FE basis at a shared vertex / face is consistent across the adjacent cells (a DM consistency requirement of FE assembly), so "any cell whose closure contains the point" is correct. Picking one specific adjacent cell and returning its id lets downstream code evaluate correctly. Add an opt-in `on_boundary: bool = False` kwarg that flips the comparison from strict `> 0` to loose `>= -1e-12`. Default is False (preserves historical strict-inside behaviour); callers that want loose semantics must request them explicitly. The kwarg is forwarded through the call chain: - `_test_if_points_in_cells_internal(on_boundary=False)` — core test - `_get_closest_local_cells_internal(on_boundary=False)` — forwards - `test_if_points_in_cells(on_boundary=False)` — public wrapper, forwards - `get_closest_local_cells(on_boundary=False)` — public wrapper, forwards Backward-compatibility rationale: callers like `points_in_domain` (used in `petsc_interpolate` to split queries into interior / exterior for FE vs. RBF routing) depend on the strict semantics — a vertex on the domain boundary is classified as "outside" and routed to RBF extrapolation. Changing that default would change `uw.function.evaluate` output values for any code that evaluates at mesh vertices (broke `test_advDiff_boxmesh[mesh0]` in a first attempt). Opt-in avoids the regression while making the loose semantics available where they're genuinely needed. This unblocks the bypass design in PR #203 (`feature/dminterp-bypass-element-check`), which needs an authoritative per-rank cell-id source. With `on_boundary=True`, `_get_closest_local_cells_internal` returns a cell whose closure contains the query (or -1 if no local cell does) — exactly what the bypass requires. Also addresses Copilot review feedback: - docstring corrected to say `_test_if_points_in_cells_internal` expects model-coords input (the public wrapper handles units). - removed an unused `inside` initialisation left over from the strict/loose refactor. - public `test_if_points_in_cells` now coerces `cells` to a 1-D numpy array so list/tuple input works as the docstring promises. Tests `tests/test_0820_in_cell_test_loose_semantics.py` (7 tests) lock the contract: - default strict-inside rejects boundary-vertex queries - on_boundary=True accepts every vertex query on 2D simplex, 3D simplex, and 2D structured-quad meshes - strict and loose modes give a distinguishable result on the same vertex queries - cells returned with on_boundary=True genuinely contain the query in their closure - public `get_closest_local_cells` forwards the kwarg Underworld development team with AI support from Claude Code (claude.com/claude-code)

lmoresi · 2026-05-25T05:22:33Z

Update on the precursor: #207 now uses an opt-in design — on_boundary=True to get loose semantics. The first attempt (loose-by-default) broke an AdvDiff test that depends on boundary vertices being routed to RBF in points_in_domain. Opt-in keeps that behaviour and makes the loose path available where it's actually needed.

When #207 merges, I'll update this PR's _function.pyx to call mesh._get_closest_local_cells_internal(coords, on_boundary=True) — gives the authoritative per-rank cell-id the bypass needs without touching the default path used by everything else.

…ries by default `Mesh._test_if_points_in_cells_internal` used a strict `> 0` test on the squared-distance difference between mirrored inner/outer control points placed ±1e-3 along each face normal. A query exactly on a cell face has zero distance difference and was rejected. That meant mesh vertices — which sit on the faces of every cell containing them in their closure — failed the in-cell test for every candidate cell, and `_get_closest_local_cells_internal` returned -1 for them. For evaluation use cases this is the wrong semantics: a closed domain Ω includes ∂Ω, and the FE basis at a shared vertex / face is consistent across the adjacent cells (a DM consistency requirement of FE assembly). So "any cell whose closure contains the point" is the right notion. Routing those queries through RBF Shepard extrapolation (the legacy behaviour of `uw.function.evaluate` for points classified outside the domain) is less accurate than evaluating via FE on any adjacent cell. Add `on_boundary: bool = True` kwarg to the in-cell test and forward through the call chain: - `_test_if_points_in_cells_internal(on_boundary=True)` — core test - `_get_closest_local_cells_internal(on_boundary=True)` — forwards - `test_if_points_in_cells(on_boundary=True)` — public wrapper - `get_closest_local_cells(on_boundary=True)` — public wrapper With on_boundary=True (the default) the comparison is `>= -1e-12` (well below the 1e-3 control-point offset, well above 64-bit float roundoff); with on_boundary=False the historical `> 0` strict-inside semantics is preserved. Callers that need uniqueness (a point claimed by exactly one cell, never a shared-face point claimed by both adjacent cells) can opt back into strict. Observable consequences: - `_get_closest_local_cells_internal` is now an authoritative cell-id source: returns a cell whose closure contains the query, or -1 if no local cell does. - `points_in_domain` reports boundary-vertex queries as in the domain (matches mathematical convention; a point on ∂Ω is in Ω). - `uw.function.evaluate` now routes boundary-vertex queries through FE instead of RBF Shepard — the more accurate path. This shifts the output of `evaluate` at mesh vertices on the domain boundary. Test impact: - `tests/test_0820_in_cell_test_loose_semantics.py` (new, 7 tests): locks the on_boundary=True default — every vertex of 2D simplex, 3D simplex, and 2D quad meshes resolves to a containing cell; strict mode still rejects on-face queries; loose mode returns cells whose closure genuinely contains the query. - `tests/test_0820_deform_mesh_solver_rebuild_regression.py` continues to pass (it was the motivating regression for the PR #203 bypass work that depends on this). - `tests/test_1100_AdvDiffCartesian.py::test_advDiff_boxmesh[mesh0]` marked xfail (strict=True). The file header already calls this "not a great test"; its atol=0.05 tolerance previously aligned only because boundary-vertex queries went through RBF Shepard's smoothing, and the test was tuned to that result. Needs reworking (smoother IC, larger transport distance) to pass under the more accurate FE-evaluate path. The two simplex variants of the same test (mesh1, mesh2) continue to pass. Also addresses Copilot review feedback: - docstring on `_test_if_points_in_cells_internal` corrected to say model-coords input (the public wrapper handles units). - removed an unused `inside` initialisation from the refactor. - public `test_if_points_in_cells` now coerces `cells` to a 1-D numpy array so list/tuple input works as the docstring promises. This unblocks the bypass design in PR #203 (`feature/dminterp-bypass-element-check`), which needs an authoritative per-rank cell-id source. With the new default, `mesh._get_closest_local_cells_internal(coords)` directly gives the cell id whose closure contains each query — exactly what the bypass requires. Underworld development team with AI support from Claude Code (claude.com/claude-code)

lmoresi · 2026-05-25T08:20:54Z

Updated precursor #207 design: loose is now the default in _get_closest_local_cells_internal. So once #207 merges, the update to this PR is just:

# in _function.pyx
cells = mesh._get_closest_local_cells_internal(coords)  # authoritative, no kwarg needed

No on_boundary=True opt-in plumbing in the bypass path. Cleaner.

Copilot AI review requested due to automatic review settings May 24, 2026 10:54

Copilot started reviewing on behalf of lmoresi May 24, 2026 10:54 View session

Copilot AI reviewed May 24, 2026

View reviewed changes

lmoresi changed the base branch from development to feature/parallel-point-eval May 24, 2026 12:12

lmoresi force-pushed the feature/dminterp-bypass-element-check branch from 97b5b8b to 17a5a8d Compare May 24, 2026 23:57

lmoresi changed the base branch from feature/parallel-point-eval to development May 24, 2026 23:57

lmoresi mentioned this pull request May 25, 2026

fix(meshing): on_boundary kwarg for in-cell test — accept on-face queries by default #207

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(interpolation): bypass DMLocatePoints when caller supplies a verified hint#203

fix(interpolation): bypass DMLocatePoints when caller supplies a verified hint#203
lmoresi wants to merge 1 commit into
developmentfrom
feature/dminterp-bypass-element-check

lmoresi commented May 24, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

lmoresi commented May 24, 2026

Uh oh!

lmoresi commented May 25, 2026

Uh oh!

lmoresi commented May 25, 2026

Uh oh!

lmoresi commented May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

lmoresi commented May 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What's in the patch

Behaviour

Test plan

Copilot review feedback (addressed)

Known limitations / out of scope

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

lmoresi commented May 24, 2026

Uh oh!

lmoresi commented May 25, 2026

Uh oh!

lmoresi commented May 25, 2026

Uh oh!

lmoresi commented May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lmoresi commented May 24, 2026 •

edited

Loading