Add quantile_uni_extrapolate preprocessor preset by LeoGrin · Pull Request #971 · PriorLabs/TabPFN

LeoGrin · 2026-05-15T17:20:10Z

Summary

Adds the quantile_uni_extrapolate preprocessor preset: instead of clamping quantile-transformed values at the [0, 1] boundary, it linearly extrapolates past the boundary for inputs outside the training range, better preserving out-of-distribution information.

Wired through both the CPU AdaptiveQuantileTransformer and the GPU TorchQuantileTransformer paths so the preset is GPU-eligible. CPU/GPU consistency tests parametrise the new preset across f16/f32/f64.

Motivation

v3 OOD-preprocessing checkpoints use quantile_uni_extrapolate in the regressor preprocessing recipe. Without this preset in the public package those checkpoints fail to load (pydantic ValidationError on the PreprocessorConfig.name literal). This unblocks loading them with the public release.

Changes (7 source + 3 test files, mirrors private #599)

preprocessing/configs.py — register quantile_uni_extrapolate in the name literal
preprocessing/steps/adaptive_quantile_transformer.py — extrapolate_ratio logic (CPU)
preprocessing/steps/reshape_feature_distribution_step.py — preset wiring
preprocessing/torch/{factory,steps,gpu_preprocessing_metadata,torch_quantile_transformer}.py — GPU path
tests for CPU transform, torch transform, and CPU/GPU consistency

Test plan

test_adaptive_quantile_transformer.py — 5/5 pass (incl. NaN-column & constant-feature edge cases)
test_torch_quantile_transformer.py extrapolation suite — 3/3 pass (incl. matches_cpu_on_out_of_range_inputs)

Note: repo policy is "open an issue first" — happy to file/link one; opening this so the diff is reviewable.

🤖 Generated with Claude Code

Note

Medium Risk
Touches core preprocessing behavior for a new quantile preset and changes both CPU (sklearn) and GPU (torch) quantile paths, which can affect model inputs and CPU/GPU parity if edge cases slip through.

Overview
Adds a new preprocessor preset, quantile_uni_extrapolate, that preserves out-of-distribution signal by linearly extrapolating quantile-transformed values beyond the usual [0, 1] clamp (with configurable extrapolate_ratio, defaulted to 1.0 for the preset).

Wires this behavior through both the CPU AdaptiveQuantileTransformer (stores per-feature train min/max and extrapolates at transform time, skipping constant features) and the GPU pipeline (TorchQuantileTransformer + Torch*QuantileTransformerStep/factory), and marks the preset as GPU-eligible. Updates tests to cover extrapolation semantics, validation guards, and CPU/GPU consistency for the new preset.

^{Reviewed by Cursor Bugbot for commit 1569371. Bugbot is set up for automated code reviews on this repo. Configure here.}

Port of internal TabPFN-private #599. Linearly extrapolates quantile-transformed values past [0, 1] for inputs outside the training range instead of clamping at the boundary, to better preserve out-of-distribution information. Wired through both the CPU AdaptiveQuantileTransformer and the GPU TorchQuantileTransformer paths so the preset is GPU-eligible. CPU/GPU consistency tests parametrise the new preset across f16/f32/f64. Needed so v3 OOD-preprocessing checkpoints (which use quantile_uni_extrapolate in the regressor preproc recipe) load with the public package. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

gemini-code-assist

Code Review

This pull request adds a new quantile_uni_extrapolate preset to the preprocessing pipeline, enabling linear extrapolation for values outside the training range in both AdaptiveQuantileTransformer (CPU) and TorchQuantileTransformer (GPU). This is achieved by adding an extrapolate_ratio parameter and corresponding logic to the transform methods. Reviewers suggested adding input validation to ensure the extrapolate_ratio is non-negative and only used with uniform output distributions to avoid incorrect results.

gemini-code-assist · 2026-05-15T17:23:30Z

        self._user_n_quantiles = n_quantiles
        # Initialize parent with this, but it will be adapted in fit
        super().__init__(n_quantiles=n_quantiles, subsample=subsample, **kwargs)
+        self.extrapolate_ratio = extrapolate_ratio


It is recommended to validate that extrapolate_ratio is non-negative and that it is only used when output_distribution is set to "uniform". Linear extrapolation as implemented here is not mathematically appropriate for a normal output distribution and would lead to incorrect results if accidentally configured that way.

self.extrapolate_ratio = extrapolate_ratio if extrapolate_ratio is not None: if extrapolate_ratio < 0: raise ValueError("extrapolate_ratio must be non-negative.") if kwargs.get("output_distribution", "uniform") != "uniform": raise ValueError("extrapolate_ratio is only supported for output_distribution='uniform'.")

gemini-code-assist · 2026-05-15T17:23:30Z

        """
        super().__init__()
        self.n_quantiles = n_quantiles
+        self.extrapolate_ratio = extrapolate_ratio


Consider adding a validation check to ensure extrapolate_ratio is non-negative. While the current presets use 1.0, explicit validation prevents potential issues if the class is used with custom configurations in the future.

Suggested change

self.extrapolate_ratio = extrapolate_ratio

self.extrapolate_ratio = extrapolate_ratio

if extrapolate_ratio is not None and extrapolate_ratio < 0:

raise ValueError("extrapolate_ratio must be non-negative.")

Deliberate deviation from internal #599 (which placed it in the coarse n_samples//10 tier alongside quantile_uni_coarse). The preset is meant to be "the default quantile transform plus boundary extrapolation" — it should differ from quantile_uni by extrapolation ONLY, not also by a coarser quantile grid. Moving it to the n_samples//5 tier makes the default -> extrapolate swap a clean one-variable change. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

- torch_quantile_transformer: compute the normalised tensor once and reuse it for both branches. (x - x_max)/range + 1 == (x - x_min)/range, so the separate norm_above was a redundant full-array recompute. Mirrors the consolidation already done on the CPU side in #599. Output-identical (consistency tests unchanged), ~halves extrapolation arithmetic on GPU. - Validation guards (per review): extrapolate_ratio must be non-negative (CPU + GPU); and only valid with output_distribution="uniform" (CPU), since linear extrapolation is not meaningful for a normal output. - Tests for both guard paths. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

LeoGrin · 2026-05-15T19:17:57Z

Thanks @gemini-code-assist — both addressed in b54cb09:

extrapolate_ratio non-negative + uniform-only (CPU) — added in AdaptiveQuantileTransformer.__init__: raises ValueError for negative ratio, and for output_distribution != "uniform" (linear extrapolation past [0,1] is only meaningful for the uniform output mapping, matching the preset's documented intent).
extrapolate_ratio non-negative (GPU) — added the same non-negative guard in TorchQuantileTransformer.__init__.

Added tests for both guard paths (test__extrapolate_ratio__validation_guards, test__torch_extrapolate_ratio__rejects_negative).

Also in this push, unrelated to the bot comments but worth flagging for review:

quantile_uni_extrapolate now uses the n_samples // 5 tier (same as quantile_uni), not the coarse n//10 tier it had in the internal PR. Rationale: the preset is meant to be "the default quantile transform plus boundary extrapolation" — it should differ from quantile_uni by extrapolation only, not also by a coarser quantile grid. This is an intentional deviation from internal Update the device doc string to include multi-gpu inference. #599.
GPU _apply_extrapolation norm consolidation: the internal port computed norm_below and norm_above as two separate full-array tensors; (x - x_max)/range + 1 is algebraically identical to (x - x_min)/range, so it now computes one tensor and reuses it (the CPU side was already consolidated this way in Update the device doc string to include multi-gpu inference. #599). Output-identical — CPU/GPU consistency tests unchanged.

gemini-code-assist · 2026-05-15T19:18:16Z

Thanks for the update, @LeoGrin. The added validation guards and the consolidation of the GPU _apply_extrapolation logic look correct and improve the robustness of the implementation. The decision to use the n_samples // 5 tier for quantile_uni_extrapolate to maintain consistency with the base quantile_uni preset is also well-reasoned. The updated tests appropriately cover the new validation logic and verify the GPU/CPU consistency. This looks ready for final review.

- test__torch_extrapolate_ratio__rejects_negative used function-level imports (PLC0415); pytest + TorchQuantileTransformer are already imported at module top, so just use those. - ruff format on the two touched files. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

FBPrior

LGTM thanks!

FBPrior · 2026-05-18T10:58:32Z

+            extrapolate_ratio = (
+                1.0 if pconfig.name == "quantile_uni_extrapolate" else None
+            )


Do we ever want to set this to sth other than one? I think it's useful to keep this parameter around because it's a pretty important parameter.

LeoGrin requested a review from a team as a code owner May 15, 2026 17:20

LeoGrin requested review from eliott-kalfon and removed request for a team May 15, 2026 17:20

gemini-code-assist Bot reviewed May 15, 2026

View reviewed changes

LeoGrin and others added 2 commits May 15, 2026 19:10

Drop explanatory comment on quantile_uni_extrapolate tier

deb1a00

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

LeoGrin added the no changelog needed PR does not require a changelog entry label May 15, 2026

LeoGrin requested review from FBPrior and removed request for eliott-kalfon May 15, 2026 19:11

FBPrior approved these changes May 18, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add quantile_uni_extrapolate preprocessor preset#971

Add quantile_uni_extrapolate preprocessor preset#971
LeoGrin wants to merge 5 commits into
mainfrom
leo/quantile-uni-extrapolate

LeoGrin commented May 15, 2026 •

edited by cursor Bot

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 15, 2026

Uh oh!

gemini-code-assist Bot May 15, 2026

Uh oh!

LeoGrin commented May 15, 2026

Uh oh!

gemini-code-assist Bot commented May 15, 2026

Uh oh!

FBPrior left a comment

Uh oh!

FBPrior May 18, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

LeoGrin commented May 15, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

Changes (7 source + 3 test files, mirrors private #599)

Test plan

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 15, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 15, 2026

Choose a reason for hiding this comment

Uh oh!

LeoGrin commented May 15, 2026

Uh oh!

gemini-code-assist Bot commented May 15, 2026

Uh oh!

FBPrior left a comment

Choose a reason for hiding this comment

Uh oh!

FBPrior May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

LeoGrin commented May 15, 2026 •

edited by cursor Bot

Loading

FBPrior May 18, 2026 •

edited

Loading