[BUG] Fix incorrect variance calculation and capability tagging in Hurdle by ANANYA542 · Pull Request #973 · sktime/skpro

ANANYA542 · 2026-03-19T20:54:47Z

Reference Issues/PRs

fixes #972

What does this implement/fix? Explain your changes.

This PR fixes an issue in the Hurdle distribution where the variance was being computed incorrectly.
The current implementation uses p * var_positive + p * (1 - p) * mean_positive, but the mean term should be squared. This was leading to significant underestimation of the variance. The formula has been corrected to use (mean_positive ** 2) in line with the law of total variance.

I also updated how capability tags are determined. Previously, Hurdle checked the base distribution to decide whether mean/var are exact, but since it internally wraps the distribution using LeftTruncated, those values are actually computed via numerical approximation. The tags now reflect this correctly by checking the truncated distribution instead.

Does your contribution introduce a new dependency? If yes, which one?

No

What should a reviewer concentrate their feedback on?

Whether the updated variance formula aligns with expected behavior across different base distributions
Whether capability tagging should consistently be based on the internally used truncated distribution

Did you add any tests for the change?

I verified the fix using Monte Carlo simulation (100k samples), where the empirical variance now matches the analytical result within tolerance. No new formal unit tests have been added yet.

Any other comments?

A screenshot showing the before/after behavior has also been added for clarity.

PR checklist

For all contributions

I've added myself to the list of contributors with any new badges I've earned :-)
How to: add yourself to the all-contributors file in the skpro root directory (not the CONTRIBUTORS.md). Common badges: code - fixing a bug, or adding code logic. doc - writing or improving documentation or docstrings. bug - reporting or diagnosing a bug (get this plus code if you also fixed the bug in the PR).maintenance - CI, test framework, release.
See here for full badge reference
The PR title starts with either [ENH], [MNT], [DOC], or [BUG]. [BUG] - bugfix, [MNT] - CI, test framework, [ENH] - adding or improving code, [DOC] - writing or improving documentation or docstrings.

For new estimators

I've added the estimator to the API reference - in docs/source/api_reference/taskname.rst, follow the pattern.
I've added one or more illustrative usage examples to the docstring, in a pydocstyle compliant Examples section.
If the estimator relies on a soft dependency, I've set the python_dependencies tag and ensured
dependency isolation, see the estimator dependencies guide.

…act capabilities tag

ANANYA542 · 2026-03-27T06:42:00Z

Hi @fkiraly — just following up on this PR. If you could take some time to review it and let me know if any changes or explanation is required, I’d really appreciate it.

[BUG] Fix Hurdle variance incorrect mathematical formula and false ex…

4016b5e

…act capabilities tag

ANANYA542 requested review from felipeangelimvieira and fkiraly as code owners March 19, 2026 20:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Fix incorrect variance calculation and capability tagging in Hurdle#973

[BUG] Fix incorrect variance calculation and capability tagging in Hurdle#973
ANANYA542 wants to merge 1 commit intosktime:mainfrom
ANANYA542:fix-hurdle-variance

ANANYA542 commented Mar 19, 2026

Uh oh!

ANANYA542 commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ANANYA542 commented Mar 19, 2026

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Does your contribution introduce a new dependency? If yes, which one?

What should a reviewer concentrate their feedback on?

Did you add any tests for the change?

Any other comments?

PR checklist

For all contributions

For new estimators

Uh oh!

ANANYA542 commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant