Implement ONNX model serialisation phase 2 #227

RAMitchell · 2025-03-31T14:41:13Z

Implements #225

This PR combines the model initialisation term as well as individual onnx models together into a serialised estimator.

I will likely only implement the predict_raw method here and leave predict_proba for anther PR as it will require e.g. softmax transforms.

Copilot

Pull Request Overview

This PR implements ONNX model serialization (phase 2) by combining model initialization with individual ONNX models into a serialized estimator. Key changes include updating dependencies to include onnxruntime, adding new test functions to verify ONNX predictions, and modifying each model’s to_onnx method (and related functions) to accept an explicit data type parameter and use consistent input/output naming.

Reviewed Changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 2 comments.

File	Description
pyproject.toml, dependencies.yaml, conda YAML	Updated dependencies to include onnxruntime>=1.21
legateboost/test/test_onnx.py	Added new functions for ONNX predictions and updated test naming
legateboost/models/tree.py, nn.py, linear.py, krr.py	Modified to_onnx methods to accept X_dtype and standardized input/output names
legateboost/legateboost.py	Introduced _make_onnx_init and updated to_onnx to merge ONNX models

legateboost/test/test_onnx.py

legateboost/legateboost.py

seberg

The approach looks good to me, commenting since I suspect the classification needs a bit work. This is tricky to test!

Thinking about the predict_function= argument, but it seems good to me. (The predict_raw seems a bit duplicating the "predict".)

I should look closer at some of the ONNX code probably.

legateboost/models/krr.py

seberg · 2025-04-23T09:20:08Z

legateboost/test/test_onnx.py

+    assert onnx_pred.dtype == pred.dtype
+    assert pred.shape == onnx_pred.shape
+    number_wrong = np.sum(
+        np.abs(pred - onnx_pred) > (1e-2 if X.dtype == np.float32 else 1e-5)


Predictions are alway similarly sized (i.e. 0-1)? Just curious if it would make sense to allow a relative deviation.

legateboost/objectives.py

legateboost/test/test_onnx.py

legateboost/objectives.py

RAMitchell · 2025-04-25T08:14:31Z

Cupynumeric test failures here require https://github.com/nv-legate/legate.internal/pull/2177 - we need to wait for nightlies to become available.

RAMitchell added 7 commits March 24, 2025 03:55

Implement onnx serialisation

fdfd150

Implement tree models

be66558

Implement neural network onnx op

8e20945

mypy

821e7f4

Add interface to estimator

b9be42e

Add interface to estimator

5e3a0b6

Merge branch 'main' of github.com:rapidsai/legateboost into onnx2

0f4570c

RAMitchell requested a review from Copilot March 31, 2025 14:41

Copilot AI reviewed Mar 31, 2025

View reviewed changes

legateboost/test/test_onnx.py Outdated Show resolved Hide resolved

legateboost/legateboost.py Outdated Show resolved Hide resolved

Increase test coverage

7bf5018

RAMitchell mentioned this pull request Apr 3, 2025

[Feature request] New encoding for TreeEnsemble* operators onnx/onnx#5851

Closed

RAMitchell added 11 commits April 4, 2025 00:59

Use older TreeEnsemble, predictions as double

4c95017

Update docs

438958c

Implement normal onnx operator

1c74de5

Implement remaining transforms

692a1a5

Classifier tests passing for float64 but not float32

2d32592

Compensate for tolerance

039a8e1

Convert some operators to text

aea209c

Reduce verbosity

b069090

Refactor and add typing annotations

f4b7b81

Remove onnx type hints

5dcc2c2

Update doc example

b9f67f0

seberg reviewed Apr 23, 2025

View reviewed changes

RAMitchell added 2 commits April 24, 2025 04:03

Make the onnx tree sparse with a recursive builder

f5dd467

Address review

05f8553

Some mypy issues

3faf4f3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement ONNX model serialisation phase 2 #227

Implement ONNX model serialisation phase 2 #227

Uh oh!

RAMitchell commented Mar 31, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

seberg left a comment

Uh oh!

Uh oh!

Uh oh!

seberg Apr 23, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

RAMitchell commented Apr 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Implement ONNX model serialisation phase 2 #227

Are you sure you want to change the base?

Implement ONNX model serialisation phase 2 #227

Uh oh!

Conversation

RAMitchell commented Mar 31, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

seberg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

seberg Apr 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

RAMitchell commented Apr 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants