[Paused] Allow using ONNX compiled model in sklearn interface #179

LeoGrin · 2025-02-11T13:41:15Z

Fix #97

Downside compared to PyTorch

We don't do the memory estimation + reduction part, so it's easier to get memory error with large datasets (TODO)

(It seems that if if I export the onnx model on gpu I get less memory error afterward but not sure)

Quick speed analysis:

For 1Kx10 on a T4 GPU node (seems to be quite hardware dependent):

first fit: 5s for ONNX (we initialize the session), 0.4s for PyTorch
first predict: ~2s for ONNX, 6s for Pytorch
2nd fit: 0.05s for ONNX, 0.15s for PyTorch
2nd predict: 0.6-1.2s for ONNX (depending on where you export it, probably some more gains here?), 0.22s for PyTorch

Seems like for smaller datasets ONNX becomes faster even for 2nd predict.

Performance analysis on real datasets

Seemed good but need to rerun.

TODOs

Make memory reduction work on ONNX
Allow to set the onnx session from outside to work with sklearn cloning
Check more carefully that we haven't degraded performance (e.g on categoricals)
Can we make ONNX model faster by exporting it with the right dummy input? How much speed vary by hardware?

…rough

…the right cache

reusyangyang · 2025-07-11T13:07:05Z

Hi,does this model currently support training in a GPU environment and saving in ONNX format?

LeoGrin marked this pull request as draft February 11, 2025 13:41

PriorLabs deleted a comment from CLAassistant Feb 12, 2025

LeoGrin mentioned this pull request Feb 20, 2025

InvalidIndexError #196

Closed

LeoGrin added 5 commits March 10, 2025 12:11

remove select_features and make RemoveEmptyFeaturesEncoderStep passth…

eef103c

…rough

remove cat_ind argument from forward (TODO check it's not used)

91034b2

allow to use onnx model inside sklearn interface

d2e89e7

allow to move to gpu

720dc16

only init onnx session once

523c051

LeoGrin force-pushed the full_onnx branch from 52e87ce to 523c051 Compare March 10, 2025 14:26

LeoGrin added 16 commits March 10, 2025 18:24

a few improvements

82e37c0

improve test when generating new onnx model

3c2ecbc

improve test when generating new onnx model fix

dd542cd

make onnx export work by remove predict, and fetch the onnx model in …

55cdbcb

…the right cache

fix tests

71f6557

merge with main

8484579

finish merge + mypi

a47b254

improve onnx export tests

4c31a9a

add onnxruntime requirements to dev and ci

dd84e0a

skip test on python3.13

1bb3a64

use the same onnx session if you fit twice

968087b

fix bug + add tests

3f13d24

fail if device is CUDA but CUDAExecutionProvider not available

d6d906d

new py3.11 ci tests and skip onnx tests on 3.9

e1ada19

fix tests

9acec27

fail nicely if someones try to export to onnx on python3.9

0f1848d

noahho changed the title ~~Allow using ONNX compiled model in sklearn interface~~ [Paused] Allow using ONNX compiled model in sklearn interface May 20, 2025

LeoGrin mentioned this pull request Jul 30, 2025

ONNX wrapping a kv-cached model #382

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Paused] Allow using ONNX compiled model in sklearn interface #179

[Paused] Allow using ONNX compiled model in sklearn interface #179

Uh oh!

LeoGrin commented Feb 11, 2025 •

edited

Loading

Uh oh!

reusyangyang commented Jul 11, 2025

Uh oh!

Uh oh!

[Paused] Allow using ONNX compiled model in sklearn interface #179

Are you sure you want to change the base?

[Paused] Allow using ONNX compiled model in sklearn interface #179

Uh oh!

Conversation

LeoGrin commented Feb 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Downside compared to PyTorch

Quick speed analysis:

Performance analysis on real datasets

TODOs

Uh oh!

reusyangyang commented Jul 11, 2025

Uh oh!

Uh oh!

LeoGrin commented Feb 11, 2025 •

edited

Loading