Implement tofile on tensors to reduce data write time by 40% #210

justinchuby · 2025-10-03T23:35:49Z

This PR introduces the tofile method on tensors (similarly named as the one on numpy arrays), which allows for faster write and lower memory usage on external data by bypassing tobytes().

Compatibility with existing TensorProtocols is maintained in the external data module by using tofile only when it is available in the class. The TorchTensor class in PyTorch exporter should be updated accordingly to leverage the new logic when saving.

Note that io time to disk is reduced by 40% below.

Note

TensorProtocol is not updated because we do isinstance() checks on external implementations (PyTorch). Adding the method in the protocol will cause isinstance check to fail on those implementations that have not added the tofile method.

Reference: https://github.com/microsoft/onnxscript/pull/2241/files/b2381658492510a9bcc8c0a8574db7368e33bceb

Before:

________________________________________________________
Executed in   48.08 secs    fish           external
   usr time   60.54 secs    0.00 millis   60.54 secs
   sys time   23.06 secs    1.22 millis   23.06 secs

After:

________________________________________________________
Executed in   45.69 secs    fish           external
   usr time   60.68 secs  244.00 micros   60.68 secs
   sys time   22.22 secs  518.00 micros   22.22 secs

Fix #207

Signed-off-by: Justin Chu <[email protected]>

codecov · 2025-10-03T23:36:58Z

Codecov Report

❌ Patch coverage is 81.96721% with 11 lines in your changes missing coverage. Please review.
✅ Project coverage is 76.92%. Comparing base (feb51e5) to head (e3df4c9).
⚠️ Report is 3 commits behind head on main.

Files with missing lines	Patch %	Lines
src/onnx_ir/_core.py	80.85%	7 Missing and 2 partials ⚠️
src/onnx_ir/external_data.py	66.66%	1 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #210      +/-   ##
==========================================
+ Coverage   76.83%   76.92%   +0.08%     
==========================================
  Files          40       40              
  Lines        4922     4992      +70     
  Branches      980      996      +16     
==========================================
+ Hits         3782     3840      +58     
- Misses        856      864       +8     
- Partials      284      288       +4

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Signed-off-by: Justin Chu <[email protected]>

justinchuby · 2025-10-04T00:58:05Z

cc @iksnagreb

sonarqubecloud · 2025-10-04T18:56:40Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

src/onnx_ir/tensor_adapters.py

Signed-off-by: Justin Chu <[email protected]>

justinchuby · 2025-10-10T03:51:55Z

@titaiwangms @gramalingam this is ready for review, thanks.

Signed-off-by: Justin Chu <[email protected]>

src/onnx_ir/_core.py

titaiwangms · 2025-10-10T17:42:40Z

src/onnx_ir/_core.py

        """Return the bytes of the tensor."""
        return self._evaluate().tobytes()

+    def tofile(self, file) -> None:


I am wondering whether tofile() makes sense to LazyTensor. hmm

Can you say more?

I just thought it's not even real until it's evaluated. Intuitively, not very suitable with tofile(), which we want to write it to disk. But I guess in general expectation, we want all tensors have this method. It's understandable.

It is actually useful: even when the tensor is lazily evaluated, we still want to avoid tobytes() making a copy of the tensor data before writing to file. The screenshots on the PR description are showing lazy tensors.

src/onnx_ir/external_data.py

Signed-off-by: Justin Chu <[email protected]>

src/onnx_ir/_core.py

Signed-off-by: Justin Chu <[email protected]>

justinchuby added 5 commits October 3, 2025 15:55

Fix endian

82b3f58

Signed-off-by: Justin Chu <[email protected]>

nvm

42d8edc

Signed-off-by: Justin Chu <[email protected]>

More implementations

63310c1

Signed-off-by: Justin Chu <[email protected]>

tofile

290ab6c

Signed-off-by: Justin Chu <[email protected]>

hasattr

1b53a6a

Signed-off-by: Justin Chu <[email protected]>

justinchuby added 5 commits October 3, 2025 17:19

tofile!

c05e189

Signed-off-by: Justin Chu <[email protected]>

write

6377435

Signed-off-by: Justin Chu <[email protected]>

always write numpy

3dc5704

Signed-off-by: Justin Chu <[email protected]>

Maintain reference

7fd35d7

Signed-off-by: Justin Chu <[email protected]>

Merge branch 'main' into justinchu/write

40cb60d

justinchuby marked this pull request as ready for review October 4, 2025 00:44

justinchuby requested review from a team and titaiwangms as code owners October 4, 2025 00:44

justinchuby requested a review from gramalingam October 4, 2025 00:44

justinchuby added the module: api label Oct 4, 2025

justinchuby mentioned this pull request Oct 4, 2025

Be smarter about torch tensors jambayk/torch-onnx-models#43

Merged

justinchuby added this to the 0.1.11 milestone Oct 4, 2025

justinchuby changed the title ~~Implement tofile on tensors~~ Implement tofile on tensors to reduce data write time by 40% Oct 6, 2025

justinchuby commented Oct 6, 2025

View reviewed changes

src/onnx_ir/tensor_adapters.py Outdated Show resolved Hide resolved

justinchuby added 4 commits October 9, 2025 13:10

Fix fileno

909344d

Signed-off-by: Justin Chu <[email protected]>

Test

e7dc301

Signed-off-by: Justin Chu <[email protected]>

test

8f832b3

Signed-off-by: Justin Chu <[email protected]>

Create tests

9afc144

Signed-off-by: Justin Chu <[email protected]>

naming

1f87be1

Signed-off-by: Justin Chu <[email protected]>

justinchuby mentioned this pull request Oct 10, 2025

[ONNX] Implement tofile on tensor pytorch/pytorch#165120

Closed

versionadded

2e06f50

Signed-off-by: Justin Chu <[email protected]>

Add tests

dafeaf7

Signed-off-by: Justin Chu <[email protected]>

titaiwangms reviewed Oct 10, 2025

View reviewed changes

justinchuby added 4 commits October 10, 2025 11:16

docstring

ff2df13

Signed-off-by: Justin Chu <[email protected]>

docs

2220eb3

Signed-off-by: Justin Chu <[email protected]>

docs

24d6e65

Signed-off-by: Justin Chu <[email protected]>

use function

ef9b697

Signed-off-by: Justin Chu <[email protected]>

justinchuby added the topic: bc breaking label Oct 10, 2025

update docs

6036735

Signed-off-by: Justin Chu <[email protected]>

justinchuby force-pushed the justinchu/write branch from d527ac8 to 6036735 Compare October 10, 2025 18:38

titaiwangms approved these changes Oct 10, 2025

View reviewed changes

justinchuby commented Oct 10, 2025

View reviewed changes

src/onnx_ir/_core.py Outdated Show resolved Hide resolved

Apply suggestion from @justinchuby

e3df4c9

Signed-off-by: Justin Chu <[email protected]>

justinchuby merged commit 43ebf47 into main Oct 10, 2025
23 checks passed

justinchuby deleted the justinchu/write branch October 10, 2025 19:55

justinchuby removed the topic: bc breaking label Oct 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement tofile on tensors to reduce data write time by 40% #210

Implement tofile on tensors to reduce data write time by 40% #210

Uh oh!

justinchuby commented Oct 3, 2025 •

edited

Loading

Uh oh!

codecov bot commented Oct 3, 2025 •

edited

Loading

Uh oh!

justinchuby commented Oct 4, 2025

Uh oh!

sonarqubecloud bot commented Oct 4, 2025

Uh oh!

Uh oh!

justinchuby commented Oct 10, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

titaiwangms Oct 10, 2025

Uh oh!

justinchuby Oct 10, 2025

Uh oh!

titaiwangms Oct 10, 2025

Uh oh!

justinchuby Oct 10, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Implement tofile on tensors to reduce data write time by 40% #210

Implement tofile on tensors to reduce data write time by 40% #210

Uh oh!

Conversation

justinchuby commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

justinchuby commented Oct 4, 2025

Uh oh!

sonarqubecloud bot commented Oct 4, 2025

Quality Gate passed

Uh oh!

Uh oh!

justinchuby commented Oct 10, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

titaiwangms Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

justinchuby Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

titaiwangms Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

justinchuby Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

justinchuby commented Oct 3, 2025 •

edited

Loading

codecov bot commented Oct 3, 2025 •

edited

Loading

justinchuby Oct 10, 2025 •

edited

Loading