Adding Estimate NPU Latency pass and unit test #2178

alinah-amd · 2025-09-23T18:42:18Z

Describe your changes

Overview

Added the EstimateNPULatency pass under olive/onnx/vitis_ai/estimate_latency.py
EstimateNPULatency makes use of the NPU Perf Estimator tool to predict computational performance of workloads given a set of parameters.

This is an analysis pass and does not transform the graph at all. Used for performance analysis only.

Installation

To install (if not installed through requirements.txt), run the following:
pip install [placeholder for wheel]

Confirm python version installed is >= 3.10 for compatibility.

If perf estimator package is not installed, the following warning will show and the pass will simply be bypassed:

Usage

Inputs
EstimateNPULatency takes in both the model in the form of an OnnxModelHandler object and a list of optional parameters in the form of a dict of PassConfigParams (consistent with all other passes).

Optional Parameters
To pass in optional parameters, list parameter name and parameter value as key-value pairs in the json file. See example:

target_device: Target device type. This is used to provide default config specific to that device type. Currently, only Strix is supported but will modify to include other devices in future.

Type: str
Default: stx
Allowed Values: ["stx"]

Adding Pass to Config File
Should ideally be run as the last pass and listed last in the <model>.json file. For example:

Output
Generates a concise_summary directory within the run directory with the following files:

{model_name}_concise_summary.txt will display the following info on roofline latency, total compute ops, and what conclusion can be drawn on performance bottleneck (whether it is DDR Bandwidth bound or Compute bound):

{model_name}_concise_summary.csv will display the same info but specific to per op. Ops are listed in descending order of latency:

Known Passing Tests

Resnet w/ Perf Estimator
Refer to Olive Recipes Repo

MobileNet w/ Perf Estimator
Refer to Olive Recipes Repo

Unit Test

Run python -m pytest test/unit_test/passes/vitis_ai/test_estimate_latency.py
This unit test runs a dummy model with the estimate latency pass (calls it directly instead of end to end flow) and asserts that a concise_summary directory with dummy csv and txt files are generated

Checklist before requesting a review

Add unit tests for this change.
Make sure all tests can pass.
Update documents if necessary.
Lint and apply fixes to your code by running lintrunner -a
Is this a user-facing change? If yes, give a description of this change to be included in the release notes.

(Optional) Issue link

alinah-amd · 2025-09-23T18:54:29Z

@microsoft-github-policy-service agree company="Microsoft"

@microsoft-github-policy-service agree company="AMD"

olive/passes/onnx/vitis_ai/estimate_npu_latency.py

jambayk · 2025-10-01T17:35:42Z

/azp run

azure-pipelines · 2025-10-01T17:35:53Z

Azure Pipelines successfully started running 1 pipeline(s).

jambayk · 2025-10-03T07:28:03Z

olive/passes/onnx/vitis_ai/estimate_npu_latency.py

+            from estimator.run import run_perf_estimate
+        except ImportError:
+            perf_installed = False
+            logger.warning("Estimator module not found. Skipping EstimateNPULatency pass.")


I think instead of raising a warning, it might be better to fail with import a helpful import error which tells what package to install. since olive caches runs, to rerun the pass with the dependency installed, they would have to clean the cache or delete the cached run that skipped the estimation.

jambayk · 2025-10-03T07:29:07Z

olive/olive_config.json

+            "supported_accelerators": [ "*" ],
+            "supported_precisions": [ "*" ],
+            "supported_algorithms": [  ],
+            "supported_quantization_encodings": [  ]


could you add a "module_dependencies" option like under the autoawqquantizer pass for the package required to run this estimation?

jambayk · 2025-10-03T19:04:39Z

test/passes/onnx/test_estimate_npu_latency.py

+class TestEstimateNPULatency:
+    """Test cases for EstimateNPULatency pass."""
+
+    def test_estimate_latency_basic(self, tmp_path):


please also add the required dependency to the requirements-test.txt under test

Adding Estimate NPU Latency pass and unit test

06f0e90

jambayk reviewed Sep 23, 2025

View reviewed changes

olive/passes/onnx/vitis_ai/estimate_npu_latency.py Outdated Show resolved Hide resolved

jambayk reviewed Oct 1, 2025

View reviewed changes

olive/passes/onnx/vitis_ai/estimate_npu_latency.py Outdated Show resolved Hide resolved

github-advanced-security bot found potential problems Oct 1, 2025

View reviewed changes

olive/passes/onnx/vitis_ai/estimate_npu_latency.py Fixed Show fixed Hide fixed

Fixed lint issues

9e35b6b

jambayk reviewed Oct 3, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding Estimate NPU Latency pass and unit test #2178

Adding Estimate NPU Latency pass and unit test #2178

Uh oh!

alinah-amd commented Sep 23, 2025 •

edited

Loading

Uh oh!

alinah-amd commented Sep 23, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jambayk commented Oct 1, 2025

Uh oh!

azure-pipelines bot commented Oct 1, 2025

Uh oh!

jambayk Oct 3, 2025

Uh oh!

jambayk Oct 3, 2025 •

edited

Loading

Uh oh!

jambayk Oct 3, 2025

Uh oh!

Uh oh!

Adding Estimate NPU Latency pass and unit test #2178

Are you sure you want to change the base?

Adding Estimate NPU Latency pass and unit test #2178

Uh oh!

Conversation

alinah-amd commented Sep 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe your changes

Overview

Installation

Usage

Known Passing Tests

Checklist before requesting a review

(Optional) Issue link

Uh oh!

alinah-amd commented Sep 23, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jambayk commented Oct 1, 2025

Uh oh!

azure-pipelines bot commented Oct 1, 2025

Uh oh!

jambayk Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

jambayk Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jambayk Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

alinah-amd commented Sep 23, 2025 •

edited

Loading

jambayk Oct 3, 2025 •

edited

Loading