Skip vmm-tests for non-product PRs (Guide/**, repo_support/**/*.py) by Copilot · Pull Request #2925 · microsoft/openvmm

Copilot · 2026-03-10T20:52:03Z

Every PR triggers all 6 expensive vmm-tests jobs today, including PRs that only touch docs or repo scripts. This adds a reusable Flowey-level classification mechanism that skips vmm-tests for PRs entirely within approved non-product buckets, while keeping the full matrix for anything that touches product code or unclassified paths.

The classification works across all three Flowey backends (GitHub, ADO, local) with no external scripts or GitHub API calls. The same Rust classification code runs on all backends — no backend-specific logic is duplicated.

Flowey node: `check_pr_changes`

Backend-native Rust node that classifies PR changed files using git diff:

Backend	Classification method	Cross-job output
GitHub	`git diff origin/$GITHUB_BASE_REF...HEAD`	Written to `$GITHUB_ENV` as `FLOWEY_IS_NON_PRODUCT`; declared as a job-level output
ADO	`git diff origin/$SYSTEM_PULLREQUEST_TARGETBRANCH...HEAD`	Published via `##vso[task.setvariable;isOutput=true]`; step name `classify_pr_changes`
Local	Always "product" (conservative)	N/A

Non-product bucket config: `non_product_config.toml`

Bucket patterns are defined in flowey/flowey_lib_hvlite/src/non_product_config.toml — a single TOML file that is the authoritative source for all backends. To add or remove a non-product bucket, edit this file only.

[[bucket]]
prefix = "Guide/"
description = "Docs tree; validated by the separate docs pipeline"

[[bucket]]
prefix = "repo_support/"
suffix = ".py"
description = "Repo automation scripts; no effect on product behavior"

The config is embedded at compile time via include_str! and parsed with toml_edit. Unit tests in check_pr_changes.rs cover config parsing, prefix-only matching, prefix+suffix matching, and product/non-product classification for representative paths.

New Flowey infrastructure

New PipelineJob and Pipeline methods:

gh_set_job_output_from_env_var(name, env_var) — declares a GitHub job-level output backed by a $GITHUB_ENV variable written by a Rust step
ado_dangerous_override_if(condition) — overrides the ADO job condition: (mirrors the existing gh_dangerous_override_if)
Pipeline::gh_job_id_of(handle) / Pipeline::ado_job_id_of(handle) — dynamically look up the auto-generated or override job ID from a handle
PipelineJobCtx::new_var<T>() — creates typed (ReadVar<T>, WriteVar<T>) pairs for use in dep_on closures

Also adds outputs: support to the Job struct in the GitHub YAML definitions.

Pipeline changes (`checkin_gates.rs`)

For PipelineConfig::Pr on both GitHub and ADO backends, a new classify PR changes job is added. All 6 run vmm-tests [...] jobs take a non-artifact dependency on it and gate on a backend-appropriate condition:

GitHub:

needs.<classify-job-id>.outputs.is_non_product != 'true' && github.event.pull_request.draft == false

ADO:

and(succeeded(), not(canceled()), ne(dependencies.<job-id>.outputs['classify_pr_changes.is_non_product'], 'true'))

The classify job is included in all_jobs, so openvmm checkin gates still waits for it and the required-check model is preserved.

Guide documentation

New section in Guide/src/dev_guide/dev_tools/flowey/pipelines.md covering cross-job conditions, the new APIs (gh_job_id_of, gh_set_job_output_from_env_var, ado_dangerous_override_if), the check_pr_changes node (including how to extend non-product buckets via the TOML config), and a pipeline usage example.

Behavior matrix

PR contents	vmm-tests
`Guide/**` only	skipped
`repo_support/*/.py` only	skipped
mixed lightweight + product	run
product only	run
draft PR	skipped (unchanged)

Original prompt

This section details on the original issue you should resolve

<issue_title>don't run vmm-tests for Guide changes</issue_title>
<issue_description># Skip vmm-tests for lightweight PRs without weakening PR gates

Summary

I want to stop burning vmm-tests capacity on PRs that clearly are not product changes.

The immediate case is Guide/**, but I don’t think we should solve this as a docs-only special case. We should use this to add a small Flowey-level mechanism for classifying PRs into “product” vs “lightweight” buckets, then use that classification to skip the expensive vmm-tests jobs when the PR is lightweight-only.

That keeps the current PR gate model intact, fixes the docs case, and gives us a clean path to exempt other non-product areas later, like repo_support/**/*.py.

What’s wrong today

Right now the main PR workflow is Flowey-generated from .flowey.toml via ci checkin-gates --config=pr, with the logic in flowey/flowey_hvlite/src/pipelines/checkin_gates.rs.

That workflow always emits the full vmm-tests matrix, including the six heavy run vmm-tests [...] jobs. That makes sense for product changes. It does not make sense for changes that only touch docs or other repo-maintenance surfaces.

For example:

Guide/** changes already have their own docs validation path
repo_support/relabel_backported.py is repo automation, not product behavior

Those changes do not affect guest behavior, device behavior, save/restore behavior, boot behavior, or VM management semantics, but today they still consume the same scarce self-hosted test capacity as a real product change.

That is the real problem here. We are treating every PR like a product PR because we do not have a reusable way to say “this change is lightweight.”

Why I want to solve this more generally

I don’t want to hardcode one exception for Guide/** and then repeat the same discussion the next time we want to exempt another obviously non-product path.

The better model is:

if a PR touches product code, run full product validation
if a PR is entirely within approved lightweight buckets, skip vmm-tests
if a PR is mixed, take the stricter path and run vmm-tests

That makes the docs case easy, and it also gives us a clean way to handle future cases like repo_support/**/*.py without inventing another one-off rule.

The default should still be conservative: if a path is not explicitly classified as lightweight, it is treated as product-affecting.

Existing validation we already have

We already have a separate Flowey-generated docs PR workflow:

.github/workflows/openvmm-docs-pr.yaml
generated from .flowey.toml via ci build-docs --config=pr
implemented in flowey/flowey_hvlite/src/pipelines/build_docs.rs

That workflow builds the guide via flowey/flowey_lib_hvlite/src/build_guide.rs, which runs:

mdbook test
mdbook build

So for Guide/**, this is not about removing validation. It is about stopping redundant product validation for a change that already has an appropriate validation path.

Proposed shape

I want to add a small reusable Flowey helper that classifies a PR into named change buckets before we expand the expensive jobs.

Conceptually, the classifier should answer things like:

does this PR touch product code?
does this PR touch Guide/**?
does this PR touch repo_support/**/*.py?
is this PR lightweight-only?

I’m fine with backing that helper using a purpose-built GitHub Action. In fact, that is probably the cleanest implementation, as long as the Flowey interface stays simple and the action contract stays small.

Then checkin_gates.rs can use that classification to gate the six vmm-tests jobs on !lightweight_only.

So the behavior becomes:

Guide/**-only PR → skip vmm-tests
repo_support/**/*.py-only PR → skip vmm-tests
Guide/** + product code PR → run vmm-tests
product-only PR → run vmm-tests

That is the behavior I want.

Why I do not want to skip the whole PR workflow

The tempting shortcut here is to add workflow-level path filtering and just not run the main PR workflow for guide-only changes.

I don’t think that is the right shape.

The current PR pipeline already has an aggregate required-check job, openvmm checkin gates, which depends on the rest of the pipeline and treats only failure and cancelled as blocking. That is a good model. I want to preserve it.

If we skip only the vmm-tests jobs, we keep the existing required check surface and mixed-change behavior stays correct.

If we skip the entire workflow, required-check behavior gets more fragile and the whole thing becomes harder to reason about.

So I want job-level skipping of the expensive matrix, not workflow disappearance.

Goals

Stop scheduling vmm-tests for PRs that are entirely within approved lightweight buckets.
Keep running the existing vmm-tests matrix for anything that touches product ...

Fixes don't run vmm-tests for Guide changes #2924

💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.

Co-authored-by: mattkur <13772048+mattkur@users.noreply.github.com>

mattkur · 2026-03-10T21:23:22Z

@copilot: this will need to work in local and ADO backends as well. Meaning: don't make this specific to the GH pipelines.

…ment for draft check Co-authored-by: mattkur <13772048+mattkur@users.noreply.github.com>

mattkur · 2026-03-10T21:28:28Z

flowey/flowey_lib_hvlite/src/check_pr_changes.rs

+
+use flowey::node::prelude::*;
+
+/// GitHub Actions expression suitable for use in a job-level `outputs:` block.


Why do we need a GitHub Actions expression?

Removed entirely. The GitHub Actions expression is now generated internally by gh_set_job_output_from_env_var(name, env_var) from the env var name constant GH_ENV_IS_NON_PRODUCT. No raw expression is exported from the node. (commit 3897202)

mattkur · 2026-03-10T21:29:19Z

flowey/flowey_core/src/pipeline.rs

+    /// (GitHub Actions only) Declare a job-level output.
+    ///
+    /// `name` is the output name; `expression` is a GitHub Actions expression
+    /// whose value becomes the output value, e.g.
+    /// `"${{ steps.my-step.outputs.my-key }}"` or `"${{ env.MY_VAR }}"`.
+    ///
+    /// The declared output is accessible to dependent jobs via
+    /// `needs.<this-job-id>.outputs.<name>`.
+    ///
+    /// See <https://docs.github.com/en/actions/writing-workflows/workflow-syntax-for-github-actions#jobsjob_idoutputs>
+    pub fn gh_set_job_output(self, name: impl AsRef<str>, expression: impl AsRef<str>) -> Self {
+        self.pipeline.jobs[self.job_idx]
+            .gh_job_outputs
+            .push((name.as_ref().into(), expression.as_ref().into()));
+        self
+    }


Can this be a {Read|Write}Var instead?

Replaced gh_set_job_output(name, raw_expression) with gh_set_job_output_from_env_var(name, env_var) — the env var name (a typed string constant from the node) is now the input, and the ${{ env.VAR }} expression is generated automatically. The job output is declared as outputs: is_non_product: ${{ env.FLOWEY_IS_NON_PRODUCT }} in the regenerated YAML. (commit 3897202)

mattkur · 2026-03-10T21:29:38Z

flowey/flowey_core/src/pipeline.rs

+    /// (GitHub Actions only) Overrides the auto-generated job ID used in the
+    /// GitHub Actions YAML.
+    ///
+    /// By default, Flowey assigns job IDs like `job0`, `job1`, etc. Use this
+    /// method to assign a stable, human-readable ID instead. This is useful
+    /// when other jobs need to reference this job's outputs via
+    /// `needs.<job-id>.outputs.<output-name>`.
+    ///
+    /// Job IDs must be unique across the pipeline.
+    pub fn gh_override_job_id(self, id: impl AsRef<str>) -> Self {
+        self.pipeline
+            .gh_job_id_overrides
+            .insert(self.job_idx, id.as_ref().into());
+        self
+    }


Smells wrong

Removed gh_override_job_id entirely. Added Pipeline::gh_job_id_of(handle) and Pipeline::ado_job_id_of(handle) methods that dynamically look up the auto-generated job IDs from handles. The classify job now gets an auto-generated ID (e.g. job18) and the vmm-tests conditions reference it via needs.job18.outputs.is_non_product. (commit 3897202)

mattkur · 2026-03-10T21:30:35Z

.github/actions/classify-pr-changes/action.yml

This whole file should be replaced by some flowey code that can then be consumed by all backends

The action.yml is deleted. check_pr_changes::Node now uses emit_rust_step (GitHub/local) and emit_ado_step_with_inline_script (ADO) — pure Flowey code, no external GitHub Action, no GitHub API call. Classification is done via git diff against the base branch, which works across all three backends. (commit 3897202)

mattkur · 2026-03-10T21:49:42Z

@copilot: In addition: please make sure that you update the Guide as well: (1) for the sections relevant to this work, and (2) elaborate on the flowey architecture + conventions, etc.

…e gh_override_job_id Co-authored-by: mattkur <13772048+mattkur@users.noreply.github.com>

Copilot · 2026-03-10T22:30:11Z

@copilot: In addition: please make sure that you update the Guide as well: (1) for the sections relevant to this work, and (2) elaborate on the flowey architecture + conventions, etc.

Added a new section to Guide/src/dev_guide/dev_tools/flowey/pipelines.md covering: cross-job conditions (gh_job_id_of, ado_job_id_of, gh_set_job_output_from_env_var, ado_dangerous_override_if), the PR change classification node (check_pr_changes), how it works per backend, the non-product bucket table, and a pipeline usage example. The ADO backend is also now wired up in checkin_gates.rs (not just GitHub). (commit 3897202)

mattkur · 2026-03-10T22:55:01Z

Guide/src/dev_guide/dev_tools/flowey/pipelines.md

+To add a new non-product bucket, update **both** `is_non_product_path` (Rust,
+used by the GitHub and local backends) and the equivalent `if` clause in the
+ADO bash script inside `check_pr_changes::Node`.


Why is ADO so special? Smells like something we should promote architecturally.

Fixed. The doc now says "edit non_product_config.toml" and the "How it works" table reflects that the same Rust classification code runs on all three backends — ADO is no longer special-cased. (commit 128ecdc)

mattkur · 2026-03-10T22:56:19Z

flowey/flowey_lib_hvlite/src/check_pr_changes.rs

+//! Classifies PR changed files to determine whether all changes fall within
+//! approved non-product buckets (e.g. `Guide/**`, `repo_support/**/*.py`).
+//!
+//! This node works across all Flowey backends:
+//!
+//! - **GitHub**: a Rust step runs `git diff` against `GITHUB_BASE_REF` and
+//!   writes the bool result to `$GITHUB_ENV` under [`GH_ENV_IS_NON_PRODUCT`]
+//!   so it is accessible as a job-level output via
+//!   [`Pipeline::gh_job_id_of`] + `needs.<job>.outputs.is_non_product`.
+//!
+//! - **ADO**: an ADO step named [`ADO_STEP_NAME`] runs a bash script that
+//!   uses `git diff` against `SYSTEM_PULLREQUEST_TARGETBRANCH`, publishes the
+//!   result as an ADO output variable (`is_non_product`), and passes it to the
+//!   inline Rust snippet which writes it to the Flowey var.  Downstream jobs
+//!   can gate on [`ado_condition`].
+//!
+//! - **Local**: always writes `false` (conservative; vmm-tests always run).
+//!


Please make it so that this is easier to maintain, like that this reads from a .toml file.
And, add unit tests for that parsing.

Done. Bucket patterns now live in flowey/flowey_lib_hvlite/src/non_product_config.toml (embedded via include_str!, parsed with toml_edit). Added 10 unit tests covering: config parsing, Guide/**, repo_support/**/*.py, product code, workflow YAML, prefix-only matching, and prefix+suffix matching. (commit 128ecdc)

mattkur · 2026-03-10T22:57:13Z

flowey/flowey_lib_hvlite/src/check_pr_changes.rs

+                // The ADO step must have a stable, well-known name so that
+                // downstream jobs can reference it as:
+                //   dependencies.<job>.outputs['classify_pr_changes.is_non_product']
+                //
+                // `emit_ado_step_with_inline_script` generates a step that:
+                //   1. Runs the bash classify script (sets IS_NON_PRODUCT + ADO output var).
+                //   2. Runs the Flowey inline snippet (writes the done signal).
+                //
+                // NOTE on format! escaping used in the YAML snippet below:
+                //   In Rust format strings, `{{` and `}}` produce literal `{` and `}`.
+                //   {{{{ ... }}}} → {{ ... }} (used for the {{FLOWEY_INLINE_SCRIPT}} marker)
+                //   ${{VAR}} → ${VAR} (Rust escaping → bash variable expansion syntax)
+                ctx.emit_ado_step_with_inline_script("classify PR changes", |ctx| {
+                    let done = done.claim(ctx);
+                    (
+                        |_rt| {
+                            format!(
+                                concat!(
+                                    "- bash: |\n",
+                                    "    set -euo pipefail\n",
+                                    "    TARGET_BRANCH=\"${{SYSTEM_PULLREQUEST_TARGETBRANCH:-}}\"\n",
+                                    "    if [[ -z \"$TARGET_BRANCH\" ]]; then\n",
+                                    "      echo \"Not a PR run; treating as product change.\"\n",
+                                    "      IS_NON_PRODUCT=false\n",
+                                    "    else\n",
+                                    "      TARGET_BRANCH=\"${{TARGET_BRANCH#refs/heads/}}\"\n",
+                                    "      echo \"Comparing against: origin/$TARGET_BRANCH\"\n",
+                                    "      CHANGED=$(git diff --name-only \"origin/$TARGET_BRANCH...HEAD\" 2>/dev/null || true)\n",
+                                    "      if [[ -z \"$CHANGED\" ]]; then\n",
+                                    "        echo \"No changed files found; treating as product change.\"\n",
+                                    "        IS_NON_PRODUCT=false\n",
+                                    "      else\n",
+                                    "        IS_NON_PRODUCT=true\n",
+                                    "        while IFS= read -r F; do\n",
+                                    "          if [[ \"$F\" == Guide/* ]] || [[ \"$F\" =~ ^repo_support/.*\\.py$ ]]; then\n",
+                                    "            : # file is in a non-product bucket\n",
+                                    "          else\n",
+                                    "            echo \"Product file detected: $F\"\n",
+                                    "            IS_NON_PRODUCT=false\n",
+                                    "            break\n",
+                                    "          fi\n",
+                                    "        done <<< \"$CHANGED\"\n",
+                                    "      fi\n",
+                                    "    fi\n",
+                                    "    echo \"is_non_product=$IS_NON_PRODUCT\"\n",
+                                    "    echo \"##vso[task.setvariable variable=is_non_product;isOutput=true]$IS_NON_PRODUCT\"\n",
+                                    "    {{{{FLOWEY_INLINE_SCRIPT}}}}\n",
+                                    "  name: {step_name}\n"
+                                ),
+                                step_name = ADO_STEP_NAME,


Yuck. Do we /really/ need this?

Simplified significantly. The ADO emit_ado_step_with_inline_script is now just a minimal wrapper to set the step name: field (required for ADO cross-job output variable references). The bash template is now just {{FLOWEY_INLINE_SCRIPT}} — all classification logic lives in the same Rust classify() helper shared with the GitHub backend. No more duplicate bash classification script. (commit 128ecdc)

…ests Co-authored-by: mattkur <13772048+mattkur@users.noreply.github.com>

Initial plan

30f7119

Copilot AI assigned Copilot and mattkur Mar 10, 2026

Copilot started work on behalf of mattkur March 10, 2026 20:52 View session

Add PR change classification to skip vmm-tests for non-product PRs

c65ffc8

Co-authored-by: mattkur <13772048+mattkur@users.noreply.github.com>

Address code review: improve action pattern documentation and add com…

4b5d4af

…ment for draft check Co-authored-by: mattkur <13772048+mattkur@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Skip vmm-tests for lightweight PRs without product changes~~ Skip vmm-tests for non-product PRs (Guide/**, repo_support/**/*.py) Mar 10, 2026

Copilot finished work on behalf of mattkur March 10, 2026 21:26

Copilot started work on behalf of mattkur March 10, 2026 21:27 View session

mattkur requested changes Mar 10, 2026

View reviewed changes

mattkur closed this Mar 10, 2026

mattkur reopened this Mar 10, 2026

Copilot stopped work on behalf of mattkur due to an error March 10, 2026 21:50
The session was cancelled by the user.

Copilot started work on behalf of mattkur March 10, 2026 21:50 View session

Refactor: replace action.yml with Flowey-native classification, remov…

3897202

…e gh_override_job_id Co-authored-by: mattkur <13772048+mattkur@users.noreply.github.com>

Copilot finished work on behalf of mattkur March 10, 2026 22:31

mattkur requested changes Mar 10, 2026

View reviewed changes

Copilot started work on behalf of mattkur March 10, 2026 22:58 View session

TOML config for non-product buckets, simplified ADO Rust step, unit t…

128ecdc

…ests Co-authored-by: mattkur <13772048+mattkur@users.noreply.github.com>

Copilot finished work on behalf of mattkur March 10, 2026 23:10


		use flowey::node::prelude::*;

		/// GitHub Actions expression suitable for use in a job-level `outputs:` block.

Conversation

Copilot AI commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Flowey node: check_pr_changes

Non-product bucket config: non_product_config.toml

New Flowey infrastructure

Pipeline changes (checkin_gates.rs)

Guide documentation

Behavior matrix

Summary

What’s wrong today

Why I want to solve this more generally

Existing validation we already have

Proposed shape

Why I do not want to skip the whole PR workflow

Goals

Uh oh!

mattkur commented Mar 10, 2026

Uh oh!

mattkur Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

mattkur Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

mattkur Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

mattkur Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

mattkur commented Mar 10, 2026

Uh oh!

Copilot AI commented Mar 10, 2026

Uh oh!

mattkur Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

mattkur Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

mattkur Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Mar 10, 2026 •

edited

Loading

Flowey node: `check_pr_changes`

Non-product bucket config: `non_product_config.toml`

Pipeline changes (`checkin_gates.rs`)