Shared: Improvements to content-sensitive model generation #20915

hvitved · 2025-11-26T09:36:21Z

Distinguish the access path limit used internally from the limit used in access paths produced by the content flow library. This allows us to recover Rust: Improve handling of implicit derefs/borrows in data flow #20891 (comment).
Exclude models with access paths that use non-supported Contents.
Apply validateAccessPath before checking whether at most 3 models exist for a given parameter.

shared/dataflow/codeql/dataflow/internal/ContentDataFlowImpl.qll

      FlowFeature getAFeature() { result = ContentConfig::getAFeature() }

-      predicate accessPathLimit = ContentConfig::accessPathLimit/0;
+      predicate accessPathLimit = ContentConfig::accessPathLimitInternal/0;


Copilot

Pull request overview

This PR improves content-sensitive model generation by distinguishing between the access path limit used internally and the limit used for generating models, enabling better filtering of invalid access paths.

Introduced separate accessPathLimit() and accessPathLimitInternal() to allow different limits for internal data flow analysis versus model generation
Enhanced validateAccessPath() to exclude models with unsupported content types by validating all content in access paths
Refactored validation to occur earlier in the pipeline by moving validateAccessPath checks into the apiFlow predicate before counting models per parameter

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated no comments.

File	Description
shared/mad/codeql/mad/modelgenerator/internal/ModelGeneratorImpl.qll	Enhanced `validateAccessPath` documentation and logic; refactored validation to occur in `apiFlow` predicate before model counting
shared/dataflow/codeql/dataflow/internal/ContentDataFlowImpl.qll	Added `accessPathLimitInternal()` method and `length()` method for access paths to support internal vs external limit distinction
rust/ql/test/utils-tests/modelgenerator/option.rs	Updated test expectations to reflect newly generated summaries that were previously missing due to access path limits

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

michaelnebel

Perhaps we should also run DCA (at least for C# model generation) to check the impact on performance and number of models generated?

michaelnebel · 2025-12-03T07:39:18Z

shared/dataflow/codeql/dataflow/internal/ContentDataFlowImpl.qll

    default int accessPathLimit() { result = Lang::accessPathLimit() }

+    /** Gets the access path limit used in the internal invocation of the standard data flow library. */
+    default int accessPathLimitInternal() { result = Lang::accessPathLimit() }


Why is this needed?

It is not currently used, but I thought I'd add it in case e.g. C# model generation blows up, in which case we can revert it back to 2.

github-actions bot added the DataFlow Library label Nov 26, 2025

github-advanced-security bot found potential problems Nov 26, 2025

View reviewed changes

shared/dataflow/codeql/dataflow/internal/ContentDataFlowImpl.qll Fixed Show fixed Hide fixed

hvitved mentioned this pull request Nov 26, 2025

Rust: Improve handling of implicit derefs/borrows in data flow #20891

Merged

hvitved force-pushed the content-flow-ap-limit branch from 6fb68ee to 8b5dbe2 Compare December 1, 2025 19:57

github-actions bot added the Rust Pull requests that update Rust code label Dec 1, 2025

hvitved changed the title ~~Shared: Do not apply accessPathLimit in content flow~~ Shared: Improvements to content-sensitive model generation Dec 1, 2025

github-advanced-security bot found potential problems Dec 1, 2025

View reviewed changes

Shared: Improvements to content-sensitive model generation

666855d

hvitved force-pushed the content-flow-ap-limit branch from 8b5dbe2 to 666855d Compare December 1, 2025 20:23

hvitved added the no-change-note-required This PR does not need a change note label Dec 2, 2025

hvitved marked this pull request as ready for review December 2, 2025 14:42

hvitved requested review from a team as code owners December 2, 2025 14:42

hvitved requested review from Copilot and michaelnebel December 2, 2025 14:42

Copilot started reviewing on behalf of hvitved December 2, 2025 14:43 View session

Copilot finished reviewing on behalf of hvitved December 2, 2025 14:45

Copilot AI reviewed Dec 2, 2025

View reviewed changes

michaelnebel reviewed Dec 3, 2025

View reviewed changes

hvitved added the C# label Dec 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Shared: Improvements to content-sensitive model generation #20915

Shared: Improvements to content-sensitive model generation #20915

hvitved commented Nov 26, 2025 •

edited

Loading

Uh oh!

Uh oh!

Check warning

Copilot AI left a comment

Uh oh!

michaelnebel left a comment

Uh oh!

michaelnebel Dec 3, 2025

Uh oh!

hvitved Dec 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Shared: Improvements to content-sensitive model generation #20915

Are you sure you want to change the base?

Shared: Improvements to content-sensitive model generation #20915

Conversation

hvitved commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Check warning

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

michaelnebel left a comment

Choose a reason for hiding this comment

Uh oh!

michaelnebel Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

hvitved Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hvitved commented Nov 26, 2025 •

edited

Loading