Prep 2.2.0 release #189

brenns10 · 2025-10-31T23:14:14Z

This is ready for review.

Ksplice cold-patches are a pain because they are kernel modules whose build IDs (and debuginfo) are mismatched, but they otherwise look just like the in-tree module. Previously we tried to detect them, and thus avoid attempting to extract or load their debuginfo. However, in practice this doesn't seem feasible. While I've been able to find some signals that a module may be a cold-patch, none have generalized to all architectures and versions. Instead, we need to just handle the effects of this problem. When cold-patches aren't handled, the ol-download and ol-local-rpm finders will repeat attempting to download & extract these debuginfo files, every time they're used. We already have some safeguards to prevent double-execution (download, then re-extract). But we can extend this safeguard to the case where we've previously extracted the RPM. If we already tried the file from the vmlinux_repo, then there's no point in trying to download or extract that module again. Signed-off-by: Stephen Brennan <[email protected]>

With the module API we can report the actual DWARF file that gets loaded. But CTF wasn't explicitly reported. Given that the Oracle plugin now handles CTF loading, we can also save the file that got loaded, so that we can later report it for the CLI or corelens logs. Signed-off-by: Stephen Brennan <[email protected]>

While a recent commit handled the case where we had extracted files from a downloaded RPM, and there was a build ID mismatch, there was still the case where the debuginfo RPM was installed to the system. Since drgn's standard finder loads those files, we would never have the opportunity to populate the "extracted" set for those modules. Thus, when the debuginfo RPM is installed, it would be possible for us to try to download and extract debuginfo in the presence of a build ID mismatch (e.g. ksplice cold-patch). Avoid this and also report a warning. Signed-off-by: Stephen Brennan <[email protected]>

This will support some of our internal customer debugging environments, by allowing us to extract debuginfo in directories relative to the core dumps that we are debugging. Signed-off-by: Stephen Brennan <[email protected]>

Right now drgn, DRGN, & corelens just delay when extracting. We really should print a status message to let users know what is happening. Signed-off-by: Stephen Brennan <[email protected]>

Maintaining the outfile & report parameters is a bit difficult for a few reasons. First, the "outfile" parameter is a string filename, which means that whenever an output must be written, the file must be opened. Second, the "report" parameter is intended to determine the mode (append vs write), but this becomes less than useful if you need to write multiple things at a time: when report is False, you'll only get the last item printed. The intended use case for these parameters seems to be so that we can easily provide custom RDS scripts to customers. The idea being that many outputs would be too large, so we may need to only run certain functions, and redirect output to several files for ease of access. To support this, let's create a @redirectable decorator. It will take any function, and allow it to accept an "outfile" parameter. When provided, this parameter will redirect the function's output to the file. An optional :w or :a can be appended to the filename in order to specify the mode (it is :w by default). All print statements can simply write to stdout, and it will be redirected appropriately where necessary. For example, a custom script could now be created easily: from drgn_tools import rds rds.rds_conn_info(prog, outfile="conn_info.txt") rds.rds_sock_info(prog, outfile="other_data.txt:a") rds.rdma_resource_usage(prog, outfile="other_data.txt:a") Signed-off-by: Stephen Brennan <[email protected]>

This will soon become moot, as we will likely be adding drgn commands for corelens, that work on 0.0.33 and later. But for now, it's useful: >>> cl("dentrycache -l 50000", outfile="foo.txt") Signed-off-by: Stephen Brennan <[email protected]>

The functions themselves raise appropriate errors, but we don't want the tests to fail on these vmcores. Signed-off-by: Stephen Brennan <[email protected]>

Signed-off-by: Stephen Brennan <[email protected]>

This ensures we have helpers with the latest fixes for the latest upstream kernels. Signed-off-by: Stephen Brennan <[email protected]>

The drgn timekeeping helpers were introduced in drgn 0.0.32 and can be used to replace our existing tk_core / shadow_timekeeper code. What's more, they are kept up-to-date with the latest kernel changes, so long as a recent enough drgn version is used. Signed-off-by: Stephen Brennan <[email protected]>

Signed-off-by: Stephen Brennan <[email protected]>

There are occasional test failures on live systems where the stack changes during a test. Of course there's no guarantee of stability here, but let's give a grace period to reduce the chances and hopefully avoid the test failure. Signed-off-by: Stephen Brennan <[email protected]>

It has been a long time since the readme got touched, and it's a bit out of date. Update it to focus more heavily on Corelens, give CTF a mention, and link to OL documentation. Also, give a bit of description for how to use the debuginfo plugin. Signed-off-by: Stephen Brennan <[email protected]>

The "kvm" corelens module should not run unless the kvm kernel module is loaded and debuginfo is present. Signed-off-by: Stephen Brennan <[email protected]>

When reading logs it's not always obvious which test run resulted in a failure. Log the full details of the test so that it is easier to detect. Signed-off-by: Stephen Brennan <[email protected]>

Signed-off-by: Stephen Brennan <[email protected]>

biger410

Looks good.

brenns10 · 2025-11-04T18:29:16Z

Thank you! The test failure is only due to the UEK7 debuginfo being missing for the latest release, for some reason. I think it's just a race condition and it will be uploaded soon. In any case, I've done quite a bit of other testing so I'm confident that the tests do pass.

brenns10 added 3 commits October 29, 2025 09:42

oracle-contributor-agreement bot added the OCA Verified All contributors have signed the Oracle Contributor Agreement. label Oct 31, 2025

brenns10 added the allow-missing-latest label Oct 31, 2025

brenns10 added 4 commits October 31, 2025 16:34

debuginfo: Add "extractions" to debuginfo config

61355b0

This will support some of our internal customer debugging environments, by allowing us to extract debuginfo in directories relative to the core dumps that we are debugging. Signed-off-by: Stephen Brennan <[email protected]>

debuginfo: print a message when extracting

01c5cf1

Right now drgn, DRGN, & corelens just delay when extracting. We really should print a status message to let users know what is happening. Signed-off-by: Stephen Brennan <[email protected]>

brenns10 force-pushed the prep-2.2.x branch 4 times, most recently from bb11231 to c4ff3a5 Compare November 1, 2025 07:16

brenns10 added 9 commits November 4, 2025 08:49

tests: vectorinfo: skip for UEK4-5, and aarch64

2b309e4

The functions themselves raise appropriate errors, but we don't want the tests to fail on these vmcores. Signed-off-by: Stephen Brennan <[email protected]>

ci: stick to 3.12 for pre-commit checks

89ad174

Signed-off-by: Stephen Brennan <[email protected]>

ci: use drgn 0.0.33

d7dbc09

This ensures we have helpers with the latest fixes for the latest upstream kernels. Signed-off-by: Stephen Brennan <[email protected]>

meminfo, numastat: handle removed NR_WRITEBACK_TEMP in 6.17

5402efb

Signed-off-by: Stephen Brennan <[email protected]>

kvm: require kvm module loaded

237fcd0

The "kvm" corelens module should not run unless the kvm kernel module is loaded and debuginfo is present. Signed-off-by: Stephen Brennan <[email protected]>

tests: improve logs

5830841

When reading logs it's not always obvious which test run resulted in a failure. Log the full details of the test so that it is easier to detect. Signed-off-by: Stephen Brennan <[email protected]>

brenns10 force-pushed the prep-2.2.x branch from c4ff3a5 to 4c40a13 Compare November 4, 2025 17:01

Release v2.2.0

13ff4ce

Signed-off-by: Stephen Brennan <[email protected]>

brenns10 force-pushed the prep-2.2.x branch from 4c40a13 to 13ff4ce Compare November 4, 2025 17:02

brenns10 requested a review from biger410 November 4, 2025 17:29

biger410 approved these changes Nov 4, 2025

View reviewed changes

brenns10 merged commit 13ff4ce into oracle-samples:main Nov 4, 2025
2 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Prep 2.2.0 release #189

Prep 2.2.0 release #189

Uh oh!

brenns10 commented Oct 31, 2025 •

edited

Loading

Uh oh!

biger410 left a comment

Uh oh!

brenns10 commented Nov 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Prep 2.2.0 release #189

Prep 2.2.0 release #189

Uh oh!

Conversation

brenns10 commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

biger410 left a comment

Choose a reason for hiding this comment

Uh oh!

brenns10 commented Nov 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

brenns10 commented Oct 31, 2025 •

edited

Loading