Make `__cpuid_count()` a pure function #1943

bal-e · 2025-10-22T11:30:17Z

Crates like raw_cpuid use core::arch::x86_64::__cpuid_count() to determine x86 CPU information. It's great that core provides such a function, instead of having to write inline assembly everywhere; but core's implementation does not use the asm! attributes pure and nomem. This means that calls to __cpuid_count() can't be elided or deduplicated. I'm writing some target-feature enhancement code (akin to multiversion), and I'd like to rely on CPUID getting optimized away appropriately.

While CPUID is a serializing instruction, that's not the primary use case for it. There are several possible approaches to separating the primary use case (where it can be treated as a pure function) from secondary use cases (where it needs to be impure):

Make __cpuid_count() pure and require inline assembly for secondary use cases. (implemented in this PR)

Secondary use cases are IMO quite rare and their users probably don't mind using inline assembly manually, in order to control LLVM thoroughly. But this is might be considered a breaking change.
Make __cpuid_count() pure and introduce __impure_cpuid_count() for secondary use cases.

This would simplify the updating of secondary use cases, but might still be considered a breaking change. It would also require replicating __cpuid() and __get_cpuid_max()`.
Leave __cpuid_count() as-is and introduce __pure_cpuid_count().

This would not be a breaking change; however, I find it unfortunate that the primary use case for this function would be relegated to a more inconvenient function name. Once an approach is stabilized, it would be harder to transition to an (IMO) ideal world where __cpuid_count() is pure.

I think approach 1 is ideal, but it's a (minor?) breaking change, and I'll leave that judgement to the reviewer.

'__cpuid_count()' is implemented using inline assembly, because LLVM doesn't have an intrinsic for it. It's a pure operation, but this wasn't marked in the 'asm!' invocation; so calls to it couldn't be elided or deduplicated. This change makes it pure. CPUID does have _some_ less-than-pure effects -- e.g. it can be used as a serializing instruction (like a strong memory fence). Users who want to rely on that could use inline assembly themselves instead.

rustbot · 2025-10-22T11:30:22Z

r? @folkertdev

rustbot has assigned @folkertdev.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

folkertdev

cc @sayantn @Amanieu, I think you just know more about this.

I was also briefly confused why this function is unsafe, but #1935 already attempts to fix that.

bjorn3 · 2025-10-22T12:10:41Z

Cpuid isn't always pure. For example it can be used to query the Local APIC ID, which changes when a thread gets rescheduled to another core. Not sure if you can query that from ring 3 though.

bal-e · 2025-10-22T14:46:42Z

If CPUID is not always pure, should we look for a way to accommodate pure use cases? Or should users looking for pure information (which is most of them AFAIK) write inline assembly to inform LLVM of that property? This is at least possible with raw_cpuid because it provides an abstraction over the raw CPUID function.

thomcc · 2025-10-24T01:57:47Z

I said this here but for availability in this issue:

It's possible people wrote code assuming this was a serializing instruction, so changing it could be considered a breaking change. Or at least a change that could break code. Also, it's worth noting, CPUID is very expensive (partially because it is serializing). You probably should not be using it directly in your macro expansion, regardless of if the compiler can elide repeated calls.

bal-e · 2025-10-24T08:18:28Z

@thomcc I completely understand that this might be seen as a breaking change, which is why I enumerated the other options. I noticed this issue when looking at assembly outputs for small code examples, where CPUID could have been elided but wasn't; I agree with your advice, I'm aware of the performance impact and am not planning to call it very often.

Amanieu · 2025-10-24T15:37:37Z

This conflicts with #1935: we can't have this be both safe and pure because if the same cpuid call (with the same inputs) ever returns different values then it would result in undefined behavior.

hanna-kruppe · 2025-10-24T15:54:58Z

If this was marked as pure and unsafe, what's the conditions that callers can/must uphold to avoid running into UB from it not being actually pure? I can't think of anything other than "don't call this with inputs for which it's not a pure function" but that seems like a breaking change if there are any such inputs.

tgross35 · 2025-10-24T17:36:30Z

I feel like having a safe cpuid that doesn't change its current behavior is easier to reason about for more users than unsafe and pure. Most cases call cpuid sparingly anyway; making users think about whether the values they read are static or OS-controlled doesn't really seem worth the optimization.

Couldn't the application discussed in the top post cache the values? Rather than relying on optimization to possibly elide calls.

bal-e · 2025-10-25T20:33:59Z

I feel like having a safe cpuid that doesn't change its current behavior is easier to reason about for more users than unsafe and pure.

That's a good argument. And you're right that this isn't a major optimization in any way; I had put up this PR because this seemed like a small oversight in the intrinsics set. I was wrong -- the intrinsics have all been pretty ironclad :)

I'll probably close the PR once the discussion dries up.

rustbot assigned folkertdev Oct 22, 2025

folkertdev reviewed Oct 22, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make `__cpuid_count()` a pure function #1943

Make `__cpuid_count()` a pure function #1943

bal-e commented Oct 22, 2025

Uh oh!

rustbot commented Oct 22, 2025

Uh oh!

folkertdev left a comment

Uh oh!

bjorn3 commented Oct 22, 2025

Uh oh!

bal-e commented Oct 22, 2025

Uh oh!

thomcc commented Oct 24, 2025

Uh oh!

bal-e commented Oct 24, 2025

Uh oh!

Amanieu commented Oct 24, 2025

Uh oh!

hanna-kruppe commented Oct 24, 2025

Uh oh!

tgross35 commented Oct 24, 2025

Uh oh!

bal-e commented Oct 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Make __cpuid_count() a pure function #1943

Are you sure you want to change the base?

Make __cpuid_count() a pure function #1943

Conversation

bal-e commented Oct 22, 2025

Uh oh!

rustbot commented Oct 22, 2025

Uh oh!

folkertdev left a comment

Choose a reason for hiding this comment

Uh oh!

bjorn3 commented Oct 22, 2025

Uh oh!

bal-e commented Oct 22, 2025

Uh oh!

thomcc commented Oct 24, 2025

Uh oh!

bal-e commented Oct 24, 2025

Uh oh!

Amanieu commented Oct 24, 2025

Uh oh!

hanna-kruppe commented Oct 24, 2025

Uh oh!

tgross35 commented Oct 24, 2025

Uh oh!

bal-e commented Oct 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Make `__cpuid_count()` a pure function #1943

Make `__cpuid_count()` a pure function #1943