Add support for C26 atomic reductions (without compiler mappings) by ThomasHaas · Pull Request #985 · hernanponcedeleon/Dat3M

ThomasHaas · 2026-02-12T14:51:11Z

I added a header c26.h with the new atomic reduction operations of C26.
~~I implemented all of them except min and max~~ min/max are supported now, but only the signed versions!
There is also some support for C litmus style versions. @hernan-poncedeleon added them and I don't know how well they work right now.

These atomics generate a rmw-pair of events just like a standard fetch_op atomic, but add the Noreturn tag to both of them (naming follows LKMM's non-returning atomics).
There is no compilation scheme to hardware targets yet, so code has to be verified with --target=c11 (default).

What needs to be done is to relax the memory models of interest: ~~right now atomic_op and atomic_fetch_op provide the same synchronization semantics.~~ EDIT: Although the memory models should probably be adapted, the fact that we currently model the load part of atomic_store_op as a plain load (not even relaxed) makes it weaker than a atomic_fetch_op in terms of ordering.

hernanponcedeleon · 2026-02-17T14:09:35Z

@graymalkin this branch should have everything you need to play around with the model

ThomasHaas · 2026-02-17T14:15:14Z

FYI, atomic_store_min/max are always the signed versions for now.

graymalkin · 2026-02-17T14:16:42Z

Thanks, I'll check it out!

hernanponcedeleon · 2026-02-19T12:59:28Z

Code-wise I think this one is ready to merge. I will wait a few days to see if @graymalkin or @gonzalobg have comments about the memory model part (especially if it makes sense to mark the read part of the reduction as atomic) or @mmalcomson reports any issues when trying the code.

ThomasHaas · 2026-02-19T14:14:30Z

With #986 merged, we could in principle add compiler mappings for atomic reductions to armv8. At least the obvious one's like store_add(... RLX) -> STADD and store_add_(... REL) -> STADDL. For SC, it would not be so clear.
I cannot imagine that any real C memory model would require the mapping to be stronger than that.

dartagnan/src/main/java/com/dat3m/dartagnan/program/event/Tag.java

hernanponcedeleon · 2026-02-26T07:42:39Z

dartagnan/src/main/java/com/dat3m/dartagnan/program/processing/compilation/VisitorC11.java

+        Local localOp = newLocal(dummyReg, expressions.makeIntBinary(dummyReg, e.getOperator(), e.getOperand()));
+        RMWStore store = newRMWStoreWithMo(load, address, dummyReg, Tag.C11.storeMO(mo));
+
+        load.addTags(C11.ATOMIC, Tag.C11.NORETURN); // Note that the load has no mo, but is still atomic!


For consistency with visitAtomicFetchOp I would rather use

Load load = newRMWLoadWithMo(dummyReg, address, Tag.C11.loadMO(mo));

and rather than getting the expected ordering guarantees "by chance" as it currently happens for rc11,
let the model explicitly state if NORETURN events should provide order or not.

It also feels strange to have an atomic event with no memory order.

I don't like these consistency arguments... those are different operations. Tag.C11.loadMO(mo) will just be RLX or SC because you cannot specify ACQ/ACQ_REL in the first place.
I think the only really sensible options are: the load has no mo, simply because it shouldn't exist in the first place, or the load has the same mo/tags as the store and the WMM removes the tags.
Anything inbetween seems arbitrary to me.

The current solution of hardcoding the atomic tag seems equally arbitrary.

I guess what you are proposing is to completely get rid of Tag.C11.loadMO/storeMO) and simply used the mo from the parsing. This would require the memory model to do some "cleanup" as lkmm does, but then we can get rid of these loadMo/storeMO as we already did for lkmm in #893.

The current solution of hardcoding the atomic tag seems equally arbitrary.

The atomic tag is not arbitrary, because the whole operation is an atomic one, even by name atomic_store_XYZ.
And if you look at what our compiler does:

boolean canRace = mo == null || mo.value().equals(C11.NONATOMIC); e.addTags(canRace ? C11.NONATOMIC : C11.ATOMIC);

then every event must be tagged either way, and NONATOMIC is certainly more wrong than ATOMIC.

I guess what you are proposing is to completely get rid of Tag.C11.loadMO/storeMO) and simply used the mo from the parsing. This would require the memory model to do some "cleanup" as lkmm does, but then we can get rid of these loadMo/storeMO as we already did for lkmm in #893.

I proposed exactly that in #984 or rather suggested it as one possible way to go forward. I think rc11.cat might already adhere to that. That being said, for now, I just took the most natural solution given the current hardcoded one:

A load must be generated for data-flow modelling (no way around this)

The load must be ignored in data races. Marking it as atomic is natural as it is part of an atomic operation independent of its memory ordering.

The load should not provide any orderings -> both plain (no mo) and RLX seem reasonable. Plain is closer to capturing the idea of "the load should not exist" whereas RLX is closer to capturing the idea of "the load exists but it should not give orderings", which is (funnily enough) too much ordering :)

At the end of the day, I'm not the one who writes the C memory models and sets the expectation of what is assumed to happen implicitly and what is assumed to be done in the model.

Implemented first support for atomic reduction ops (only for C-code yet).

…y are have no memory order (~plain).

Signed-off-by: Hernan Ponce de Leon <hernanl.leon@huawei.com>

hernanponcedeleon · 2026-03-24T10:11:54Z

dartagnan/src/main/java/com/dat3m/dartagnan/program/event/Tag.java

            };
        }
+
+        public static String intToMo(int i) {


Why do we need this back?

It is used in our Intrinsics to get the correct memory ordering for the new atomic reductions from our custom c26 header. I think once LLVM supports those instructions natively, we won't need this anymore.

EDIT: I could move the mapping code into Intrinsics if you prefer.

I could move the mapping code into Intrinsics if you prefer.

That might be better. Also, please add a TODO so we remember to get rid of this once LLVM supports the instructions.

Once LLVM supports those instructions, we will get parser issues anyhow :). The code needs to change, so you cannot forget it really.

Add comments in c26.h

ThomasHaas mentioned this pull request Feb 12, 2026

Implement new C++26 atomic operations #984

Open

ThomasHaas force-pushed the atomic-modify-write branch from 15e1a9c to dc83233 Compare February 16, 2026 19:28

hernanponcedeleon force-pushed the atomic-modify-write branch from dc83233 to ad06c85 Compare February 17, 2026 14:05

ThomasHaas changed the title ~~[DRAFT] Add support for C26 atomic reductions~~ Add support for C26 atomic reductions (without compilation) Feb 18, 2026

ThomasHaas changed the title ~~Add support for C26 atomic reductions (without compilation)~~ Add support for C26 atomic reductions (without compiler mappings) Feb 18, 2026

hernanponcedeleon force-pushed the atomic-modify-write branch from f33d0a3 to 2cadfc2 Compare February 19, 2026 12:54

hernanponcedeleon reviewed Feb 25, 2026

View reviewed changes

dartagnan/src/main/java/com/dat3m/dartagnan/program/event/Tag.java Outdated Show resolved Hide resolved

hernanponcedeleon force-pushed the atomic-modify-write branch 2 times, most recently from 7be51db to 1ca78bc Compare February 26, 2026 07:35

hernanponcedeleon reviewed Feb 26, 2026

View reviewed changes

hernanponcedeleon mentioned this pull request Mar 3, 2026

add store_add tests gonzalobg/cpp_memory_model#8

Open

hernanponcedeleon force-pushed the atomic-modify-write branch from 1ca78bc to db502d6 Compare March 24, 2026 08:35

hernan-poncedeleon and others added 8 commits March 24, 2026 10:11

Prototype support for atomic modify-write operations

c987744

Add c26 header for atomic (reduction) ops

c372a09

Implemented first support for atomic reduction ops (only for C-code yet).

Rename C26 reduction operators

1212820

Add min/max operators to C

894bdfe

Make the loads of atomic_store_op compilation atomic, even though the…

df1b922

…y are have no memory order (~plain).

Fix accidental compilation error

ba42cff

Noreturn -> NORETURN

4488f3b

Signed-off-by: Hernan Ponce de Leon <hernanl.leon@huawei.com>

Add back C11.intToMo and fix missing source code for atomic reductions

2a5fa5f

ThomasHaas force-pushed the atomic-modify-write branch from db502d6 to 2a5fa5f Compare March 24, 2026 09:12

hernanponcedeleon reviewed Mar 24, 2026

View reviewed changes

Inline C11.intToMo to its only call-site in Intrinsics

093ee12

Add comments in c26.h

Conversation

ThomasHaas commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hernanponcedeleon commented Feb 17, 2026

Uh oh!

ThomasHaas commented Feb 17, 2026

Uh oh!

graymalkin commented Feb 17, 2026

Uh oh!

hernanponcedeleon commented Feb 19, 2026

Uh oh!

ThomasHaas commented Feb 19, 2026

Uh oh!

Uh oh!

hernanponcedeleon Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

ThomasHaas Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

hernanponcedeleon Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

ThomasHaas Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

hernanponcedeleon Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

ThomasHaas Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hernanponcedeleon Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

ThomasHaas Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ThomasHaas commented Feb 12, 2026 •

edited

Loading

ThomasHaas Mar 24, 2026 •

edited

Loading