One datapoint is that locally on OpenMPI 5, the test ran fine on one GPU. There was a discussion elsewhere (maybe @maleadt remembers) if that flag is still needed or what MPI versions can now handle the new memory interface.