feat: add Truthiness datatype for Monte Carlo comparisons #128

wdconinc · 2025-10-29T17:12:19Z

Briefly, what does this PR introduce?

This PR adds a datatype to record the "truthiness" (as mathematically defined...) for a reconstructed event; where truthiness is the "quality of seeming or being felt to be true, even if not necessarily true," or in this case also "the amount of confidently proclaiming the wrong thing to be true."

Mathematically, truthiness is a non-negative value that is zero only for perfectly reconstructed events (positive-definite), and is radially increasing in the error of the reconstruction (greater error leads to greater truthiness).

It is possible to define truthiness in multiple ways, but we will typically use some combination of the following components:

a χ² measure on associated reconstructed and generated particles, with normalization given by the determined uncertainty in the reconstruction (if available) or 1 GeV otherwise,
a positive penalty term for discrete reconstruction errors, such as PID mis-identification (where weighting can be used to penalize some mis-identification more than others),
a positive penalty term for generated particles that should have been reconstructed, but weren't,
a positive penalty term for reconstructed particles that were not part of the original event record.

There are non-reconstruction reasons why the truthiness will be non-zero in realistic scenarios:

multiple-scattering effects will cause the event to lose momentum starting from the true value, deviating both in direction and magnitude in a consistent direction,
secondary particles will be generated in materials or along bent trajectories, leading to additional reconstructed particles corresponding to e.g. hard bremsstrahlung gammas in the electromagnetic calorimeters,
primary particles (in particular are low energies) may be absorbed in support structures, leading to their absence in the reconstructed event.

Nevertheless, the decrease of the overall average event truthiness for the same geometry and input hit collections is intended to indicate an improved reconstruction, and converse.

What kind of change does this PR introduce?

Bug fix (issue #__)
New feature (issue: store truthiness for event reconstruction)
Documentation update
Other: __

Please check if this PR fulfills the following:

Tests for the changes have been added
Documentation has been added / updated
Changes have been communicated to collaborators

Does this PR introduce breaking changes? What changes might users need to make to their code?

No.

Does this PR change default behavior?

No.

wdconinc · 2025-10-29T17:20:10Z

edm4eic.yaml

+       - edm4eic::MCRecoParticleAssociation associations          // Reference to the associated reconstructed particles
+       - edm4hep::MCParticle unassociated_mc_particles            // Reference to the unassociated MC particles
+       - edm4eic::ReconstructedParticle unassociated_rc_particles // Reference to the unassociated reconstructed particles
+


This definition of the truthiness data type excludes vertex terms. I don't think at this point we are ready to compare generated and reconstructed vertices, and they are not easily accessible through individual associations. It may be possible to have some adhoc relation to a MCParticle mean the vertex where that particle was generated. Still, I think that is a harder problem than this first attempt. One thing to keep in mind in the vertexing problem is that it is hard to define what a missing reconstructed vertex should be since some vertices are going to be so close together as to be effectively unresolvable.

ruse-traveler

Interesting! I'm really intrigued by this! I'm curious about how this type would get used in practice, so for my own understanding: the vector members should be 1-to-1 with the relations in the associations field, correct?

wdconinc · 2025-11-04T15:39:13Z

Interesting! I'm really intrigued by this! I'm curious about how this type would get used in practice, so for my own understanding: the vector members should be 1-to-1 with the relations in the associations field, correct?

It's not intended for analyzers but for reconstruction development (so absolutely not for selecting for your analysis only those events that are close to the truth). But I would imagine we can use this to select events that are particularly poorly reconstructed and look at what went wrong, or compare before and after PRs to make sure we don't make things worse.

ruse-traveler · 2025-11-06T18:55:58Z

It's not intended for analyzers but for reconstruction development (so absolutely not for selecting for your analysis only those events that are close to the truth). But I would imagine we can use this to select events that are particularly poorly reconstructed and look at what went wrong, or compare before and after PRs to make sure we don't make things worse.

Gotcha! Makes sense! This could be really useful as a high level summary of the impact of a change...

ruse-traveler · 2025-11-06T19:03:31Z

Then following up on the vectors: I partly ask because I wonder if it would make sense to define a Truthiness component just to make the interface a little easier...

For example, there's an overall, energy, momentum, and PID truthiness at both the event and association level. So we could define a component sort of like:

edm4eic::TruthinessContribution:
  Members:
    - float overall
    - float pid
    - float energy
    - float momentum

So that:

edm4eic::Truthiness:
  <... description ...>
  Members:
    - edm4eic::TruthinessContribution eventContribution
    - float unassociatedMCParticleContribution
    - float unassociatedRecoParticleContribution
  VectorMembers:
    - edm4eic::TruthinessContribution associationContributions
  <... relations ...>
  ```

ruse-traveler

Only comments I have so far are the component-idea for consideration above, and some suggestions for naming consistency between this and other types' fields below!

edm4eic.yaml

ruse-traveler

🙌 Thanks! I think this looks good!

wdconinc requested a review from a team as a code owner October 29, 2025 17:12

wdconinc commented Oct 29, 2025

View reviewed changes

wdconinc mentioned this pull request Oct 29, 2025

feat: add Truthiness algorithm for event quality assessment eic/EICrecon#2163

Open

8 tasks

ruse-traveler reviewed Nov 4, 2025

View reviewed changes

ruse-traveler reviewed Nov 6, 2025

View reviewed changes

edm4eic.yaml Outdated Show resolved Hide resolved

edm4eic.yaml Outdated Show resolved Hide resolved

edm4eic.yaml Outdated Show resolved Hide resolved

wdconinc force-pushed the truthiness branch from 32f7c7e to 0f0a6f0 Compare November 6, 2025 21:01

wdconinc requested a review from ruse-traveler November 6, 2025 21:34

ruse-traveler approved these changes Nov 6, 2025

View reviewed changes

wdconinc added 2 commits November 7, 2025 14:28

feat: add Truthiness datatype for Monte Carlo comparisons

4dcdf4c

CamelCase 🐪🐫🐫🐫🐫🐫🐫🐫 (starting with dromedary)

b397bf9

wdconinc force-pushed the truthiness branch from 9d5bae0 to b397bf9 Compare November 7, 2025 20:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add Truthiness datatype for Monte Carlo comparisons #128

feat: add Truthiness datatype for Monte Carlo comparisons #128

Uh oh!

wdconinc commented Oct 29, 2025

Uh oh!

wdconinc Oct 29, 2025

Uh oh!

ruse-traveler left a comment

Uh oh!

wdconinc commented Nov 4, 2025

Uh oh!

ruse-traveler commented Nov 6, 2025

Uh oh!

ruse-traveler commented Nov 6, 2025

Uh oh!

ruse-traveler left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ruse-traveler left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: add Truthiness datatype for Monte Carlo comparisons #128

Are you sure you want to change the base?

feat: add Truthiness datatype for Monte Carlo comparisons #128

Uh oh!

Conversation

wdconinc commented Oct 29, 2025

Briefly, what does this PR introduce?

What kind of change does this PR introduce?

Please check if this PR fulfills the following:

Does this PR introduce breaking changes? What changes might users need to make to their code?

Does this PR change default behavior?

Uh oh!

wdconinc Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

ruse-traveler left a comment

Choose a reason for hiding this comment

Uh oh!

wdconinc commented Nov 4, 2025

Uh oh!

ruse-traveler commented Nov 6, 2025

Uh oh!

ruse-traveler commented Nov 6, 2025

Uh oh!

ruse-traveler left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ruse-traveler left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants