[RF] Disable redundant dirty-flag propagation during minimization by guitargeek · Pull Request #21343 · root-project/root

guitargeek · 2026-02-20T21:46:17Z

When a likelihood is evaluated with the new "cpu" backend, the RooFit::Evaluator fully manages dependency tracking and re-evaluation of the computation graph. In this case, RooFit’s built-in dirty flag propagation in RooAbsArg becomes redundant and introduces significant overhead for large models.

This patch disables regular dirty state propagation for all non-fundamental nodes in the Evaluator's computation graph by setting their OperMode to RooAbsArg::ADirty. Fundamental nodes (e.g. RooRealVar, RooCategory) are excluded because they are often shared with other computation graphs outside the Evaluator (usually the original pdf in the RooWorkspace).

To set the OperMode of all RooAbsArgs to ADirty during minimization, while avoiding side effects outside the minimization scope, the dirty flag propagation for the fundamental nodes is only disabled temporarily in the RooMinimizer.

This commit drastically speeds up fits with AD in particular (up to 2 x for large models), because with fast gradients, the dirty flag propagation that determines which part of the compute graph needs to be recomputed becomes the bottleneck. It was also redundant with a faster "dirty state" bookkeeping mechanism in the RooFit::Evaluator class itself.

At this point, there is no performance regression anymore when disabling recursive dirty flag propagation for all evaluated nodes, so the old comment in the code about test 14 in stressRooFit being slow doesn't apply anymore.

See also slide 12 and 13 on my RooFit AD ROOT users workshop talk for the flamegraphs that show how significant the RooFit bookkeeping was for minimizations with AD gradients.

github-actions · 2026-02-21T01:19:54Z

Test Results

22 files 22 suites 3d 11h 22m 14s ⏱️
3 845 tests 3 840 ✅ 1 💤 4 ❌
76 827 runs 76 688 ✅ 135 💤 4 ❌

For more details on these failures, see this check.

Results for commit 53aec08.

♻️ This comment has been updated with latest results.

When a likelihood is evaluated with the new `"cpu"` backend, the `RooFit::Evaluator` fully manages dependency tracking and re-evaluation of the computation graph. In this case, RooFit’s built-in dirty flag propagation in RooAbsArg becomes redundant and introduces significant overhead for large models. This patch disables regular dirty state propagation for all non-fundamental nodes in the Evaluator's computation graph by setting their OperMode to `RooAbsArg::ADirty`. Fundamental nodes (e.g. RooRealVar, RooCategory) are excluded because they are often shared with other computation graphs outside the Evaluator (usually the original pdf in the RooWorkspace). To set the OperMode of *all* RooAbsArgs to `ADirty` during minimization, while avoiding side effects outside the minimization scope, the dirty flag propagation for the fundamental nodes is only disabled temporarily in the RooMinimizer. This commit drastically speeds up fits with AD in particular (up to 2 x for large models), because with fast gradients, the dirty flag propagation that determines which part of the compute graph needs to be recomputed becomes the bottleneck. It was also redundant with a faster "dirty state" bookkeeping mechanism in the `RooFit::Evaluator` class itself. At this point, there is no performance regression anymore when disabling recursive dirty flag propagation for all evaluated nodes, so the old comment in the code about test 14 in stressRooFit being slow doesn't apply anymore.

Several places needed to record a set of operation-mode changes and restore them later as a group, so it's better to have the ChangeOperModeRAII act on groups of RooAbsArg to not have to create one RAII object per arg.

lmoneta

Thank you Jonas for implementing this significant improvement, speeding up performances!

guitargeek self-assigned this Feb 20, 2026

guitargeek added in:RooFit improvement labels Feb 20, 2026

guitargeek force-pushed the ADirty branch from 8eeeb9a to a4e139a Compare February 20, 2026 21:47

guitargeek changed the title ~~[RF] Set OperMode::ADirty for all RooAbsArgs in RooFit::Evaluatur~~ [RF] Set OperMode::ADirty for all RooAbsArgs in RooFit::Evaluator Feb 20, 2026

guitargeek force-pushed the ADirty branch from a4e139a to 63e7741 Compare February 21, 2026 04:22

guitargeek changed the title ~~[RF] Set OperMode::ADirty for all RooAbsArgs in RooFit::Evaluator~~ [RF] Disable redundant dirty-flag propagation during minimization Feb 21, 2026

guitargeek force-pushed the ADirty branch from 63e7741 to f6c7b98 Compare February 21, 2026 14:38

vgvassilev reviewed Feb 21, 2026

View reviewed changes

Comment thread roofit/roofitcore/inc/RooAbsPdf.h Outdated

guitargeek force-pushed the ADirty branch 2 times, most recently from e386697 to 111908f Compare February 21, 2026 16:43

vgvassilev reviewed Feb 21, 2026

View reviewed changes

Comment thread roofit/batchcompute/res/RooNaNPacker.h Outdated

vgvassilev reviewed Feb 21, 2026

View reviewed changes

Comment thread roofit/roofitcore/res/RooFitImplHelpers.h Outdated

vgvassilev reviewed Feb 21, 2026

View reviewed changes

Comment thread roofit/roofitcore/src/RooFit/Evaluator.cxx Outdated

guitargeek force-pushed the ADirty branch from 111908f to 021d722 Compare February 22, 2026 14:41

guitargeek requested a review from couet as a code owner February 22, 2026 14:41

guitargeek commented Feb 22, 2026

View reviewed changes

Comment thread tutorials/roofit/roofit/rf617_simulation_based_inference_multidimensional.py Outdated

guitargeek force-pushed the ADirty branch from 021d722 to c20df61 Compare February 25, 2026 10:12

guitargeek mentioned this pull request Feb 26, 2026

[RF] Don't do dirty state propagation if more than 2 params changed #20044

Closed

guitargeek force-pushed the ADirty branch 2 times, most recently from 6550887 to a2192f4 Compare March 16, 2026 12:13

This was referenced Mar 16, 2026

[RF] RooFFTConvPdf: cache norm. val to avoid bookkeeping during scan #21615

Merged

[RF] Refactor coefficient updating for RooAddPdf and RooAddModel #21616

Merged

guitargeek force-pushed the ADirty branch 3 times, most recently from 4858a3e to 45b179f Compare March 17, 2026 08:37

couet removed their request for review March 27, 2026 16:35

guitargeek force-pushed the ADirty branch 2 times, most recently from 611fce2 to 187e8e3 Compare April 22, 2026 22:10

guitargeek added 2 commits April 23, 2026 00:18

[RF] Refactor ChangeOperModeRAII to work with groups of RooAbsArgs

53aec08

Several places needed to record a set of operation-mode changes and restore them later as a group, so it's better to have the ChangeOperModeRAII act on groups of RooAbsArg to not have to create one RAII object per arg.

guitargeek force-pushed the ADirty branch from 187e8e3 to 53aec08 Compare April 22, 2026 22:19

lmoneta approved these changes Apr 23, 2026

View reviewed changes

guitargeek merged commit d42a27c into root-project:master Apr 23, 2026
51 of 53 checks passed

guitargeek deleted the ADirty branch April 23, 2026 14:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RF] Disable redundant dirty-flag propagation during minimization#21343

[RF] Disable redundant dirty-flag propagation during minimization#21343
guitargeek merged 2 commits intoroot-project:masterfrom
guitargeek:ADirty

guitargeek commented Feb 20, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Feb 21, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lmoneta left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

guitargeek commented Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Feb 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test Results

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lmoneta left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

guitargeek commented Feb 20, 2026 •

edited

Loading

github-actions Bot commented Feb 21, 2026 •

edited

Loading