feat: Add public utility for per-sample gradient validation (#484) by chidoziemanagwu · Pull Request #810 · meta-pytorch/opacus

chidoziemanagwu · 2026-03-18T17:13:05Z

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Docs change / refactoring / dependency upgrade

Motivation and Context / Related issue

Fixes #484: Utility to test correctness of per sample gradients

Context:
Currently, users building custom privacy models or extending Opacus with custom grad_samplers have no public-facing mechanism to verify that Opacus computes their per-sample gradients mathematically correctly. The internal validation method check_per_sample_gradients_are_correct swallows all diagnostic data and only returns a boolean flag, making it impossible for engineers to debug why their Layer gradient failed.

The Approach:
I extracted the core comparison logic between Opacus's reference micro-batch computation and the optimized hook computation into a newly exported public API: get_per_sample_gradient_diagnostics.

Rich Diagnostics: This function returns strict mathematical assertions (L1 Loss, Mean Squared Error, L2 Norms, and Shape Mismatches) for every individually named trainable parameter processed by Opacus across both mean and sum loss reduction paradigms.

# Example output dictionary per parameter
'weight': {
    'passed': True,
    'shape_match': True,
    'opacus_shape': (4, 5, 10),
    'microbatch_shape': (4, 5, 10),
    'opacus_l2_norm': 2.45,
    'microbatch_l2_norm': 2.45,
    'mse': 1.2e-15,
    'l1_loss': 3.4e-8
}

Backward Compatibility: I deliberately left the original internal assertions intact to ensure zero testing breakages for upstream Opacus developers. The new APIs are properly documented and exported via opacus/utils/__init__.py.

Community Impact (Scalability & Architecture):
By exposing this tool, external researchers and security engineers can independently verify their custom mechanisms against the Opacus framework. This lowers the technical barrier to entry for developing novel Differential Privacy architectures, directly shifting the validation and debugging burden away from project maintainers.

How Has This Been Tested (if it applies)

I expanded the opacus.tests.per_sample_gradients_utils_test.py suite. Alongside Conv1d and Linear tests, I introduced coverage for LayerNorm arrays, validated the exact structured dictionary returned by the new diagnostic tool, and tested the public import routing. I also updated the README.md to reflect these public enhancements.

Usage Example:

import torch
import torch.nn as nn
from opacus.utils import get_per_sample_gradient_diagnostics

model = nn.Linear(10, 5)
x = torch.randn(4, 10)

report = get_per_sample_gradient_diagnostics(x, model)
if report["passed"]:
    print("All per-sample gradients are mathematically correct.")
else:
    for name, p in report["reductions"]["mean"]["parameters"].items():
         if not p["passed"]:
              print(f"FAILED LAYER: {name} (MSE: {p['mse']:.2e}, Shape Match: {p['shape_match']})")

Test Execution & Visual Proof (Terminal Log):

$ python -m unittest opacus.tests.per_sample_gradients_utils_test
UserWarning: Full backward hook is firing when gradients are computed with respect to module outputs since no inputs require gradients.
  loss.backward()
......
----------------------------------------------------------------------
Ran 6 tests in 11.077s

OK

Checklist

The documentation is up-to-date with the changes I made.
I have read the CONTRIBUTING document and completed the CLA (see CONTRIBUTING).
All tests passed, and additional code has been covered with new tests.

…orch#484)

meta-cla · 2026-03-18T17:13:12Z

Hi @chidoziemanagwu!

Thank you for your pull request and welcome to our community.

Action Required

In order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at cla@meta.com. Thanks!

meta-cla · 2026-03-18T19:06:39Z

Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Meta Open Source project. Thanks!

chidoziemanagwu · 2026-03-20T09:59:11Z

hi @iden-kalemaj @alexandresablayrolles the CLA has been signed and all import checks are passing. This PR adds a public diagnostic utility for per-sample gradient verification (Issue #484). Would you be able to approve the workflows and review when you have a moment? Thank you.

meta-codesync · 2026-03-25T16:54:07Z

@facebook-github-bot has imported this pull request. If you are a Meta employee, you can view this in D98158224. (Because this pull request was imported automatically, there will not be any future comments.)

chidoziemanagwu · 2026-04-19T10:25:35Z

Hi @HuanyuZhang, I noticed this PR was imported into Meta's internal system (D98158224) and assigned a few weeks ago.

Just checking in to see if there is any feedback from the internal review or if there are any additional changes needed on my end to move this toward a merge. This utility will be a great help for anyone building custom layers in Opacus!

Thanks for your time.

HuanyuZhang · 2026-05-20T16:31:07Z

Thx @chidoziemanagwu for the reminder. Just left some comments.

HuanyuZhang · 2026-05-20T16:32:56Z

+
+__all__ = [
+    "check_per_sample_gradients_are_correct",
+    "get_per_sample_gradient_diagnostics",


Any reason why we need to expose check_per_sample_gradients_are_correct? I thought get_per_sample_gradient_diagnostics should have contained all the information needed.

HuanyuZhang · 2026-05-20T16:34:25Z

+```
+
+The simpler `check_per_sample_gradients_are_correct` function is also available
+if you only need a boolean pass/fail result.


Let us hide check_per_sample_gradients_are_correct for simplicity.

HuanyuZhang · 2026-05-20T16:35:35Z

+        report = get_per_sample_gradient_diagnostics(x, model)
+        self.assertTrue(report["passed"])
+
+    def test_public_import_path(self):


Any chance we could add a test diagnosing mismatched gradients (i.e., assertFalse rather than assertTrue)?

sure we can

…gradients_are_correct - Remove check_per_sample_gradients_are_correct from public opacus.utils API - Drop README mention of the boolean helper for simplicity - Add diagnostics test exercising the mismatched-gradient (failing) path Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

…gradients_are_correct - Remove check_per_sample_gradients_are_correct from public opacus.utils API - Drop README mention of the boolean helper for simplicity - Add diagnostics test exercising the mismatched-gradient (failing) path

chidoziemanagwu · 2026-05-22T21:11:26Z

Hello @HuanyuZhang I made an update pls review :)

feat: Add public utility for per-sample gradient validation (meta-pyt…

212bf22

…orch#484)

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 18, 2026

style: apply black formatting

6476e8f

HuanyuZhang self-assigned this Mar 26, 2026

HuanyuZhang reviewed May 20, 2026

View reviewed changes

chidoziemanagwu force-pushed the feature/issue-484-per-sample-gradient-diagnostics branch from fcaf563 to 69253de Compare May 22, 2026 21:06

chidoziemanagwu requested a review from HuanyuZhang May 22, 2026 21:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add public utility for per-sample gradient validation (#484)#810

feat: Add public utility for per-sample gradient validation (#484)#810
chidoziemanagwu wants to merge 3 commits into
meta-pytorch:mainfrom
chidoziemanagwu:feature/issue-484-per-sample-gradient-diagnostics

chidoziemanagwu commented Mar 18, 2026

Uh oh!

meta-cla Bot commented Mar 18, 2026

Uh oh!

meta-cla Bot commented Mar 18, 2026

Uh oh!

chidoziemanagwu commented Mar 20, 2026

Uh oh!

meta-codesync Bot commented Mar 25, 2026

Uh oh!

chidoziemanagwu commented Apr 19, 2026

Uh oh!

HuanyuZhang commented May 20, 2026

Uh oh!

HuanyuZhang May 20, 2026

Uh oh!

HuanyuZhang May 20, 2026

Uh oh!

HuanyuZhang May 20, 2026

Uh oh!

chidoziemanagwu May 21, 2026

Uh oh!

chidoziemanagwu commented May 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

chidoziemanagwu commented Mar 18, 2026

Types of changes

Motivation and Context / Related issue

How Has This Been Tested (if it applies)

Checklist

Uh oh!

meta-cla Bot commented Mar 18, 2026

Action Required

Process

Uh oh!

meta-cla Bot commented Mar 18, 2026

Uh oh!

chidoziemanagwu commented Mar 20, 2026

Uh oh!

meta-codesync Bot commented Mar 25, 2026

Uh oh!

chidoziemanagwu commented Apr 19, 2026

Uh oh!

HuanyuZhang commented May 20, 2026

Uh oh!

HuanyuZhang May 20, 2026

Choose a reason for hiding this comment

Uh oh!

HuanyuZhang May 20, 2026

Choose a reason for hiding this comment

Uh oh!

HuanyuZhang May 20, 2026

Choose a reason for hiding this comment

Uh oh!

chidoziemanagwu May 21, 2026

Choose a reason for hiding this comment

Uh oh!

chidoziemanagwu commented May 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants