test(autogram): Add extra test for batched equivalence by ValerianRey · Pull Request #445 · SimplexLab/TorchJD

ValerianRey · 2025-10-02T19:03:36Z

This test can be quite practical, to check that both engines do give the same results. For instance, when working on Transformer, autogram is not equivalent to autograd_gramian yet, but both engines (autogram with batch_dim=None and autogram with batch_dim=0) are equivalent.

It seems that this test does not pass for FreeParam. This is weird, considering that both should be equivalent to autograd_gramian. We may have a bug here. This could also be caused by having two engines, but then again it would be an unexpected behavior.

ValerianRey · 2025-10-02T19:23:06Z

I just investigated, and having a 2nd engine makes the result for engine_none be different if and only if engine_0.compute_gramian is called before engine_none.compute_gramian

ValerianRey · 2025-10-02T19:42:24Z

The problem can be reproduced even without having two engines.

engine_none = Engine(model.modules(), batch_dim=None)

inputs = make_tensors(batch_size, input_shapes)
targets = make_tensors(batch_size, output_shapes)
loss_fn = make_mse_loss_fn(targets)

# Call the model's forward pass twice, to make the hook run twice instead of once
torch.random.manual_seed(0)  # Fix randomness for random models
output = model(inputs)
losses = reduce_to_vector(loss_fn(output))
torch.random.manual_seed(0)  # Fix randomness for random models
output = model(inputs)
losses = reduce_to_vector(loss_fn(output))

gramian_none = engine_none.compute_gramian(losses)

Having two engines (and the one that does an extra forward first) is just a convoluted way of calling twice the model's forward pass, which results in the same bug.

ValerianRey · 2025-10-02T19:53:08Z

A solution is to reset the state of the engine between the two forward passes. We don't want the first hook call to have any effect on the state of the engine.

Should we add a public reset_state method to the engine? Or at least a private one that we could use in our tests involving multiple engines?

@PierreQuinton

… the same model

codecov · 2025-10-02T20:01:02Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

ValerianRey · 2025-10-02T22:08:34Z

I'm not sure this is such a good test tbh, I'll probably not merge this.

PierreQuinton · 2025-10-03T07:34:32Z

I like this test. I'm not sure to understand exactly what the problem is with the state. Could we have the reset in the hook? Like at the beginning?

ValerianRey · 2025-10-03T13:25:42Z

I like this test. I'm not sure to understand exactly what the problem is with the state. Could we have the reset in the hook? Like at the beginning?

We don't wanna reset at every hook: we wanna reset before the forward pass of the model. So we'd need a model hook for that (which would change the engine constructor) or we could also use a context manager. It would be something like:

engine = ...
with engine.activate_hooks():
    output = ...

losses = ...
gramian = engine.compute_gramian(losses)

The hooks would thus be manually activated, rather than be always activated unless we're in gramian computation phase. I'm not a big fan of that either but it's not that bad. IMO we shouldn't change anything for now, but still think about it a bit

Add test_batched_non_batched_equivalence_2

5e5e920

ValerianRey added cc: test Conventional commit type for changes to tests. package: autogram labels Oct 2, 2025

ValerianRey self-assigned this Oct 2, 2025

ValerianRey requested a review from PierreQuinton October 2, 2025 19:03

ValerianRey changed the title ~~test(autogram): Add extra test for batched equivalance~~ test(autogram): Add extra test for batched equivalence Oct 2, 2025

Fix test_batched_non_batched_equivalence_2 to not have two engines on…

e2e11a2

… the same model

Merge branch 'main' into add-batched-equivalence-test

431fc9d

ValerianRey closed this Oct 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test(autogram): Add extra test for batched equivalence#445

test(autogram): Add extra test for batched equivalence#445
ValerianRey wants to merge 3 commits intomainfrom
add-batched-equivalence-test

ValerianRey commented Oct 2, 2025 •

edited

Loading

Uh oh!

ValerianRey commented Oct 2, 2025

Uh oh!

ValerianRey commented Oct 2, 2025 •

edited

Loading

Uh oh!

ValerianRey commented Oct 2, 2025

Uh oh!

codecov bot commented Oct 2, 2025 •

edited

Loading

Uh oh!

ValerianRey commented Oct 2, 2025

Uh oh!

PierreQuinton commented Oct 3, 2025

Uh oh!

ValerianRey commented Oct 3, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ValerianRey commented Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ValerianRey commented Oct 2, 2025

Uh oh!

ValerianRey commented Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ValerianRey commented Oct 2, 2025

Uh oh!

codecov bot commented Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ValerianRey commented Oct 2, 2025

Uh oh!

PierreQuinton commented Oct 3, 2025

Uh oh!

ValerianRey commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ValerianRey commented Oct 2, 2025 •

edited

Loading

ValerianRey commented Oct 2, 2025 •

edited

Loading

codecov bot commented Oct 2, 2025 •

edited

Loading

ValerianRey commented Oct 3, 2025 •

edited

Loading