Add `HierarchicalWeighting` by ValerianRey · Pull Request #414 · SimplexLab/TorchJD

ValerianRey · 2025-09-10T14:53:13Z

This PR contains 0a25555 which adds the HierarchicalWeighting.

I think we should wait to test this on proper experiments / speed benchmarks, before merging.

Add reshape of jacobian for the scalar output case. Fix reshape of the Gramian, we for the last half of the dimensions, we need to reshape in the same order as the first, then we move the dimensions. We could in principle create a `reshape_gramian` function that does this, as well as a `move_dim_gramian` Add a test of values for all four cases of having a batched/non-batched dimension. Tests or reshape/move-dim should work should go in another test. Remove some tests that do not test anything more than `test_gramian_is_correct`. Add `_gramian_utils.py` which contains helper to `reshape` and `movedim` on a Gramian. Add `generate_vmap_rule = True` for `JacobianAccumulator`. This allows vmaping the forward phase. This enables having several Engines defined on the same module. Add `test_reshape_equivariance` Add tests to verify that gramian utils yields the correct quadratic forms. Add tests to verify that gramian utils yields the correct quadratic forms. Add `test_movedim_equivariance` Fix warning. Fix warning. Remove handles from `ModuleHookManager` Change `batched_dims` to a single optional `batched_dim`. Fix movedim in `compute_gramian` and add `test_movedim_equivariance` Remove `grad_output`, can be added later, but should be `jac_output` instead. Make modules with incompatible batched operations are compatible with non-batched autogram. Fix doc tests Provide the autograd vjp for when no dimension is batched. This enables having a single forward in that case which should be faster. Make VJPs into Callable classes.

Co-authored-by: Valérian Rey <31951177+ValerianRey@users.noreply.github.com>

…ute_gramian

Not the final name I think, but at least it's consistent with the method name

Co-authored-by: Pierre Quinton <pierre.quinton@epfl.ch>

At this point, 3 architectures fail: SomeFrozenParam, SomeUnusedParam and MultiOutputWithFrozenBranch

… variables This fixes non-batched engien on SomeFrozenParams architecture

…grad in AutogradVJP This fixes non-batched engine on SomeUnusedParam

…adVJP This fixes non-batched engine on MultiOutputWithFrozenBranch

Maybe not a definitive name, but I think it's more clear

* Small improvement of clarity

…e can also contain (at most) one element set to -1, the size of that dimension is deduced from the total number of elements

ValerianRey · 2025-09-25T22:06:55Z

Closing this PR in favor of an archive branch (archive/add-hierarchical-weighting). We can start a new PR when we want to experiment on this.

PierreQuinton and others added 30 commits August 31, 2025 15:25

Update src/torchjd/autogram/_vjp.py

8f6e5ee

Co-authored-by: Valérian Rey <31951177+ValerianRey@users.noreply.github.com>

Update tests/unit/autogram/test_engine.py

d507daa

Co-authored-by: Valérian Rey <31951177+ValerianRey@users.noreply.github.com>

Fix batched_dim param in some Engine usages

d60328e

Make batched_dim parameter name explicit when creating Engine

4103e4d

Rename batched_dims to batched_dim in test_gramian_is_correct

4fadac6

Fix parameter description of batched_dim

5c98ee8

Improve error message in _check_module_is_compatible

e5e6e9d

Remove parameter description of removed parameter grad_output in comp…

c6c6a3f

…ute_gramian

Rename flat_gramian to square_gramian

af3373b

Not the final name I think, but at least it's consistent with the method name

Remove redundant cast to dict in AutogradVJP

aeb8f3b

Rename info to _ in AccumulateJacobian.vmap

d5b8685

Type-hint in_dims as PyTree in AccumulateJacobian.vmap

b789e35

Rename tree_spec to output_spec in ModuleHookManager

e949dff

Rename self.tree_spec to self.param_spec in AutogradVJP

16f6aa9

Add example in comment of reshape_gramian

ab134a8

Improve variable names in compute_quadratic_form

8d66d57

Revamp documentation of compute_gramian

4bdc400

Update src/torchjd/autogram/_engine.py

33216c1

Co-authored-by: Pierre Quinton <pierre.quinton@epfl.ch>

Add ... indexing in jac_output for code clarity

c090dfe

Fix formatting of docstring

3466a3c

Add more parametrizations to test_reshape_equivariance

d2bab7a

Improve parametrization of test_movedim_equivariance

79e0609

Improve parametrization of test_batched_non_batched_equivalence

c40f44c

Add comment in compute_gramian

2fe3b51

Improve clarity of reshape_gramian

d82a8f6

Improve clarity of movedim_gramian

c846ed7

Revert removal of _handles in ModuleHookManager

c2fd0a0

Add more edge cases to test_quadratic_form_invariance_to_reshape

1cbff18

Add more edge cases to test_quadratic_form_invariance_to_movedim

6938560

ValerianRey and others added 16 commits September 5, 2025 19:16

Factorize code into _make_path_jacobians and use for-loop

1b56558

Fix model not being moved to cuda in new tests

1d78e15

Make test_equivalence_autojac_autogram also work with non-batched engine

91d739b

At this point, 3 architectures fail: SomeFrozenParam, SomeUnusedParam and MultiOutputWithFrozenBranch

Make separation between trainable and frozen params in VJP and rename…

92b7b1a

… variables This fixes non-batched engien on SomeFrozenParams architecture

Add allow_unused=True and materialize_grads=True in call to autograd.…

9b92cf7

…grad in AutogradVJP This fixes non-batched engine on SomeUnusedParam

Stop trying to differentiate outputs that dont require grad in Autogr…

7950bb4

…adVJP This fixes non-batched engine on MultiOutputWithFrozenBranch

Fix variable name

4dc3f7c

Rename jacobians to generalized_jacobians

23cb0a8

Maybe not a definitive name, but I think it's more clear

Merge branch 'main' into vgp-v3-rebased

d58a08b

Replace torch.movedim by tensor.movedim

1169b85

Add batch_size variable in compute_gramian

59c957c

* Small improvement of clarity

Merge branch 'main' into vgp-v3-rebased

dbfbbc6

Add GeneralizedWeighting

9a01a12

Add FakeGeneralizedWeighting in tests

31812a1

reshape_gramian can now take generalized gramians as inputs, its shap…

ee16f9b

…e can also contain (at most) one element set to -1, the size of that dimension is deduced from the total number of elements

Implement HierarchicalWeighting Weighting, needs testing.

0a25555

ValerianRey closed this Sep 25, 2025

ValerianRey deleted the add-hierarchical-weighting branch September 25, 2025 22:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `HierarchicalWeighting`#414

Add `HierarchicalWeighting`#414
ValerianRey wants to merge 46 commits intomainfrom
add-hierarchical-weighting

ValerianRey commented Sep 10, 2025 •

edited

Loading

Uh oh!

ValerianRey commented Sep 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ValerianRey commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ValerianRey commented Sep 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ValerianRey commented Sep 10, 2025 •

edited

Loading