Incorporate GRIT model by ttolhurst · Pull Request #22 · gridfm/gridfm-graphkit

ttolhurst · 2025-11-17T19:26:29Z

Incorporation of GRIT from L. Ma's "Graph Inductive Biases in Transformers without Message Passing".

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

… into mirror_ft1125_incorporate_grit

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

romeokienzler · 2026-06-26T20:52:37Z

Code Review Findings

gridfm_graphkit/models/grit_transformer.py — GritHeteroAdapter.aggregate_pg reads self.grit.mask_value[0].item() to fill fully-masked buses. Since mask_value is registered as a learnable nn.Parameter when learn_mask=True, .item() detaches it: the learnable mask token receives no gradient through the only place it's consumed. learn_mask=True is silently a no-op.
gridfm_graphkit/training/loss.py — PBELoss computes S_injection = torch.diag(V) @ Y_bus_conj @ V_conj. torch.diag(V) materializes a dense N×N matrix and the matmul densifies the sparse Y-bus; on case2000 this is O(N²) memory and defeats the sparse Y-bus construction directly above. Use V * (Y_bus_conj @ V_conj) instead.
gridfm_graphkit/training/loss.py — PBELoss builds Y_diag = Y_diag + bus_orig[:, GS] + 1j * bus_orig[:, BS], pulling shunt admittance from the normalized input x_dict[\"bus\"]. If the normalizer scales GS/BS by a different factor than the Y-bus edge attributes YFF/YFT, the assembled Y-bus is physically inconsistent and the PBE residual is biased.
gridfm_graphkit/datasets/rrwp.py — Walk-length semantics are inconsistent with add_identity: with add_identity=True the PE has walk_length powers [I, A, A², …]; with add_identity=False it has walk_length-1 powers. Downstream RRWPLinearNodeEncoder(emb_dim=ksteps) assumes one fixed dimension, so toggling add_identity produces a silent size mismatch.
gridfm_graphkit/models/grit_layer.py — torch_scatter is imported under try/except and stored as None, but pyg_softmax/propagate_attention call scatter, scatter_max, scatter_add directly without a guard. A missing torch-scatter install raises TypeError: 'NoneType' is not callable instead of the explicit ImportError pattern used in rrwp_encoder._check_scatter.
gridfm_graphkit/training/loss.py — MaskedGenMSE was changed from pred_dict[\"gen\"][mask_dict[\"gen\"][:, :(PG_H+1)]] to slicing pred/target first then masking. This is a behavior change for any config with output_gen_dim > 1 (the old form would broadcast/error, the new form computes a different quantity). There's no test asserting either path; confirm intent and add coverage.
gridfm_graphkit/datasets/rrwp.py — deg = adj.sum(dim=1) followed by deg_inv = 1.0 / adj.sum(dim=1) recomputes the row sum; reuse deg. Minor but executes every PE computation (mitigated by caching).
gridfm_graphkit/models/rrwp_encoder.py — RRWPLinearEdgeEncoder.__init__ accepts fill_value and uses it to build padding, but then unconditionally sets self.fill_value = 0.0. The stored attribute (and __repr__) always reports 0.0 regardless of the argument; either honor it or drop the parameter.
Side note — GritHeteroAdapter.__init__ mutates the shared args object (args.model.input_dim, args.model.gt.dim_hidden, args.model.encoder.posenc_RWSE.kernel.times). Fine for a single-model run, but it's a footgun for any future code that constructs multiple models from the same config.

romeokienzler · 2026-06-26T20:57:06Z

@ttolhurst had a go with claude code, can you please have a look at the found issues and comment?

ttolhurst and others added 30 commits November 17, 2025 14:07

added basic GRIT code

39a5862

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

initial connection of model to config

922d6ce

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

collect model components and replace old register method

e8281ac

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

clean up imported layers and encoders

a67e522

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

flow in basic structure for RRWP calculation

6966f5f

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

clean up

a7bd51d

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

matching up parameters

226f2a3

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

matching up parameters

88d9ca6

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

matching up parameters

b7d9dcf

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

matching up parameters

38cc44a

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

matching up parameters in grit layer

7fded95

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

matching up parameters in grit layer

0f3b803

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

matching up parameters in grit layer

af8ad03

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

matching up parameters in data module

f430f2a

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

flow over parameters from base model

e1c4890

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

verified encodings and data flow to model forward method

36dca00

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

match feature dimensions

a8ec56e

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

match feature dimensions

0868b96

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

reformat decoder to handle batch format

3cc21a3

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

confirmed training loop functions

1783051

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

update toml

c75012f

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

added forward method to transform class

3d3f98b

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

update readme with install instructions

d238e75

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

verifed compat with GPS and GNN

17b0889

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

work on comments and clean up

091f084

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

deep copy in test method

53d5644

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

merge main

272afa6

basic RWSE flown over

e23c9c6

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

tested addition of RWSE

bfe2af0

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

flow over kernel encoders

c1e5721

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

ttolhurst added 20 commits April 16, 2026 10:45

support for scatter and sparse

dbb5d47

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

formatting

101a9f5

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

support for optional scatter and sparse

e900295

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

ruff format

8730d06

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

clean up

dc3f494

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

support for scatter and sparse

526efa9

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

formatting

32e3ac2

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

support for scatter and sparse

f9cb7b2

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

Merge remote-tracking branch 'graphkitMirror/ft1125_incorporate_grit'…

81ac6e3

… into mirror_ft1125_incorporate_grit

update callback

969129e

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

allow posenc caching

4cbd59b

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

allow posenc caching

d4c4725

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

allow posenc caching

6e041e2

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

update caching for RRWP

baa2132

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

topk sparsity

aeedb7d

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

reduce cache memory footprint

db88a7c

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

clean up

fb0dbcf

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

change encoder

d46680f

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

clean up

f7af5be

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

linting

4c09970

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

frmir requested review from albanpuech and romeokienzler June 15, 2026 14:47

ttolhurst added 6 commits June 16, 2026 13:46

settle merge conflicts

ebace59

settle merge conflicts

91ce9f0

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

adjust grit config

c1706e7

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

adjust grit config

af76a9f

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

clamp un-predicted values in predict

34bafe9

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

precommit checks

41d2567

Signed-off-by: Thomas Tolhurst <99353435+ttolhurst@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Incorporate GRIT model#22

Incorporate GRIT model#22
ttolhurst wants to merge 103 commits into
gridfm:mainfrom
ttolhurst:ft1125_incorporate_grit

ttolhurst commented Nov 17, 2025

Uh oh!

romeokienzler commented Jun 26, 2026

Uh oh!

romeokienzler commented Jun 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

ttolhurst commented Nov 17, 2025

Uh oh!

romeokienzler commented Jun 26, 2026

Code Review Findings

Uh oh!

romeokienzler commented Jun 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants