Skip to content

Adding L1-L4 design to RFC-0050#93

Open
can-gaa-hou wants to merge 1 commit intopytorch:masterfrom
can-gaa-hou:cjh
Open

Adding L1-L4 design to RFC-0050#93
can-gaa-hou wants to merge 1 commit intopytorch:masterfrom
can-gaa-hou:cjh

Conversation

@can-gaa-hou
Copy link
Copy Markdown

As @ZainRizvi mentioned here, I am opening this PR for adding L1-L4 design in our RFC. This may help us better develop L2-L4 implementation.

cc @fffrog @ZainRizvi @albanD

@meta-cla meta-cla bot added the cla signed label Apr 14, 2026

#### L4: Always-On PR Checks

`L4` uses the same callback and synchronization behavior as `L3`, but it does not wait for a label. The `L4` scenario is effectively the same as `L3` Scenario 1 except that the downstream result is represented in upstream PR Checks by default, without requiring explicit labeling.
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Was there a plan to allow out of tree testing to block PRs? It seems like we'd need a very tight agreement on availability if out of tree tests can halt merging on pytorch/pytorch.

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @groenenboomj, we've discussed this in detail here: pytorch/pytorch#175022 (comment). For L4 level repos, they have the capabilities to gate PyTorch PR, but there will be a very high standard to achieve L4 (and the PyTorch core team remains the right to choose whether a repo is L4 even though it meets all the requirements).

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Okay, I see that was added in the last level breakdown expansion in March. Has there been more discussion on the requirements for gating?

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The main requirement is going to be this one:

Very few accelerators would reach this level. One only becomes L4 if pytorch maintainers are highly invested in it and find it worthwhile to add the signal to every PR.

It's intentionally left somewhat fuzzy, since it's a balance of "how much value will this provide vs how much pain does this inflict" since the idea itself is somewhat subjective. The automatic downgrade will need a firm reliability metric, but even that will likely need fine tuning based on experimentation, so hard numbers here in the short term are likely to be unreliable.

Do you have concerns about the approach @groenenboomj ?

Copy link
Copy Markdown
Contributor

@ZainRizvi ZainRizvi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you please sync this PR with the design specified in this one?

They're both discussing the same goal, so it would be good to keep the discussion in one place and not repeat points across them.

@can-gaa-hou
Copy link
Copy Markdown
Author

Could you please sync this PR with the design specified in this one?

Sure! I will discuss this with @jewelkm89 ASAP and update it to the RFC as well.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants