CK mha bwd: add sink attention score gradient support by LJ-underdog · Pull Request #2321 · ROCm/aiter

LJ-underdog · 2026-03-18T06:34:52Z

Motivation

This PR extends the CK-backed MHA backward paths (mha_bwd / mha_varlen_bwd) to accept sink attention log-scores and optionally accumulate a sink gradient (d_sink), and adds Python tests to validate d_sink correctness.

Technical Details

Plumbs sink / d_sink through the Torch C++ interfaces, pybind args, and CK kernel argument structs.
Updates CK kernel launch argument packing to pass sink pointers into backward kernels (batch + varlen).
Adds new GPU tests for mha_bwd and mha_varlen_bwd d_sink accumulation vs a PyTorch reference.

Test Plan

Add test in test_mha_bwd&varlen_bwd.py

Test Result

Local test passed

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

github-actions · 2026-03-18T06:35:28Z

🏷️ CI Guide

Runs automatically on every PR:

✅ Pre-checks (submodule verification, code formatting)
✅ Aiter op tests (gfx942 + gfx950)
✅ Triton tests (only when aiter/ops/triton/** or related paths are changed)

Extended tests (opt-in via labels):

Label	Tests
`ci:sglang`	SGLang integration tests
`ci:atom`	ATOM benchmark (DeepSeek-R1 + GPT-OSS)
`ci:vllm`	vLLM benchmark
`ci:all`	All of the above

Add labels via the sidebar or gh pr edit 2321 --add-label <label>

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Copilot

Pull request overview

This PR extends the CK-backed MHA backward paths (mha_bwd / mha_varlen_bwd) to accept sink attention log-scores and optionally accumulate a sink gradient (d_sink), and adds Python tests to validate d_sink correctness.

Changes:

Plumbs sink / d_sink through the Torch C++ interfaces, pybind args, and CK kernel argument structs.
Updates CK kernel launch argument packing to pass sink pointers into backward kernels (batch + varlen).
Adds new GPU tests for mha_bwd and mha_varlen_bwd d_sink accumulation vs a PyTorch reference.

Reviewed changes

Copilot reviewed 10 out of 10 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
`op_tests/test_mha_sink_bwd.py`	New tests validating `d_sink` accumulation for batch and varlen backward kernels.
`aiter/ops/mha.py`	Updates Python-exposed `mha_bwd` / `mha_varlen_bwd` signatures to accept `sink` / `d_sink`.
`csrc/include/torch/mha_bwd.h`	Extends the Torch C++ API for `mha_bwd` to accept `sink` / `d_sink`.
`csrc/include/torch/mha_varlen_bwd.h`	Extends the Torch C++ API for `mha_varlen_bwd` to accept `sink` / `d_sink`.
`csrc/include/rocm_ops.hpp`	Adds `sink` / `d_sink` parameters to the pybind signatures for backward ops.
`csrc/include/mha_bwd.h`	Extends `mha_bwd_args` with sink pointer fields.
`csrc/cpp_itfs/mha_bwd.cu`	Passes sink pointers into the CK `fmha_bwd_args` used by the non-asm path.
`csrc/py_itfs_ck/mha_bwd_kernels.cu`	Adds optional sink/d_sink plumbing to CK batch-mode backward wrapper.
`csrc/py_itfs_ck/mha_varlen_bwd_kernels.cu`	Adds optional sink/d_sink plumbing to CK varlen backward wrapper.
`csrc/py_itfs_cu/fmha_bwd_pre_post_kernel_generate.py`	Updates codegen template to include `LSEDataType` in pipeline problem typing.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

You can also share your feedback on Copilot code review. Take the survey.

csrc/py_itfs_ck/mha_varlen_bwd_kernels.cu

aiter/ops/mha.py

csrc/include/mha_bwd.h

csrc/py_itfs_ck/mha_bwd_kernels.cu

…mha_varlen

CK mha bwd: add sink attention score gradient support

d3fd124

LJ-underdog and others added 2 commits March 18, 2026 02:05

test: add varlen sink bwd tests to test_mha_sink_bwd

d974b3e

Update op_tests/test_mha_sink_bwd.py

b065c60

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

LJ-underdog requested a review from Copilot March 18, 2026 07:07

Copilot started reviewing on behalf of LJ-underdog March 18, 2026 07:08 View session

style: apply black formatting to test_mha_sink_bwd

a7bf4ae

Copilot AI reviewed Mar 18, 2026

View reviewed changes

csrc/py_itfs_ck/mha_varlen_bwd_kernels.cu Show resolved Hide resolved

aiter/ops/mha.py Show resolved Hide resolved

csrc/include/mha_bwd.h Show resolved Hide resolved

csrc/py_itfs_ck/mha_bwd_kernels.cu Show resolved Hide resolved

LJ-underdog added 2 commits March 18, 2026 02:23

test: move sink bwd tests into test_mha.py and test_mha_varlen.py

4ac64db

style: apply black formatting to sink bwd tests in test_mha and test_…

3690ad1

…mha_varlen

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CK mha bwd: add sink attention score gradient support#2321

CK mha bwd: add sink attention score gradient support#2321
LJ-underdog wants to merge 6 commits intomainfrom
lj_ck_sink_bwd_v2

LJ-underdog commented Mar 18, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 18, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

LJ-underdog commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

github-actions bot commented Mar 18, 2026

🏷️ CI Guide

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

LJ-underdog commented Mar 18, 2026 •

edited

Loading