Bugfix: support qk_head_dim != v_head_dim in FMHA by huanghua1994 · Pull Request #40 · NVIDIA/cutile-python

huanghua1994 · 2025-12-23T21:10:44Z

The original implementation does not support qk_head_dim != v_head_dim, which is needed in Multi-head Latent Attention. Also fix some test code logic.

Description

The original implementation does not support qk_head_dim != v_head_dim, which is needed in Multi-head Latent Attention. Problem sizes in samples/AttentionFMHA.py are updated s.t. qk_head_dim != v_head_dim and q_num_head != kv_num_head to test a generic GQA case. Parameters and the way calling PyTorch scale_dot_product_attention are also updated to avoid being unable to find a working backend.

All tests have passed locally on a B200.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

The original implementation does not support qk_head_dim != v_head_dim, which is needed in Multi-head Latent Attention. Also fix some test code logic. Signed-off-by: Hua Huang <huah@nvidia.com>

haijieg · 2026-03-18T18:18:03Z

@huanghua1994 thank you for your contribution. For the moment, we intent to keep the FMHA sample simple for educational purpose. If you are looking to contributing kernels, please check out the TileGym project: https://github.com/nvidia/tilegym

Mind if I close this PR?

huanghua1994 · 2026-03-18T23:12:38Z

Sure. Thank you for reminding me this opening issue.

Bugfix: support qk_head_dim != v_head_dim in FMHA

75d0dcb

The original implementation does not support qk_head_dim != v_head_dim, which is needed in Multi-head Latent Attention. Also fix some test code logic. Signed-off-by: Hua Huang <huah@nvidia.com>

huanghua1994 closed this Mar 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bugfix: support qk_head_dim != v_head_dim in FMHA#40

Bugfix: support qk_head_dim != v_head_dim in FMHA#40
huanghua1994 wants to merge 1 commit intoNVIDIA:mainfrom
huanghua1994:bug-fmha-different-qk-v-head-dim

huanghua1994 commented Dec 23, 2025

Uh oh!

haijieg commented Mar 18, 2026

Uh oh!

huanghua1994 commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

huanghua1994 commented Dec 23, 2025

Description

Checklist

Uh oh!

haijieg commented Mar 18, 2026

Uh oh!

huanghua1994 commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants