Skip to content

CUDA FA: run KV_max mask scan for all Q batch sizes#22137

Open
ssam18 wants to merge 1 commit intoggml-org:masterfrom
ssam18:fix/fattn-kv-max-small-batch
Open

CUDA FA: run KV_max mask scan for all Q batch sizes#22137
ssam18 wants to merge 1 commit intoggml-org:masterfrom
ssam18:fix/fattn-kv-max-small-batch

Commits

Commits on Apr 20, 2026