Skip to content

Mul mat vec optimization#19

Open
XXjcontiniXX wants to merge 2 commits intoreeselevine:masterfrom
XXjcontiniXX:mul-mat-vec-optimization
Open

Mul mat vec optimization#19
XXjcontiniXX wants to merge 2 commits intoreeselevine:masterfrom
XXjcontiniXX:mul-mat-vec-optimization

Conversation

@XXjcontiniXX
Copy link

Make sure to read the contributing guidelines before submitting a PR

jeffbolznv and others added 2 commits February 2, 2026 14:29
…gml-org#18295)

* vulkan: extend topk_moe to handle sigmoid w/exp_probs_b for nemotron

Also handle GGML_OP_SCALE at the end (nemotron, deepseek2).

Fewer pipeline variants and spec constants, just use push constants.

In test_topk_moe, change exp_probs_b to be 1D, matching real networks.

Update test-backend-ops and ggml-backend to allow verifying multiple outputs
in a fusion test (topk_moe has two outputs). Previously only the final node
was verified.

* change test_topk_moe to allow results in arbitrary order

* disable sigmoid fusion for moltenvk
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants