Refactor kernel by kevssim · Pull Request #237 · modelscope/twinkle

kevssim · 2026-06-29T07:41:13Z

PR type

Bug Fix
New Feature
Document Updates
More Models or Datasets Support

PR information

Refactor twinkle.kernel from registry-based API to a minimal mapping-driven API. Public surface reduced to kernelize, hub, npu_builtin.

kernelize(model, mappings) applies class/attr replacements onto a live model (exact type(m) is target_cls match)
hub(*entries) declares kernels resolved lazily from the optional kernels package
npu_builtin() returns the standard NPU bundle (RMSNorm, rotary, swiglu, SDPA, MoE, FLA); GMM opts in manually

Deleted legacy registry.py, function.py, layer.py, base.py, monkey_patch_npu.py; added npu_impls/ package. Migrated cookbook/transformers/{fsdp2,sp_fsdp_dense,ep_fsdp2_lora_qwen3_5_moe}.py and rewrote zh/en Kernel docs.

…atch_npu

…load - builtin.py: _install_sdpa() now only runs when torch_npu is importable, preventing the NPU (boolean-mask-inverting) SDPA impl from contaminating the global ALL_ATTENTION_FUNCTIONS['sdpa'] registry on CUDA/CPU hosts. - builtin.py: drop dead _SdpaPatchSentinel + add/pop scaffolding. - fla.py: flip is_flash_linear_attention_available only after the MindSpeed kernel imports successfully; previously a MindSpeed-missing NPU host would be left with FLA flagged available but no kernel installed -> Qwen3.5 runtime failure.

This reverts commit 126efc3.

gemini-code-assist

Code Review

This pull request refactors the Twinkle kernel module to introduce a mapping-driven kernel replacement API, exposing kernelize, hub, and npu_builtin while removing legacy registration and patch helpers. It also modularizes NPU-specific optimizations under src/twinkle/kernel/npu_impls/ and updates documentation and tests. A critical issue was identified in src/twinkle/kernel/core.py where the helper function _infer_device is missing, causing an ImportError in the test suite.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

…ng-api

kevssim added 24 commits June 29, 2026 15:15

feat(kernel): add HubRef dataclass and hub() factory

63a1c2b

feat(kernel): add _infer_device helper

7049f6f

feat(kernel): add _resolve_value with device-conditional dispatch

1547d8c

feat(kernel): add _replace_class and _replace_attr helpers

1be64aa

feat(kernel): add _load_hub_ref with lazy kernels import

c72883b

feat(kernel): add kernelize() dispatcher

d4318b1

feat(kernel): add npu_impls/rms_norm module

77165c9

feat(kernel): add npu_impls/rotary module

ac47045

feat(kernel): add npu_impls/swiglu module

f0d0a23

feat(kernel): add npu_impls/attention module

1e9902e

feat(kernel): add npu_impls/moe module

4fc02c1

feat(kernel): add npu_impls/fla module

a5421f9

feat(kernel): add npu_builtin() bundle and class-attr replacement

87e8477

refactor(kernel): expose only kernelize, hub, npu_builtin

39a9225

refactor(kernel): remove legacy registry/function/layer/base/monkey_p…

f4c491f

…atch_npu

refactor(cookbook): migrate to new twinkle.kernel API

3fb7071

docs(kernel): rewrite Chinese doc for new mapping API

109cf24

docs(kernel): rewrite English doc for new mapping API

c7babac

wip

a742827

wip

dc7cb93

wip

398dd37

wip

126efc3

Revert "wip"

a20da5a

This reverts commit 126efc3.

gemini-code-assist Bot reviewed Jun 29, 2026

View reviewed changes

Comment thread src/twinkle/kernel/core.py

kevssim added 3 commits June 29, 2026 15:44

lint

ff67fc1

Merge remote-tracking branch 'origin/main' into refactor/kernel-mappi…

9603c62

…ng-api

delete

b06cfe5

kevssim changed the title ~~refactor kernel~~ Refactor kernel Jun 30, 2026

wip

1b5f8c9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor kernel#237

Refactor kernel#237
kevssim wants to merge 28 commits into
modelscope:mainfrom
kevssim:refactor/kernel-mapping-api

kevssim commented Jun 29, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

kevssim commented Jun 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR type

PR information

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

kevssim commented Jun 29, 2026 •

edited

Loading