fix qwen3_next by Jintao-Huang · Pull Request #58 · modelscope/mcore-bridge

Jintao-Huang · 2026-05-04T10:29:24Z

No description provided.

gemini-code-assist

Code Review

This pull request updates the forward method in qwen3_next.py to utilize resolve_gdn_attention_mask instead of resolve_hf_attention_mask. A review comment identifies potential logic errors and runtime risks within the resolve_gdn_attention_mask implementation, specifically regarding type safety when applying bitwise operators to integer tensors and potential dimensionality mismatches that could cause runtime errors.

gemini-code-assist · 2026-05-04T10:30:42Z

        else:
            hidden_states = hidden_states.transpose(0, 1)
-            attention_mask = resolve_hf_attention_mask(kwargs)
+            attention_mask = resolve_gdn_attention_mask(kwargs)


The function resolve_gdn_attention_mask (called here and defined at line 62) has a potential logic error and runtime risk:

Type Safety: At line 70, it uses the bitwise NOT operator ~ on attention_mask. If the mask is an integer tensor (common for HuggingFace masks), ~1 results in -2, which will cause the sum(...) > 0 check to behave unexpectedly. It should be explicitly cast to boolean: (~attention_mask.to(torch.bool)).

Dimensionality: The function assumes a 4D input by using dim=(1, 2). If kwargs.get('attention_mask') returns a 2D tensor (standard HuggingFace format), this will raise an IndexError.

Since this function is now being used to resolve the attention mask, these issues should be addressed in its implementation to ensure robustness.

update

787dfed

Jintao-Huang mentioned this pull request May 4, 2026

npu qwen3.5 megatron padding_free fix #50

Merged

gemini-code-assist Bot reviewed May 4, 2026

View reviewed changes

Jintao-Huang merged commit 5d0d323 into modelscope:main May 4, 2026
1 check passed

Jintao-Huang added a commit that referenced this pull request May 5, 2026

fix qwen3 next (#58)

59a1212

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix qwen3_next#58

fix qwen3_next#58
Jintao-Huang merged 1 commit intomodelscope:mainfrom
Jintao-Huang:fix_qwen3_next

Jintao-Huang commented May 4, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Jintao-Huang commented May 4, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 4, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant