[bugfix] fix gather_from_sp by Jintao-Huang · Pull Request #63 · modelscope/mcore-bridge

Jintao-Huang · 2026-05-05T16:04:46Z

No description provided.

gemini-code-assist

Code Review

This pull request refines sequence parallel operations across several model files. Key updates include replacing reduce_scatter_to_sequence_parallel_region with scatter_to_sequence_parallel_region, which removes the need for manual scaling by the tensor model parallel size. Additionally, the gather_from_sequence_parallel_region calls now explicitly set tensor_parallel_output_grad=False, and forward passes in Qwen models are wrapped with a CUDA RNG tracker fork to ensure consistent randomness. I have no feedback to provide as there were no review comments to evaluate.

fix

2d0cb4e

gemini-code-assist Bot reviewed May 5, 2026

View reviewed changes

Merge branch 'main' into fix_gather_from_sp

6488653

Jintao-Huang merged commit dab4f16 into modelscope:main May 5, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bugfix] fix gather_from_sp#63

[bugfix] fix gather_from_sp#63
Jintao-Huang merged 2 commits intomodelscope:mainfrom
Jintao-Huang:fix_gather_from_sp

Jintao-Huang commented May 5, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Jintao-Huang commented May 5, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant