Skip to content

Fix two bugs in "Fix fsdp"#1

Open
SCZwangxiao wants to merge 3 commits into
C1rN09:fix_fsdpfrom
SCZwangxiao:fix_fsdp_sczwangxiao
Open

Fix two bugs in "Fix fsdp"#1
SCZwangxiao wants to merge 3 commits into
C1rN09:fix_fsdpfrom
SCZwangxiao:fix_fsdp_sczwangxiao

Conversation

@SCZwangxiao
Copy link
Copy Markdown

@SCZwangxiao SCZwangxiao commented Jan 29, 2023

Hi Cirno, It's nice to meet a Touhou fan here! Besides, I am deeply appreciative of your commitment, since FSDP is critical for training large models.

However, I leveraged this commit to train my own model, and found two bugs when resuming checkpoints. Details are in open-mmlab#553 (comment). This pull request fixes the bugs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant