Skip to content

关于SFT时学习率的问题 #26

@Fu-Fu-Fu-Fu

Description

@Fu-Fu-Fu-Fu

我看到您脚本中设置的1e-5?一般SFT学习率不是用1e-6吗?1e-5会不会有些大?

包括您Video-R1的时候设置的似乎是1e-6

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions