Skip to content

fix: adopt mup/Transformers API for torch2.3#75

Open
emergenz wants to merge 1 commit intomicrosoft:mainfrom
emergenz:mup-transformers-api-update-torch-23
Open

fix: adopt mup/Transformers API for torch2.3#75
emergenz wants to merge 1 commit intomicrosoft:mainfrom
emergenz:mup-transformers-api-update-torch-23

Conversation

@emergenz
Copy link

Adding batch_first as an __init__ argument of MultiHeadAttention is just a quickfix since we are ignoring it.

It does the job, though.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant

Comments