For mutihead-attention, I confused that no mask is passed in. Will it work ?
For mutihead-attention, I confused that no mask is passed in. Will it work ?