Hello Thank you for the work :) May I ask the number of V100s you used for training the model? Trying to estimate the total batch size you used (understand that its 22 per V100 GPU)
Hello
Thank you for the work :)
May I ask the number of V100s you used for training the model?
Trying to estimate the total batch size you used (understand that its 22 per V100 GPU)