Viewing a single comment thread. View all comments

Cheap_Meeting t1_iqq8oku wrote on October 2, 2022 at 8:58 AM

Adding to other answers: Even if you had enough memory, if it would still be computationally inefficient. There is a diminishing return from increasing batch size in terms of how much the loss improves each step.