Viewing a single comment thread. View all comments

Cheap_Meeting t1_iqq8oku wrote

Adding to other answers: Even if you had enough memory, if it would still be computationally inefficient. There is a diminishing return from increasing batch size in terms of how much the loss improves each step.

1