Zondartul t1_j2cthgl wrote
Reply to comment by magpiesonskates in [D] Has any research been done to counteract the fact that each training datapoint "pulls the model in a different direction", partly undoing learning until shared features emerge? by derpderp3200
Would using a bath size of "all your data at once" (so basically no batching) be ideal, if unfeasible?
Viewing a single comment thread. View all comments