Difficult_Ferret2838

Difficult_Ferret2838 t1_ivtprrn wrote on November 10, 2022 at 3:22 PM

Reply to comment by kksnicoh in [D] Is there an advantage in learning when taking the average Gradient compared to the Gradient of just one point by CPOOCPOS

Exactly, that is a meaningless phrase.

Difficult_Ferret2838 t1_ivrom17 wrote on November 10, 2022 at 2:43 AM

Reply to comment by make3333 in [D] Is there an advantage in learning when taking the average Gradient compared to the Gradient of just one point by CPOOCPOS

>gradient descent takes the direction of the minimum at the step size according to the taylor series of degree n at that point.

No. Gradient descent is first order by definition.

>in a lot of other optimization settings they do second order approx to find the optimal direction

It still isn't an "optimal" direction.

Difficult_Ferret2838 t1_ivrnegq wrote on November 10, 2022 at 2:34 AM

Reply to comment by make3333 in [D] Is there an advantage in learning when taking the average Gradient compared to the Gradient of just one point by CPOOCPOS

That doesn't mean anything.

Difficult_Ferret2838 t1_ivrnctl wrote on November 10, 2022 at 2:33 AM

Reply to [D] Is there an advantage in learning when taking the average Gradient compared to the Gradient of just one point by CPOOCPOS

Are you not talking about batch size?

Difficult_Ferret2838 t1_ivqb6wz wrote on November 9, 2022 at 8:51 PM

Reply to [D] At what tasks are models better than humans given the same amount of data? by billjames1685

Multivariate nonlinear systems, e.g. reactors.