Viewing a single comment thread. View all comments

Less-Article1309 t1_irrjhfe wrote

There's plenty of other optimization methods out there, simulated annealing for example. SGD just lends itself well to the massively parallel architecture of Nvidia GPUs, that's the only reason why it's so prevalent in the industry.

1