When tuning hyperparameters, is learning rate (decay, scheduling, etc.) dependent on things like model size and activation function? Or can I search for the ideal model architecture first, then tune learning rate after?
Noooo!!! Also can’t believe I missed them! I saw them in Portsmouth, NH in 2019 and they were absolutely amazing live, especially in a small venue. Love the fact that they come to smaller towns here on the east coast.
Was just there last week. Staff comment is on-point, but I’d still recommend it. I don’t get the Ganko hate, it must be a seasonal staffing thing? Their ramen is so much better than what I’ve had at Wara Wara.
seacucumber3000 t1_j0fem4u wrote
Reply to [D] Simple Questions Thread by AutoModerator
When tuning hyperparameters, is learning rate (decay, scheduling, etc.) dependent on things like model size and activation function? Or can I search for the ideal model architecture first, then tune learning rate after?