Viewing a single comment thread. View all comments

Co0k1eGal3xy t1_is6xpmg wrote

How does the loss of converged models compare?

Removing parameters is similar to decreasing the learning rate as far as I remember, so you can't compare them during early training stages.

1