Submitted by International_Deer27 t3_10qhscf in deeplearning
like_a_tensor t1_j6q5cs1 wrote
Are you implementing the CNN from scratch? If so, the problem might be in your implementation.
Play with the batch size and batch norm. Try different optimizers. Your learning rate might also be too large; experiment with smaller learning rates or something like torch's ReduceLROnPlateau.
5500 sample is also pretty small, so maybe try a shallower network.
International_Deer27 OP t1_j6rxo78 wrote
Yes I am, I also uploaded the code below in case you can have a look. I'll look into ReduceLROnPlateau
Viewing a single comment thread. View all comments