Viewing a single comment thread. View all comments

like_a_tensor t1_j6q5cs1 wrote

Are you implementing the CNN from scratch? If so, the problem might be in your implementation.

Play with the batch size and batch norm. Try different optimizers. Your learning rate might also be too large; experiment with smaller learning rates or something like torch's ReduceLROnPlateau.

5500 sample is also pretty small, so maybe try a shallower network.

2

International_Deer27 OP t1_j6rxo78 wrote

Yes I am, I also uploaded the code below in case you can have a look. I'll look into ReduceLROnPlateau

1