Submitted by AKavun t3_105na47 in deeplearning
suflaj t1_j3bubtm wrote
Reply to comment by AKavun in Why didn't my convolutional image classifier network learn anything! by AKavun
Another problem you will likely have is your very small convolutions. Basically, output channels of 8 and 16 are probably only enough to solve MNIST. You should then probably use something more like 32 and 64, and use larger kernels and strides to hopefully reduce reliance on the linears to do the work for you.
Finally, you are not using nonlinear activations between layers. Your whole network essentially acts like one smaller convolutional layer with a flatten and softmax.
Viewing a single comment thread. View all comments