Submitted by Oceanboi t3_zm6h07 in MachineLearning
Why is it that when I go to create a CNN with 4 layers (output channels: 64, 32, 16, 16), I can do this in PyTorch, but in Tensorflow I get resource errors saying I don't have enough resources?
For reference I am using a stock NVIDIA RTX 3080.
Also, now that I am experimenting with larger models, would I benefit from renting TPU? Does this make the actual models train faster and would it help with larger batches?
schludy t1_j09ukmr wrote
Do you handle the data the same way? Maybe you're loading more data in the tensorflow implementation. It's really hard to tell without seeing the code