Submitted by Nike_Zoldyck t3_yk6jdy in deeplearning

I got a good scare on Halloween🎃, what with my model throwing a CUDA Out of memory error👻. After spending 2 whole days debugging and trying everything under the sun, here's what I learnt and how to fix it. Hope it helps !

Here's the Blog Post . Updated the link so that it is free to access incase it is stuck behind a membership thingy

Feedback and additional suggestions welcome !

1

Comments

You must log in or register to comment.

zalperst t1_iuu828x wrote

Lol, the solution to this will be different for everyone

3

Nike_Zoldyck OP t1_iuvdz7i wrote

>This was mostly an attempt to collect more info from people who might see their usual trick not mentioned there.

Yes, trust me I learnt that the hard way. I tried to include multiple scenarios and will keep updating it

1

[deleted] t1_iuvarxr wrote

[deleted]

2

mr_birrd t1_iuved4p wrote

yeah right that's not a bug it's an error simply

1

Ttttrrrroooowwww t1_iuvceuv wrote

Article which scratches the surface and has been discussed thousands of times without adding any value. Another piece of clutter on the internet.

2

Nike_Zoldyck OP t1_iuvdvsa wrote

Thanks for your insight. We're you even able to access the link? Turns out it was behind a membership thing. I updated the link url so it should be free now. I couldn't find any helpful solutions to my problem and had to try everything, until the last paragraph which finally solved it and I had to figure that out through trial and error. So instead of someone new opening 35 tabs the next time , I figured I'd consolidate everything I attempted into a post that I can keep editing if I come across anything more, or if someone decides to share anything useful about their experience with this issue, along with what sort of models they were running.

This was mostly an attempt to collect more info from people who might see their usual trick not mentioned there. I'm glad I could cover everything you already know

1