Submitted by Fun_Country_4193 t3_za7d1t in MachineLearning
[removed]
Submitted by Fun_Country_4193 t3_za7d1t in MachineLearning
[removed]
No, it is a preset loss function.
How about the Val set?
val set is random batches from large overall dataset, but same source
I’d give it a shot, though I’d probably fail
sure, DM me
Do you use tensorflow or pytorch ?
pytorch
What kind of data is it?
all text data, consists of data from the pile and some other datasets, it's about 1TB total, but you can train on randomly pulled batches from the overall set (about 2GB), which works about as well as trying to train on the whole dataset
« Data from the pile »? Why don’t you organize a Kaggle challenge ?
good idea, thanks!
I just checked, and minimum cost is 50,000. I could probably do like 20k, but 50k is a lot.
Oh ok. I guess they have some costs on their end too. What did you mean by data from the pile? I’m happy to give it a shot if you think ~1 GB of data can be enough.
aloser t1_iyksem4 wrote
Do we get to choose the loss function?