[P] nanoT5 - Inspired by Jonas Geiping's Cramming and Andrej Karpathy's nanoGPT, we fill the gap of a repository for pre-training T5-style "LLMs" under a limited budget in PyTorch Submitted by korec1234 t3_11t1857 on March 16, 2023 at 5:53 PM in MachineLearning 25 comments 258
Viewing a single comment thread. View all comments