[P] nanoT5 - Inspired by Jonas Geiping's Cramming and Andrej Karpathy's nanoGPT, we fill the gap of a repository for pre-training T5-style "LLMs" under a limited budget in PyTorch Submitted by korec1234 t3_11t1857 on March 16, 2023 at 5:53 PM in MachineLearning 25 comments 258
cathie_burry t1_jcgqwkn wrote on March 16, 2023 at 6:36 PM How does it compare to current large language models in terms of efficacy etc. Permalink 3
Viewing a single comment thread. View all comments