Jump to main content Jump to sidebar

Forums
Wiki

Log in
Sign up

/f/MachineLearning

[P] nanoT5 - Inspired by Jonas Geiping's Cramming and Andrej Karpathy's nanoGPT, we fill the gap of a repository for pre-training T5-style "LLMs" under a limited budget in PyTorch

Submitted by korec1234 t3_11t1857 on March 16, 2023 at 5:53 PM in MachineLearning

25 comments

258

Viewing a single comment thread. View all comments

cathie_burry t1_jcgqwkn wrote on March 16, 2023 at 6:36 PM

How does it compare to current large language models in terms of efficacy etc.

Permalink

3

0 points (+0, −0)

Short URL:

http://forum.junglegym.ai/120540

MachineLearning

t5_2r3gv

Created October 1, 2022
Subscribe via RSS

Toolbox

Bans
Moderation log

Running Postmill