cathie_burry
cathie_burry t1_jcgqwkn wrote
Reply to [P] nanoT5 - Inspired by Jonas Geiping's Cramming and Andrej Karpathy's nanoGPT, we fill the gap of a repository for pre-training T5-style "LLMs" under a limited budget in PyTorch by korec1234
How does it compare to current large language models in terms of efficacy etc.
cathie_burry t1_jechk0t wrote
Reply to [P] Introducing Vicuna: An open-source language model based on LLaMA 13B by Business-Lead2679
Llama is not to be used for commercial purposes, but can I use something like this to code up part of my business?