gelukuMLG
gelukuMLG t1_j9vfnmg wrote
OPT but better lol. Also will it still require to request access to use it?
gelukuMLG t1_j9lfp3j wrote
Reply to comment by dwarfarchist9001 in What. The. ***k. [less than 1B parameter model outperforms GPT 3.5 in science multiple choice questions] by Destiny_Knight
You mean like a LoRA?
gelukuMLG t1_j9kxnj4 wrote
Reply to comment by dwarfarchist9001 in What. The. ***k. [less than 1B parameter model outperforms GPT 3.5 in science multiple choice questions] by Destiny_Knight
But higher parameters allow for broader knowledge right? You can't have a 6-20B model have broad knowledge as a 100B+ model, right?
gelukuMLG t1_j9kftza wrote
Reply to comment by turnip_burrito in What. The. ***k. [less than 1B parameter model outperforms GPT 3.5 in science multiple choice questions] by Destiny_Knight
does that prove that parameters aren't everything?
gelukuMLG t1_j6cy6wq wrote
Reply to comment by Yodawgweheardyou in When will you talk more to A.I. than to other humans? by Terminator857
What do you mean in about 5 days and why?
gelukuMLG t1_j4s074b wrote
Reply to comment by AsheyDS in Perhaps ChatGPT is a step back? by PaperCruncher
EleutherAi is a company that released a lot of open source models, which are the neo models and pythia ones and some are quite big 20-60B parameters. The models are downloadable from huggingface.
gelukuMLG t1_j23znll wrote
Reply to comment by EthansWay007 in [D] When chatGPT stops being free: Run SOTA LLM in cloud by _underlines_
I think it saves the highly rated responses and feeds it into a dataset then it uses reinforcement learning by giving a positive reward to them.
gelukuMLG t1_j1dcpp1 wrote
Reply to comment by alexiuss in Confining infinity into a cardboard box, aka the unsolvable problem of current gpt3 chatbot generation by alexiuss
you mean closed ai?
gelukuMLG t1_j1ahq9x wrote
Reply to Confining infinity into a cardboard box, aka the unsolvable problem of current gpt3 chatbot generation by alexiuss
Idk if people are aware but stability ai is working on an open source model that will compete with open ai gpt3.
gelukuMLG t1_ja1zm8l wrote
Reply to comment by Ok-Ability-OP in Meta unveils a new large language model that can run on a single GPU by AylaDoesntLikeYou
The LM's side is starting to catch up to image generation models, and soon voice generation/synthesis will follow.