As the title says, I'm curious about using open source models like GPT-J, GPT-NeoX, Bloom, or OPT to compete with ChatGPT for *specific use-cases* such as explaining what a bit of code does. ChatGPT does this task quite well, but it's closed-source nature prevents it from being useful in documenting or commenting proprietary code. There's also limitations such as the amount of text ChatGPT will read or respond with.

Getting beyond these limitations is something I'm interested in pursuing, perhaps with the help of somewhere in this subreddit. Some assumptions you can safely make:

We can get (lots of) funding for the training, hardware, etc...
The end product should be on-premises
The inference does not actually need to run very quickly. If it costs millions to buy enough GPUs just due to VRAM limitations, we could simply run on CPUs and utilize ram, as long as inference could be done a few times per day.

So I guess my questions are where would we start? What model is best to fine-tune? How would you specifically fine-tune to improve specific use cases?

Comments

avocadoughnut t1_j4m12v2 wrote on January 16, 2023 at 5:51 PM

#1,377,677

There's currently a project in progress called OpenAssistant. It's being organized by Yannic Kilcher and some LAION members, to my understanding. Their current goal is to develop interfaces to gather data, and then train a model using RLHF. You can find a ton of discussion in the LAION discord. There's a channel for this project.

Acceptable-Cress-374 t1_j4m7mee wrote on January 16, 2023 at 6:32 PM

#1,378,021

Replying to avocadoughnut (#1,377,677)

> Their current goal is to develop interfaces to gather data, and then train a model using RLHF

Potentially naive question, as I don't have much experience with LLMs. Has anyone tried using existing SotA (paid) models like davinci / gpt3 instead of RLHF? They seem to be pretty good at a bunch of focused tasks, especially in few-shot. Does that make sense?

Zondartul t1_j4mb6rm wrote on January 16, 2023 at 6:53 PM

#1,378,216

Replying to Acceptable-Cress-374 (#1,378,021)

So using a big network to teach a small network? That's a thing people do. See teacher-student learning, and distillation.

avocadoughnut t1_j4mci2y wrote on January 16, 2023 at 7:01 PM

#1,378,282

Replying to Acceptable-Cress-374 (#1,378,021)

ChatGPT is GPT3 + instructional finetuning + RLHF for alignment. If you're talking about using those models ro gather training data, that's against OpenAI TOS, so I've heard. The goal is to make something that isn't closed source, something you can run yourself.

LetGoAndBeReal t1_j4mihya wrote on January 16, 2023 at 7:39 PM

#1,378,621

Replying to avocadoughnut (#1,377,677)

I looked through their repo, but I'm not understanding something: what is the foundational model that they plan to use and where/how will the model be run?

avocadoughnut t1_j4n5sp8 wrote on January 16, 2023 at 10:05 PM

#1,379,783

Replying to LetGoAndBeReal (#1,378,621)

From what I've heard, they want a model small enough to run on consumer hardware. I don't think that's currently possible (probably not enough knowledge capacity). But I haven't heard that a decision has been made on this end. The most important part of the project at the moment is crowdsourcing good data.

LetGoAndBeReal t1_j4n6rfa wrote on January 16, 2023 at 10:11 PM

#1,379,837

Replying to avocadoughnut (#1,379,783)

Wow, that seems awfully ambitious given that GPT3.5 requires something like 700GB of RAM and the apparent unlikeliness that SoTA model sizes will get smaller anytime soon. Interesting project to watch, though.

avocadoughnut t1_j4n8bp2 wrote on January 16, 2023 at 10:21 PM

#1,379,923

Replying to LetGoAndBeReal (#1,379,837)

Well, there are projects like WebGPT (by OpenAI) that make use of external knowledge sources. I personally think that's the future of these models: moderated databases of documents. The knowledge is much more interpretable and modifiable that way.

Ham05 t1_j4noofg wrote on January 17, 2023 at 12:12 AM

#1,380,835

If the goal is for a commercial endeavor I suggest bringing on an ML-specialized shop. Good ones can knock this out in a few sprints. PM me if you need more info.

MegavirusOfDoom t1_j4oelbd wrote on January 17, 2023 at 3:16 AM

#1,382,310

Replying to LetGoAndBeReal (#1,379,837)

less than 500MB is used for code learning, 690GB is used for culture, geography, history, fiction and non-fiction... 2GB for cats, 2GB bread, horses, dogs, Cheese, Wine, Italy, France, Politics, Television, Music, Japan, Africa. less than 1% of the training is on science and technology, i.e. 300MB is biology, 200MB chemistry, 100MB physics, 400MB maths...

sad_dad_is_a_mad_lad t1_j4ohl7t wrote on January 17, 2023 at 3:38 AM

#1,382,485

Replying to avocadoughnut (#1,378,282)

I don't think there are any laws that protect their data in this way, except perhaps contract law because they have a hidden ToS that you have to accept to use their service. As long as you use it for free though, I'm not sure there is consideration, and well... I don't know how they would go about proving misuse or damages.

Certainly it would not be copyright law, given that GPT3 itself was trained on copyrighted data...

T1METR4VEL t1_j4ol7zx wrote on January 17, 2023 at 4:05 AM

#1,382,670

Replying to Ham05 (#1,380,835)

I sent you at chat

yahma t1_j4owot0 wrote on January 17, 2023 at 5:48 AM

#1,383,208

Replying to MegavirusOfDoom (#1,382,310)

This may be the size of the datasets, but i it's hard to say how many parameters will be needed for a good llm that's just really good at explaining code.

TheTerrasque t1_j4p9kk7 wrote on January 17, 2023 at 8:19 AM

#1,383,766

There is a project called Petals that have BLOOM running for everyone to use. It's distributing the model over many machines and thus allows it to run on consumer hardware. There is a PoC chat at http://chat.petals.ml/

They just converted BLOOMZ and is currently setting up a network for that. That should be more suited for a chat interface. There's still missing gpu's though, so would be great with some more servers if people got some spare compute.

Acceptable-Cress-374 t1_j4pacws wrote on January 17, 2023 at 8:30 AM

#1,383,810

Replying to Zondartul (#1,378,216)

> See teacher-student learning, and distillation.

Thanks, I'll check it out.

theaimlguy t1_j4pceky wrote on January 17, 2023 at 8:58 AM

#1,383,901

If it was possible to try distillation on ChatGPT to produce smaller models which could run on mobile hardware, it would have been great!

MegavirusOfDoom t1_j4pfdi1 wrote on January 17, 2023 at 9:40 AM

#1,384,032

Replying to yahma (#1,383,208)

Then we'd have to crawl all of stack exchange, all of wiki, and 1 terabyte of programming books... This "generalist NLP" is for article writing, for poetry.

I'm a big fan of teaching ChatGPT how to interpret graphs, the origin lines, to record in a vector engine that is couple with the NLP. For a coding engine, I believe NLP should be paired with a compiler, just like a maths specialized NLP should also have a mathlab type engine.

throwaway2676 t1_j4q8zuh wrote on January 17, 2023 at 2:44 PM

#1,385,671

Replying to LetGoAndBeReal (#1,379,837)

Well, can you just run it from an SSD, but more slowly?

[D] Fine-tuning open source models on specific tasks to compete with ChatGPT?

Comments

avocadoughnut t1_j4m12v2 wrote on January 16, 2023 at 5:51 PM

Acceptable-Cress-374 t1_j4m7mee wrote on January 16, 2023 at 6:32 PM

Zondartul t1_j4mb6rm wrote on January 16, 2023 at 6:53 PM

avocadoughnut t1_j4mci2y wrote on January 16, 2023 at 7:01 PM

LetGoAndBeReal t1_j4mihya wrote on January 16, 2023 at 7:39 PM

thomasdarimont t1_j4mp02f wrote on January 16, 2023 at 8:20 PM

avocadoughnut t1_j4n5sp8 wrote on January 16, 2023 at 10:05 PM

LetGoAndBeReal t1_j4n6rfa wrote on January 16, 2023 at 10:11 PM

avocadoughnut t1_j4n8bp2 wrote on January 16, 2023 at 10:21 PM

Ham05 t1_j4noofg wrote on January 17, 2023 at 12:12 AM

MegavirusOfDoom t1_j4oelbd wrote on January 17, 2023 at 3:16 AM

sad_dad_is_a_mad_lad t1_j4ohl7t wrote on January 17, 2023 at 3:38 AM

T1METR4VEL t1_j4ol7zx wrote on January 17, 2023 at 4:05 AM

yahma t1_j4owot0 wrote on January 17, 2023 at 5:48 AM

TheTerrasque t1_j4p9kk7 wrote on January 17, 2023 at 8:19 AM

Acceptable-Cress-374 t1_j4pacws wrote on January 17, 2023 at 8:30 AM

theaimlguy t1_j4pceky wrote on January 17, 2023 at 8:58 AM

MegavirusOfDoom t1_j4pfdi1 wrote on January 17, 2023 at 9:40 AM

throwaway2676 t1_j4q8zuh wrote on January 17, 2023 at 2:44 PM