visarga t1_ix2vuys wrote on November 20, 2022 at 9:14 AM

You mean like this? You just prepend "The following is a conversation with [a very intelligent AI | a human expert]". In image generation the trick is to add artist names to the prompt "in the style of X and Y", also called "style phrases" or "vitamin phrases".

Dall-E 2 was tweaked in a similar way to be more diverse when asking for a photo of a CEO, or other job, they would add various race and gender keywords. People were generally upset about having their prompts modified. But prepending the modifier on top by default might be useful in some cases.

If you want to extract a specific style or ability more precisely from a model you can fine-tune it on a small dataset, probably <1000 examples. This is easy to do using the cloud APIs, but not as easy as prompting.

massimosclaw2 t1_ix2w5bc wrote on November 20, 2022 at 9:19 AM

Not quite. I think there’s value to this technique but it’s still constrained by probability of what GPT thinks an AI would say based on all the instances of similar texts in the data it consumed, which is not quite the same thing

visarga t1_ix2wyw6 wrote on November 20, 2022 at 9:31 AM

There is also prompt-tuning that will fine-tune only a few token embeddings keeping the model itself frozen. This changes the problem from finding that elusive prompt to finding a few labeled examples + fine-tuning the prompt.

Another approach is to use a LLM to generate prompts and filter them by evaluation. This has also been used to generate step by step reasoning traces for datasets that only have input-output pairs. Then train another model on the examples + chain of thought for a big jump in accuracy.

There's a relevant paper here: Large Language Models Can Self-Improve. They find that

> fine-tuning on reasoning is critical for self-improvement

I would add that sometimes you can evaluate a result, for example when generating math or code. Then you can learn from the validated outputs of the network. Basically what was used for AlphaZero to reach super-human level without supervision, but requires a kind of simulator - a game engine, a python interpreter, or a symbolic math engine.

ID4gotten t1_ix32vw0 wrote on November 20, 2022 at 10:58 AM

Omnipotence - I don't think that word means what you think it means

[deleted] OP t1_ix3387l wrote on November 20, 2022 at 11:02 AM

[deleted]

ID4gotten t1_ix484dw wrote on November 20, 2022 at 5:08 PM

Ah. Well there is lots of research on understanding user intent (from queries or other actions) so I'd start there.

[deleted] OP t1_ix53qb0 wrote on November 20, 2022 at 8:38 PM

[deleted]

rehrev t1_ix3b4pr wrote on November 20, 2022 at 12:44 PM

Oh models should transcend human experience. Okay.

[deleted] OP t1_ix3f9xi wrote on November 20, 2022 at 1:28 PM

[deleted]

[D] Are researchers attempting to solve the ‘omnipotence’ requirement problem in LLMs?

Comments

visarga t1_ix2vuys wrote on November 20, 2022 at 9:14 AM

massimosclaw2 t1_ix2w5bc wrote on November 20, 2022 at 9:19 AM

visarga t1_ix2wyw6 wrote on November 20, 2022 at 9:31 AM

ID4gotten t1_ix32vw0 wrote on November 20, 2022 at 10:58 AM

[deleted] OP t1_ix3387l wrote on November 20, 2022 at 11:02 AM

ID4gotten t1_ix484dw wrote on November 20, 2022 at 5:08 PM

[deleted] OP t1_ix53qb0 wrote on November 20, 2022 at 8:38 PM

rehrev t1_ix3b4pr wrote on November 20, 2022 at 12:44 PM

[deleted] OP t1_ix3f9xi wrote on November 20, 2022 at 1:28 PM