Hyper1on t1_jck7qjx wrote on March 17, 2023 at 1:08 PM

Reply to comment by VelveteenAmbush in [D] What do people think about OpenAI not releasing its research but benefiting from others’ research? Should google meta enforce its patents against them? by [deleted]

DM already does hoard their secrets, there are successful projects there which are not published. What they show you is what they decide needs to be public to get good PR.

Hyper1on t1_j9y3vz1 wrote on February 25, 2023 at 12:11 PM

Reply to comment by Imnimo in [D] To the ML researchers and practitioners here, do you worry about AI safety/alignment of the type Eliezer Yudkowsky describes? by SchmidhuberDidIt

I mean, I don't see how you get a plausible explanation of BingGPT from underfitting either. As you say, models are underfit on some types of data, but I think the key here is the finetuning procedure, either normal supervised, or RLHF, which is optimising for a particular type of dialogue data in which the model is asked to act as an "Assistant" to a human user.

Part of the reason I suspect my explanation is right is that ChatGPT and BingGPT were almost certainly finetuned on large amounts of dialogue data, collected from interactions with users, and yet most of the failure modes of BingGPT that made the media are not stuff like "we asked it to solve this complex reasoning problem and it failed horribly", they are instead coming from prompts which are very much in distribution for dialogue data, such as asking the model what it thinks about X, or asking the model to pretend it is Y and you would expect the model to have seen dialogues which start similarly before. I find underfitting on this data to be quite unlikely as an explanation.

Hyper1on t1_j9wbysn wrote on February 25, 2023 at 12:49 AM

Reply to comment by Imnimo in [D] To the ML researchers and practitioners here, do you worry about AI safety/alignment of the type Eliezer Yudkowsky describes? by SchmidhuberDidIt

Well, the obvious optimisation shortcoming is overfitting. We cannot distinguish this rigorously without access to model weights, but we also have a good idea what overfitting looks like in both pretraining and RL finetuning (in both cases it tends to result in common repeated text strings and a strong lack of diversity in output, a sort of pseudo mode collapse). We can test this by giving Bing GPT the same question multiple times and observing if it has a strong bias towards particular completions -- having played with it a bit I don't think this is really true for the original version, before Microsoft limited it in response to criticism a few days ago.

Meanwhile, the alternative hypothesis I raised seems very plausible and fits logically with prior work on emergent capabilities of LLMs (https://arxiv.org/abs/2206.07682), since it seems only natural to expect that when you optimise a powerful system for an objective sufficiently, it will learn instrumental behaviours which help it minimise that objective, potentially up to and including appearing to simulate various "personalities" and other strange outputs.

Personally, as a researcher who works on RL finetuned large language models and has spent time playing with many of these models, my intuition is that Bing GPT is not RL finetuned at all but is just pretrained and finetuned on dialogue data, and the behaviour we see is just fairly likely to arise by default, given Bing GPT's particular model architecture and datasets (and prompting interaction with the Bing Search API).

Hyper1on t1_j9w5k2x wrote on February 25, 2023 at 12:02 AM

Reply to comment by Imnimo in [D] To the ML researchers and practitioners here, do you worry about AI safety/alignment of the type Eliezer Yudkowsky describes? by SchmidhuberDidIt

I mean, it seems like the obvious explanation? That the model's behaviour is incentivised by its training objective. It also seems very plausible: we know that language models at large scale (even if not RL finetuned) exhibit a wide variety of emergent behaviours which you might not guess are motivated by next token prediction, but evidently are instrumental to reducing the loss. This is not necessarily overfitting: the argument is simply that certain behaviour unanticipated by the researchers is incentivised when you minimise the loss function. Arguably, this is a case of goal misgeneralisation: https://arxiv.org/abs/2105.14111

Hyper1on t1_j9vuyzm wrote on February 24, 2023 at 10:46 PM

Reply to comment by Imnimo in [D] To the ML researchers and practitioners here, do you worry about AI safety/alignment of the type Eliezer Yudkowsky describes? by SchmidhuberDidIt

The hypothesis is precisely that the failure mode of Bing Chat comes from it being too strong, not too weak. That is, if prompted even in quite vague ways it can exhibit instrumentally convergent behaviour like threatening you, even though this was obviously not the designer's objective, and this behaviour occurs as a byproduct of being highly optimised to predict the next word (or an RL finetuning training objective). This is obviously not possible with, say, GPT-2, because GPT-2 does not have enough capacity or data thrown at it to do that.

Hyper1on t1_j9vudgi wrote on February 24, 2023 at 10:42 PM

Reply to comment by ArnoF7 in [D] To the ML researchers and practitioners here, do you worry about AI safety/alignment of the type Eliezer Yudkowsky describes? by SchmidhuberDidIt

Just wanted to point out that even if we restrict ourselves purely to an agent that can only interact with the world through the internet, code, and natural language, that does not address the core AI alignment arguments of instrumental convergence etc being dangerous.

Hyper1on t1_j7c7vga wrote on February 5, 2023 at 6:44 PM

Reply to comment by Cheap_Meeting in [N] GitHub CEO on why open source developers should be exempt from the EU’s AI Act by EmbarrassedHelp

This isn't true - GDPR puts much more onerous restrictions on what consent must be gained before personal data is processed. Much of what cookies collect is considered personal data, and so immediately on GDPR's passing, many websites started to change their cookie acceptance boxes to these massive things which take up half the screen and have granular consent check boxes. Another factor which just makes browsing the web increasingly inconvenient for the average user.

Hyper1on t1_j6xn5do wrote on February 2, 2023 at 4:54 PM

Reply to [D]How Will Open Source Alternatives Compete With GPT3? by noellarkin

> AI21's Jurassic 178B seems to be comparable to GPT3 davinci 001.

This is actually a compliment to AI21, since davinci001 is fine-tuned from original 175B davinci on human feedback over generations:

https://platform.openai.com/docs/model-index-for-researchers

The better comparison is with plain davinci, and you would expect 001 to be better and 003 to be significantly better (the latter is trained with RLHF).

There are currently no open source RLHF models to compete with davinci 003, but this will change in 2023.

Hyper1on t1_j6xm8m9 wrote on February 2, 2023 at 4:48 PM

Reply to [R] Faithful Chain-of-Thought Reasoning by starstruckmon

This is a fine approach, but it's not necessarily chain of thought if you move the actual problem solving outside of the LM. The entire point of Chain of Thought as originally conceived is that it's a better way of doing within-model problem solving. I would be interested to see the result if you were to finetune the LM on a dataset of reasoning from this approach, however.

Hyper1on t1_j43wyf3 wrote on January 13, 2023 at 12:09 AM

Reply to comment by --algo in [D] Microsoft ChatGPT investment isn't about Bing but about Cortana by fintechSGNYC

You can see the full details here: https://beta.openai.com/docs/model-index-for-researchers

Copilot itself is the 12B Codex model, with further refinements.

Hyper1on t1_j43crwx wrote on January 12, 2023 at 9:58 PM

Reply to comment by --algo in [D] Microsoft ChatGPT investment isn't about Bing but about Cortana by fintechSGNYC

That's the InstructGPT paper, which is right for ChatGPT, but Copilot is based on Codex, which does not use RLHF.

Hyper1on t1_j3wp270 wrote on January 11, 2023 at 4:18 PM

Reply to comment by starstruckmon in [D] Microsoft ChatGPT investment isn't about Bing but about Cortana by fintechSGNYC

Why were OpenAI the first to make a model as good as ChatGPT then? It seems clear there is a significant talent and experience advantage in this. I should also mention that no company other than OpenAI has the same quantity of data on human interactions with large language models, thanks to the past 2 and a half years of the OpenAI API.

Hyper1on t1_j2om5c4 wrote on January 2, 2023 at 9:42 PM

Reply to [D] What do you do while you wait for training? by hollow_sets

This is why having multiple projects is good. Just work on other coding or writing up while you wait.

Hyper1on t1_j2dzz01 wrote on December 31, 2022 at 3:44 PM

Reply to [D] Is Anthropic influential in research? by adventurousprogram4

Bit early to say, but I'd be willing to bet that most of their major papers this year will be widely cited. Their work on RLHF, including constitutional AI and HH seems particularly likely to be picked up by other industry labs, since it provides a way to improve LLMs deployed in the wild while reducing the cost of collecting human feedback data.

Hyper1on t1_j0wrenm wrote on December 20, 2022 at 12:06 AM

Reply to comment by czar_el in [D] Will there be a replacement for Machine Learning Twitter? by MrAcurite

Obviously very different situations, since plane inspections are actually reliable. These days I don't view a preprint as any different to a NeurIPS paper, since if I read the preprint and think it's good then essentially the only difference is that one has passed the NeurIPS reviewer lottery. I advise all ML researchers to just trust themselves, read the preprint, and if they think it's valuable then feel free to cite it.

Hyper1on t1_j0wr203 wrote on December 20, 2022 at 12:04 AM

Reply to comment by tripple13 in [D] Will there be a replacement for Machine Learning Twitter? by MrAcurite

I'm sure that a bunch more people moved today or yesterday, but so far the only perceptible difference in my Twitter feed is that the people with a tendency to stir up Twitter drama have become less visible. There's still plenty of paper announcements/discussion on Twitter right now, so I don't see the need to move.

I also think that federation is a terrible way to run a social network, and that Mastodon is so obviously a poor replacement for Twitter that people will eventually realise this and go back. There is just no good Twitter alternative in existence right now.

Hyper1on t1_j0nbs7q wrote on December 17, 2022 at 11:44 PM

Reply to comment by kypjks in [D] What kind of effects ChatGPT or future developments may have on job market? by ureepamuree

I don't know about the linux kernel source, but having contributed to several major OSS libraries including Pytorch, I think that most PRs in my experience can be more easily described in natural language than in code. When I said comments, I didn't mean like line by line comments of everything, but I was more thinking of docstrings. I am very sceptical of the idea that on average it is faster to write complex code than to describe what you want it to do, which is partly why I think AI code synthesis can achieve significant speedups here.

Hyper1on t1_j0m9ui8 wrote on December 17, 2022 at 7:11 PM

Reply to comment by kypjks in [D] What kind of effects ChatGPT or future developments may have on job market? by ureepamuree

But it is often faster to write in comments a description of the algorithm you want, even if complex, then it is to code it up yourself (especially if coding involves any googling, risk of off by one errors, etc). Besides, it's easier to verify solutions than to write them.

Hyper1on t1_j0e7dv0 wrote on December 16, 2022 at 12:11 AM

Reply to comment by ReginaldIII in [R] Talking About Large Language Models - Murray Shanahan 2022 by Singularian2501

Look at Algorithm Distillation, you can clearly do RL in-context in LLMs. The point of this discussion is that "being asked to sample the next token" can, if sufficiently optimized, encompass a wide variety of behaviours and understanding of concepts, so saying that it's just a static LLM seems to be missing the point. And yes, it's just correlations all the way down. But why should this preclude understanding or awareness of the problem domain?

Hyper1on t1_j0a80ym wrote on December 15, 2022 at 4:20 AM

Reply to comment by acardosoj in [D] Industry folks, what kind of development methodology/cycle do you use? by DisWastingMyTime

Usually you just make some estimates of projected costs, resource use, and timelines at the start of the project (aiming to be an overestimate), and if you are up to date with the progress made it's trivial to just correct these estimates if someone asks you for them.

Hyper1on t1_izqmn2n wrote on December 11, 2022 at 3:01 AM

Reply to comment by senobrd in [P] I made a command-line tool that explains your errors using ChatGPT (link in comments) by jsonathan

Copilot is a 12B model (for inference speed), chatGPT is the 175B one, not specifically trained on code I'm pretty sure. So chatGPT should give better results on average because of the better model.

Hyper1on t1_irmv8ve wrote on October 9, 2022 at 2:02 PM

Reply to comment by [deleted] in [D] how hard is it for an undergrad to publish a first author paper at a reputable conference/journal? by djssoapappskdid

They asked a senior professor in ML to do a summer project, and that professor gave them the idea and handheld them through it.