Necessary-Meringue-1 t1_jdgwjo8 wrote
Reply to comment by Maleficent_Refuse_11 in [D] "Sparks of Artificial General Intelligence: Early experiments with GPT-4" contained unredacted comments by QQII
That's true, but the outputs it produces are eerily persuasive. I'm firmly in the "LLMS are impressive but not AGI" camp. Still, the way it used Java to draw a picture in the style of Kandinsky blew me away. Obviously, a text2image model would be able to do that. But here they prompted GPT-4 to generate code that would generate a picture in a specific style. Which requires an extra level of abstraction and I can't really understand how that came about given that you would not expect a task like this in the training data. (page 12 for reference: https://arxiv.org/pdf/2303.12712.pdf)
I agree that a transformer really should not be considered "intelligent" or AGI, but LLMs really have an uncanny ability to generate output that looks "intelligent". Granted, that's what we built them to do, but still.
Username2upTo20chars t1_jdiesqt wrote
>But here they prompted GPT-4 to generate code that would generate a picture in a specific style.
5 seconds of googling "code which generates random images in the style of the painter Kandinsky":
http://www.cad.zju.edu.cn/home/jhyu/Papers/LeonardoKandinsky.pdf
https://github.com/henrywoody/kandinsky-bot
GPT's trained on the whole of the WWW sensible text are just sophisticated echo/recombination chambers. True, it works far better than most would have predicted, but that doesn't change the way they work. I am also impressed, but GPT-3 got known for parroting content, why should the next generation be fundamentally different? It just gets harder and harder to verify.
Nevertheless I even expect such generative models to be good enough to become very general. Most human work isn't doing novel things either. Just copying up to smart recombination.
inglandation t1_jdij4o8 wrote
> why should the next generation be fundamentally different?
Emergent abilities from scale are the reason. There are many examples of that in nature and many fields of study. The patterns of snowflakes cannot easily be explained by the fundamental properties of water. You need enough water molecules in the right conditions to create the patterns of snowflakes. I suspect that a similar phenomenon is happening with LLMs, but we haven't figured out yet what the patterns are and what are the right conditions for them to materialize.
Username2upTo20chars t1_jdj8a6k wrote
I don't disagree with the phenomena of emergence, it's just that it doesn't explain anything. It is one word for "I have no idea how it works" or better: its magic. The issue I have with that is that you are quick to hide behind that word, using it as an explanation, accepted as the emergence has become.
But in fact you can't model one bit with it, it has no predictive power and it kind of shuts down discussions.
So far I haven't seen any evidence (have you?) that LLMs aren't doing anything else but predicting the next token. Yes there are certain thresholds, where they do overcome the one or other weakness. But in the end they just predict the next token better ... and even better. Impressive what you can do with that (chinese room like), but that doesn't imply that GPT4 is any different than GPT3.5, it's just better.
But as I wrote, you can in theory replace most non-manual work with that somewhere down the line anyway. But no GPT will develop you some ground-breaking Deep Learning architecture or solve important physics problems which need actual thought and not just more compute or...
Not that you claimed that - I do here -, but should GPT-7 or so suddenly do that, then you can hold me to it.
inglandation t1_jdjvmqe wrote
> you can't model one bit with it, it has no predictive power and it kind of shuts down discussions.
For now yes, my statement is not very helpful. But this is a phenomenon that happens in other fields. In physics, waves or snowflakes are an emergent phenomenon, but you can still model them pretty well and make useful predictions about them. Life is another example. We understand life pretty well (yes there are aspects that we don't understand), but it's not clear how we go from organic compounds to living creatures. Put those molecules together in the right amount and in the right conditions for a long time, and they start developing the structures of life. How? We don't know yet, but it doesn't stop us from understanding life and describing it pretty well.
Here we don't really know what we're looking at yet, so it's more difficult. We should figure out what the structures emerging from the training are.
I don't disagree that LLMs "just" predict the next token, but there is an internal structure that will pick the right word that is not trivial. This structure is emergent. My hypothesis here is that understanding this structure will allow us to understand how the AI "thinks". It might also shed some light on how we think, as the human brain probably does something similar (but maybe not very similar). I'm not making any definitive statement, I don't think anyone can. But I don't think we can conclude that the model doesn't understand what it is doing based on the fact that it predicts the next token.
I think that the next decades will be about precisely describing what cognition/intelligence is, and in what conditions exactly it can appear.
CheeseDon t1_jdh83jk wrote
lots of humans too have an uncanny ability to generate output that looks intelligent. most if not all do.
Maleficent_Refuse_11 t1_jdh87r4 wrote
Let's agree to "some".
ShadoWolf t1_jdipal4 wrote
Some of the capabilities of ChatGPT4 .. are spooky. I mean GPT-4 hired someone off of TaskRabbit to solve a Captcha for it in test phases (https://cdn.openai.com/papers/gpt-4.pdf) . I don't think it's at AGI .. but it sort of feels like Where ever we are on the S curve for this technology. we finally on the same continent for getting to AGI
And some of the stuff people are getting to to do using LangChain with ChatGPT is crazy
cyborgsnowflake t1_jdl47n8 wrote
I think the simpler answer is its easier than some people believed to reproduce certain knowledge tasks statistically than the alternative theory that shuffling tensors creates living thinking beings like everyone else on this thread seems to be jumping on board.
Necessary-Meringue-1 t1_jdlezco wrote
Yeah, I think you're on the money there. It's very hard for us to not anthropomorphize this behavior, especially because we literally used RLHF in order to make it more human-like.
mudman13 t1_jdgyr91 wrote
>but LLMs really have an uncanny ability to generate output that looks "intelligent".
Like many of these internet gurus and intellectuals e.g. Eric Weinstein
Viewing a single comment thread. View all comments