Blog: https://nanothoughts.substack.com/p/reflecting-on-reflexion

Github: https://github.com/noahshinn024/reflexion-human-eval

Twitter: https://twitter.com/johnjnay/status/1639362071807549446?s=20

Abstract:

>Recent advancements in decision-making large language model (LLM) agents have demonstrated impressive performance across various benchmarks. However, these state-of-the-art approaches typically necessitate internal model fine-tuning, external model fine-tuning, or policy optimization over a defined state space. Implementing these methods can prove challenging due to the scarcity of high-quality training data or the lack of well-defined state space. Moreover, these agents do not possess certain qualities inherent to human decision-making processes, specifically the ability to learn from mistakes. Self-reflection allows humans to efficiently solve novel problems through a process of trial and error. Building on recent research, we propose Reflexion, an approach that endows an agent with dynamic memory and self-reflection capabilities to enhance its existing reasoning trace and task-specific action choice abilities. To achieve full automation, we introduce a straightforward yet effective heuristic that enables the agent to pinpoint hallucination instances, avoid repetition in action sequences, and, in some environments, construct an internal memory map of the given environment. To assess our approach, we evaluate the agent's ability to complete decision-making tasks in AlfWorld environments and knowledge-intensive, search-based question-and-answer tasks in HotPotQA environments. We observe success rates of 97% and 51%, respectively, and provide a discussion on the emergent property of self-reflection.

https://preview.redd.it/4myf8xso9spa1.png?width=1600&format=png&auto=webp&v=enabled&s=867a16e1114108053d08d4cdf41485c8b29a132c

https://preview.redd.it/bzupwyso9spa1.png?width=1600&format=png&auto=webp&v=enabled&s=95cacfe6b99756e7eed9ec8c40784f8c4cb94cee

https://preview.redd.it/009352to9spa1.jpg?width=1185&format=pjpg&auto=webp&v=enabled&s=5ccc52597d6e001c2ba754fc5f05afd1df09cd63

https://preview.redd.it/ef9ykzso9spa1.jpg?width=1074&format=pjpg&auto=webp&v=enabled&s=2701778aa5a9f3e80f683a1e3d0eaf0160928f54

Comments

You must log in or register to comment.

3deal t1_jdkiao9 wrote on March 25, 2023 at 1:25 AM

#2,346,477

AI is growing faster than our capacity to adapt. We are doomed

Nyanraltotlapun t1_jdkkc6q wrote on March 25, 2023 at 1:41 AM

#2,346,653

Replying to 3deal (#2,346,477)

There is no way for humans to adapt for alien intelligence. The idea of developing general AI is insanely horrifying from the beginning.

3deal t1_jdkmcrb wrote on March 25, 2023 at 1:58 AM

#2,346,882

Replying to Nyanraltotlapun (#2,346,653)

We all know the issue, and we still running on the way.

[deleted] t1_jdknpzb wrote on March 25, 2023 at 2:09 AM

#2,347,024

[removed]

RealSonZoo t1_jdkoq5c wrote on March 25, 2023 at 2:17 AM

#2,347,113

Question, maybe dumb - how are they comparing results to GPT-4, which isn't released yet, and I think is mostly closed source?

t0slink t1_jdkq5c1 wrote on March 25, 2023 at 2:29 AM

#2,347,252

Replying to 3deal (#2,346,882)

Nah, full speed ahead please. With enough development, a cure for cancer, aging, and all manner of devastating human ailments could happen in this decade.

It is senseless to cut off a pathway that could literally save and improve tens of billions of lives over the next few decades because you're scared it can't be done correctly.

metalman123 t1_jdkqd75 wrote on March 25, 2023 at 2:31 AM

#2,347,273

Replying to RealSonZoo (#2,347,113)

Gpt 4 is released......

RealSonZoo t1_jdkqjld wrote on March 25, 2023 at 2:33 AM

#2,347,295

Replying to metalman123 (#2,347,273)

Oh so if I go to the ChatGPT website and start talking with it, that's GPT-4?

tysam_and_co t1_jdkqv3e wrote on March 25, 2023 at 2:35 AM

#2,347,327

Replying to RealSonZoo (#2,347,113)

I would presume that it's a bolt-on external method that utilizes a pretrained model with its own inputs as a dynamically-generated information sieve of sorts. Of course, the inductive prior is encoded in the Reflexion algorithm itself so we are bringing some new information to the table here (not that GPT4+ couldn't somehow do this itself someday, either).

metalman123 t1_jdkqv8i wrote on March 25, 2023 at 2:35 AM

#2,347,328

Replying to RealSonZoo (#2,347,295)

What rock have you been under?

The paid version has gpt 4 access. People have access to the gpt 4 api.

This is old information

addition t1_jdkrd3s wrote on March 25, 2023 at 2:40 AM

#2,347,380

Replying to RealSonZoo (#2,347,295)

You need chatgpt plus to use 4 at the moment

meregizzardavowal t1_jdksro1 wrote on March 25, 2023 at 2:52 AM

#2,347,509

Replying to t0slink (#2,347,252)

I don’t know if people are as much saying we should cut off the pathway because they are scared. What I’m hearing is they think we ought to spend more effort on ensuring it’s safe, because a Pandora’s box moment may come up quickly.

addition t1_jdkssmg wrote on March 25, 2023 at 2:52 AM

#2,347,511

Wow! I was just thinking the other day, now that we have very advanced statistical models of the world the next step is some search algorithm + feedback loop. In other words, a way for the model to use its statistical understanding of the world to guide a search towards a solution while also updating itself along the way. This feels like an important step. Or at least the idea is the first step in this direction.

sweatierorc t1_jdkt9uq wrote on March 25, 2023 at 2:56 AM

#2,347,545

Replying to t0slink (#2,347,252)

A cure for cancer and aging in this decade. AI has gotten really good, but let's not get carried away.

t0slink t1_jdkufvf wrote on March 25, 2023 at 3:06 AM

#2,347,648

Replying to sweatierorc (#2,347,545)

> AI has gotten really good, but let’s not get carried away.

People were saying the same thing five years ago about the generative AI developments we've seen this year.

ertgbnm t1_jdkv8rw wrote on March 25, 2023 at 3:13 AM

#2,347,726

Umm wow! I recommend backing up this GitHub before it gets taken down for "safety"

learn-deeply t1_jdl1bmp wrote on March 25, 2023 at 4:09 AM

#2,348,266

Anyone else tired of papers that obscure a simple concept with endless paragraphs of verbose gibberish? This 17 page could be a few sentences.

Tl;DR the authors wrote prompts to tell GPT-4 to fix code given some unit tests and the output of the broken code. It performs better than GPT-4 that doesn't have access to the output of the code execution.

https://github.com/noahshinn024/reflexion-human-eval/blob/main/reflexion.py#L7-L12

throwaway957280 t1_jdl2cq3 wrote on March 25, 2023 at 4:19 AM

#2,348,365

Replying to RealSonZoo (#2,347,295)

If you pay for ChatGPT plus and manually select the new model, yes. By default, no.

_Arsenie_Boca_ t1_jdlc2ah wrote on March 25, 2023 at 6:11 AM

#2,349,292

Replying to learn-deeply (#2,348,266)

Thanks! If that is really the TL;DR, I have never seen an abstract that beats about the bush so much

brucebay t1_jdlc3ix wrote on March 25, 2023 at 6:12 AM

#2,349,294

Replying to Nyanraltotlapun (#2,346,653)

This is not an alien intelligence yet. We understand how it works how it thinks. But eventually this version can generate an AI that is harder for us to understand, and that version can generate another ai. At some point it will become alien to us because we may not understand the math behind jt,

greenskinmarch t1_jdlc952 wrote on March 25, 2023 at 6:14 AM

#2,349,309

Replying to t0slink (#2,347,252)

I just want humans to stop dying of cancer!

Monkey's paw curls. The humans all die of being shot by drones instead

sweatierorc t1_jdlcwkm wrote on March 25, 2023 at 6:23 AM

#2,349,366

Replying to t0slink (#2,347,648)

True, but with AI more computing power/data means better models. With medicine, things move slower. If we get a cure for one or two cancer this decade, it would be a massive achievement.

nekize t1_jdldodi wrote on March 25, 2023 at 6:33 AM

#2,349,424

Replying to learn-deeply (#2,348,266)

Sadly that is what academia came to. I am doing my phd and 80% od my papers is just padding. And if you don t follow the “template” you can t publish anything

SmLnine t1_jdlgtl8 wrote on March 25, 2023 at 7:17 AM

#2,349,647

Replying to sweatierorc (#2,347,545)

If an intelligence explosion happens, there's really no telling what's possible. Maybe these problems are trivial to a 1 million IQ machine, maybe not. The only question really is if the explosion will happen. Two years ago I would have said 1% in the next ten years, now I'm up to 10%. Maybe in two more years it'll look like 30%.

AI-Pon3 t1_jdlgw1x wrote on March 25, 2023 at 7:18 AM

#2,349,653

Interesting methodology/technology. I realize it's GPT-4+ a refining process but even so, 88% is ~64% fewer errors than 67%, which proves it's a powerful technique even when the underlying model is already fairly capable.

t0slink t1_jdlhf3s wrote on March 25, 2023 at 7:26 AM

#2,349,690

Replying to meregizzardavowal (#2,347,509)

I wish you were right, but people are calling for investment in AGI to cease altogether:

> There is no way for humans to adapt for alien intelligence. The idea of developing general AI is insanely horrifying from the beginning.

One of the parent comments.

Such absolutist comments leave no room whatsoever for venturing into AGI.

sweatierorc t1_jdlhgay wrote on March 25, 2023 at 7:26 AM

#2,349,693

Replying to SmLnine (#2,349,647)

IMHO, I think that cancer and aging are necessary for complex organism. It is more likely that we solve cloning or build the first in vitro womb, than we are at deafeating cancer or aging.

t0slink t1_jdlhje0 wrote on March 25, 2023 at 7:28 AM

#2,349,699

Replying to greenskinmarch (#2,349,309)

Thanks Obama

Cherubin0 t1_jdlif7r wrote on March 25, 2023 at 7:41 AM

#2,349,772

Wow so we can hook it up with cargo --check and it will generate perfect Rust code.

comfytoday t1_jdljrdg wrote on March 25, 2023 at 8:01 AM

#2,349,887

Replying to 3deal (#2,346,882)

I'm a little surprised at the seeming lack of any backlash, tbh. I'm sure it's coming though.

artsybashev t1_jdlml1f wrote on March 25, 2023 at 8:44 AM

#2,350,120

Replying to nekize (#2,349,424)

Sounds like we need a LLM to generate padding for the academia and LLM to write the tldr for the readers. World is dumb.

Deep-Station-1746 t1_jdlmrh5 wrote on March 25, 2023 at 8:46 AM

#2,350,134

Replying to learn-deeply (#2,348,266)

This is actually a very good PR material, as it will save engineers' time. Just opened it and referenced your comment. https://github.com/noahshinn024/reflexion-human-eval/pull/1

MINECRAFT_BIOLOGIST t1_jdlmzvv wrote on March 25, 2023 at 8:50 AM

#2,350,154

Replying to sweatierorc (#2,349,693)

Well cloning and artificial wombs are basically done or very close, we just haven't applied it to humans due to ethical reasons. Six years ago there was already a very premature lamb kept alive in an artificial womb for four weeks.

As for cancer and aging...it seems increasingly clear that part of the process is just that genes necessary for development get dysregulated later on in life. I think the fact that we can rejuvenate our own cells by making sperm and eggs points to the fact that the dysregulation should be fixable, and recent advances in aging research seem to show that this is true. The issue is, of course, pushing that process too far and ending up with cells dedifferentiating or becoming cancerous, but I think it's possible if we're careful.

nonotan t1_jdln1d9 wrote on March 25, 2023 at 8:51 AM

#2,350,158

Replying to sweatierorc (#2,349,693)

We already know of complex organisms that essentially don't age, and also others that are cancer-free or close to it. In any case, "prevent any and all aging and cancer before it happens" is a stupid goalpost. "Be able to quickly and affordably detect, identify and treat arbitrary strains of cancer and/or symptoms of aging" is essentially "just as good", and frankly seems like it could well already be within the reach of current models if they had the adequate "bioengineering I/O" infrastructure, and fast & accurate bioengineering simulations to train on.

ML could plausibly help in getting those online sooner, but unless you take the philosophical stance that "if we just made AGI they'd be able to solve every problem we have, so everything is effectively an ML problem", it doesn't seem like it'd be fair to say the bottlenecks to solving either of those are even related to ML in the first place. It's essentially all a matter of bioengineering coming up with the tools required.

Fal_the_commentator t1_jdlo48r wrote on March 25, 2023 at 9:07 AM

#2,350,245

Replying to nekize (#2,349,424)

Good papers don't need to do that. If papers are self contained, no need for gibberish.

From my experience, it comes from when the paper is not planned before being written, or when results/methodology is either not refined or not interesting enough.

Spud_M314 t1_jdlp71e wrote on March 25, 2023 at 9:23 AM

#2,350,340

Replying to Nyanraltotlapun (#2,346,653)

Genetically alter the human brain to make more neocortical neurons and glia... That make brain more brainy, more gray matter, more smart stuff... A biological (human) superintelligence is more likely...

gmork_13 t1_jdlpq90 wrote on March 25, 2023 at 9:31 AM

#2,350,373

Replying to learn-deeply (#2,348,266)

Sometimes I feel like a toddler for doing it, but I always scroll to the images first and for most papers that’s the TLDR.

Normal_Antelope_2556 t1_jdlqc42 wrote on March 25, 2023 at 9:40 AM

#2,350,415

Replying to nekize (#2,349,424)

as a person who inspires to go into research in this field,how bad is it? Can people even do their own research?

theotherquantumjim t1_jdlre84 wrote on March 25, 2023 at 9:55 AM

#2,350,491

Replying to greenskinmarch (#2,349,309)

No! Not like that!

nekize t1_jdlrqnt wrote on March 25, 2023 at 10:00 AM

#2,350,513

Replying to Normal_Antelope_2556 (#2,350,415)

Of course you can. Depending in which group you end up, there is a lot of cool stuff being done outside of NLP and Computer vision (if you consider these two “solved”).

maskedpaki t1_jdlu3k1 wrote on March 25, 2023 at 10:34 AM

#2,350,704

Replying to nekize (#2,349,424)

well at least you can use gpt4 for padding now.

[deleted] t1_jdlurrw wrote on March 25, 2023 at 10:43 AM

#2,350,753

Replying to Deep-Station-1746 (#2,350,134)

[deleted]

Puzzleheaded_Acadia1 t1_jdlw72w wrote on March 25, 2023 at 11:02 AM

#2,350,855

Can someone explain to me what this paper is about

SmLnine t1_jdlwhtu wrote on March 25, 2023 at 11:06 AM

#2,350,876

Replying to nonotan (#2,350,158)

>but unless you take the philosophical stance that "if we just made AGI they'd be able to solve every problem we have, so everything is effectively an ML problem", it doesn't seem like it'd be fair to say the bottlenecks to solving either of those are even related to ML in the first place. It's essentially all a matter of bioengineering coming up with the tools required.

We're currently using our brains (a general problem solver) to build bioengineering tools that can cheaply and easily edit the DNA of a living organism. 30 years ago this would have sounded like magic. But there's no magic here. This potential tool has always existed, we just didn't understand it.

It's possible that there are other tools in the table that we simply don't understand yet. Maybe what we've been doing the last 60 years is the bioengineering equivalent of bashing rocks together. Or maybe it's close to optimal. We don't know, and we can't know until we aim an intellectual superpower at it.

SmLnine t1_jdlxego wrote on March 25, 2023 at 11:17 AM

#2,350,973

Replying to sweatierorc (#2,349,693)

There are complex mammals that effectively don't get cancer, and there are less complex animals and organisms that effectively don't age. So I'm curious what your opinion is based on.

MarmonRzohr t1_jdlyfub wrote on March 25, 2023 at 11:30 AM

#2,351,091

Replying to MINECRAFT_BIOLOGIST (#2,350,154)

>artificial wombs are basically done or very close

Bruh... put down the hopium pipe. There's a bit more work to be done there - especially if you think "artifical womb" as in from conception to term, not artifical womb as in device intended from prematurely born babies.

The second one was what was demonstrated with the lamb.

MINECRAFT_BIOLOGIST t1_jdlz2nr wrote on March 25, 2023 at 11:37 AM

#2,351,151

Replying to MarmonRzohr (#2,351,091)

Hmm, perhaps I was being a bit hyperbolic, but check this out (from 2021):

https://www.science.org/content/article/mouse-embryos-grown-bottles-form-organs-and-limbs

Nyanraltotlapun t1_jdm0r15 wrote on March 25, 2023 at 11:56 AM

#2,351,308

Replying to brucebay (#2,349,294)

>This is not an alien intelligence yet. We understand how it works how it thinks.

Its alien not because we don't understand It, but because It is not protein life form. It have nothing common with humans, It does not feel hunger, does not need sex, does not feel love or pain. It is metal plastic and silicone. It is something completely nonhuman that can think and reason. It is the true horror, wont you see?

>We understand how it works how it thinks

Sort of partially. And also, it is false to assume in general. Long story short, main property of complex systems is the ability to pretend and mimic. You cannot properly study something that can pretend and mimic.

Nyanraltotlapun t1_jdm1505 wrote on March 25, 2023 at 12:00 PM

#2,351,352

Replying to t0slink (#2,347,252)

I'M Jaded

WonderFactory t1_jdm1slk wrote on March 25, 2023 at 12:07 PM

#2,351,433

Replying to brucebay (#2,349,294)

We don't understand how it works. We understand how it's trained but we don't really understand the result of the training and exactly how it arrives at a particular output. The trained model is an incredibly complex system.

light24bulbs t1_jdm413r wrote on March 25, 2023 at 12:31 PM

#2,351,654

Replying to learn-deeply (#2,348,266)

This is an insane way to communicate knowledge.

sweatierorc t1_jdm83bv wrote on March 25, 2023 at 1:09 PM

#2,352,073

Replying to SmLnine (#2,350,973)

which one ? do they not get cancer or are they more resistant to it ?

danielbln t1_jdm967m wrote on March 25, 2023 at 1:19 PM

#2,352,192

Replying to artsybashev (#2,350,120)

Relevant: https://i.imgur.com/D8WFIMZ.png

SpaceCadetIowa t1_jdmcfga wrote on March 25, 2023 at 1:47 PM

#2,352,537

No need, the government makes up new ones to keep the people thinking we need them.

ambient_temp_xeno t1_jdmdh2i wrote on March 25, 2023 at 1:55 PM

#2,352,639

Replying to Nyanraltotlapun (#2,351,308)

There is work done on how to even start interacting with an extraterrestrial civilization, and it would probably be a vast amount harder than whatever intelligence is contained in a human-data-filled, human-trained model. https://www.nasa.gov/connect/ebooks/archaeology_anthropology_and_interstellar_communication.html

That said, it is the closest we have to that so you're not 'wrong'.

Art10001 t1_jdmff0b wrote on March 25, 2023 at 2:11 PM

#2,352,848

Replying to sweatierorc (#2,349,366)

More intelligence, more time (AIs are at different time scales) = faster rate of discoveries

SmLnine t1_jdmftzs wrote on March 25, 2023 at 2:14 PM

#2,352,897

Replying to sweatierorc (#2,352,073)

I said "effectively" because a blanked statement would be unwarranted. There has probably been at least one naked mole rate in the history of the universe that got cancer.

https://www.cam.ac.uk/research/news/secrets-of-naked-mole-rat-cancer-resistance-unearthed

[deleted] t1_jdmgnya wrote on March 25, 2023 at 2:21 PM

#2,352,993

[removed]

lego3410 t1_jdmi0hv wrote on March 25, 2023 at 2:31 PM

#2,353,169

Replying to learn-deeply (#2,348,266)

Yes! But GPT-4 could summarize it for me.

sweatierorc t1_jdmilbm wrote on March 25, 2023 at 2:36 PM

#2,353,234

Replying to Art10001 (#2,352,848)

Do we know that ? E.g. with quantum computing, we know that it won't really revolutionize our lives despite the fact that it can solve a new class of problem.

MarmonRzohr t1_jdmj8th wrote on March 25, 2023 at 2:41 PM

#2,353,310

Replying to SmLnine (#2,350,973)

>There are complex mammals that effectively don't get cancer

You got a source for that ?

That's not true at all according everything I know, but maybe what I know is outdated.

AFAIK there are only mammals that seem to develop cancer much less than they should - namely large mamals like whales. Other than that every animal above and including Cnidaria deveop tumors. E.g. even the famously immortal Hydras develop tumors over time.

That's what makes cancer so tricky. There is good chance that far, far back in evolution there was a selection between longevity and rate of change or something else. Therefore may be nothing we can do to prevent cancer and can only hope for suppression / cures when / if it happens.

Again, this may be outdated.

sweatierorc t1_jdmkacg wrote on March 25, 2023 at 2:48 PM

#2,353,441

Replying to SmLnine (#2,352,897)

Sure, humans under 40 are also very resistant to cancer. My point was that cancer comes with old age, and aging seems to be a way for us to die before cancer or dementia kill us. There are "weak" evidence that people who have dementia are less likely to get a cancer. I understand that some mammals like whales or elephant seems to be very resistant to cancer, but if we were to double or triple their average life expectancy, other disease may become more prevalent, maybe even cancer.

artsybashev t1_jdmpwwd wrote on March 25, 2023 at 3:29 PM

#2,354,141

Replying to danielbln (#2,352,192)

The fluffy overly complex writing around your main message has worked as a barrier or prefilter to filter out bad job candidates or unqualified contributions to scientific discussion. LLMs are destroying this part. Interesting to see what this leads to.

DiscussionGrouchy322 t1_jdmrq88 wrote on March 25, 2023 at 3:42 PM

#2,354,346

Wow so many words to try and say you're applying test driven design to prompt engineering. I will keep this as example of how not to write technical content. (I was reading the "blog post")

Maybe this is a joke posting that was also written by the chat gpt.

When you make those charts with the weights and things... Are they meant to convey information or do you just follow previous template where you saw information presented that way and you just try and match the shape?

kim_en t1_jdmu877 wrote on March 25, 2023 at 4:00 PM

#2,354,603

Replying to Puzzleheaded_Acadia1 (#2,350,855)

me too

massimosclaw2 t1_jdmvjlp wrote on March 25, 2023 at 4:09 PM

#2,354,752

Replying to learn-deeply (#2,348,266)

When you haven’t done much, best to obscure it in some complicated language /s

Art10001 t1_jdmyazo wrote on March 25, 2023 at 4:29 PM

#2,355,042

Replying to sweatierorc (#2,353,234)

Quantum computing solves new types of problems, and their resolution, or findings from them, improves our lives.

mrfreeman93 t1_jdnebrv wrote on March 25, 2023 at 6:22 PM

#2,356,797

I think it was aleady well known that it would fix its own errors when provided the error message, this is not a breakthrough

learn-deeply t1_jdnkaw7 wrote on March 25, 2023 at 7:04 PM

#2,357,399

Replying to nekize (#2,349,424)

If you need to pad your paper, that means there hasn't been enough original research done.

farmingvillein t1_jdo16sz wrote on March 25, 2023 at 9:08 PM

#2,359,246

Replying to learn-deeply (#2,348,266)

> This 17 page could be a few sentences.

> Tl;DR the authors wrote prompts to tell GPT-4 to fix code given some unit tests and the output of the broken code. It performs better than GPT-4 that doesn't have access to the output of the code execution.

I agree with your overall sentiment--the paper IMO could be, in the very least, substantially re-organized for clarity--but your summary isn't actually accurate, since the paper itself has nothing to do with coding(!).

The coding work is all in their blog post...

...which also suffers from the same issue: a long preamble to scroll down and find the core nugget.

yaosio t1_jdomvtr wrote on March 25, 2023 at 11:52 PM

#2,361,392

Replying to Puzzleheaded_Acadia1 (#2,350,855)

I think they give GPT-4 a task, GPT-4 attempts to complete it and is told if it worked or not, then GPT-4 looks at what happened and determines why it failed, and then tries again with this new knowledge. This is all done through natural language prompts, the model isn't being changed.

I saw somebody else in either this sub or /r/openai using a very similar method to get GPT-4 to write and deploy a webpage that could accept valid email addresses. Of course, I can't find it, and neither can Bing Chat, so maybe I dreamed it. I distinctly remember asking if it could do QA, and then the person asked what I meant, and I said have it check for bugs. I post a lot so I can't find it in my post history.

I remember the way it worked was they gave it the task, then GPT-4 would write out what it was going to do, what it predicted would happen, write the code, and then check if what it did worked. If it didn't work it would write out why it didn't work, plan again, then act again. So it went plan->predict->act->check->plan. This successfully worked as it went from nothing to a working and deployed webpage without any human intervention other than setting the task.

sneakpeekbot t1_jdomx0y wrote on March 25, 2023 at 11:52 PM

#2,361,401

Replying to yaosio (#2,361,392)

Here's a sneak peek of /r/OpenAI using the top posts of the year!

#1: meme | 110 comments
#2: ChatGPT transforming data and running SQL queries | 119 comments
#3: [Official] ChatGPT now supports plugins!!! | 270 comments

ellev3n11 t1_jdp7evr wrote on March 26, 2023 at 2:30 AM

#2,363,604

Replying to learn-deeply (#2,348,266)

That is not what the paper is about. The paper has nothing to do with code actually. Why are people here so obtuse?

noobgolang t1_jdpmeq9 wrote on March 26, 2023 at 4:51 AM

#2,365,202

Replying to learn-deeply (#2,348,266)

Stop gate keeping researchhhh!!!! It is already that bad

rsha256 t1_jdq13w4 wrote on March 26, 2023 at 8:05 AM

#2,366,687

Replying to nekize (#2,350,513)

What does CV have That makes it “solved”? Stable Diffusion?

Jeffy29 t1_jdquk5v wrote on March 26, 2023 at 2:00 PM

#2,370,223

Replying to 3deal (#2,346,477)

Literally doomsayer. I know I know “bUt ThIs TiMe iTs dIfFeRenT”. I am sure you guys will be right one day.

[deleted] t1_jdrj49x wrote on March 26, 2023 at 5:00 PM

#2,373,224

Replying to [deleted] (#2,350,753)

[removed]

[deleted] t1_jdrjkx4 wrote on March 26, 2023 at 5:04 PM

#2,373,265

Replying to yaosio (#2,361,392)

[removed]

afreydoa t1_jdrvs8d wrote on March 26, 2023 at 6:30 PM

#2,374,630

I wonder if combining LLMs with planning would enhance the creation of poems or that example task, of creating sentences that end with a specific letter.

My thinking is that poem generation often struggles when the LLM can't find a suitable ending, as the initial part of the line or paragraph, is already locked and can't be altered. However, when directing ChatGPT to rework the response by modifying the starting point, it seems to often produce better outcomes.

Dry_Percentage_1399 t1_jds1o8b wrote on March 26, 2023 at 7:12 PM

#2,375,335

Replying to metalman123 (#2,347,328)

Really？ I have paid to access gpt-4, but only by website. How can I use gpt-4 api?

VelveteenAmbush t1_jdsjab4 wrote on March 26, 2023 at 9:15 PM

#2,377,528

Replying to artsybashev (#2,350,120)

Also an LLM to read all of the tldrs and tell me which of them I should pay attention to.

SzilvasiPeter t1_jdudjj3 wrote on March 27, 2023 at 7:17 AM

#2,386,317

Replying to brucebay (#2,349,294)

Well, our own body is alien to us. The brain, the gut, the endocrine system, and so on. There are emergent complexities everywhere from giant black holes to a pile of dirt. It is the same with conceptual things like math or computer science. Simple axioms and logic gates lead to beautiful complex systems.

I guess, we should get used to "not understanding" at this point.

fnordstar t1_jdv0sl3 wrote on March 27, 2023 at 12:19 PM

#2,389,308

Replying to artsybashev (#2,354,141)

That just seems like elitism. Like rejecting someone for having an accent instead of speaking oxford english.

pm_me_your_pay_slips t1_jdv6l50 wrote on March 27, 2023 at 1:10 PM

#2,390,266

Replying to ellev3n11 (#2,363,604)

while the paper doesn't mention any code, there is no practical difference: replace RL environment with compiler/interpreter, and action selection with prompt engineering.

pm_me_your_pay_slips t1_jdv748e wrote on March 27, 2023 at 1:15 PM

#2,390,348

Replying to yaosio (#2,361,392)

this is literally what gdb did during the GPT-4 launch livestream