[R] Reflexion: an autonomous agent with dynamic memory and self-reflection - Noah Shinn et al 2023 Northeastern University Boston - Outperforms GPT-4 on HumanEval accuracy (0.67 --> 0.88)! Submitted by Singularian2501 t3_1215dbl on March 25, 2023 at 1:00 AM in MachineLearning 85 comments 241
AI-Pon3 t1_jdlgw1x wrote on March 25, 2023 at 7:18 AM Interesting methodology/technology. I realize it's GPT-4+ a refining process but even so, 88% is ~64% fewer errors than 67%, which proves it's a powerful technique even when the underlying model is already fairly capable. Permalink 25
Viewing a single comment thread. View all comments