uristmcderp t1_j8db0gw wrote on February 13, 2023 at 1:44 PM

Reply to comment by diviludicrum in [R] [N] Toolformer: Language Models Can Teach Themselves to Use Tools - paper by Meta AI Research by radi-cho

The whole assessing its own success is the bottleneck for most interesting problems. You can't have a feedback loop unless it can accurately evaluate if it's doing better or worse. This isn't a trivial problem either, since humans aren't all that great at using absolute metrics to describe quality, once past a minimum threshold.

ksatriamelayu t1_j8ebpx4 wrote on February 13, 2023 at 6:06 PM

Do people use things like evolutionary fitness + changing environments to describe those quality? Seems dynamic environment might be the answer?

Oat-is-the-Best t1_j8ef5x0 wrote on February 13, 2023 at 6:29 PM

How do you calculate your fitness? That has the same problem of a model not being able to assess its own success