currentscurrents t1_jdrt3gv wrote on March 26, 2023 at 6:11 PM

Reply to comment by liqui_date_me in [D] GPT4 and coding problems by enryu42

I think all tests designed for humans are worthless here.

They're all meant to compare humans against each other, so they assume you don't have the ability to read and remember the entire internet. You can make up for a lack of reasoning with an abundance of data. We need synthetic tests designed specifically for LLMs.

Yecuken t1_jdsm4w1 wrote on March 26, 2023 at 9:36 PM

Tests would not help against optimization, models will just learn how to pass the test. Optimization will always win against any problem with a known solution