TehDing
TehDing t1_je08lpg wrote
Reply to comment by sebzim4500 in [P] two copies of gpt-3.5 (one playing as the oracle, and another as the guesser) performs poorly on the game of 20 Questions (68/1823). by evanthebouncy
You can ask GPT to spell a word, or provide the words as individual "S P A C E D" characters and it will similarly do poorly- it has nothing to do with tokenization. GPT is capable of spelling, it can even identify that it is not playing well if you ask if something is a good guess- but continues to give poor answers.
In terms of 'solving' a game as this 20 questions example, there are only 12000 valid words to guess from, or at worst 26^5 possible answers, which still makes this a smaller example (or at worst case on par) as the blog experiment.
Want an easier game? Sucks at Hangman too. It'll guess in terms of frequency, but not well enough to bring together a word. Even guessing on the basis of common ngrams would probably be a good enough strategy.
My experience is that LLMs are poor in terms of novel reasoning. This makes sense, RFHL isn't giving these things a consciousness. Maybe with tweaks/ tools we'll actually see some "thinking", but for now (this may change next week at the rate things are going) it's not very good at games in general as a result (another example: I haven't tried it with GPT4, but GPT3 cheats at chess).
TehDing t1_jdz3anl wrote
Reply to [P] two copies of gpt-3.5 (one playing as the oracle, and another as the guesser) performs poorly on the game of 20 Questions (68/1823). by evanthebouncy
Similarly, sucks at Wordle
TehDing t1_jdtwa6k wrote
Reply to [D] GPT4 and coding problems by enryu42
I have not been impressed with LLMs reasoning for solving novel puzzles/ challenges. Ask any model to play Wordle with you. They are not good
TehDing t1_j0z5wyi wrote
Reply to comment by ferrel_hadley in Which astronauts would be your picks for Artemis III? by Emble12
Kim hasn't had any space flight time yet right?
There's time from now to 24, but my bet will be on someone who's flown
TehDing t1_jeb7rvq wrote
Reply to [D] Directed Graph-based Machine Learning Pipeline tool? by Driiper
Like tensorboard graph view?