LLMs as basis for general problem-solving

jwatte · May 24, 2023, 11:45pm

Yes, absolutely!

The model doesn’t “try to generate” some sequence of tokens, then run a hypothesis checker on that trial, and then roll back that sequence if the checker says “not good enough.” The search is single-token. It’s not even equivalent to a multi-step planner or a minimax optimizer, much less a full theorem prover. And neither planners, minimax optimizers, nor theorem provers are, in the end, “reasoning” (but I argue they are closer in many regards.)

If you describe a new, chess-like, game, to this model, and then ask it to play the game, it won’t be able to do well, and frequently doesn’t even follow the rules. This is despite the fact that simple 64 kilobyte 8-bit MCUs are able to play a reasonable (though not impressive) game of chess. And chess is an open-knowledge game, it doesn’t even add the complication of estimating hidden state of the world.

It can fake it pretty convincingly in simple cases, though

Topic		Replies	Views
Discussion thread for "Foundational must read GPT/LLM papers" Community gpt-4 , gpt-35-turbo , chatgpt , research	75	10546	September 3, 2024
Foundational must read GPT/LLM papers Community research , large-language-model	79	69775	May 16, 2024
Getting a function call + textual response in the same call API gpt-4 , function-calling	10	1131	April 8, 2025
Reverse Engineer: creative answers through step-by-step in reverse Prompting prompt-engineering	18	666	August 31, 2024
Is API completion model davinci-002 and its 16385 context useful for ANYTHING? API davinci	1	2554	December 14, 2023

LLMs as basis for general problem-solving

Related topics