LLMs as basis for general problem-solving

bruce.dambrosio · May 24, 2023, 7:40pm

Perhaps a better way to say this, using the MCMC analogy, is that, when generating a token, all previous tokens in the chain have been sampled, that is, the new token is being generated given instance values of all previous output tokens as well as the original input

jwatte · May 24, 2023, 11:45pm

Yes, absolutely!

The model doesn’t “try to generate” some sequence of tokens, then run a hypothesis checker on that trial, and then roll back that sequence if the checker says “not good enough.” The search is single-token. It’s not even equivalent to a multi-step planner or a minimax optimizer, much less a full theorem prover. And neither planners, minimax optimizers, nor theorem provers are, in the end, “reasoning” (but I argue they are closer in many regards.)

If you describe a new, chess-like, game, to this model, and then ask it to play the game, it won’t be able to do well, and frequently doesn’t even follow the rules. This is despite the fact that simple 64 kilobyte 8-bit MCUs are able to play a reasonable (though not impressive) game of chess. And chess is an open-knowledge game, it doesn’t even add the complication of estimating hidden state of the world.

It can fake it pretty convincingly in simple cases, though

Topic		Replies	Views
Discussion thread for "Foundational must read GPT/LLM papers" Community gpt-4 , gpt-35-turbo , chatgpt , research	75	10658	September 3, 2024
Foundational must read GPT/LLM papers Community research , large-language-model	79	71533	May 16, 2024
Getting a function call + textual response in the same call API gpt-4 , function-calling	10	1492	April 8, 2025
Reverse Engineer: creative answers through step-by-step in reverse Prompting prompt-engineering	18	757	August 31, 2024
Is API completion model davinci-002 and its 16385 context useful for ANYTHING? API davinci	1	2576	December 14, 2023

LLMs as basis for general problem-solving

Related topics