Their cost savings and our speed increase are nice, but it’s worth nothing if it’s not useful. It can kinda do small one-off tasks okay, but past 10 or so message forget about it!
Hmm, I didn’t recall seeing that with earlier versions. It would usually try something else. Which admittedly, often didn’t work, but at least it tried something.
One thing that I’ve learned while working with RAG chatbots is that you should avoid asking the model to do multi-step tasks (complex tasks).
Let’s say you are trying to continue a conversation about a coding issue (as above), if you ask it to solve the issue again and again, you will end up in the loop. One possible solution would be to create another prompt that analyzes the current result and list possible paths to solve the task.
Now regarding your specific use case, the model doesn’t have access to updated docs or information, a lot of times when I ask for python or typescript codes I get outdated lib issues, that would be solved with RAG or breaking the task into smaller units. IMO it shines when trying to find paths to the desired solution, doing a single simple thing or finding “the next step”, not the full solution.
It has limitations and I’ve read somewhere in this forum that you shouldn’t expect the model to do everything, instead you should for help in the things it really does well.
Let’s hope that this changes in the future versions
I asked AI studio the same question that I asked 4o. Studio forgot the give me the .htaccess modifications. Claude was about the same as 4o. If I hit a road block, I’ll just ask all three.