Teknium says what we're all thinking

qrdl · June 22, 2024, 8:27pm

Teknium, a well known OS AI developer, made this comment about gpto

Here is his comment :

@petrroyce It ****** cannot debug anything it does wrong and instead just repeats its same past attempt at a solution ad infinitum

I absolutely agree with it and have been running into the same behavior. I’ve wanted to post something similar many times in this community forums.

I’ve edited out the profanity as @vb reasonably asked me to in private mail.

qrdl · June 22, 2024, 8:30pm

I don’t know what it is but it’s become useless for me when trying to do anything with an API it’s not super trained on.

I think there needs to be a great deal of clever RAG going on in order to get this to work.

Eg, something that can pull up the right APIs and sample code would be quite effective I think.

anon10827405 · June 22, 2024, 10:18pm

It’s interesting to see how it did so well on the charts but from what I’ve seen, not many people really like it.

It just reminds me so much working in construction. Theoretical measurements do not conform well to reality. That’s where the applied learners excel.

I agree. It feels over fitted. It fails in conversations. I moved back to GPT-4 but it’s throwing spelling errors and is also unusable.

For API I’ve been very happy with gpt-4o though. The pricing and capabilities are amazing.

PaulBellow · June 22, 2024, 10:25pm

Sure,. I’ll repeat exactly the same thing again!

really

Their cost savings and our speed increase are nice, but it’s worth nothing if it’s not useful. It can kinda do small one-off tasks okay, but past 10 or so message forget about it!

Diet · June 22, 2024, 10:27pm

All raw models do that though. It’s just a question of how many of its attempts it can keep in attention…

I guess the answer for omni is 1

PaulBellow · June 22, 2024, 10:30pm

I’m hoping that recent OpenAI RAG company acquisition recently helps.

https://openai.com/index/openai-acquires-rockset/

qrdl · June 22, 2024, 10:42pm

LOL perfect meme for gpto. I dont’ know how many times I’ve had to resist typing obscenities to chatgpt when it spits out exactly the same thing.

“Chat, what freaking changed!?”

qrdl · June 22, 2024, 10:43pm

Hmm, I didn’t recall seeing that with earlier versions. It would usually try something else. Which admittedly, often didn’t work, but at least it tried something.

Diet · June 22, 2024, 10:44pm

hey, just a nitpick, maybe it’s just me, but could you say gpt4o or omni rather than gpto? thanks a bunch

qrdl · June 22, 2024, 10:45pm

Blame OpenAI for introducing it as gpt2

Diet · June 22, 2024, 10:58pm

I do

giovanneafonso · June 22, 2024, 11:11pm

One thing that I’ve learned while working with RAG chatbots is that you should avoid asking the model to do multi-step tasks (complex tasks).

Let’s say you are trying to continue a conversation about a coding issue (as above), if you ask it to solve the issue again and again, you will end up in the loop. One possible solution would be to create another prompt that analyzes the current result and list possible paths to solve the task.

Now regarding your specific use case, the model doesn’t have access to updated docs or information, a lot of times when I ask for python or typescript codes I get outdated lib issues, that would be solved with RAG or breaking the task into smaller units. IMO it shines when trying to find paths to the desired solution, doing a single simple thing or finding “the next step”, not the full solution.

It has limitations and I’ve read somewhere in this forum that you shouldn’t expect the model to do everything, instead you should for help in the things it really does well.

Let’s hope that this changes in the future versions

mynewspressreleases · June 25, 2024, 7:32pm

I asked AI studio the same question that I asked 4o. Studio forgot the give me the .htaccess modifications. Claude was about the same as 4o. If I hit a road block, I’ll just ask all three.

Topic		Replies	Views
ChatGPT 4o model feedback after a few months of usage (coding) Feedback gpt-4	8	1500	December 3, 2024
GPT-4o performing poorly for code related tasks! Why? Feedback gpt-4 , chatgpt , api , lost-user	39	4658	July 19, 2024
GPT-4o has been bad for my GPT; anyways to switch back to GPT-4 Plugins / Actions builders gpt-4 , gpts , gpt-4o	45	3490	June 18, 2024
Chat GPT 4 getting worse? API	8	5439	December 17, 2023
Custom GPTs cannot even retrieve information from its custom knowledge? GPT builders	11	1090	February 27, 2025

Teknium says what we're all thinking

Related topics