Unknown model 'gpt-4o-mini'

bachree · September 18, 2024, 9:50pm

I have been getting the “Unknown model ‘gpt-4o-mini’” error for the last 12 hours. This is happening with python OpenAI package, Openwebui and Langflow as well. Openwebui and Langflow are defaulting to the 3.5 turbo. I am tier1.

bachree · September 19, 2024, 12:43am

Further proof from OpenAI platform chat playground. gpt-4o-mini is selected and it responds to confirm that gpt-3 is in use (not even 3.5, it was 3.5-turbo earlier) even though 4o-mini is selected.

PaulBellow · September 19, 2024, 1:40am

Welcome to the community!

ChatGPT can and does make mistakes.

_j · September 19, 2024, 5:42am

Only further proof that you haven’t specified the identity, role, job, purpose of the AI for the application you are building . You don’t ask it what it is. You tell it what it is. It says welcome to Jose’s Tacos, not welcome to OpenAI.

merefield · September 19, 2024, 8:44am

But it’s not even a mistake. That’s actually somewhat true. It is based on GPT 3 technology … and some.

It’s not trained to know the model designation, that’s something you can only rely upon at the API level, but may have been injected into prompts … but probably very inconsistently.

razvan.i.savin · September 23, 2024, 2:52pm

In playground, in GitHub Marketplace I saw this:

Well, something strange happens behind all these models. I tested this on Llama, Mistral, and other models; one version of Llama was based on Mistral, while another Llama 3 was said to be Llama 2, similar to the Phi models.

That was in the playground. However, when I tested the model via the API, I could see the version of the model — the name was correct.

Response: ChatCompletion(id=‘chatcmpl-A81QewSFQottwZ7kTMcexfP8l0kDY’, choices=[Choice(finish_reason=‘stop’, index=0, logprobs=None, message=ChatCompletionMessage(content=‘The next flight from Seattle to Miami is with Delta Airlines, flight number DL123. It is scheduled for May 7th, 2024, at 10:00 AM.’, refusal=None, role=‘assistant’, function_call=None, tool_calls=None))], created=1726475480, model='gpt-4o-mini', object=‘chat.completion’, service_tier=None, system_fingerprint=‘fp_80a1bad4c7’, usage=CompletionUsage(completion_tokens=38, prompt_tokens=198, total_tokens=236, completion_tokens_details=None))

There must be some bugs because if they start to hide versions, the damage to trust could be too high, and nobody would want to face a downfall of AI.

My opinion is that there is an attempt to test an inference with all the models. Am I right?

This made me curious; I want to hear more about this, especially if the models can use function tools.

_j · September 23, 2024, 3:52pm

It’s not a “bug”. The only bug is that there is any model name or awareness of an entity being an AI at all in the training.

They are not “hiding versions”. The AI just doesn’t know. And it shouldn’t know. An API solutions provider doesn’t want their customer support chatbot saying it is “GPT-4o-mini by OpenAI”, or to even be uncertain about what was just prompted by the developer.

What you desire is already existing training undesired:

Untitled

The training even is a blocker of this kind of logical extrapolation, but ultimately it is down to an embeddings-based algorithm deciding what number to produce at a particular position:

Untitled-1

The bad behavior of saying GPT-3 etc is picked up from the very start of talking to “InstructGPT” by the omnipresent, “As an AI language model” and directly-placed supervised data, and then this old knowledge starts a feedback loop when past chats (where ChatGPT had a prompt telling what it is) are used as future post-training. ChatGPT-only RLHF (since they don’t use API for training since March 2023), is actually anti-training on API use cases.

Addendum: The latest gpt-4o-x models are more fouled up by post-system prompt injection of a knowledge date - even a perversion of the Malay language the AI is supposed to understand:

Untitled

Topic		Replies	Views
GPT-4 API behaves likes it's GPT-3 API	13	5481	December 17, 2023
Confusing response regarding what model used API gpt-4	4	1226	October 11, 2023
When calling the API, the model passed in is gpt-4-0314. When asked what language model it is, it still says that it is gpt-3. Why? API gpt-4	16	2303	June 13, 2023
ChatGPT API with model gpt-4 is not using GPT4. It's completely different from CHATGPT PLUS GPT4 API	3	1983	December 14, 2023
API Engine doesn't appear to be the one requested in some cases API	6	57	October 7, 2024

Unknown model 'gpt-4o-mini'

Related topics