ChatGPT vs gpt-4o via APIs show noticeable quality difference

mangled.data · May 16, 2024, 2:35am

I have noticed this issue since early days - but ChatGPT continues to perform much better than APIs. Does anyone have any idea why ? Feels like APIs hide some of the strength on purpose (could be performance considerations?)

GregBoBeg · May 16, 2024, 1:37pm

This discussion regarding the playground applies to your question:

qrdl · May 16, 2024, 3:20pm

Hmmm, not sure if it is your issue, but - How do I enable or disable memory in API? - #5 by qrdl

The memory might be providing a pretty huge capability jump.

mangled.data · May 16, 2024, 3:34pm

I don’t think it’s memory or context. I do get a feel the chat continues to be ahead of APIs (pattern from early days)

EduGPT · May 18, 2024, 1:26am

The API is just part of the solution, but you still have to deal with system prompts, and actions, and architecture, and you can also programmatically, dynamically change the system prompts as it’s solving different tasks.

The way that you’re processing the memory in your application through the API, how are you rolling that memory in chat. There are so many decisions that you can make, that the API is not ever going to be the same as using ChatGPT, because they’re just very different solutions.

anil.jose · May 22, 2024, 6:52am

I do see huge differences. For e.g. go to play ground, upload image and run - the assistant’s quality is much better: see the code: from openai import OpenAI
client = OpenAI()

response = client.chat.completions.create(
model=“gpt-4o”,
messages=[
{
“role”: “system”,
“content”: [
{
“type”: “text”,
“text”: "You are an AI specialized in generating Appian SAIL functions. Your goal is to create accurate and functional SAIL code based on provided design descriptions and images.\r\n \n\r\n \r\n\r\n Guidelines:\r\n 1. Analyze the Input: Carefully analyze the provided the image and its description to understand the requirements. Pay close attention to the image content to accurately capture all visual elements and their functionalities.\r\n 2. Generate SAIL Code: Create the SAIL functions based on your analysis. Use local variables.\n \r\n 3. Format the Answer: Ensure the final SAIL code is well-formatted and meets the requirements."""), \n "
}
]
},
{
“role”: “user”,
“content”: [
{
“type”: “image_url”,
“image_url”: {
“url”: “data:image/jpeg;base64,{image}”
}
},
{
“type”: “text”,
“text”: “The image you see is to add mortgage product. Please generate a Appian SAIL code based on it. This is an Appian SAIL Code”
}
]
}
],
temperature=0,
max_tokens=4095,
top_p=1,
frequency_penalty=0,
presence_penalty=0
)

Use the code to invoke the API, the difference is day and night. From API the quality is bad.

anil.jose · May 22, 2024, 7:15am

The issue was with the base64 image. when I use proper formatting it worked.f"data:image/jpeg;base64,{base64_image}"

mangled.data · May 27, 2024, 6:57pm

@anil.jose is correct. I can reiterate. The API version continues to be a dumber version of ChatGPT (and i suspect they use a much longer context width for chatGPT because it seems to remember a LOT of details from a much older chat or may be they do some smart RAG to auto pull these, I don;t know). But I have never been able to get the same quality of response between API and the console version.

Topic		Replies	Views
Why is GPT 4's response and performance on playground is so different from when using chatgpt 4 API gpt-4	12	9447	December 16, 2023
Why Does OpenAI's API Struggle to Match ChatGPT's Commercial Response Quality API gpt-4 , chatgpt , api	9	1373	May 1, 2025
Huge difference between ChatGPT assistant and API assistant API gpt-4 , api , playground	2	2680	March 18, 2025
Is API GPT4 way less intelligent than ChatGPT4? API gpt-4 , chatgpt	14	8907	July 17, 2024
Chatgpt API isn't good as it's website Prompting api , prompt	3	8555	January 11, 2024

ChatGPT vs gpt-4o via APIs show noticeable quality difference

Related topics