LLM Output Differences: ChatGPT Plus vs API – Context Window & Prompt Adherence

gtantan · February 18, 2025, 1:46am

I’m developing an LLM application using OpenAI’s API, and I’ve noticed some differences between using ChatGPT Plus/Pro via the web UI and accessing the models through the API. While it’s expected that the output might vary, I also feel that the context window behaves differently.

Through multiple tests, it seems that the API version tends to be more restrictive, with prompts not being fully reflected and the output being more concise than expected.

Have any other developers experienced similar issues? If so, how do you handle this? Are there specific techniques or settings that help improve prompt adherence and increase output length when using the API?

Munna23 · February 18, 2025, 9:52pm

@gtantan - Welcome to the forums!

increase output length when using the API

You can increase the top-p value, which raises the threshold to include lower-probability tokens.

Are there specific techniques or settings that help improve prompt adherence

Lowering the temperature is one way to make the model more deterministic and focused on high-probability outputs.
Are you passing a list of messages comprising the conversation so far along with the roles if you are using chat completions? You can also take a quick look at the assistants api overview.

_j · February 18, 2025, 11:43pm

I’ve transcribed your question over to another business case.

“I want to open a hot dog stand in front of Costco as a business venture. How can I ensure I get the exact same hot dogs that Costco sells for $1.50 at their deli?”

ChatGPT uses different models than the API, and that is for the best. There is only one “system-you are ChatGPT” message in the general-purpose consumer product, but API app developers need models that readily take to permanent instruction-following (yet rarely get their wish).

The default top_p is 1.0. Meaning “no effect”. You cannot increase this parameter for a length benefit, but you can reduce it for less creative token selections during generation, getting more of the “best words”.

You will just need to prompt the model into the type of output you desire. You have control of the developer message contents: use it. Make the AI your product where Joe Bob Bucky bought the VIP lifetime membership upgrading to 10000 word response lengths.

Topic		Replies	Views
API Completions not really matching with chat.openAI GPT-3.5 Completions API gpt-35-turbo , chatgpt , api	7	2867	December 17, 2023
Can i get same kind of responses from GPT-4 through api as I get from Custom GPTs? API gpt-4 , api , custom-gpt	9	1699	January 12, 2024
MyGPT vs. API output variances API chatgpt , mygpts	4	117	March 3, 2025
Chatgpt API isn't good as it's website Prompting api , prompt	3	8389	January 11, 2024
ChatGPT and API results are quite different API chatgpt , api	5	4105	December 18, 2023

LLM Output Differences: ChatGPT Plus vs API – Context Window & Prompt Adherence

Related topics