Understanding which GPT model is ready for production

kreut · January 26, 2024, 3:09pm

Hi,

I’d like to upgrade to one of the less expensive turbo models but I’m not sure which to use. From the docs, it says that:

gpt-4-1106-preview GPT-4 Turbo model featuring improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Returns a maximum of 4,096 output tokens. This preview model is not yet suited for production traffic

At the same time, there’s a new Turbo model that was just launched which doesn’t have the warning:

gpt-4-0125-preview New

GPT-4 Turbo
The latest GPT-4 model intended to reduce cases of “laziness” where the model doesn’t complete a task.

Is there any guidance in terms of which could/should be used in a production environment?

Thanks!

Topic		Replies	Views
Can we use GPT 4-preview (Turbo) in production? API gpt-4 , api , gpt-4-turbo	4	3117	January 30, 2024
Trying to understand if 4 turbo is out and available for feature creation API gpt-4 , gpt-4-turbo	4	723	March 18, 2024
Production stable version of gpt-4-turbo API gpt-4-turbo , gpt-4-128k	1	1860	January 29, 2024
GPT 4 turbo for production use API gpt-4-turbo	3	2425	April 9, 2024
Any plans to deprecate the GPT-4 Turbo preview models? API	0	483	May 8, 2024

Understanding which GPT model is ready for production

Related topics