By default the gpt-3.5-turbo models has been have been coming with 16k context since the release of gpt-3.5-turbo-1106 which itself has 16k context length.
2023-11-06: Chat model updates
On November 6th, 2023, we announced the release of an updated GPT-3.5-Turbo model (which now comes by default with 16k context) along with deprecation of
gpt-3.5-turbo-0613andgpt-3.5-turbo-16k-0613. As of June 17, 2024, only existing users of these models will be able to continue using them.
The depreciation docs show the shutdown date of the following gpt-3.5-turbo models:
| SHUTDOWN DATE | DEPRECATED MODEL | DEPRECATED MODEL PRICE | RECOMMENDED REPLACEMENT |
|---|---|---|---|
| 2024-09-13 | gpt-3.5-turbo-0613 |
$1.50 / 1M input tokens + $2.00 / 1M output tokens | gpt-3.5-turbo |
| 2024-09-13 | gpt-3.5-turbo-16k-0613 |
$3.00 / 1M input tokens + $4.00 / 1M output tokens | gpt-3.5-turbo |
It’s already recommended by by OpenAI to migrate to gpt-4o-mini if you’re still using the gpt-3.5-turbo models, because it’s more economic and powerful.
