It looks like GPT-4-32k is rolling out

qrdl · May 6, 2023, 6:41pm

What does the 32K context window actually mean, does anyone know?

For example, the 8K context window on GPT4 currently doesn’t actually seem like 8K context to me. More like maybe 7K input and max 1K output, depending on the prompt.

Ie, anything longer than 1K tokens seems to be limited, unless it’s fairly simple encoding.

The pricing scheme of 2x for output tokens has me even more curious.

This isn’t a huge issue for my use cases, I can work around it, but I feel the “32K context” is a bit vague.

I looked through the technical report but didn’t see anything on this topic. [2303.08774] GPT-4 Technical Report

Topic		Replies	Views
New 4-turbo model has a unique limit? Or is this a bizarre hallucation? API	18	4450	January 26, 2024
Is the GPT4 api actually this limited or am I doing something wrong? API	13	1489	December 13, 2023
Prompt Fatigue Question For API Calls Prompting gpt-35-turbo	24	426	January 25, 2025
Need Help With Prompts? Ask me* Prompting chatgpt	149	18835	February 6, 2024
How to confirm that you got the correct value from a text other than repeating the same prompt over and over API	39	758	September 1, 2024

It looks like GPT-4-32k is rolling out

Related topics