It looks like GPT-4-32k is rolling out

curt.kennedy · May 15, 2023, 3:48pm

@theevildays What were you hoping to use 32k for?

Realize that these transformer/attention-head architectures are quadratic in time with the output token length. So it’s more of a technology bottleneck, and these servers don’t grow on trees!

Topic		Replies	Views
New 4-turbo model has a unique limit? Or is this a bizarre hallucation? API	18	4677	January 26, 2024
Is the GPT4 api actually this limited or am I doing something wrong? API	13	1643	December 13, 2023
Prompt Fatigue Question For API Calls Prompting gpt-35-turbo	24	932	January 25, 2025
How to confirm that you got the correct value from a text other than repeating the same prompt over and over API	39	1368	September 1, 2024
Test new 128k window on gpt-4-1106-preview API	29	18833	February 6, 2024

It looks like GPT-4-32k is rolling out

Related topics