Could GPT-4 Turbo be a quantized version?

trenton.dambrowitz · November 15, 2023, 9:35am

It’s obviously a huge leap forward in speed and context size, but the slight degradation in accuracy compared to normal GPT-4 makes me wonder, could OpenAI have taken notes from the open-source community and quantized for speed and scalability?

I know it’s a bit of a random topic, but I’m curious what you all think. This is all just speculation, of course.

Topic		Replies	Views
Is GPT4 turbo really smarter? Community gpt-4-turbo	3	2494	November 28, 2023
Is gpt4 1106-Preview on azure the real gpt 4 turbo? API	0	1240	November 29, 2023
Gpt-4 vs gpt-4-turbo-preview Community api	7	23284	April 9, 2024
Worse results when using GPT-4o as an evaluator Community gpt-4o , evals	2	464	October 1, 2024
GPT-4 Turbo vs. GPT-3.5 Turbo API gpt-35-turbo , gpt-4-turbo	1	3973	November 28, 2023

Could GPT-4 Turbo be a quantized version?

Related topics