What is the size of the training set for GPT-3

curt.kennedy · September 8, 2023, 9:57pm

I guess the way I look at it is that the LLM is essentially a curve fit.

So multiple points go into the training, and what you are left with are the smoothing coeffiecients, which occupy much less bits.

I would say that if the LLM is bigger, or the same size of the training data, means the LLM is actually under-trained, because of this phenomena.

Topic		Replies	Views
What version of GPT is `text-embedding-ada-002` based on? API embeddings , api	7	8883	September 30, 2023
Fine-Tuning Setup for gpt-3.5-turbo-16k API fine-tuning , api	9	3676	October 31, 2023
Do 'MAX tokens' include the follow up prompts and completion in a single chat session API token	22	5441	August 25, 2023
How does the knowledge of custom GPT actually work Documentation chatgpt	7	16460	December 1, 2023
Gpt-4o-2024-11-20 megathread - new API model released API	0	2366	November 22, 2024