What is the size of the training set for GPT-3

curt.kennedy · September 8, 2023, 9:57pm

I guess the way I look at it is that the LLM is essentially a curve fit.

So multiple points go into the training, and what you are left with are the smoothing coeffiecients, which occupy much less bits.

I would say that if the LLM is bigger, or the same size of the training data, means the LLM is actually under-trained, because of this phenomena.

Topic		Replies	Views
Discussion thread for "Foundational must read GPT/LLM papers" Community gpt-4 , gpt-35-turbo , chatgpt , research	74	7434	December 16, 2023
Fine-Tuning Setup for gpt-3.5-turbo-16k API fine-tuning , api	9	2760	October 31, 2023
Do 'MAX tokens' include the follow up prompts and completion in a single chat session API token	22	3631	August 25, 2023
Some GPT Questions API	2	364	December 27, 2023
Reference data for GPT-3 need far less detail Community	3	565	May 16, 2022