How did ChatGPT learn different text varieties/text types/genres?

tonycorpuslg · November 14, 2023, 9:22pm

Hi everyone, how did chatGPT learn to distinguish between different text varieties such as song lyrics, news, and research articles, for example? When you prompt it to produce a text from a particular variety or in a particular style, it shapes the output in a way that resembles the desired variety or style (to some extent). Were (at least some of) the the texts that it was trained on labeled with tags representing the general variety of the text? If not, how is that knowledge acquired or passed on to it?

Thank you ahead

Macha · November 14, 2023, 11:00pm

Nobody knows nor has a complete answer. That is why people call these neural network architectures a “black box”.

LLMs are simply trained on huge amounts of textual data. some are tagged, some aren’t. Although, it’s not really that simple, and each company has their own technique for developing this.

The ability to identify and replicate patterns from wildly different kinds of linguistic data is simply a byproduct of the mathematical algorithms and model weights that underpin these neural networks.

Topic		Replies	Views
How do LLLMs know to output answers in Markdown, or whatever Markup they're using? Community llm	3	5931	January 23, 2024
How can LLM powered agent be a specialist in a specfic domain? Prompting gpt-4	1	1213	October 18, 2023
Understanding how Models recognize Roman Numerals and Detect Questions Prompting chatgpt , api	2	783	October 21, 2023
Some GPT Questions API	2	439	December 27, 2023
ChatGPT 3.5 was not trained with RLHF Community chatgpt	1	2186	November 20, 2023

How did ChatGPT learn different text varieties/text types/genres?

Related topics