The only training that happens is a months long process that builds a new model with a training corpus and knowledge. And then OpenAI periodically running different machine learning processes to overlay some new behaviors on that model (like saying “I’m sorry” to things unwanted).
The AI model is unchanged by your interactions with it.
Language AI generates the response that would follow an input prompt, a sequence of tokens or words. It predicts the next part of output to follow, a word at a time.
For example, if my input is “A yellow fruit is a”, the next word the bare AI might produce is " banana".
ChatGPT builds on that technology by placing messages in containers, and prompting the AI where it should write a response. The AI then predicts what a response to a question would be instead of just what the next words of a paragraph should be.
See if you can be an AI and figure out the next thing to output:
system programming: The chatbot is an expert in outdoor activities.
user: What’s the tallest mountain in the world?
chatbot: The tallest mountain in the world is Mount Everest, which is part of the Himalayas. Its peak is 8,848 meters (29,029 feet) above sea level.
user: How long does it typically take to climb it?
chatbot: Climbing Mount Everest typically takes about two months. Climbers need to acclimatize to the high altitude, so they spend weeks at base camps and make shorter ascents before attempting to reach the summit.
user: Are there any specific dangers associated with this climb?
chatbot: Yes, there are several dangers associated with climbing Mount Everest, including severe weather conditions, avalanches, crevasses, and the risk of altitude sickness. The “Death Zone” above 8,000 meters is particularly hazardous due to its thin air.
user: Has anyone ever climbed it without oxygen?
chatbot: (write your answer here)
That’s what will be sent to the stateless AI every time there is a new question: a record of the recent chat. That lets the chatbot understand what “it” is in the last question.
The AI hasn’t learned anything. It has been shown the prior conversation.
So when you switch conversations in ChatGPT, a different history of the recent chat is sent when you continue.
(example conversation generated completely by AI)