Blog post of April2024: Mid Training

jsbinette · April 12, 2024, 11:13pm

This is part of a recent openai blog:

After testing out prompt engineering, RAG, and fine-tuning, Harvey worked with our team to add the depth of context needed to the model—the equivalent of 10 billion tokens worth of data. Our team modified every step of the model training process, from domain-specific mid-training to customizing post-training processes and incorporating expert attorney feedback.
What is mid-training?
Can we do this? Is it like a first fine tune and then a second based on it? (Other users posts have mentioned that this method is doable but complained that GPT was forgetting…)

Diet · April 12, 2024, 11:23pm

I don’t think they’re talking about fine tuning that’s available to you or me. OpenAI seems to be building out its capacity to create custom models, but that’s currently not something that’s available to us mortals

jsbinette · April 15, 2024, 8:31pm

I’d love to get an explanation from the OpenAI team though

Topic		Replies	Views
Fine tuning chat models -- coming soon? API	5	1109	December 15, 2023
Fine-tuning using GPT-4 Beta API	18	24251	December 12, 2023
What should we expect from the GPT 3.5/4 fine-tuning API API gpt-4 , gpt-35-turbo , chatgpt , fine-tuning	5	801	December 15, 2023
How compartmentalized (if at all) can we be when training our API model? API api	3	408	August 4, 2023
ChatGPT fine-tuning as a service API	17	12766	December 13, 2023

Blog post of April2024: Mid Training

Related Topics