Segmenting training data for fine-tune

KasugaiSakura · January 9, 2024, 9:28am

I had the same confusion and found an answer here: Fine-tuning with conversation format: Which messages are used for training? , it seems we still need to segment the conversation despite we provided it as a message list. Please note this is from a cookbook’s comment and not guaranteed to be true, I didn’t find any references in the official docs, TBH the fine tuning API is so poorly documented.

Topic		Replies	Views
Fine-tuning with conversation format: Which messages are used for training? API	9	6302	November 16, 2023
Correct format for dataset in chat model fine-tuning API fine-tuning , documentation	4	2188	January 9, 2024
How to structure fine tuned data API	8	5097	September 2, 2023
Fine tuning data format for chatting history API chatgpt	2	462	March 20, 2024
OpenAI Fine-Tuning: Multi-turn Dataset Examples API openapi , fine-tuning , gpt-3	5	10642	November 2, 2023

Segmenting training data for fine-tune

Related topics