I had the same confusion and found an answer here: Fine-tuning with conversation format: Which messages are used for training? , it seems we still need to segment the conversation despite we provided it as a message list. Please note this is from a cookbook’s comment and not guaranteed to be true, I didn’t find any references in the official docs, TBH the fine tuning API is so poorly documented.
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Fine-tuning with conversation format: Which messages are used for training? | 9 | 6302 | November 16, 2023 | |
| Correct format for dataset in chat model fine-tuning | 4 | 2188 | January 9, 2024 | |
| How to structure fine tuned data | 8 | 5097 | September 2, 2023 | |
| Fine tuning data format for chatting history | 2 | 462 | March 20, 2024 | |
| OpenAI Fine-Tuning: Multi-turn Dataset Examples | 5 | 10642 | November 2, 2023 |