Fine tuning model on full chats instead of a prompt response

sardararslan033 · November 11, 2023, 4:15am

Hi, can we fine tune 3.5 turbo on a list of messages instead of one system, user and assistant message. Has anyone tried that? Does it improve results?
Example json:
{“messages”:[{“role”:“system”,“content”: sys_prompt},{“role”:“user”,“content”: user_prompt_1},{“role”:“assistant”,“content”: as_response_1},{“role”:“user”,“content”: user_prompt_2},{“role”:“assistant”,“content”: as_reponse_2}]}

goldyard · November 11, 2023, 4:46am

I met the same task, and I followed this video to generate the finetune dataset from messages:

https://www.youtube.com/watch?v=ceSu1w_CzXA&t=334

sardararslan033 · November 11, 2023, 5:23am

This is a different problem, actually I want something like: `

{“messages”:[{“role”:“system”,“content”: sys_prompt},{“role”:“user”,“content”: user_prompt_1},{“role”:“assistant”,“content”: as_response_1},{“role”:“user”,“content”: user_prompt_2},{“role”:“assistant”,“content”: as_reponse_2}]}

The goal is to train the model to respond in a specific way given a specific situation.

goldyard · November 11, 2023, 5:56am

This is what I did from this video, and I have completed my finetune script like you wanted. But I spend about 3~4 hours to finish this.

OnceAndTwice · November 26, 2023, 8:46pm

Yes, you can finetune with many different messages inside each conversation. Your dataset should ideally reflect the kinds of conversations you would expect to occur.

Topic		Replies	Views
How does gpt-3.5-turbo fine-tuning work? API gpt-35-turbo , fine-tuning	10	1868	September 11, 2023
OpenAI Fine-Tuning: Multi-turn Dataset Examples API openapi , fine-tuning , gpt-3	6	8353	December 14, 2023
Fine tuning data format for chatting history API chatgpt	2	310	March 20, 2024
Hi finetuning help with system prompts API chatgpt	0	164	July 12, 2024
Few questions about GPT 3.5 fine-tuning API gpt-35-turbo , fine-tuning	2	1152	December 24, 2023

Fine tuning model on full chats instead of a prompt response

Related topics