Autoregressive Fine-Tuning for Chat Models

sanyu.rajakumar · July 10, 2024, 5:51pm

I’m running experiments on LLMs which involves fine-tuning on research papers (for knowledge acquisition). I did this autoregressively with davinci-002, where the data were formatted as prompt-completion pairs with an empty prompt (technically a single whitespace because empty strings aren’t allowed anymore) and the completions was the content of the paper. For example:

{"prompt": " ", "completion": "<text from research paper>"}

I would like to know if there’s any way to do autoregressive fine-tuning like the above for chat models instead of typical supervised fine-tuning

I imagine I would have to format the fine-tuning data as:
{messages: [{"role": "user", "content": " "}, {"role": "assistant", "content": "<text from research paper>"}]}

Is this sensible?

Additionally, is there anything unprincipled about doing autoregressive fine-tuning on chat models?. For instance, do the chat capabilities decrease performance on a task like this (since the model “expects” question-answer formatting}, or does this kind of tuning cause the chat capabilities to be lost?

Topic		Replies	Views
Causal/autoregressive fine tuning? API	1	682	February 8, 2023
Fine-Tuning with Non-Prompt/Completion Data: Seeking Advice for Direct Text-Based Training? API gpt-4 , chatgpt , fine-tuning , api	3	239	August 23, 2024
Are fine-tuned models a good way to give GPT a specific tone of voice? API api	5	3529	July 20, 2023
How does gpt-3.5-turbo fine-tuning work? API gpt-35-turbo , fine-tuning	10	1852	September 11, 2023
Mixed Training Data for Fine-Tuning API fine-tuning	9	671	April 1, 2024

Autoregressive Fine-Tuning for Chat Models

Related topics