Causal/autoregressive fine tuning?

cfox · February 8, 2023, 2:59am

I was surprised to see that the language models fine tuning is based on providing prompt-completion. My understanding was the the base models available for fine tuning (Davinci, Curie, Babbage, Ada) on trained using a next word prediction task (causal/autoregressive language modeling). For that type of training, you would expect the input data to be a list of texts, not text pairs. Presumably loss is being computed over the completion tokens only. That seems a bit inefficient. You can already see this in the “Case study: Customer support chatbot” example. Previous conversation messages are repeated across different prompts. It seems it would be better in that case to provide a list of conversations as the input (less expensive for the API user). But perhaps they are doing something else for fine tuning that I didn’t expect.

Does anyone on here know more about what the approach they are taking for fine tuning?

ruby_coder · February 8, 2023, 3:06am

As I understand things, the fine-tuning function occurs in the decoding component of the GPT output architecture, and not in the model pre-training component.

Topic		Replies	Views
Fine tuning - how exactly does it work? API	6	2390	December 23, 2023
Autoregressive Fine-Tuning for Chat Models API fine-tuning	0	105	July 10, 2024
What exactly and technically happens with fine-tuning? API	10	5394	January 3, 2024
Fine-tuning only available for 'base models'? API	6	1388	December 23, 2023
Is the GPT 3.5 fine-tuning service prompt tuning? Community gpt-35-turbo , chatgpt , fine-tuning	8	4393	September 13, 2023

Causal/autoregressive fine tuning?

Related topics