Train back and forth dialogues

lechnerf · February 23, 2022, 3:57pm

Hello everyone,

i am currently working on my master’s thesis and wanted to fine tune a GPT-3 Curie chatbot.
I suceeded, but because i used Q&A style dialogues a single input provoces a total “conversation finishing answer”.
But i wanted to give it a more chatbotty feel and found a dataset, where the answer part actually asks questions back and the conversation goes on for a couple of iterations between two parties.
Because the finetuning model requires a “prompt/completion” style input thogh i don’t know how to finetune curie to keep the context of the conversation.

I hope this is in any way understandable, it’s quite the complicated problem.

Thank you for your help

sps · February 23, 2022, 4:46pm

Hi @lechnerf

This “memory” problem of GPT-3 is very common for the chatbot scenario. There’ve been quite some post on the community with potential solutions involving using GPT-3 to condense/summarize the previous conversation to retain context, so as to make the most out of it.

My hypothesis is that it can be solved with a “rolling memory” i.e. remember the most recent N tokens. I haven’t got a chance to test it out though as my grant expired in October last year.

lechnerf · February 23, 2022, 5:14pm

@sps Thank you for the answer.

I will now lable the entire conversation before the last sentence as “prompt” and try to finetune a curie model. We’ll see how this works out.

sps · February 23, 2022, 5:40pm

In my last response, the potential solutions I suggested are to be used for the conversation and not for fine-tuning the model.

Yes, do this with the existing fine-tuned model you have. Though you will quickly run out of tokens with every single dialogue between user and bot. Hence the need to condense.

marc.gehring · March 1, 2022, 4:26pm

hey, marcbe we can get in touch. marc.gehring@icloud.com

antonio.ciolino · March 2, 2022, 5:25pm

I’ve been using the rolling memory idea - it’s “good”, but it’s surprising how far back a conversation really goes, even with summarization, which quickly renders this process obsolete after 4 or 5 “events” from my ad-hoc experiments.

sps · March 2, 2022, 5:48pm

Yes that’s kinda expected, given the token limit. Another one of my hypothesis would be using embeddings, this should give significant room for memory.
Here’s how this would go:

The latest conversation is appended to an archive.
At every point the human’s message is used to search and rank N semantically similar lnes from the archive.
Then these N lines are given to completion engine as context to generate the response.
The generated response is sent back to human as the message reply.

Note that this will require figuring out the prompt(s) that get the completion engine to generate appropriate response.

antonio.ciolino · March 2, 2022, 6:10pm

I started down a similar path but abandoned it for the same reason I don’t use the /answers endpoint: too many calls (and the costs get high fast), as well as poor performance (out of /answers). Now, I haven’t tried embeddings, so there might be something to this I haven’t experimented with, but if it requires an upload of files to be the data source, that could take too long to realistically use.

sps · March 2, 2022, 6:16pm

True. But if it’s done in a chat interface, the wait time can be covered with some kind of typing... or other UX

antonio.ciolino · March 2, 2022, 6:17pm

I meant file upload is painfully slow, and having the data processed takes time. If done in batches over 30 mins or so…maybe.

sps · March 2, 2022, 6:22pm

I don’t know how is it taking ~30min on your end but on my end it takes like ~10 seconds to upload a file and get a response when I’m using I the /answers api. My guess is that using embeddings and doing the whole procedure, shouldn’t take more than a minute.

Also you can cut the majority of that time when the data is saved on cloud like AWS/Azure etc.

antonio.ciolino · March 2, 2022, 6:23pm

Possibly because I’m thinking of the fine tuning indexing process. File uploads are fast, though the answers API is, as I mentioned, slightly costlier.

sps · March 2, 2022, 6:26pm

The /answers API I mentioned is just for reference. Also one definitely should not fine-tune a model every time the user sends a message.

brkrabac · March 15, 2022, 8:26pm

One of the things I’ve found to be helpful is to pass the chat history so far, plus the new interaction and ask it to rewrite the new interaction to leverage the context from the chat history. This is a separate prompt/api call. Then you get a single question you can pass through the rest of your pipeline, avoid it becoming self-reinforcing, but still including the important references.

Topic		Replies	Views
Finetuning for shortening prompts Documentation fine-tuning	10	3825	December 24, 2023
Own model fine tuning for communication platform API chatgpt , fine-tuning	16	1892	December 24, 2023
Do I need to send prompt every time for a same task? API	10	14773	February 15, 2024
Best prompt engineering to simulate the remembrance of the conversation Prompting	2	2647	December 19, 2023
Context aware context for follow-up question Prompting embeddings , gpt-4 , api	13	9042	October 16, 2024

Train back and forth dialogues

Related topics