Classify whether a question can be answered from the provided data

erik9631 · March 14, 2023, 2:17pm

I am kind of new to the OpenAI API
We have a custom company AI chat and we are using an embedding model for factual checks.

We don’t want to keep the context history within the memory because in some cases it might contain a lot of tokens.

We want to decide whether it is actually worth it to look into the embedding model
We want to avoid adding duplicate data into the context.
There are cases where the AI might ask a question regarding the context
Any question asked by the user returns a context from the embedding model to the chat.
– A simple “hello, how are you” will add a context from the embedding model this is not necessary.
We want to avoid doing this and only look up the embedding table when needed.
Currently we solved this by only having ONE context in the memory.
– This is a problem since if you ask another question within the context of the previous one, the chat will lose the data

In order to do that we need some way to classify whether the data in the context + the chat history is sufficient to give an answer to the user who is asking a question.

I tried experimenting with gpt-3 prompt to classify the data but I failed miserably. Is it even capable of doing this?

Could you suggest me how would I go about solving this issue?

wfhbrian · March 15, 2023, 12:39pm

Instead of classification of the context, you could classify whether the incoming user input, or last few messages in the chat log, need the context from your corpus.

Also, I don’t find it necessary to store the past context from embeddings in the chat message history, since the assistant’s answer should provide a sufficient summary for use as context in future messages. This will reduce your token use substantially.

Lastly, if your inserting only one piece of context, and that context is over 250 tokens(~1000 characters), then you would probably benefit from smarter “chunking” of your embed inputs.

erik9631 · March 15, 2023, 1:18pm

Most of what you have said is already implemented that way. The issue we are facing is this:

Instead of classification of the context, you could classify whether the incoming user input, or last few messages in the chat log, need the context from your corpus.

That is the idea. The question is how to go about it. My idea was building a secondary embedding database from the chat history and seeing if there is a good match. Is there a more efficient method to do this?

curt.kennedy · March 15, 2023, 2:00pm

You can certainly do it without embeddings using GPT-3 with a fine-tuned Ada or Babbage. More details below, but I do this regularly, and it is one of the hidden gems of GPT-3 as compared to other solutions out there. It’s quick, accurate, and relatively inexpensive too to run these fine-tuned models.

Topic		Replies	Views
Over-prompting with irrelevant context Prompting embeddings , gpt-4	8	1053	December 17, 2023
Need assistance in designing a prompt to classify if the question is related to the provided context or not Prompting	4	685	March 28, 2023
How to provide "context" in a Q&A chatbot Prompting	12	9605	December 20, 2023
Example for fine tuning pro - Token savings due to shorter prompts API	5	488	October 4, 2023
GPT-4 keeps lying instead of saying "I don't know"! Prompting gpt-4 , hallucinations	6	4769	December 19, 2023

Classify whether a question can be answered from the provided data

Related Topics