How i put ChatGPT giving recomendations based on Database

afonsomiguel1990 · March 5, 2023, 10:27pm

Hello guys, with this API for chatGPT is there anyway to instruct Chat GTP search from certain Database.

I mean, i want to use chat GPT in my restaurant website. But want to Chat to only answer based on my data.

Is this possible?

Thanks in advance.

jliebler · March 5, 2023, 11:04pm

I am looking for a similar answer. I only want to provide answers based on what I provide it.

afonsomiguel1990 · March 5, 2023, 11:11pm

Yes, that’s it. I already used the “System” in the prompt, but that take to much tokens.

batjko · March 6, 2023, 12:08am

One relatively easy way to solve this could be to get a Codex model to transform the user request into a query for your database (e.g. SQL), by giving it the pseudo schema of your database, then query the database and return the results to chatGPT as the context of your user request (this is basically how bing chat works, as well):

For example:

User → App: “Does the onion soup have mushrooms in it?”
App → OpenAI API (using codex model completion): “Given this database schema: [insert the structure of your tables], write me a SQL query for this question: [insert user question]”.
OpenAI API → App: “Here is your Select statement: SELECT x, y, z FROM table1 JOIN table 2 ON…” etc.
App → Restaurant Database: [execute SQL and retrieve results]
App → OpenAI API (using text model completion this time): “Act as the manager of a restaurant, who wants to answer user queries about the restaurant and only about the restaurant. Given the following user question and the related search results, respond to the user using only the results as context: User question: ‘Does the onion soup have mushrooms in it?’ - Search Results: [insert the results table of what the codex model gave you]”
OpenAI API - > App: “Yes, our onion soup does contain traces of mushrooms. Do you not like them, would you like me to recommend alternative dishes without mushrooms?”
… and so on.

This is simplistic and you’d need to make sure you construct and tweak the right prompts, add error handling, maybe also use the moderation end point that OpenAI offers etc.
But roughly, this is how I’d do it.

afonsomiguel1990 · March 6, 2023, 10:13am

Thanks for your amazing answer i will take a look at this and try to get this working.

ishikawa · March 8, 2023, 2:01pm

That is great. How could the step 2 be permanent, in a way that it would not have to be repeated on each call?

chrstfer · March 8, 2023, 3:23pm

Id be terrified to run codex generated queries on a production server without verification, but thats just me

batjko · March 8, 2023, 3:24pm

You can probably cache the returned SQL query, e.g. in Redis, based on the user question.
But the problem you’ll have is that the question from the user won’t be repeated all that often (and if it does, it may be phrased differently and it’s then hard to tell if you still need the same SQL statement).

So while you may save the occasional API call this way, it’s probably rare and you still make actual API calls most of the time.

But at least in the case of a restaurant, you won’t expect that many people interacting with your chatbot so the volumes will be small and affordable.

For much larger use cases, you could perhaps work on extracting some sort of normalised version of every user request, to make it more likely to be matched with a cached SQL query. That may also decrease your API calls a bit, but that’s trickier and also won’t be foolproof, not like exact API call requests in the non-ML world.

chrstfer · March 8, 2023, 3:28pm

To offer another idea that’s probably harder but more token efficient and safe than generating sql, you can get embeddings for your menu items and the services you offer, store them in a database, then chunk up the user input and get embeddings for that. Using cosine distance you get the k nearest items/services in your menu embeddings and include those in your system prompt.

ishikawa · March 8, 2023, 3:58pm

Thank you! But sorry, I was not clear. I meant actually the part of the schema, that probably won’t change that often.

“Given this database schema: [insert the structure of your tables]”

ishikawa · March 8, 2023, 3:59pm

Embeddings is interesting, but than it would require to update the embedding for any new insert or update.

georgei · March 8, 2023, 4:21pm

Actually embeddings is the way to go.
There is plenty of discussions on this forum and for this use case it’s most likely the best option.

chrstfer · March 8, 2023, 5:34pm

Yes, each time you change your menu database you’ll have to update the embeddings. Ideally menu changes dont happen that often, but no menu is so large that you couldnt get the whole thing chunked and embedded every day and be paying more than a few pennies. The chats are going to be your token-cost leader, with both an embedding request for the user input and then a completion request once youve put together your system message. That being said, i think this would overall be cheaper than making codex create queries, because then you’d need to be sending your whole schema along with every code completion request (every user message). Plus going through a remote sql server is going to add a ton of latency, if that was the plan.

batjko · March 8, 2023, 11:00pm

Ah I see.
Yes, I’m afraid you will probably have to pass in your schema every time, with every new completion. Your schema may not change, but the model API doesn’t remember it.

Even with the new chat model, the API is “stateless”, i.e. it requires you to re-send your entire context with every request, even if you are continuing an existing conversation.

You could create a new “fine-tuned” model, using OpenAI’s API to send training completions based on your menu and restaurant information, and then just query that.
But I don’t know how accurate that would be.

anon10827405 · March 8, 2023, 11:20pm

Embeddings is a great, safe method for information retrieval.

Topic		Replies	Views
Understanding Vector Database API api	4	9857	June 5, 2023
How to train open ai with my own datas from database? API chatgpt , api , text-davinci-003	15	97637	December 14, 2023
Feeding data then ask questions about it API	1	1720	February 28, 2024
Training OpenAI on a private dataset API	19	59481	December 12, 2023
About the usage of ChatGPT Embedding API	9	4732	August 18, 2023

How i put ChatGPT giving recomendations based on Database

Related topics