How to structure the embeddings?

Remember the embeddings all correlate and map back to YOUR DATA!

So all this is trying to do is smooth out the interface between <Random Question> and <Company Approved Answer>.

A big problem is this lack of correlation of the question (or input) and your data. And you don’t want a “weak question” to correlate with off-topic data either. You can also tell GPT to respond with “I don’t know” if the answer doesn’t lie within the data that was pulled from the embedding correlations.

Not if you fed the answer back to the prompt!

Example input: “What color sky!”

Your database:
“What color is the sky?”
“Is the earth flat?”
“Where is Waldo?”

The correlation locks onto the first one. So you look this up, and also the answer in your database (in another field), and put this in the prompt:

Use the following context to answer the question:

Q: What color is the sky?
A: Blue

Q: What color sky!
A:

########

The Q/A pair was from your embedded data, and the original question is fed back in and GPT will reference your data to answer it!

1 Like