Reducing Cost of GPT 4 by using embeddings

ZAdam · March 30, 2023, 5:56pm

LlamaIndex is a good resource:

https://gpt-index.readthedocs.io/en/latest/guides/primer/usage_pattern.html

Ultimately, you’ll need to use a vector DB like Pinecone to store the embeddings. It’s trivially simple to store and query…

# store docs to an index
from llama_index import GPTSimpleVectorIndex

index = GPTSimpleVectorIndex([])
for doc in documents:
    index.insert(doc)

Query the index

response = index.query("What did the author do growing up?")
print(response)

That’s the most simple case. Store your taxonomy mappings as a simple store of documents, then you pass in a few hundred need-to-process lines and the first query you make is to the index to get only the needed taxonomies, using a lower threshold (you can finetune the threshold to query)., and finally, you feed those into GPT4 api. No need to send a full taxonomy into the model each time; only the matching taxonomies.

Topic		Replies	Views
Can this api be used to query internal data? API	35	8459	April 20, 2023
About the usage of ChatGPT Embedding API	9	4547	August 18, 2023
Using GPT to Search & Pull Recommendations from a Database? API	23	10619	August 22, 2024
Can someone make embeddings make sense? (Not what you think, more in post, lets discuss!) API embeddings , gpt-4	6	2323	September 19, 2023
How to feed data for completions, instead of using prompt/answer fine-tuning format? API	25	18005	December 17, 2023

Reducing Cost of GPT 4 by using embeddings

Related topics