ChatGPT & llamaindex & embeddings

curt.kennedy · March 16, 2023, 2:40pm

Your GPU might pair well with the open source Facebook AI Similarity Search (FAISS). But if you have less than 1 million embeddings, like discussed above, you can do this “by hand” with the naive searches like this:

def mips_naive(q, vecs):
    mip = -1e10
    idx = -1
    for i, v in enumerate(vecs):
        c = np.dot(q,v) # dot is the same a cosine similarity for unit vectors
        if c > mip:
            mip = c
            idx = i
    return idx, mip

Also you could use Redis, see this thread: Using Redis for embeddings

Topic		Replies	Views
Which database tools suit for storing embeddings generated by the Embedding endpoint? API	46	24495	December 13, 2023
Best architecture for searching historical emails semantically? API	25	3942	August 22, 2024
How to fine tune a chatbot for Q&A API	12	8226	December 16, 2023
About the usage of ChatGPT Embedding API	9	4113	August 18, 2023
Reducing Cost of GPT 4 by using embeddings Prompting	23	10155	May 4, 2023

ChatGPT & llamaindex & embeddings

Related topics