Improving Semantic Search Engine Accuracy Using OpenAI Embeddings and Llama VectorStoreIndex

anon10827405 · May 17, 2024, 9:23pm

You need to reduce the noise.

Instead of grouping everything together just simply embed the reviews and then link each embedding with the product name.

Separate the concerns.

You can combine embeddings many ways. So it’s better to create groupings of single-concern embeddings. Product names are meaningless for this, but you could do some fun things like see how the embedding engine “feels” about the product names.

I’d recommend Weaviate. They offer a database that accepts your schema and also can embed items individually, and in groups

Topic		Replies	Views
Embedding and searching from similar embeddings API	6	6857	October 27, 2023
Reducing Cost of GPT 4 by using embeddings Prompting	23	10751	May 4, 2023
Using Embeddings for search poor results vs GPT3 API	1	779	December 17, 2023
How to fine tune a chatbot for Q&A API	12	8578	December 16, 2023
About the usage of ChatGPT Embedding API	9	4616	August 18, 2023

Improving Semantic Search Engine Accuracy Using OpenAI Embeddings and Llama VectorStoreIndex

Related topics