RAG and Embeddings - Is it better to embed text with labels or not?

DevGirl · January 27, 2024, 6:54pm

@almosnow - I may be misunderstanding (apologies if so, please elaborate and I’ll help where I can) – but this architecture sounds backwards at first glance. One of those situations where “When you’re a hammer, everything looks like a nail.” (in this case, an LLM being the hammer).

Because I may be misunderstanding, I’ll explain how I would go about this. If I’m wrong, I think that in your explanation, it will help constructively answer your queston with the best solution.

How I’d architect this:

Create a scattered ingestion of enough “documents” (questionnaires) to build your classification dataset; a simple list of categories / etc.
Build a prompt to convert each of the freeform questionnaires into structured data, which will be stored along with the original questionnaire text.
…
With the data now in-place, your application (the ability to search/analyze/report) on the data will hit a relational database. Or better yet, a relational DB with a vector store as well.

In other words, you should be able to gain far more capability by first processing the data into something more usable for your purpose, than using an LLM in place of the RDBMS/SQL component. Even if you need an LLM to replace the “interface” (aka human-data-middleware), you’re still better off with the data being processed and the RDBMS/vector RB doing the bulk of the filtering.

Does that make any sense?

Topic		Replies	Views
Scaling RAG chatbot system to millions of documents API gpt-4 , prompt-engineering , rag	18	6076	February 28, 2024
RAG and Embeddings - When to embed context (Follow-Up) Community embeddings , rag	5	2925	January 29, 2024
Example incorporation into query formulation API	14	1320	December 16, 2023
Should I modify user queries before semantic search? API chatgpt , api	22	3349	June 28, 2024
Mixing embedding services? API api	37	4909	July 1, 2024

RAG and Embeddings - Is it better to embed text with labels or not?

Related topics