The length of the embedding contents

AgusPG · March 22, 2023, 7:43pm

What I don’t understand is point 3. What do you mean by “I use title and subtitle to index embedding, and I store the content as meta_data to get to the content of the embed. I use the content as the text of the prompt context.“?

How is your actual call to the embedding endpoint? Can you share it with us?

If I were you, I’d def incorporate “title” and “subtitle” as global context of each chunk prior to embed it. I’d incorporate some other global metadata as well (timestamps? Author of each document? Short summary of the whole doc or at least key entities extracted via NER?)

The local context is a little bit more tricky, so I’d probably leave it for later .

Topic		Replies	Views
How can I use Embeddings with Chat GPT 3-5 Turbo Prompting	38	49586	September 6, 2023
Train (fine-tune) a model with text from books or articles API	62	29466	November 30, 2023
Poor quality response on trained LLM with pdf files Community gpt-4	29	7239	May 1, 2024
Creating a Chatbot using the data stored in my huge database Community embeddings , chatgpt , fine-tuning , api	90	95050	November 24, 2023
RAG is failing when the number of documents increase API	35	22091	December 17, 2024

The length of the embedding contents

Related topics