Is it good practice to send html tags with context

20euai044 · January 30, 2024, 9:32am

cosine similarity, will it be better to pass content with HTML tags, helping to answer better for context when passed to LLM?

but the issue is, due to these tags the distance is large in vector space for user queries and chunks.

Foxalabs · January 30, 2024, 8:25pm

As a general rule, your stored embeddings should be a close to your search terms as possible. Typically with text this means stripping anything that might influence semantic meaning. Better models always help with this, but one should attempt to minimise all external factors where possible.

Topic		Replies	Views
Html in text uploaded via files api API	2	979	May 4, 2022
Best way to save html files in vector store API langchain	4	3044	October 9, 2023
How can I send vectors as a chat context? Prompting embeddings	8	5467	May 15, 2023
Custom AI Model with contextual product data in the form for txt, pdf and api documents etc API chatgpt , api	1	627	June 28, 2023
How to Optimize Text Chunking for Improved Embedding Vectorization? API vector-db , semantic-search	6	5130	December 15, 2023

Is it good practice to send html tags with context

Related Topics