I have built a RAG-based chatbot. I want to perform section-wise chunking on my document, which consists of data in the following order: header, description, image URL, and graph markdown. Let’s refer to this as a section. In this order, I have multiple sections in my document. I want to chunk my document so that each chunk consists of one complete section, from the header to the graph markdown. I am using a semantic splitter node parser from the Llama Index to perform the chunking.
Related Topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Optimal way to chunk word document for RAG(semantic chunking giving bad results) | 5 | 2831 | May 15, 2024 | |
What is the best way to chunk a PDF file for RAG in a smart way that preserves the meaning during retrieval? | 3 | 7013 | May 30, 2024 | |
Source document chunk identification and highlighting for RAG usecase | 1 | 339 | August 13, 2024 | |
Strategy for chunking rag input | 0 | 223 | May 27, 2024 | |
Automating Chunking for Customized GPT Knowledge in Vector Databases | 2 | 928 | April 27, 2024 |