I have built a RAG-based chatbot. I want to perform section-wise chunking on my document, which consists of data in the following order: header, description, image URL, and graph markdown. Let’s refer to this as a section. In this order, I have multiple sections in my document. I want to chunk my document so that each chunk consists of one complete section, from the header to the graph markdown. I am using a semantic splitter node parser from the Llama Index to perform the chunking.
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| What is the best way to chunk a PDF file for RAG in a smart way that preserves the meaning during retrieval? | 5 | 18942 | October 28, 2024 | |
| Strategy for chunking rag input | 0 | 265 | May 27, 2024 | |
| Optimal way to chunk word document for RAG(semantic chunking giving bad results) | 5 | 5124 | May 15, 2024 | |
| How to tackle similar section names in RAG? | 1 | 222 | December 18, 2024 | |
| How do i chunk PDFs with complex layout in RAG application? | 1 | 903 | December 4, 2024 |