Document Sections: Better rendering of chunks for long documents

shuntley · April 8, 2024, 11:30pm

I just implemented a customised version of the semantic chunking from this video. I wrote it in TS and it works pretty well.

Had to customise a few different things to get it to work well, including writing a custom retriever to make sure retrieval worked with semantic chunks.

I kept my semantic chunk format to be composed of the ‘sentences’ and I actually to retrieval against all my sentences, but then score all my semantic chunks based on sentence scoring for the query. IE I just have an algorithm for ranking my semantic chunks for each query.

But otherwise everything in the video is pretty close to what I did.

Topic		Replies	Views
RAG is failing when the number of documents increase API	35	17757	December 17, 2024
Using gpt-4 API to Semantically Chunk Documents API embeddings	186	22312	April 2, 2025
The length of the embedding contents API	48	33752	December 13, 2023
What's the most accurate? Fine tunning vs Prompt Stuffing Community fine-tuning	13	4994	October 2, 2023
Creating a Chatbot using the data stored in my huge database Community embeddings , chatgpt , fine-tuning , api	93	86175	November 25, 2023

Document Sections: Better rendering of chunks for long documents

Related topics