Using gpt-4 API to Semantically Chunk Documents

SomebodySysop · August 27, 2024, 10:41pm

I use Weaviate also, didn’t know about the clustering option – will need to look into that.

What I’ve been doing are two things:

Small to Big Retrieval, where I programmatically retrieve x chunks before and after each chunk that is returned in the cosine similarity search: Advanced RAG 01: Small-to-Big Retrieval | by Sophia Yang, Ph.D. | Towards Data Science
Chunk Retrieval Rating: I rate (0 - 10) each retrieved chunk as to it’s relationship to the query submitted. I remove those chunks with low ratings and only return to the model those which have the highest likelihood of responding to the query. This process is neither as time consuming nor expensive as I originally thought it would be.

These two methodologies, along with the Hierarchal/Semantic chunking process discussed here, and Weaviate using the OpenAI text-embedding-3-large embed model, are giving me the best responses I’ve ever received.

Topic		Replies	Views
RAG is not really a solution Community api , rag	112	29726	July 13, 2025
New 4-turbo model has a unique limit? Or is this a bizarre hallucation? API	18	4498	January 26, 2024
Preparing data for embedding API	33	14760	December 16, 2023
Building first RAG system API	17	620	July 6, 2025
Poor quality response on trained LLM with pdf files Community gpt-4	29	6385	May 1, 2024

Using gpt-4 API to Semantically Chunk Documents

Related topics