How to do retrieval and return ID from ChromaDB

JuanjoAlm · October 20, 2023, 11:00am

Im using ChromaDB for storing my embedded texts and I need to know the ID of the embedded chunk into the ChromaDB when I try to do retrieval. I need it to reference the generated text with the used document.

My code for retrieval is:

    def __setRetrieval(self, vectordb, llm):
        memory = ConversationBufferMemory(
            memory_key=MEMORY_KEY,
            return_messages=True
        )
        return ConversationalRetrievalChain.from_llm(
            llm,
            retriever=vectordb.as_retriever(search_type=RETRIEVAL_TYPE),
            memory=memory
        )

And my code for embedding is:

        # Dividir el documento en fragmentos
        splitter = RecursiveCharacterTextSplitter(
            chunk_size=SPLIT_CHUNK_SIZE_CHARS,
            chunk_overlap=SPLIT_CHUNK_OVERLAP_CHARS)
        docs_split = splitter.split_documents(docs_txt)
        logger.info(f""" 
                    Chunks: "{len(docs_split)}" 
                """)

        # Guardar los datos en Chroma
        openai_lc_client = Chroma.from_documents(
            docs_split, 
            embeddings, 
            client=chroma_client, 
            collection_name=COLLECTION_NAME_DEFAULT
        )

Thanks!

sk33 · February 23, 2024, 5:40pm

Hi @JuanjoAlm , were you able to figure it out ? I am looking for similar solution.

Topic		Replies	Views
How to use chroma db as retriever API chromadb	2	4162	May 22, 2024
How do I use ChromaDB is Create Embeddings API chatgpt , chromadb	0	1236	March 22, 2024
Identify chunks with langchain and ChromaDB Community chatgpt , api	2	2750	May 23, 2024
Need Help with RAG and Embeddings Community embeddings , chatgpt , chromadb	0	675	April 2, 2024
Load embedding from disk - Langchain Chroma DB API embeddings , langchain	6	22698	February 6, 2024

How to do retrieval and return ID from ChromaDB

Related topics