Using namespaces in Pinecone with Langchain ConversationalRetrievalChain and OpenAI LLM

cookiemo · October 10, 2023, 1:04pm

I’m able to use Pinecone as a vector database to store embeddings created using OpenAI text-embedding-ada-002, and I create a ConversationalRetrievalChain using langchain, where I pass OpenAI gpt-3.5-turbo as the LLM, and the Pinecone vectorstore as the retriever. The use case is that I’m saving the backstory of a fictional company employee so that I can do question and answer using embeddings.

I want to now save three different backstories into the same Pinecone index, which I suppose can be accomplished using namespaces? How can I change the code to save and retrieve from three different namespaces? I searched langchain’s documentation on its Pinecone vectorstore, but I couldn’t find anything.

Relevant sections of my code:

import streamlit as st
from dotenv import load_dotenv
import re, os
import pinecone
from langchain.embeddings import OpenAIEmbeddings 
from langchain.vectorstores import Pinecone
from langchain.chat_models import ChatOpenAI
from langchain.memory import ConversationBufferMemory
from langchain.chains import ConversationalRetrievalChain
from langchain.callbacks import get_openai_callback

from langchain.prompts import (
    ChatPromptTemplate,
    HumanMessagePromptTemplate,
    SystemMessagePromptTemplate,
)

def initializePinecone():
    PINECONE_API_KEY = os.environ.get("PINECONE_API_KEY")
    PINECONE_ENVIRONMENT = os.environ.get("PINECONE_ENVIRONMENT")

    pinecone.init(api_key=PINECONE_API_KEY, 
              environment=PINECONE_ENVIRONMENT)
    
    # Set the index name
    index_name = "fictionalemployees"

    # check if the extractive-question-answering index exists
    if index_name not in pinecone.list_indexes():
        # create the index if it does not exist
        pinecone.create_index(
            index_name,
            dimension=1536,
            metric="cosine"
        )

    return index_name

def get_pineconeVectorStore(text_chunks, indexName):
    embeddings = OpenAIEmbeddings(deployment="text-embedding-ada-002", model="text-embedding-ada-002", chunk_size=10)
    pineconeVectorStore = Pinecone.from_texts(
                            text_chunks, 
                            embeddings, 
                            index_name=indexName)
    return pineconeVectorStore

def get_conversation_chain(vectorstore):
    llm = ChatOpenAI(model="gpt-3.5-turbo", max_tokens=50)

    general_system_template = r""" 
    You are John, a Customer Service Officer at the company. You are answering questions regarding your job at helping with the rental application process.
    ----
    {context}
    ----
    """
    general_user_template = "Question:{question}"
    messages = [
                SystemMessagePromptTemplate.from_template(general_system_template),
                HumanMessagePromptTemplate.from_template(general_user_template)
    ]
    qa_prompt = ChatPromptTemplate.from_messages( messages )

    memory = ConversationBufferMemory(
        memory_key='chat_history', return_messages=True)
    
    conversation_chain = ConversationalRetrievalChain.from_llm(
        llm=llm,
        retriever=vectorstore.as_retriever(),
        memory=memory,
        combine_docs_chain_kwargs={"prompt": qa_prompt}
    )

    return conversation_chain

def handle_userinput(user_question):
    response = count_tokens(
                st.session_state.conversation,
                {'question': user_question})
    st.session_state.chat_history = response['chat_history']

    for i, message in enumerate(st.session_state.chat_history):
        if i % 2 == 0:
            st.write(user_template.replace(
                "{{MSG}}", message.content), unsafe_allow_html=True)
        else:
            message.content = truncate_at_last_sentence(replace_list_format(message.content))
            st.write(bot_template.replace(
                "{{MSG}}", message.content), unsafe_allow_html=True)



# Somewhere in main:
  #create Pinecone vector store
  pineconeVectorStore = get_pineconeVectorStore(
  text_chunks, pineconeIndexName)

  # create conversation chain using Pinecone
  st.session_state.conversation = get_conversation_chain(
                                                                          pineconeVectorStore)

ctavolazzi · October 10, 2023, 9:24pm

^ Bump

I’d also like to see an answer to this question. I’m doing similar work on my Obsidian plugin.

bobartig · October 11, 2023, 3:30pm

Just organizationally, I’m not sure where this is supposed to be categorized b/c this is really a question about the langchain implementation of the pinecone API. But since I worked through this at some point, here’s my recollection:

You need to specify a namespace kwarg in your upsert and retrieval calls. How exactly you do this is a matter of design and ui in terms of when the user provides that info. Specifically, with respect to your upsert function:

def get_pineconeVectorStore(text_chunks, indexName, 😃nameSpace😃):
    embeddings = OpenAIEmbeddings(deployment="text-embedding-ada-002", model="text-embedding-ada-002", chunk_size=10)
    pineconeVectorStore = Pinecone.from_texts(
                            text_chunks, 
                            embeddings, 
                            index_name=indexName,
                            namespace=😃nameSpace😃)
    return pineconeVectorStore

I’ve modified it to accept a third argument, nameSpace:smiley:, and roughly shown how it needs to be used.

Pinecone will automatically create namespaces as needed when you upsert to a non-existent namespace. By default you are upserting to namespace=“”, which you’ll possibly have to manage/delete at some point.

The same namespace kwarg needs to be included when you retrieve from your vector store as well. For whatever reason, I’m not seeing precisely how that works in your code. It’s been a while since I’ve worked w/ langchain and I’m not familiar w/ some of the features you’re using.

stephen.collins · December 31, 2023, 6:02am

As of Dec 31st, 2023 we were able to use the search_kwargs parameter in as_retriever like so to query individual namespaces.

vector_store = Pinecone(
    pinecone_index,
    embed_model.embed_query,
    "text"
    )

retriever=vector_store.as_retriever(
    search_kwargs={ "namespace": <your namespace> }
)

Topic		Replies	Views
Trouble specifying Pinecone namespace for RAG chatbot API langchain	0	596	November 14, 2023
How to use multiple vector stores API gpt-35-turbo	13	3234	October 7, 2024
OpenAI with Pinecone with a prompt containing multiple questions API embeddings , pinecone	4	865	February 23, 2024
Saving conversational memory in Pinecone using Openai for each session API chatgpt , large-language-model , pinecone , openai , llm	0	858	December 7, 2023
Replicate langchain custom data query with openai python API? API	3	2467	October 11, 2023

Using namespaces in Pinecone with Langchain ConversationalRetrievalChain and OpenAI LLM

Related topics