Receving an incorrect response from text-embedding-ada-002

emochka2007 · July 4, 2023, 8:42am

Im creating an embedding application using langchain, pinecone and Open Ai embedding. While i was using da-vinci model, I havent experienced any problems. When i switched to text-embedding-ada-002 due to very high cost of davinci, I cannot receive normal response.

import { OpenAIEmbeddings } from 'langchain/embeddings/openai';
import { RecursiveCharacterTextSplitter } from 'langchain/text_splitter';
import { OpenAI } from 'langchain/llms/openai';
import { loadQAStuffChain } from 'langchain/chains';
import { Document } from 'langchain/document';
import { timeout } from './config';

export const queryPineconeVectorStoreAndQueryLLM = async (
  client,
  indexName,
  question,
) => {
  console.log('=>(utils.ts:14) question', question);
  // 1. Start query process
  console.log('Querying Pinecone vector store...');
  // 2. Retrieve the Pinecone index
  const index = client.Index(indexName);
  // 3. Create query embedding
  const queryEmbedding = await new OpenAIEmbeddings().embedQuery(question);
  // 4. Query Pinecone index and return top 10 matches
  let queryResponse = await index.query({
    queryRequest: {
      topK: 10,
      vector: queryEmbedding,
      includeMetadata: true,
      includeValues: true,
    },
  });
  // 5. Log the number of matches
  console.log(`Found ${queryResponse.matches.length} matches...`);
  // 6. Log the question being asked
  console.log(`Asking question: ${question}...`);
  if (queryResponse.matches.length) {
    // 7. Create an OpenAI instance and load the QAStuffChain
    const llm = new OpenAI({
      modelName: 'text-embedding-ada-002',
    });
    const chain = loadQAStuffChain(llm);
    // 8. Extract and concatenate page content from matched documents
    const concatenatedPageContent = queryResponse.matches
      .map((match) => match.metadata.pageContent)
      .join(' ');
    // 9. Execute the chain with input documents and question
    const result = await chain.call({
      input_documents: [new Document({ pageContent: concatenatedPageContent })],
      question: question,
    });
    console.log('=>(utils.ts:48) result', result);
    // 10. Log the answer
    //console.log(`Answer: ${result.text}`);
    return result.text;
  } else {
    // 11. Log that there are no matches, so GPT-3 will not be queried
    console.log('Since there are no matches, GPT-3 will not be queried.');
  }
};
export const createPineconeIndex = async (
  client,
  indexName,
  vectorDimension,
) => {
  // 1. Initiate index existence check
  console.log(`Checking "${indexName}"...`);
  // 2. Get list of existing indexes
  const existingIndexes = await client.listIndexes();
  // 3. If index doesn't exist, create it
  if (!existingIndexes.includes(indexName)) {
    // 4. Log index creation initiation
    console.log(`Creating "${indexName}"...`);
    // 5. Create index
    await client.createIndex({
      createRequest: {
        name: indexName,
        dimension: vectorDimension,
        metric: 'cosine',
      },
    });
    // 6. Log successful creation
    console.log(
      `Creating index.... please wait for it to finish initializing.`,
    );
    // 7. Wait for index initialization
    await new Promise((resolve) => setTimeout(resolve, timeout));
  } else {
    // 8. Log if index already exists
    console.log(`"${indexName}" already exists.`);
  }
};

export const updatePinecone = async (client, indexName, docs) => {
  console.log('Retrieving Pinecone index...');
  // 1. Retrieve Pinecone index
  const index = client.Index(indexName);
  // 2. Log the retrieved index name
  console.log(`Pinecone index retrieved: ${indexName}`);
  // 3. Process each document in the docs array
  for (const doc of docs) {
    console.log(`Processing document: ${doc.metadata.source}`);
    const txtPath = doc.metadata.source;
    const text = doc.pageContent;
    // 4. Create RecursiveCharacterTextSplitter instance
    const textSplitter = new RecursiveCharacterTextSplitter({
      chunkSize: 1000,
    });
    console.log('Splitting text into chunks...');
    // 5. Split text into chunks (documents)
    const chunks = await textSplitter.createDocuments([text]);
    console.log(`Text split into ${chunks.length} chunks`);
    console.log(
      `Calling OpenAI's Embedding endpoint documents with ${chunks.length} text chunks ...`,
    );
    // 6. Create OpenAI embeddings for documents
    const embeddingsArrays = await new OpenAIEmbeddings().embedDocuments(
      chunks.map((chunk) => chunk.pageContent.replace(/\n/g, ' ')),
    );
    console.log('Finished embedding documents');
    console.log(
      `Creating ${chunks.length} vectors array with id, values, and metadata...`,
    );
    // 7. Create and upsert vectors in batches of 100
    const batchSize = 32;
    let batch: any = [];
    for (let idx = 0; idx < chunks.length; idx++) {
      const chunk = chunks[idx];
      const vector = {
        id: `${txtPath}_${idx}`,
        values: embeddingsArrays[idx],
        metadata: {
          ...chunk.metadata,
          loc: JSON.stringify(chunk.metadata.loc),
          pageContent: chunk.pageContent,
          txtPath: txtPath,
        },
      };
      batch = [...batch, vector];
      // When batch is full or it's the last item, upsert the vectors
      if (batch.length === batchSize || idx === chunks.length - 1) {
        await index.upsert({
          upsertRequest: {
            vectors: batch,
          },
        });
        // Empty the batch
        batch = [];
      }
    }
    // 8. Log the number of vectors updated
    console.log(`Pinecone index updated with ${chunks.length} vectors`);
  }
};

**The example of response i receive from ada-002**

'I, S,I,S,1,1,1,1\\n' +
    '1 B,1,1 1 1S1theN 1 1\\n' +
    '\\n' +
    ' 1 1 1 1 1\\n' +
    '\\n' +
    ' 1,1,1,1\\n' +
    '\\n' +
    '1,4, 1,1,1\\n' +
    '\\n' +
    '1,1, 1,1\\n' +
    '\\n' +
    '1,1,1,1S2 1,1 1S2 1,1\\n' +
    '\\n' +
    '1,1,1,1,1,1,1,1,1\\n' +
    '\\n' +
    '1,1,1,1,1 1,1 1\\n' +
    ' 1,1,1,1,1,1,1,1,1,1, 1 1,1,1,1,1 1 1,1,1\\n' +
    '1,1,1,1,1,1,1\\n' +
    '\\n' +
    '3 1\\n' +
    '\\n' +
    ' 1,1,1,1 1,1,1,1,1,1,1\\n' +
    ' 1,1,1,1,1,1\\n' +
    '\\n' +
    ' 1,1,1,1,1\\n' +
    '\\n' +
    ', 1,1 1 1\\n' +
    '\\n' +
    ' 1,1,1'

Response from davinci “Test_Flatcity”

Tried reducing or increasing batchsize, dimensions not changed(1536) from docs. How to get the normal response from this model

emochka2007 · July 5, 2023, 1:38pm

Fixed it, was thinking that it’s needed to use embedding model inside new OpenAI(). Instead need to use the model for chat competion.(e.g. gpt-3.5)

Topic		Replies	Views
Bad request Error... how can I fix it? API	5	2545	October 18, 2023
Quality of embeddings using davinci-001 embeddings model vs. ada-002 model API embeddings	15	4064	April 9, 2024
Hebrew text not being correctly tokenized API	5	210	July 24, 2024
Are OpenAI text-embedding-ada-002 embedding model greater than text-embedding-3-large? Community embeddings , chatgpt , api	1	1641	February 21, 2024
Is there any sample code to split a json file into smaller chunks? API	11	13079	October 26, 2023

Receving an incorrect response from text-embedding-ada-002

Related topics