GPTs hanging on "Searching my knowledge"

0xeb · November 13, 2023, 4:16pm

Hello team,

I started experimenting with creating my own GPTs. I purely configure by hand.
I uploaded 2 PDFs with a total of 3mb size.
The system prompt instructs the model to first and foremost always consult the knowledge before answering.

70% of the times, after I type my query, the model hangs on “Searching my knowledge”, generates nothing at all, or just errors out: “Message not found in conversation” error.

A bunch of questions apart from this report:

Why the slowdown when querying with retrieval enabled?
How much time, in the background, does it take for my documents to be considered fully indexed, embedded, etc. before I am able to fully query them via my custom GPT?
What kind of chunking is happening behind the scenes?
Since, OAI’s retrieval is a blackbox for us, what is the best advise to format our documents to be consumed by the retrieval tool?

Thanks

MaxZieb · November 15, 2023, 10:53am

Same, here… gets stuck on retrieving knowledge.

I deleted my PDF and uploaded md files… now I get stuff like this:

mrmaxpepper · November 15, 2023, 11:05am

This certainly is a global issue, hope they resolve it asap

AILu · November 15, 2023, 11:07am

0xeb:

Hello team,

I started experimenting with creating my own GPTs. I purely configure by hand.
I uploaded 2 PDFs with a total of 3mb size.
The system prompt instructs the model to first and foremost always consult the knowledge before answering.

70% of the times, after I type my query, the model hangs on “Searching my knowledge”, generates nothing at all, or just errors out: “Message not found in conversation” error.

A bunch of questions apart from this report:

Why the slowdown when querying with retrieval enabled?

How much time, in the background, does it take for my documents to be considered fully indexed, embedded, etc. before I am able to fully query them via my custom GPT?

What kind of chunking is happening behind the scenes?

Since, OAI’s retrieval is a blackbox for us, what is the best advise to format our documents to be consumed by the retrieval tool?

Thanks

I am experiencing the same issue

EDIT: I add that it would be nice (I’d even say “fair”) that in case of error from GPT, the question is not counted towards the usage cap for GPT-4.

prozac · November 16, 2023, 10:00am

Same issue - My GPTs crashes after saying searching my knowledge

jobterhaar · November 18, 2023, 11:08am

My GPT is searching the PDFs I uploaded, and so far it is not crashing - but it doesn’t seem to be very good at actually finding stuff. In a few cases, its responses reflect the content of the knowledge base I uploaded (a manual for artistic research), but more often it says it can’t find sections or topics which are most certainly in the PDFs I uploaded. Could this be related to the formatting of these documents? Are there ways to optimize the PDFs for searching by my GPT?

Slashdir · November 18, 2023, 5:39pm

Yeah, i have the same issue with regards to PDF’s as knowledge. Either the search takes a very long time and provides only sections of the pdf. Most of the time though it will reply with “message in conversation not found”, or replies it’s not able to search it’s knowledgebase. In the edit section I will have it check if it has access to the documents and it will confirm it does. However if I ask to give me the first 5 sentences of each document it will go into 'message in conversation not found.

0xeb · November 19, 2023, 5:31pm

Ok, I may not be the first, but it seems all our issues stem from the fact that: we are making assumptions.

Who told us that OAI does embeddings at all? Or uses vector dbs or even goes through the trouble of implementing RAG.

Not sure how to repro this, but I caught OAI emit some code as with it was “Searching my knowledge”.

It showed me its search() function

import fitz  # PyMuPDF
import re

# Open the documentation
sdk_doc = fitz.open("/mnt/data/sdk.pdf")

# Function to search for information in the document
def search_document(doc, query, max_pages=5):
    results = []
    for page_num in range(min(max_pages, len(doc))):  # Limiting the number of pages to scan
        page = doc[page_num]
        text = page.get_text()
        if re.search(query, text, re.IGNORECASE):
            results.append((page_num, text))
    return results

# Search for information about enumerating structures and their names
search_results = search_document(sdk_doc, r"enumerate structures|structure name")

search_results

Now the user asked: “how to enumerate structures and their name”

Note how the user question got translated to a regular term(s) search:

search_document(sdk_doc, r"enumerate structures|structure name")

And if this document search function is invoked each time the LLM wants to access the knowledge base, then that explains why it is too slow to respond.

So I feel, for now, OAI implemented ‘Knowledge’ quick and dirty. Later they can perhaps improve it.

Topic		Replies	Views
Custom GPT says "no info" despite actively searching database for said info GPT builders gpt-4 , chatgpt , custom-gpt	30	1533	February 20, 2024
Errors in reading knowledge base in the custom GPT Documentation chatgpt	18	2510	December 13, 2023
What are the limitations of GPT-4 in analyzing PDF text? Prompting gpt-4	6	26482	March 12, 2024
How to best use GPTs with PDF files? Plugins / Actions builders plugin-development	14	16823	September 18, 2024
Speeding up knowledge base searches in build a GPT? Plugins / Actions builders gpt-4 , chatgpt , gpt-builder	1	1385	November 14, 2023

GPTs hanging on "Searching my knowledge"

Related topics