Retrieval makes no sense - Only first message retrieves reliably

anon-dev-72 · July 24, 2024, 6:49pm

Hello everyone,

I’m currently using the Assistants API with specific instructions to always retrieve context to answer the user’s questions accurately.

I’ve noticed that when the question is the first message in the thread, the retrieval process works flawlessly 100% of the time. GPT-4o will fetch the necessary context from the vector store and provide the correct answer.

However, if the question isn’t the first message in the thread, the retrieval only works about 50% of the time. I’m puzzled by this inconsistency. Could it be that the initial exchange influences the model’s decision on whether or not to retrieve additional context? Does the model assume that the initial retrieval was sufficient for the subsequent messages?

Has anyone else experienced this or have any insights into why this might be happening?

Thank you!

Topic		Replies	Views
Model Gives Incorrect Answers for Second Question in Thread with Retrieval Enabled API gpt-4	4	460	March 10, 2024
Assistants API fails to use Retrieval tool as effectively as GPTs Bugs assistants , assistants-api	2	1821	December 15, 2023
Gpt4o-mini retrieval doesn't work as expected? API assistants-api , vector-store , gpt-4o-mini	9	789	February 18, 2025
AI Assistant in Messenger Ignores First User Message on Automation Platform Community chatgpt , assistants-api	1	43	November 26, 2024
GPT or the assistant forgets the conversation history and starts producing nonsensical responses Community gpt-4 , api , assistants-api	1	279	September 17, 2024

Retrieval makes no sense - Only first message retrieves reliably

Related topics