Fluctuating Relevance Scores in OpenAI Vector Store — Bug or Intended Behavior?

dlovric2 · June 18, 2025, 6:13pm

Hey everyone,

I’ve been encountering something strange with the vector store behavior in OpenAI recently and I wanted to see if anyone else has noticed this or has insight into what’s going on.

The Issue

When I upload documents to a vector store (using OpenAI’s file_search/vector_store flow), I immediately try querying them with questions that should match directly. But the relevance scores I get back are oddly low — like 0.03, 0.028, or similarly tiny values — even when the document is clearly the best match.

The really weird part? I ran the exact same query again the next day, the score jumped to 0.9+. Then the day after, back to 0.03. It’s inconsistent — almost as if some background indexing or reranking is changing or failing intermittently.

Test Case

To be sure, I ran a controlled test:

Created a brand new vector store through the API.
Uploaded a single document (a simple text file about a random topic).
Asked a clear question that the document directly answers.

Despite it being the only document in the store, I still got a relevance score of around 0.02. But other times, this same test yields 0.99.

I first started noticing this right after the major OpenAI outage a few days ago (when OpenAI was down for hours). Ever since then, these fluctuations have been frequent. Could something have broken in the underlying embedding or reranking pipeline?

What I’m Wondering

Is anyone else seeing this inconsistency in relevance scores from vector store queries?
Does OpenAI do any delayed processing or background optimization after initial upload that would affect results?
Is this just a glitch post-outage, or is this an intended behavior we should account for?

Any thoughts, similar experiences, or official responses would be super helpful. This is making reliable vector-based retrieval kind of shaky for production apps right now.

Thanks!
— Dominic

mrceolla · June 18, 2025, 5:06pm

We are using the Responses API with file search. We retrieve the scores from the search results to show that score next to the citation if it is output in the response. Today we notice that these scores have changed dramatically. We used to get scores between .5 and .9. Now we’re seeing the highest scores being .03. What gives? I’ve been scouring the web and these forums to learn about what has happened here and are coming up empty. Can anyone shed some light on this for us?

vb · June 18, 2025, 6:57pm

Hi @dlovric2, it’s great to see you back.

Just a quick question: you are referencing ChatGPT but you are actually using the API?

dlovric2 · June 18, 2025, 7:48pm

Apologies for the confusion - I am referring to the creation and usage of the vector_store through the API. I just edited the post.

mrceolla · June 18, 2025, 7:53pm

I see my post was merged into this one. I’ll be following. For us, the problem started yesterday afternoon, 6/17/2025 around 5:30-6:30p central. Once the low scores started, they stayed that way for us. We didn’t experience the inconsistency the OP reported.

vb · June 18, 2025, 7:57pm

Thank you for your understanding!
I saw the two similar, new reports within a single day and decided to merge the topics before informing staff about your issues.

Hope this will be resolved soon!

ccarrett · June 19, 2025, 5:50am

Some additional feedback; we are also see this issue when using the search function within a vector store. Max score from a recent sample query was 0.0322. A useful result for our use case would typically be > 0.5. At first I assumed it was something on our end, but after re-creating the issue with a new vector store and freshly uploaded data I suspect it’s not.

Colm_Roche · June 20, 2025, 2:53pm

Thanks for sharing this detailed breakdown. If you can send this to [my email] we can triage this to the best placed team to speak to this.

mrceolla · June 23, 2025, 2:44pm

The scoring appears to have returned to normal.

Are there any explanations as to what happened ?

arda · October 13, 2025, 2:54pm

Hi, I’m experiencing the same issue described in @dlovric2’s initial message and getting unpredictable results.

For example, when I query the same vector store with different threshold values, I sometimes get results with both very low and very high similarity scores (as shown in the screenshot I shared below) and these scores don’t seem consistent with the threshold argument I provide.

Is anyone else experiencing the same?

Thanks!

OpenAI_Support · November 18, 2025, 2:24pm

Hey everyone — thanks again for sharing the screenshots. We’ve run several tests against fresh vector stores (including single-file stores and controlled queries), and we’re not seeing the score fluctuations you described. In our tests, relevance scores remain stable across repeated identical queries but we agree that there could be multiple factors for change in relevance score and this might depending case to case so request you to please submit us a request on support@openai.com. Please include that you have a thread open on community forum and include your forum username so we can track it. Thank you!

vinicius.cestari.h · November 27, 2025, 6:30pm

The issue persists. In my experience, approximately 25% of requests return scores flattened to ~0.03, even though the retrieved content is relevant. This indicates that the retrieval logic is functioning, but the scoring metric is failing intermittently. Retrying the identical query typically returns the same results but with expected scores.

Topic		Replies	Views
Responses API file_search: later tool-call ranking/score seems biased by earlier topic even with “one-topic-per-call” API bug , file-search , responses-api	0	65	January 16, 2026
Bug: Vector store status: completed does not guarantee searchability - file_search returns empty results silently API bug , api , vector-store , file-search , responses-api	1	93	February 20, 2026
Persistent “Invalid URL (GET /v1/vector_stores/…)” error across all vector stores — UI broken, cannot upload new files Bugs vector-store	20	340	February 23, 2026
Issue with Document Uploads to Vector Stores - "Unable to Process Document" Error API assistants-api , vector-store	1	461	November 13, 2024
File_search returns results from deleted files no longer linked to the vector store Bugs	2	122	November 6, 2025

Fluctuating Relevance Scores in OpenAI Vector Store — Bug or Intended Behavior?

The Issue

Test Case

What I’m Wondering

Related topics