High accuracy chatbot trained on our docs - not happy with the responses

my-winter-never-ends · June 1, 2025, 10:51am

Hi everyone,

this is my first post on this forum. I need a bit of advice on the approach of how to create accurate chatbot trained on our data. My current approach is:

export pages / files from our docs
store them as files in open ai and attach to a vector store
create assistant with a file_search function. Model 4.1 nano / mini
test, test, test
result: lot of halucinations e.g. I am asking what is our latest release and it’s features. Releases are stored as separate files / pages with detail information for each LTS release.

I am not an AI / ML expert so bit of guidance on how to approach this problem would be helpful. Thank you,
j.

sergeliatko · June 1, 2025, 11:06am

Hi and welcome to the dev forum.

Personally, I wouldn’t go with vector store/assistant API, especially if precision is a must.

I would start by defining my relational database with all the data they bought needs to have access to.

I would add procedures to embed the stored data to allow access by vector search.

Then I would wrap that database in a rest API and define access to the information via tools.

Then I would use responses API for the bot.

OnceAndTwice · June 1, 2025, 11:57am

Have you tried with gpt-4.1 regular version?

I would recommend playing with temperature and top_p, specifically lowering those in small increments, to see if this improves your results.

Also, prompt engineering is an overlooked aspect of LLM integration and may actually be a major factor of these issues. OpenAI has a good resource on prompting 4.1: GPT-4.1 Prompting Guide

How is this any different than OpenAI’s vector store?

sergeliatko · June 1, 2025, 12:38pm

Personally for me having full control over the whole system and each of the elements in it allows to get way better precision and exact fit into my apps. More work in the beginning but far better results and less work in the long run, especially if planning to scale/reuse.

Topic		Replies	Views
Design advice - one or many Assistants? API assistants , assistants-api , assistants-files , vector-store	10	619	September 21, 2024
Creating chat assistant for client, not sure what way is the best way to go about it API	1	528	March 14, 2024
Assistants inconsistency with attached vector store API api , assistants-api	1	235	December 16, 2024
Chatbot for company website to answer product-related questions API assistants-api	1	200	August 30, 2024
Large document - Inject into API or use knowledge base with semantic search? Prompting gpt-4 , api	6	398	May 16, 2024

High accuracy chatbot trained on our docs - not happy with the responses

Related topics