Mini or Nano? 4o or 4.1? RAG MODEL

JVillarrubia9 · August 12, 2025, 10:46am

Based on your experience with chatbots using RAG, which model has worked best for you?
Both when responding and when interpreting the prompt, because I see that if you want to add a new line to the prompt, such as “respond in the language of the user’s question,” it doesn’t take this new rule into account, and it also happens that if you tell it how to act, it takes some things into account, but not others.
The main problem I’m having is that I can’t get it to not refer to the information I give it when it answers me. It always says things like “in the information I have” or “in the documentation provided.”

What have been your conclusions? Thank you very much.

_j · August 12, 2025, 10:56am

manage your own chat
use your own functions
after a function return message add a system message
describe the needed output behavior
(be OpenAI, do this for web_search and file_search on developer APIs because models fail to follow directions: break tool iterations and developer applications)

If the AI is constantly told by OpenAI before every user message sent “the user has uploaded files”, then RAG applications will also break and the AI will talk to the users.

Solutions: Get off OpenAI internal tools. Get off Responses. Build portable apps that can follow to any developer-friendly AI inference provider that has good AI models, turn-by-turn.

Topic		Replies	Views
Designing a Custom Chatbot with RAG and Function Calling GPT builders	4	1363	January 22, 2025
How to structure system prompt, RAG context, and user input for multi-turn RAG-based chatbots using OpenAI Chat Completions API lost-user	1	631	June 20, 2025
Custom chatbot says that it's developed by OpenAI API gpt-4	33	2285	April 2, 2024
Issues and training when updating the LLM model on a project GPT builders gpt-4 , azure	3	515	June 13, 2024
Best approach for adding knowledge to base model API fine-tuning , rag	4	1543	February 7, 2024

Mini or Nano? 4o or 4.1? RAG MODEL

Related topics