I have uploaded a 70mb german PDF to a vector store. It is a rule PDF of a role playing game.
Then I associate an assistant and ask questions in the playground. I use a system prompt that assistant should only use the pdf and not make anything up. I asked 4o-mini three questions and all answers were wrong/hallucinated. 4-mini hasnt produced better results.
In comparison I have tried ChatGPT and use the same file by onedrive connection. I have got much better answers there.
The text in the PDF can be copy/pasted in acrobat reader. So no image text. The PDF contains images, but they are just artwork.
First I have tried gpt-4o-mini
Then gpt-4o
system content: “You are a helpful assistant for a vampire role playing game. Dont make something up which is not in the provided text. If you dont know tell the user”
user content: “what is obfuscate 3?”
or
user content: "how much blood does obfuscate 3 cost3?
I have the PDF connected by a vector store
I have tried 3, 10, 30 search results
Other settings:
temperature: 0.55
top p: 0.55
I have reduced the values since I want rules not a fantasy story.