QA fine-tuned chatbot not answering from the trained data but nonfactual

exactly. it is not the examples.

you need “high quality” examples!

10k just (maybe) has a chance to get you enough statistically high quality examples

but what is high quality?

high quality = having “aware” distinctions built into them as examples

they ask x, say y

but they ask x, excluding t, but maybe c, say a

etc

examples for every situation giving the AI an “understanding” of the way answers a,b,y, and d, all semantically relate to each other

then it “understands” what to say when asked about a,b,y and d (and the closer a question is semantically related but removed from a,b,y, and d, the closer it can answer)

yes you are right - ~90% of the problem is in the data, ~10% of the problem is in the request. IMO.

need better data/prompt

understand how it thinks, chat with it for awhile - ask it questions on what it “thinks”. i have spent hours and hours

2 Likes