exactly. it is not the examples.
you need “high quality” examples!
10k just (maybe) has a chance to get you enough statistically high quality examples
but what is high quality?
high quality = having “aware” distinctions built into them as examples
they ask x, say y
but they ask x, excluding t, but maybe c, say a
etc
examples for every situation giving the AI an “understanding” of the way answers a,b,y, and d, all semantically relate to each other
then it “understands” what to say when asked about a,b,y and d (and the closer a question is semantically related but removed from a,b,y, and d, the closer it can answer)
yes you are right - ~90% of the problem is in the data, ~10% of the problem is in the request. IMO.
need better data/prompt
understand how it thinks, chat with it for awhile - ask it questions on what it “thinks”. i have spent hours and hours