Thank you for your thoughtful response.
I still have a question.
From your explanation, I understand that fine-tuning does not equate to developing inference skills. However, I assumed that GPT-4o already possesses a strong enough inference capability to deduce grandparent-grandchild relationships.
As proof of this, if I provide GPT-4o with the following prompt, it correctly answers:
“A’s child is B. B’s child is C. What is the relationship between A and C?”
So, what is the key difference between these two cases?
- When I provide the relationships between A and B, and B and C through fine-tuning, the model is unable to infer the relationship between A and C.
- When I provide the same relationships via a prompt, the model can correctly deduce the relationship between A and C.