I am trying to fine tune a model where I am trying to qualify a record by matching two json objects criteria and requirements.
I am trying gpt-turbo to calculate score or return how many criteria are met with requirements provided
for above we should have met 2 criteria name and industry
What should be some consideration to keep in mind? As of now I am getting 50% accuracy with 75 training instance
Okay, here’s the issue with parsing the values and comparing them manually: Some fields in the JSON are not matched one-to-one. For example, if the ‘location’ in the requirements JSON has a value like [“USA, MEXICO”] and the criteria have ‘Boston’ or ‘Guadalajara’ as locations, in that case, they should match.
There are other values like these for ex. pharma should match with healthcare etc.
Okay, here’s the issue with parsing the values and comparing them manually: Some fields in the JSON are not matched one-to-one. For example, if the ‘location’ in the requirements JSON has a value like [“USA, MEXICO”] and the criteria have ‘Boston’ or ‘Guadalajara’ as locations, in that case, they should match.
There are other values like these forex. pharma should match with healthcare etc.
Few more ex.
criteria
{“name”:“joe”, “industry”:“tech”,“location”,“boston”}
requirement
{“name”:[any],“industry”:“techonology”:location":[“usa”,“mexico”}
In this case answer should be 3 or 100% all fields met the criteria
In this case answer should be 3 or 75% as the industry does not match.
On the flip side gpt-4 is behaving quite well for the calculation of criteria met and comparison but that is just 1:1 comparison. gpt-3.5 is just awful. I have tried short prompt , large prompt but no luck
And do you always provide two json to compare? And each pair of json can have different attributes all the time or is it always the same attributes or is it a changing number of attributes?
And so do you expect it to itself decide that pharma is
And just to be sure you need it to answer how many matching arguments?
It can have different number of attributes. I would say out of list of 25 attributes we can have any combination.
I always provide 2 json one is criteria and other is requirements it has to meet.
3.Number of attributes can change.
I want it to decide things like Boston is a city in USA. Pharma and healthcare are similar or under same umbrella.
I want actually score (number of matching args/total args)*100 but I understand it is a LLM not calculator so I am fine with matching args and can calculate score.
Till now I have seen issues where model has hard time keeping track of matching criteria
In your Assistant prompt do you have instructions for each of the possible 25 attributes? That would be needed I think for consistent results. And this is certainly a GPT-4 not 3.5 task.
Feel free to email me some examples that are often failing.