I’ve come across a scenario where I’m trying to extract key details from candidates interview notes. Using the latest GPT-4-Turbo preview model available on Azure OpenAI & function calling to describe as a JSON schema the notes I want pulled out.
The notes contain a reference to growing revenue from $25 - $50 million over the time period.
The output of the function call response consistently contains a reference to this, but says $24 - $50 million instead.
It does this every time.
I’ve even added a “citation” property to the JSON schema and updated the prompt with instructions to include a snippet of text extracted exactly from the notes that supports it’s notes, and in that property it extracts the correct sentence but with the $25 changed to $24 in that reference as well.
I’ve rarely seen a “mistake” like this in extraction, and never one that is wrong in the exact same way so consistently.
Has anyone encountered a similar case? If so, what (if anything) helped?