I am building a web app for a customer service company and would like to use gpt-3.5-turbo to do basic sentiment analyses on the conversations.
The basic idea is i want to label each line of the transcription with either a Positive , Negative or Neutral label , and a Clarity label either Clear or Unclear based on whether the customer understood the request.
My text is in Arabic and if formatted as follows
[hh:mm:ss] Speaker(Agent/Customer) : Content
I pass the whole content of the transcription in one go to the API with a this prompt as initialization
prompt : Your job is to extract the timestamp at the start each line then read the content of the line and based of the sentiment expressed in the content of the line add to it one of the following tags {Positive} , {Negative} and {Neutral} label each line based on the sentiment expressed in the line additionally include a {Unclear} label if the text in the line is unclear or was not understood by the customer or Include {Clear} label if the conversation the sentence was understood and was clear include only the timestamp, sentiment and clarity for that line as follows make sure to analyze the sentiment and clarity for every line in the input and every timestamp don’t skip anything in the input
My problem is it has a problem identifying clarity especially when the context of the misunderstanding comes more than 2 time blocks away , also it choice of sentiment label is sometime random how would i solve this?
Also would passing the input line by line instead of one big block help at all and would it cost more?
I am fairly new in using Openai’s API, so any guidance would be greatly appreciated.