Hi,
Totally new here. I am working on a project to classify text statements as question or not a question.
Here is what i am doing now.
example_question = [
['How are you?','question'],
['What is your name?','question'],
[' My name is Bob', 'not a question'],
[' My cat is running.','not a question'],
['What time is it','question'],
['Can you fix this sentence for me','question'],
[' This is my house', 'not a question'],
['Is this your house', 'question']
]
Then i am using something like this call the openai api.
response = openai.Classification.create(
examples = examples_question,
labels = ["question", "not a question"],
query = sentence,
search_model = "davinci",
model = "davinci"
)
When i run this for 100s of tweets, i don’t see the classification performing well.
Statement like this “Too bad that he was injured, they would have won if he was able to play! Glad the surgery went well!” is getting classified as question.
I am pretty sure i am making a mistake here, should i be giving more example to train here?
I am pretty new to ML in general, so if someone could help me out , that would be great.
Thanks