We are trying to create the most accurate way to extract any valid dates from random sentences and format them in a common format.
has anyone done this before?
We are trying to create the most accurate way to extract any valid dates from random sentences and format them in a common format.
has anyone done this before?
nice testing it out now. also, the dates in the random sentences may not necessarily be in a normal date format, could 2/12/09, or 1809, feb 12th, or 02/09/1809, etc.
perfecto! testing it out now.
we are dealing with speech to text transcriptions, so sometimes the dates are messy. this is how far we got so far:
Extract any dates from the following sentence and write them in the format of MM/DD/YYYY and remove any extra words. If there are no dates then display No-Date-Found. Examples:
Query:
any further suggestions let me know.
we are going to give fine tuning a go, if anyone has any words of wisdom let us know.
will post results here once tested.
yup read that. the issue is the prompt unless we do this gets bigger and bigger:
Query:
BUT IT WORKS GREAT! so it will work, it is a matter of figuring out the best most efficient method in OpenAI to do so.
What do you hope to achieve with finetuning? Faster? Cheaper? Better performance?
we just want it to work. so whatever method works is the one we want to use.
for sure it can work, but just like everything else, the best most accurate method is not currently known.
we are going to try similar experiments on plain old NLP.
but this is our preferred method.
question: do the examples above count against quota and tokens and are we charged for them?
Yes. If you’re wanting it to not pull certain numbers (ie non-date numbers), I would give it a few of those examples too. Good luck!
good point. will try that too. whatever we find we will be posting results here.