I have a product which is suppose to get text from different sources, sales calls, docs, webinars… and in that text to find sentences which belong to specific categories.
For each category we can give sentence examples. Things complicate a bit when we want to allow users to give their own sentence example and create their own rules for categories.
We started using gpt-4
API with okish results. But now prompt becomes larger as we try to feed it with more examples for each category and slow, ~15s for analyzing 300 words of text and it gets too expensive to afford it and I’m not sure we need a LLM for our case.
We are trying to move away from gpt maybe and find less expensive, faster, more reliable solution.
What options do we have ? After talking with some people we consider the following:
- Spacy
- SetFit - few shots learning ( gets complex when people start adding their own rules )
- LoRa - training our own llm ( gets complex when people start adding their own rules )
- LLama Index (with GPT-3.5-turbo ) - RAG - fixes the problem with people having their own rules, but is it a good fit for this use case ? still dependent on GPT, slow, limited requests/second.
Thank you in advance for any opinion or idea on this one.