Hello, I’m thinking to build Q&A bot where the LLM could handle question about related to deep business context. So far I use system message but got stucked with token limit.
What would you suggest to master for my need? Fine tuned, embedding search, or semantic search method ?
Fine tuned model might not be a bad shout but embeddings would be the way to go. Also, alongside embeddings, givings sample will allow you more control over the type of output as well