Just a fast OTTOMH rough example, @arun.raja, you can take your training data prompts and put them in an array of prompts as follow:
@array_of_prompts[
"fine_tune_prompt",
"fine_tune_prompt",
"fine_tune_prompt",
"fine_tune_prompt",
"fine_tune_prompt",
"fine_tune_prompt",
"fine_tune_prompt",
]
You might store these in a DB along with the embeddings for each prompt, from a array like this:
@array_of_prompts[
{"fine_tune_prompt", "embedding_vector"},
{"fine_tune_prompt", "embedding_vector"},
{"fine_tune_prompt", "embedding_vector"},
{"fine_tune_prompt", "embedding_vector"},
{"fine_tune_prompt", "embedding_vector"},
{"fine_tune_prompt", "embedding_vector"},
{"fine_tune_prompt", "embedding_vector"},
{"fine_tune_prompt", "embedding_vector"},
]
Then, when your chatbot user enters a prompt, you can take the embedding of that prompt and compare it to the DB of embedding vectors and if there is a match with high score, you can route the query to your fine-tuned model. If there is no match with a high score threshold (which you can get by experimentation), then you can route to another model.
The above is just an outline, you can modify this based on the programming language you are using, etc.
Hope this helps.
