Better approach to build a chatbot on llama2

kiranvadaga16 · October 25, 2023, 10:33pm

I’m trying to build a custom chatbot with enterprise data for information retrieval. For that I’m currently following the below approach, anyone can suggest better approaches?

Embedding all the documents using “all-mpnet-base-v2” pre-trained model and extracting vector embeddings, later based on user query, most appropriate document is being extracted and the top responses are later sent to llama2 for getting the final response.

An other approach could be fine tuning llama2 on the entire documents data. For that I’m confused how should I represent my data and I just have one T4 GPU.

Any other approaches will be much appreciated. Please correct me if I’m doing anything wrong. Thanks!

Topic		Replies	Views
Building Own Knowledge Base LLM Community embeddings , chatgpt , api , assistants-api	3	7023	April 8, 2024
Fine-tuning or using embeddings? Small dataset API chatgpt	5	1519	December 17, 2023
Best way to create a chatbot using pre-trained models Community fine-tuning	3	3305	April 16, 2024
Langchain OpenAI Pinecone chatbot issue API langchain , large-language-model	5	1824	December 17, 2023
Want to retrain my LLM based user questions and answers on OpenAI API chatgpt , api , development	2	172	July 20, 2024

Better approach to build a chatbot on llama2

Related topics