Use Fine Tuning or Prompt Engineering or both?

gill.metcalf · May 29, 2023, 8:55am

We have a dataset of legal cases, academic papers etc which we will load into a vector database. We want to develop an agent that will allow a user to enter a specific legal issue, the agent then will search for all related documents (or portions of documents) and provide the user with a summary of each along with the citation. The responses should only be based on our domain.
Can this be done with prompt engineering? Would fine-tuning help the quality of the responses? Anything else I should be investigating? TIA

sps · May 29, 2023, 9:25am

Hi @gill.metcalf

Welcome to the OpenAI community.

IMO fine-tuning isn’t required for this use-case.

According to my understanding of your requirements, you are trying to get the summary and citation of relevant docs based on the user’s query. Here’s an outline of the process:

Retrieve the relevant docs from your vector DB.
Generate a prompt programmatically with the doc to be summarized. This can be done per doc, or for all the docs in one go, depending on your requirements and docs size. Use gpt-3.5-turbo to minimize token costs.
Generate the user reply programmatically by concatenating response(s) from the model along with the retrieved docs to be cited.

Topic		Replies	Views
Optimizing System Prompts for fine tuning Prompting fine-tuning	2	549	March 20, 2024
Fine-Tuning with Non-Prompt/Completion Data: Seeking Advice for Direct Text-Based Training? API gpt-4 , chatgpt , fine-tuning , api	3	412	August 23, 2024
What's better for the type of chatbot I am building? Fine tune or embedding? Community chatgpt , api	10	2224	August 20, 2023
Referencing materials for content analysis and related content generation Prompting chatgpt , fine-tuning , prompt-engineering	1	371	May 31, 2024
Text extraction based on complex rules Prompting gpt-4 , plugin-development	1	912	November 9, 2023

Use Fine Tuning or Prompt Engineering or both?

Related topics