Fine tuning the molecular answers with papers training

vladimir.a.gimenez.r · March 27, 2023, 7:04am

Hello everyone,

I am new here. I am trying to use our paper data repository to train the bot to answer more topic-specific questions. I saw that it is impossible to load PDFs, but you need to do it with JSON. However, before I start this journey, which will take certainly weeks to optimize, I want to be sure that this is the right way to go.

I want to use the data repository of peer-reviewed papers so that allows me to fine-tune the IPA bot to give more insightful answers about how proteins interact with other proteins that are contained in the user question. Is that even possible?

Thanks in advance,

Andrey.

sps · March 27, 2023, 9:41am

Hi @vladimir.a.gimenez.r

Welcome to the community.

It looks like embeddings will be a much better approach than fine-tuning for your use case.

A lot of projects have also been launched lately that enable question answering based on provided documents.

Topic		Replies	Views
Fine-tuning or using embeddings? Small dataset API chatgpt	5	1674	December 17, 2023
Use case: asking questions about a specific document API	7	2561	June 12, 2023
OpenAI Embeddings - Search through ~1000 PDFs API embeddings	3	3872	August 28, 2024
Fine Tuning a Chatbot to provide answers from a specific dataset API	6	4255	December 17, 2023
I need help buildning a chatbot on my own data API fine-tuning	4	2463	August 24, 2023

Fine tuning the molecular answers with papers training

Related topics