Use case: asking questions about a specific document

gotmike · February 4, 2023, 1:07pm

I am trying to figure out the best route to be able to load a long text document (think a 60 page lease or medical paper). Then i want to ask questions about the text. Is this fine tuning? Seems like fine tuning would only work if i had sample responses.

Seems every scenario i try runs out of tokens.

nunodonato · February 4, 2023, 1:12pm

No, finetuning is the worst option for this.
Please use the search, this question gets asked ad-nauseum. (there are even free tools to do this)

gotmike · February 4, 2023, 4:06pm

So search embeddings? When i get results from embedding it is a vector. How do i interpret the results?

I would like to query (for instance) “please summarize the above document in bullet points”.

It does this very well with short docs in the playground and also chatgpt but can’t seem to figure out how to get larger bodies of text into it.

Dent · February 4, 2023, 7:25pm

I have a pipeline written down that creates embeddings for subdocuments, uses semantic search to find the relevant subdocument, and then uses text-davinci-003 to rephrase the subdocument for a specific audience. Does that seem to be helpful for your use-case?

gotmike · February 5, 2023, 2:17pm

Could be. Would love to see what u put together…

Dent · February 5, 2023, 9:41pm

raymonddavey · February 5, 2023, 10:39pm

Hi Mike,

I have replied in a private chat message with details of a workflow you can use.

We have also resolved the breaking down of documents for fine tuning, hallucination issue, and providing accurate citations, and video/html in responses. It is a mix of embedding and fine-tuning.

We are not quite ready for a public announcement yet and are proving limited BETA access on a case-by-case basis (Depends on use case)

I will share more to the community in coming days.

090520mz · June 12, 2023, 3:34pm

Did you try chatdochub.com ? it seems that there is no limitation regarding their PDF size. Try it out.

Topic		Replies	Views
Seeking Advice: Uploading Large PDFs for Analysis with GPT-3 API API gpt-35-turbo , chatgpt , fine-tuning , api	7	7125	December 13, 2023
Feedback please: Chatbot to answer questions about long documents API	4	2270	December 17, 2023
Creating a conversational chat bot with a large data set API	4	3307	March 2, 2023
Novel way to chat with your one or many PDF documents. A multi-step agent approach Community api	21	7586	December 17, 2023
Extracting insights from multiple documents API	4	2122	December 17, 2023

Use case: asking questions about a specific document

Related topics