Can I use my own pdf/text documents to train to get an article out

ajdude · November 19, 2021, 12:29pm

Hi,
My use case is as follows. I would like to generate an article based on a set of text/pdf documents (that belong to a very specific nice topic) that reside on my desktop. Is it possible? I was wondering if someone can help me understand either the limitation in that context or how it can be done. Would appreciate it very much.
Thank you,
AJ

daveshapautomator · November 19, 2021, 2:00pm

You would just need to scrape it and prepare a JSONL file, but yeah. Check out fine-tuning in the docs.

DutytoDevelop · November 19, 2021, 2:15pm

To build off @daveshapautomator, you can try following this article if you’re familiar with Python to scrape text from PDFs!

luca.salvatori · January 9, 2023, 2:30pm

Hi there! There’s a way to use a set of pdf or word documents for teaching GP-3 Davinci? I would use this for make synthesis of paragraphs and extract quote and so on…
Unfortunately the link Medium linked by @daveshapautomator not working!

Thank you

kanzariyamihir · April 8, 2023, 6:06pm

You can use PDFGPT.IO this tool summarize pdf

090520mz · June 12, 2023, 3:33pm

did you tried chatdochub.com ? it seems that is powerful and it provides an API.

Topic		Replies	Views
Train GPT for analyze large number of pdf Community chatgpt	8	1623	August 2, 2024
Use case: asking questions about a specific document API	7	2325	June 12, 2023
GPT-4 API for Educational Application API gpt-4 , chatgpt	2	1448	January 24, 2025
How can I summarize a tons of articles and then sentiment it? API api	10	3066	February 16, 2024
Can you explain how to analyze a PDF file in GPT-4? API	9	71710	December 13, 2023

Can I use my own pdf/text documents to train to get an article out

Related topics