Can I use my own pdf/text documents to train to get an article out

Hi,
My use case is as follows. I would like to generate an article based on a set of text/pdf documents (that belong to a very specific nice topic) that reside on my desktop. Is it possible? I was wondering if someone can help me understand either the limitation in that context or how it can be done. Would appreciate it very much.
Thank you,
AJ

1 Like

You would just need to scrape it and prepare a JSONL file, but yeah. Check out fine-tuning in the docs.

2 Likes

To build off @daveshapautomator, you can try following this article if you’re familiar with Python to scrape text from PDFs!

3 Likes