Creating Fine-Tune Model from PDF Data in Node.js: Need Advice and Recommendations

vaibhavchaudhary · November 23, 2023, 6:07pm

Hey everyone!

I’m currently working on a project where I have a book in PDF format and I’m aiming to create a fine-tune model for GPT directly from that PDF using Node.js. The fine-tune model expects data in JSONL format.

I’m seeking advice and recommendations from the community on the best approach for converting and using PDF data for fine-tuning in Node.js. Specifically, I’m interested in insights on data structuring, tools for extracting text from PDFs, and any specific libraries or techniques that have proven helpful in similar projects.

Your experiences and expertise are highly valued! Thanks in advance for your insights and recommendations!

Topic		Replies	Views
Efficiently Interacting with super super Long PDFs/documents API gpt-4	2	1268	June 25, 2024
Best Way to Process 2500 large PDFs for Specific Data Extraction? API chatgpt , api , langchain , pdf	2	391	November 3, 2024
Accurately read PDF files? API	12	74429	December 12, 2023
Document processing solutions API chatgpt , plugin-development , api , assistants-api	6	3043	April 3, 2024
Best way to process PDF File that has over 100k lines? API embeddings , gpt-35-turbo , api	6	7733	December 14, 2024

Creating Fine-Tune Model from PDF Data in Node.js: Need Advice and Recommendations

Related topics