Seeking Advice on Integrating TTS-1 API for Reading PDF Book Aloud

Gustaf · May 20, 2024, 1:19pm

Hey,

I’m currently trying to get my book being read out to me using the TTS-1 api integration. My plan is to use a regular pdfparser to extract the text from the pdf, store it in a variable and then fetch that as the input to the model.

Is this a bad approach or how would you do this?

Best regards,
Gustaf

Yepher · May 20, 2024, 6:17pm

I’ve done this a lot. In my experience the PDF to text is no where near perfect but may greatly depend on type of PDF content. Now I convert the PDF to a text file, then I clean up the text and then I run that through TTS. I do it page by page and create one audio file per page but you may not need that. Either way you will need to segment the text and merge audio files together.

nordonton · December 29, 2024, 1:45pm

Hi, maybe I’m asking a stupid question, but it seems like you know the answer. What do you use to split the text correctly and how do you write the request to the API? Could you give an example of how it works? Thank you.

Topic		Replies	Views
Best way to process PDF File that has over 100k lines? API embeddings , gpt-35-turbo , api	6	8241	December 14, 2024
Epub Conversion using API API gpt-4	0	116	August 16, 2024
Efficiently Interacting with super super Long PDFs/documents API gpt-4	2	1420	June 25, 2024
Creating Fine-Tune Model from PDF Data in Node.js: Need Advice and Recommendations API gpt-35-turbo , chatgpt , api	0	1556	November 23, 2023
Best practice for generating transcriptions from long audio files API	0	771	May 15, 2024

Seeking Advice on Integrating TTS-1 API for Reading PDF Book Aloud

Related topics