Suggestion: Let the OpenAI Voice API Read Books as Personal Narrators (With Emotion & Character Support)
Hey OpenAI team
This is a suggestion/request/dream scenario for the future of the Voice API, and I think it has massive potential.
The Idea:
Enable users to input books (EPUB, PDF, plain text, etc.) into the Voice API and turn them into narrated experiences—essentially a personal audiobook AI that:
- Reads any book aloud using the streaming TTS system
- Allows the user to select voice tones and personalities (e.g., calm, sassy, dramatic)
- Can pause, react, or respond to the listener during narration (like an AI buddy)
- Switches voices or tones for characters in dialogue (optional but amazing)
- Supports user interaction (e.g., “What happened?”, “Explain that part”, etc.)
Why This Matters:
Audible is great, but limited by what’s available and how it’s delivered. People want:
- More dynamic listening experiences
- Voices that react or engage with the reader
- The ability to listen to anything, not just what’s on Audible
- Customization, accessibility, and personality in narration
It turns passive listening into an interactive experience. It could also support those with reading difficulties or who just want a new way to engage with text.
What Would Help:
- Official support for longer text inputs or “book mode”
- Tools or best practices for parsing books into chunks for the API
- More guidance for integrating interaction (e.g., pausing to answer questions mid-story)
- Optional “voice swapping” or character tagging support
Use Cases:
- Personal audiobook reader for your own library
- AI that reads fanfiction with flair
- Kids’ stories that pause and ask questions like interactive books
- Studying with textbooks being read to you + explained live
- Emotional storytelling where tone matches the scene
Bonus Dream Feature:
Let the AI change its tone automatically based on story content. Happy scenes? Cheerful voice. Dramatic climax? Give me that deep cinematic tone.
This could seriously be a game-changer for education, entertainment, accessibility, and more. The Voice API already has amazing quality—this is just the next step.
Thanks for reading, and I’d love to hear thoughts from others and the OpenAI team!