Hi there,
Does anyone have experience using text-to-speech and the assistant API? And make it dynamic? At the moment I only managed to get a single MP3 file, but what I want is to keep sending the speech to the user which is the same as the text output of the API.
1 Like
amve
March 3, 2024, 6:04pm
2
I did it here, you can review the code in the repository:
1 Like
amve
March 3, 2024, 6:08pm
3
Specifically in this file:
/**
* mai_ | Whisper to ChatGPT and Claude.ai
* Copyright (C) 2023 mai_ (Fundacja Reborn) | https://mai.net.pl | info@mai.net.pl
*
* This program is free software: you can redistribute it and/or modify
* it under the terms of the GNU Affero General Public License as
* published by the Free Software Foundation, either version 3 of the
* License, or (at your option) any later version.
*
* This program is distributed in the hope that it will be useful,
* but WITHOUT ANY WARRANTY; without even the implied warranty of
* MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
* GNU Affero General Public License for more details.
*
* You should have received a copy of the GNU Affero General Public License
* along with this program. If not, see <https://www.gnu.org/licenses/>.
*/
/**
* Class responsible for handling Text-to-Speech (TTS) operations.
This file has been truncated. show original
You can see it in action here: (308) Whisper to ChatGPT and Claude.ai (voice interaction with chat) - YouTube
Thanks man! @amve will try it out
2 Likes