Extracting emotion or tone of speech

sps · November 15, 2023, 8:15pm

Is it possible to extract the emotion or tone of speech from a voice recording using the audio transcription models available on the API viz whisper-1 and canary-whisper using prompt param?

Currently it only does STT but I’d also like to extract the tone from speech as well.

Foxalabs · November 15, 2023, 8:19pm

Interesting Idea! That would be a great way to get more bandwidth from a recording.

sps · November 15, 2023, 8:43pm

Yes, it would be really interesting to see if this is possible.

Though in my experiments so farI have been unable to get this from the model(s).

_j · November 15, 2023, 9:18pm

Here, figure out if this is a scam to fleece VC investors

“just 20 seconds of speech - plus a quiz about your mood”

Topic		Replies	Views
Need API like Whisper (speech to text) with emotional parameters and pause detections capabilities API	0	111	November 1, 2024
Can whisper detect shouting? API whisper	1	505	May 17, 2024
How to analyze the tone in audio - similar to sentiment analysis of text API api	0	111	February 25, 2025
Can the Whisper model detect the speaker's accent? Community whisper	0	163	July 22, 2024
TTS - Adding ability to change tone/emotion of an existing openai voice Feedback tts	2	2994	August 13, 2024

Extracting emotion or tone of speech

Related topics