Request for Improved Persian Accent Support in Text-to-Speech Service

,

Hello OpenAI Team,

I’m reaching out to share my observations and provide feedback on the text-to-speech (TTS) functionality, particularly regarding its performance in the Persian language. I’ve noticed that while the technology is adept at recognizing and converting Persian text to speech, the output seems to align more with an Afghan accent rather than a standard Persian (Iranian) accent.

I hypothesize that this issue might stem from the fact that the TTS system does not have an explicit input for specifying the desired language accent. Since Persian texts are linguistically similar across different regions (like Iran and Afghanistan), the system might not differentiate between the accents and defaults to an Afghan style, which differs significantly from the Tehrani accent or other regional Persian accents in Iran.

In an attempt to guide the TTS towards a Tehrani accent, I tried prefacing my texts with phrases like “this is a Persian text with a Tehrani accent,” but the results did not align with expectations. This leads me to suggest a possible enhancement: the introduction of a feature to explicitly specify the desired accent or dialect for a given language. Such a feature could enable users to choose between variations like Tehrani Persian, Afghan Persian, etc., thus providing more accurate and culturally relevant outputs.

The ability to fine-tune the accent in TTS is crucial for applications requiring linguistic and cultural precision, such as educational tools, localized content creation, and user interfaces designed for specific Persian-speaking communities.

I appreciate the complexity involved in developing nuanced TTS systems capable of capturing the subtleties of every language and dialect. However, enhancing the Persian TTS to include an option for selecting specific accents would be a significant step forward in making the technology more inclusive and widely applicable.

Thank you for your ongoing efforts to improve AI technology. I am looking forward to future developments that might address these nuances in TTS services.

Best regards,

I hope you see a response in some way. Being able to fine tune accents would be amazing.

Hi bat.man :slight_smile:

The reason for this problem is not the Afghan accent, but because the model is trained with data that is artificially generated by the existing TTS and has many mistakes.

To solve this problem, you can train your model with common voice data or with your own correctly recorded data. Also, using good g2p translator models will help you significantly.

Good luck,

I came here to say the exact same thing… I loveee using the audio feature but it is annoying that the accent is not right!

I made a request with help of my chatgpt!

Feedback on Persian Accent in Voice Feature

Dear [Support Team/Development Team],

I hope this message finds you well. I would like to provide some feedback regarding the Persian voice feature in your app. While I appreciate the effort put into recognizing different dialects, I’ve noticed that the voice assistant often uses an Afghan accent when responding in Persian. While Afghan and Iranian accents are both valid and important in their own right, I believe it’s crucial to give users the option to choose between these accents, as they are distinctly different.

As an Iranian speaker, I feel that using an Afghan accent instead of an Iranian one can be misleading and may not resonate well with Persian speakers from Iran. It’s akin to speaking in a different dialect than expected, which could unintentionally lead to misunderstandings or feelings of disrespect, even though I’m sure that’s not the intention.

I kindly request that you consider offering a more accurate Iranian Persian accent in future updates, or at least give users the option to select their preferred Persian dialect.

Thank you for your attention to this matter, and I look forward to seeing continued improvements in your product.

Best regards,