What is difference between TTS HD vs TTS?

tom.tyiu · January 6, 2024, 12:41am

What are the differences between TTS HD vs TTS? Any frequency difference? Trained differently? I am going to use it in applications! It will be TTS HD only. So I need to know.

PaulBellow · January 6, 2024, 1:37am

I’m not sure if it’s trained differently or if HD is just saved at higher bitrate, etc. I’d imagine the latter?

_j · January 6, 2024, 2:26am

Twice the price for optimized for quality instead of optimized for speed.

They have the same audio bandwidth and sample rate the last I checked, and since the generations of successive runs are not identical, are hard to compare for quality.

It may be something subtle, like the number of internal dimensions or parameters of AI models. Training quantity or perplexity. Such would account for less “speed”, although that doesn’t seem markedly different either.

sps · January 6, 2024, 3:17am

Agree with @_j

The docs state:

For real-time applications, the standard tts-1 model provides the lowest latency but at a lower quality than the tts-1-hd model. Due to the way the audio is generated, tts-1 is likely to generate content that has more static in certain situations than tts-1-hd . In some cases, the audio may not have noticeable differences depending on your listening device and the individual person.

tom.tyiu · January 6, 2024, 5:36am

What do general public prefer?
TTS or TTS-HD? or both? Should I charge by 1k character or by monthly? Just wonder what is the best?

PaulBellow · January 6, 2024, 5:41am

I’d offer them both at different prices depending on what they prefer

dignity_for_all · January 6, 2024, 7:18am

According to the document description, tts-1 is optimized for speed, while tts-1-hd is optimized for quality. However, in about 30 Japanese text-to-speech tests that I conducted, tts-1-hd often read parts of the Japanese text with a strange pronunciation that was neither Japanese nor English.

Therefore, it is likely that tts-1 and tts-1-HD were trained on different datasets.

I have not confirmed whether this applies to languages other than Japanese, but which one to prefer may vary depending on what language is being used for the text-to-speech.

The cost is indicated as per character, so it is probably not per token.

tom.tyiu · January 6, 2024, 4:48pm

That will be nice if OpenAI trains better for Japanese.

jwatte · January 6, 2024, 6:34pm

If you charge monthly, you should make sure to implement some kind of upper usage cap, so users can’t drive you into bankruptcy.
For example, “$X per month, includes up to Y hours of generated audio, after which we charge you $Z per additional hour.”
Also, monthly charges have the benefit that you don’t need to account for pre-payment that “stores value” you have to keep valid for a long amount of time. Also, you will have a more regular revenue stream, with montly subscriptions coming in every month.

dignity_for_all · January 7, 2024, 1:15pm

It’s clear that determining how to secure profits is a challenge, as can be seen from OpenAI’s own difficulties in balancing the provision of services through ChatGPT and via their API.

Personally, I believe there is a certain rationale in combining a monthly subscription model with a pay-as-you-go system.

By adopting a monthly subscription, you can ensure a steady profit regardless of whether users utilize the service or not.

Additionally, by setting limits on the amount of service provided in a subscription model, you can protect the provider’s profits without detriment.

Should users require more service usage, you can accommodate this by offering an additional pay-as-you-go plan.

As mentioned above, user preferences between tts-1 and tts-1-HD may vary, so it would be advisable to make both options available.

tom.tyiu · January 9, 2024, 5:39pm

Thanks everyone, It was really helpful!

tom.tyiu · January 18, 2024, 10:17pm

This is a related issue. Will OpenAI have more voices and more professional voices? For chrome extensions, what is the best way for OpenAI API to call for extensions? How can I hide the OpenAI key and quick API call?

Topic		Replies	Views
Feedback about tts-1-HD vs tts-1 API tts	1	1514	December 12, 2023
Difference between tts-1-hd-1106 and tts-1-hd API tts	1	1901	March 6, 2024
TTS API service usability API tts	17	7397	December 16, 2023
New TTS API pricing and gotchas API	8	3893	March 25, 2025
TTS API Speed and Quality Issues API api , tts	5	4370	February 6, 2024

What is difference between TTS HD vs TTS?

Related topics