如何使用 FliteController 类中的 withVoice: 参数减慢语音速度？

发布于 2024-11-13 08:42:41 字数 480 浏览 4 评论 0原文

我正在使用 OpenEars FliteController 类将文本转换为语音。

我正在使用以下方法：

[self.fliteController say:@"Aphrase Id like my app to say outloud." withVoice:@"cmu_us_awb8k"];

在 withVoice: 之后可以输入的参数选项如下：

cmu_us_awb8k
cmu_us_rms8k
cmu_us_slt8k
cmu_time_awb
cmu_us_awb
cmu_us_kal
cmu_us_kal16
cmu_us_rms
cmu_us_slt

我用 withVoice: 尝试了所有这些参数，

但是，我的客户对这些声音不满意。他告诉我这些都很快。因此，我需要一种使用 withVoice: 参数来减慢语音速度的方法。我该怎么做呢？还有其他语音可用吗？

原文

I am using the OpenEars FliteController class to convert text to speech.

I am using the method:

[self.fliteController say:@"A phrase I'd like my app to speak out loud." withVoice:@"cmu_us_awb8k"];

The options for arguments one can enter after withVoice: are as follows:

cmu_us_awb8k
cmu_us_rms8k
cmu_us_slt8k
cmu_time_awb
cmu_us_awb
cmu_us_kal
cmu_us_kal16
cmu_us_rms
cmu_us_slt

I tried all of these arguments with withVoice:

However, my client is not happy with the voices. He told me these all are fast. So, I need a way to slow down speech using the withVoice: argument. How can I do it? Are there any other speech voices available?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

坦然微笑 2024-11-20 08:42:41

自 0.911 版本起，速度/音高/方差变换现已成为 OpenEars 的一项功能。您可以按如下方式使用它：

使用 FliteController 的以下属性更改声音的速度、音调和变化性：

duration_stretch // Duration of speech
target_mean // Average pitch of speech
target_stddev // Variance

例如，在发送此消息之前：

[self.fliteController say:@"Aphrase Id like my app大声说话。” withVoice:@"cmu_us_awb8k"];

您可以对 self.fliteController 进行以下设置：

self.fliteController.duration_stretch = 1.5; // Slow down the speed a bit
self.fliteController.target_mean = 1.2; // Raise the pitch
self.fliteController.target_stddev = 1.5; // Increase the variance

1.0 是默认值，0.0 是最小值，2.0 是可能的最大有用值（尽管您可以高于 2.0，但它可能不会是一个有用的值）。

uration_stretch 为 0.5 时，速度是 1.0 的两倍，duration_stretch 为 2.0 时，速度是 1.0 的一半。 target_mean 或 target_stddev 为 2.0 将使平均音调频率或方差加倍，而 0.5 会将它们减少一半。

您不必设置或覆盖这些设置 - 如果您不使用它们，它们将被设置为语音的默认值。如果您想在覆盖它们后将它们恢复为默认值，只需将它们全部设置为 1.0：

self.fliteController.duration_stretch = 1.0; // Reset the speed
self.fliteController.target_mean = 1.0; // Reset the pitch
self.fliteController.target_stddev = 1.0; // Reset the variance

Speed/pitch/variance shift is now a feature of OpenEars as of version .911. You can use it as follows:

Change the speed, pitch and variability of a voice by using the following properties of FliteController:

duration_stretch // Duration of speech
target_mean // Average pitch of speech
target_stddev // Variance

For instance, right before sending this message:

[self.fliteController say:@"A phrase I'd like my app to speak out loud." withVoice:@"cmu_us_awb8k"];

You could make the following settings to self.fliteController:

self.fliteController.duration_stretch = 1.5; // Slow down the speed a bit
self.fliteController.target_mean = 1.2; // Raise the pitch
self.fliteController.target_stddev = 1.5; // Increase the variance

1.0 is the default, 0.0 is the minimum value and 2.0 is a likely maximum useful value (although you can go higher than 2.0, it probably isn't going to be a useful value).

A duration_stretch of .5 will be twice as fast as 1.0, a duration_stretch of 2.0 will be half the speed of 1.0. A target_mean or target_stddev of 2.0 will double the average pitch frequency or double the variance, while 0.5 will reduce them by half.

You do not have to set or override these settings — if you don’t use them, they will be set to the defaults for the voice. If you want to return them to the defaults after overriding them, simply set them all to 1.0:

self.fliteController.duration_stretch = 1.0; // Reset the speed
self.fliteController.target_mean = 1.0; // Reset the pitch
self.fliteController.target_stddev = 1.0; // Reset the variance

回复收藏 0 原文