如何使用 FliteController 类中的 withVoice: 参数减慢语音速度?
我正在使用 OpenEars FliteController 类将文本转换为语音。
我正在使用以下方法:
[self.fliteController say:@"Aphrase Id like my app to say outloud." withVoice:@"cmu_us_awb8k"];
在 withVoice: 之后可以输入的参数选项如下:
cmu_us_awb8k
cmu_us_rms8k
cmu_us_slt8k
cmu_time_awb
cmu_us_awb
cmu_us_kal
cmu_us_kal16
cmu_us_rms
cmu_us_slt
我用 withVoice: 尝试了所有这些参数,
但是,我的客户对这些声音不满意。他告诉我这些都很快。因此,我需要一种使用 withVoice: 参数来减慢语音速度的方法。我该怎么做呢?还有其他语音可用吗?
I am using the OpenEars FliteController class to convert text to speech.
I am using the method:
[self.fliteController say:@"A phrase I'd like my app to speak out loud." withVoice:@"cmu_us_awb8k"];
The options for arguments one can enter after withVoice: are as follows:
cmu_us_awb8k
cmu_us_rms8k
cmu_us_slt8k
cmu_time_awb
cmu_us_awb
cmu_us_kal
cmu_us_kal16
cmu_us_rms
cmu_us_slt
I tried all of these arguments with withVoice:
However, my client is not happy with the voices. He told me these all are fast. So, I need a way to slow down speech using the withVoice: argument. How can I do it? Are there any other speech voices available?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
自 0.911 版本起,速度/音高/方差变换现已成为 OpenEars 的一项功能。您可以按如下方式使用它:
使用 FliteController 的以下属性更改声音的速度、音调和变化性:
例如,在发送此消息之前:
[self.fliteController say:@"Aphrase Id like my app大声说话。” withVoice:@"cmu_us_awb8k"];
您可以对 self.fliteController 进行以下设置:
1.0 是默认值,0.0 是最小值,2.0 是可能的最大有用值(尽管您可以高于 2.0,但它可能不会是一个有用的值) 。
uration_stretch 为 0.5 时,速度是 1.0 的两倍,duration_stretch 为 2.0 时,速度是 1.0 的一半。 target_mean 或 target_stddev 为 2.0 将使平均音调频率或方差加倍,而 0.5 会将它们减少一半。
您不必设置或覆盖这些设置 - 如果您不使用它们,它们将被设置为语音的默认值。如果您想在覆盖它们后将它们恢复为默认值,只需将它们全部设置为 1.0:
Speed/pitch/variance shift is now a feature of OpenEars as of version .911. You can use it as follows:
Change the speed, pitch and variability of a voice by using the following properties of FliteController:
For instance, right before sending this message:
[self.fliteController say:@"A phrase I'd like my app to speak out loud." withVoice:@"cmu_us_awb8k"];
You could make the following settings to self.fliteController:
1.0 is the default, 0.0 is the minimum value and 2.0 is a likely maximum useful value (although you can go higher than 2.0, it probably isn't going to be a useful value).
A duration_stretch of .5 will be twice as fast as 1.0, a duration_stretch of 2.0 will be half the speed of 1.0. A target_mean or target_stddev of 2.0 will double the average pitch frequency or double the variance, while 0.5 will reduce them by half.
You do not have to set or override these settings — if you don’t use them, they will be set to the defaults for the voice. If you want to return them to the defaults after overriding them, simply set them all to 1.0:
我相信这是 OpenEars 的成果。不知道如何在 OpenEars 中进行配置。但是您可以查看此演示,它可以根据您的要求进行自定义。就像您可以设置声音的音高/方差/速度一样。从链接克隆存储库。
如果您可以迁移到此,这将是您的解决方案。如果您对此演示有任何疑问,我一定可以提供帮助,因为我为 TextToSpeech 准备了 1 个带有播放/暂停/停止选项的演示。
希望有帮助。
I believe this is with OpenEars. Don't know about configuring this in OpenEars. But you can have a look at this demo which can be customizable as per your requirement. Like you can set Pitch/Variance/Speed for the voices. Clone the repository from the link.
If you can migrate to this it would be your solution. If you have any doubt in this demo I can surely help as I prepared 1 demo with Play/Pause/Stop options for TextToSpeech.
Hope it helps.