Microsoft Sam、SAPI 替代品
我们有一个应用程序,计划使用 Microsoft 语音 API。现在我们在 Windows XP 上使用 Microsoft Sam 语音对其进行了测试,坦率地说,这听起来很糟糕......几乎不可能听清声音想说什么。
还有其他更好的声音吗?是否有更好的更新或更新版本。是否有其他产品、开源项目等可以作为替代方案?
只是为了澄清 - 它需要某种 API,以便我实际上可以针对它进行编程。
We have a application that we were planing to use Microsoft speech API for. Now we tested it on Windows XP using Microsoft Sam voice and frankly it sound terrible ... It's almost impossible to hear what the voice is trying to say.
Are there other, better voice. Are there any updates or newer versions out there that are better. Are there other product, open source projects etc that can work as an alternative?
Just to clarify - It needs to have some sort of API so I actually can program against it.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(4)
在 Windows 上,我发现的最好的方法是使用语音 API 和来自 AT&T Natural Voices 的语音: https:// nextup.com/attnv.html
然而,如果有的话,它们也非常昂贵。我遇到过一些项目,其使用/商业模式与 AT&T 的想法相去甚远,他们甚至不会出售许可证。
有一个免费软件替代品,Festival:http://festvox.org/,但质量很糟糕。比目前商用系统的音质落后约10年。然而它是免费的。
对我来说效果很好的第三种选择是将一些项目的语音合成部分转移到 OS X。OS X 有一套不错的工具和语音 API 以及一套相当不错的库存语音。当然,缺点是为这些 API 编写的程序只能在 OS X 下运行,而 OS X 只能在 Apple 硬件上运行。
On Windows about the best I have found was using the speech API and voices from AT&T Natural Voices: https://nextup.com/attnv.html
They are however VERY expensive if available at all. I have run into projects where the usage/business model was so far from what AT&T was thinking of that they wouldn't even sell a license.
There is a free software alternative, Festival: http://festvox.org/ , the quality though is horrible. It is about 10 years behind the current sound quality of commercial systems. It is however free.
A third alternative which has worked well for me was to shift the voice synthesis part of a few projects to OS X. OS X has a decent set of tools and speech APIS and a fairly decent set of stock voices. The downside of course is that prorams written for these APIs run only under OS X which runs only on Apple hardware.
AT&T Natural Voices 引擎可产生出色的语音,但它不是免费的
还有 NeoSpeech 也不错 - 也不是免费的
AT&T Natural Voices engine produces great speech but its not free
there is also NeoSpeech which are also good - Not free as well
您没有描述您的许可需求,所以我不知道其中任何一个是否适合这方面,但以下所有内容都是 SAPI 5 兼容语音的来源:
Ivona (http://www.ivona.com/) - 我在 SAPI 项目上使用他们的 Kendra 语音。
AT&T 自然声音 (http://www2.research.att.com/~ttsweb/ tts/)
Loquendo (http://www.loquendo.com/)
Acapela (http://www.acapela-group.com/products/products.asp) 倒
谱(http://www.cepstral.com/)
fonix (http://www.fonixspeech.com/tts.php) - 仅当您喜欢原版说话并拼写。
Nuance RealSpeak(我不确定这个...)
You don't describe your licensing needs, so I don't know if any of these will be suitable in that regard, but all of the following are sources of SAPI 5 compatible voices:
Ivona (http://www.ivona.com/) - I'm using their Kendra voice on a SAPI project.
AT&T Natural Voices (http://www2.research.att.com/~ttsweb/tts/)
Loquendo (http://www.loquendo.com/)
Acapela (http://www.acapela-group.com/products/products.asp)
Cepstral (http://www.cepstral.com/)
fonix (http://www.fonixspeech.com/tts.php) - only if you loved the original Speak & Spell.
Nuance RealSpeak (I'm not sure about this one...)
您可以使用免费且开源的 Festival。默认的 Festival 声音听起来有点像史蒂芬·霍金,但您可以使用其他一些更好的 HTS 声音。例如,尝试在此演示页面上选择 Peter HTS 2011 语音: http:// www.cstr.ed.ac.uk/projects/festival/morevoices.html。我见过的大多数 Festival 的 HTS 声音都不允许用于商业用途,但这个似乎是免费的:http://homepages.inf.ed.ac.uk/jyamagis/software/page54/page54.html
您可以查看此 YouTube 教程:http://www.youtube.com/watch?v=MmcLFJQpv2o< /a>
You can use free and open source Festival. The default Festival voice sounds a little like Stephen Hawking but you can use some other much better HTS voices. For example try selecting Peter HTS 2011 voice on this demo page: http://www.cstr.ed.ac.uk/projects/festival/morevoices.html. Most of HTS voices for Festival that I've seen are not allowed for commercial use however this one seems to be free: http://homepages.inf.ed.ac.uk/jyamagis/software/page54/page54.html
You can check this youtube tutorial: http://www.youtube.com/watch?v=MmcLFJQpv2o