文本转语音(语音生成)和语音转文本(语音识别)API?

发布于 2024-11-15 05:02:54 字数 35 浏览 4 评论 0原文

是否有适用于桌面或浏览器环境的已知 API 的完整列表?

Is there a comprehensive list of known APIs for desktop or browser environments?

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(3

千と千尋 2024-11-22 05:02:54

我将重新整理并更新 用 C 语言、Java 语言或 PHP 语言进行语音识别?。这绝不是全面的,但它可能是您的一个开始


通过几个月来观察这些问题,我发现大多数开发人员的选择都是这样分解的:

Windows 人员 - 使用 .Net 或 Microsoft 的 System.Speech 功能。语音并安装 Microsoft 提供的免费识别器。 Windows 7 包含完整的语音引擎。其他可免费下载。同一引擎有一个 C++ API,称为 SAPI。请参阅 http://msdn.microsoft.com/en-us/magazine/ cc163663.aspx。或 http://msdn.microsoft.com/ en-us/library/ms723627(v=vs.85).aspx。有关适用于 Windows 的 Microsoft 引擎的更多背景信息
System.Speech 之间有什么区别.Recognition 和 Microsoft.Speech.Recognition?

Linux 人员 - Sphinx 似乎有很多追随者。请参阅 http://cmusphinx.sourceforge.net/http://cmusphinx.sourceforge.net/wiki/

商业产品 - NuanceLoquendo, AT&TIBM、其他。每个都为各种语言提供自己的 SDK 和库。

在线服务 - NuanceYapme, ispeech.orgvlingo、其他。 Nuance 改进了他们的开发者计划,现在将让您免费访问他们的服务进行开发。 Yap(我相信)最近 由亚马逊购买,因此我们可能会看到一些变化。

当然这也可能有帮助 - http://en.wikipedia.org/wiki/List_of_speech_recognition_software

有一个 Java 语音 API。请参阅 Java 语音 API 中的 javax.speech.recognition http://java.sun.com/products/java-media/speech/forDevelopers/jsapi-guide/Recognition.html。我相信你还是得找到一个支持这个API的语音引擎。我不认为 Sphinx 完全支持它 - http://cmusphinx。 sourceforge.net/sphinx4/doc/Sphinx4-faq.html#support_jsapi

还有很多其他问题:
需要适用于 Linux 的文本转语音和语音识别工具
pyspeech (python) - Transcribe mp3 files? 谈论 http://code.google.com/p/pyspeech/。您可能还想查看http://code.google.com/p/dragonfly/

I'll rehash and update an answer from Speech recognition in C or Java or PHP?. This is by no means comprehensive, but it might be a start for you


From watching these questions for few months, I've seen most developer choices break down like this:

Windows folks - use the System.Speech features of .Net or Microsoft.Speech and install the free recognizers Microsoft provides. Windows 7 includes a full speech engine. Others are downloadable for free. There is a C++ API to the same engines known as SAPI. See at http://msdn.microsoft.com/en-us/magazine/cc163663.aspx. or http://msdn.microsoft.com/en-us/library/ms723627(v=vs.85).aspx. More background on Microsoft engines for Windows
What is the difference between System.Speech.Recognition and Microsoft.Speech.Recognition?

Linux folks - Sphinx seems to have a good following. See http://cmusphinx.sourceforge.net/ and http://cmusphinx.sourceforge.net/wiki/

Commercial products - Nuance, Loquendo, AT&T, IBM, others. Each provide their own SDKs and libraries for various languages.

Online service - Nuance, Yapme, ispeech.org, vlingo, others. Nuance has improved their developer program and will now give you free access to their services for development. Yap (I believe) was recently purchased by Amazon, so we may see some changes there.

Of course this may also be helpful - http://en.wikipedia.org/wiki/List_of_speech_recognition_software

There is a Java speech API. See javax.speech.recognition in the Java Speech API http://java.sun.com/products/java-media/speech/forDevelopers/jsapi-guide/Recognition.html. I believe you still have to find a speech engine that supports this API. I don't think Sphinx fully supports it - http://cmusphinx.sourceforge.net/sphinx4/doc/Sphinx4-faq.html#support_jsapi

There are lots of other SO quesitons:
Need text to speech and speech recognition tools for Linux
and pyspeech (python) - Transcribe mp3 files? which talks about http://code.google.com/p/pyspeech/. You may also want to look at http://code.google.com/p/dragonfly/

平生欢 2024-11-22 05:02:54

领先的文本转语音(语音生成)API 供应商是 YAKiToMe! 和 iSpeech。 YAKiToMe!是我使用的一个,因为我最喜欢他们的语音质量,而且价格最便宜(大部分是免费的)。他们支持多种语言的男性和女性使用者。一些语音供应商,如 Acapella、Nuance、Loquendo 和 iVona,拥有不错的语音,但使用起来往往很昂贵。

The leading API vendors of text to speech (voice generation) are YAKiToMe! and iSpeech. YAKiToMe! is the one I use because I like their voice quality the best and they're the least expensive (mostly free). They support male and female speakers in multiple languages. Some of the voice vendors, like Acapella, Nuance, Loquendo and iVona have decent voices but tend to be expensive to use.

她比我温柔 2024-11-22 05:02:54

您可以这样做:
注意:它是来自google的api,所以它只适用于chrome浏览器。

(在此处查看现场演示并下载完整源代码 http://purpledesign.in/blog/?p=33 )

定义一个按钮

<input id="speech" type="text" speech="speech" x-webkit-speech="x-webkit-speech" onspeechchange="processspeech();" onwebkitspeechchange="processspeech();" />

并定义您想要在 javascript 文件中的函数中执行的操作

,如下

   function processspeech()
   {
     var speechtext=$("#speech").val();
     var elem = document.getElementById("test");
     elem.value = speechtext;
     var notification="\"<span style=\"color:#F00; text-transform:uppercase;\">"+  speechtext + "</span>\" <br />*Is this what you said???";
    notify(notification);
}

所示

<textarea> id="test"></textarea>

语音写在文本区域中

Here is how you can do it:
Note: it is an api from google, so it only works in chrome browser.

(See live demo and download full source code here http://purpledesign.in/blog/?p=33)

define a button

<input id="speech" type="text" speech="speech" x-webkit-speech="x-webkit-speech" onspeechchange="processspeech();" onwebkitspeechchange="processspeech();" />

and define what you want to do in a function in your javascript file

Like This

   function processspeech()
   {
     var speechtext=$("#speech").val();
     var elem = document.getElementById("test");
     elem.value = speechtext;
     var notification="\"<span style=\"color:#F00; text-transform:uppercase;\">"+  speechtext + "</span>\" <br />*Is this what you said???";
    notify(notification);
}

Here

<textarea> id="test"></textarea>

The speech is written in the textarea

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文