服务器端语音识别

发布于 2024-09-07 03:01:48 字数 1539 浏览 0 评论 0原文

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(1

池木 2024-09-14 03:01:48

有多种 IVR 服务将整个 VOIP 会话(电话呼叫)作为一个完整的应用程序托管,而不是“按菜单点菜”提供单独的服务交易。如果您想让您的程序看起来像 VOIP 呼叫,您也许可以使用其中一些服务来完成它。

Voxeo 发布了 免费(且低成本)IVR 托管提供商列表,面向有限使用的开发人员。毫不奇怪,所有这些都需要注册。

另一种可能性是直接查询 VlingoTwilioTropo 因为他们可能会向您出售您所需要的东西。

更新:2012 年 7 月 25 日

AT&T 宣布在 .您向其发送音频 – 它会返回 XML 或 JSON 数据格式的文本。另请参阅开发人员网站

更新:2012 年 8 月 27 日

另一种可能性是 Nuance 的 Dragon Mobile SDK ,它针对的是寻求 API 的个人开发人员,该 API 能够为消费者应用程序提供语音和/或文本转语音功能。

更新:2012 年 9 月 21 日

似乎有几个新的提供商提供正是您正在寻找的东西:语音样本输入、文本输出。 可编程 Web 上列出了以下内容:

另请注意 Loquendo 现在是 Nuance 的一部分。

更新:2013 年 6 月 27 日

AT&T 的语音 API 有一个 少数目标SDKAndroid、iOS、PhoneGap、Titanium、Windows ) - 其中一些托管在 GitHub 上。甚至还有Unity 3D 演示源

更新:2014 年 1 月 23 日

OneTok 已将其产品重新制定为 iOS 的 SDKAndroid

显然,Voice Genie 产品已被 Genesys 彻底消化,以至于几乎没有任何痕迹可以找到。鉴于 Genesys 面向大型企业的定位,很难知道他们是否有任何小批量或商品产品。

Plumvoice 似乎扩大了他们的产品范围。

与之前的许多产品一样,Vlingo 现在是 Nuance 的一部分。

(我尝试更新原始答案中任何损坏的链接。)

更新:2015 年 10 月 31 日

保持此答案最新是一项西西弗斯任务。

Voxeo 的免费(且低成本)IVR 托管提供商列表现在重新定向到 AT&T 语音 API,完全公开地说,我现在在其中有实质性参与,因此,我没有资格在不质疑我的可信度的情况下提供几乎任何内容的链接。

也就是说,语音/NLP 市场上有很多参与者。尽职尽责。

更新:2016 年 4 月 8 日

现在Google 彻底颠覆了苹果购物车

There are several IVR services which host an entire VOIP session (telephone call) as a complete application, rather than offer individual service transactions "àla carte". If you were to make your program look like a VOIP call, you might be able to get it done with some of these services.

Voxeo published a list of free (and low cost) IVR hosting providers aimed towards developers for limited use. Not surprisingly, all will require registration.

Another possibility would be to make a direct inquiries with Vlingo, Twilio, or Tropo as they might sell you exactly what you need.

UPDATE: July 25, 2012

AT&T has announced availability of a Speech API on . You send it audio – it returns text in XML or JSON data formats. See also developer site.

UPDATE: August 27, 2012

Another possibility is the Dragon Mobile SDK from Nuance, which is aimed at individual developers looking for an API enabling consumer applications with speech and/or text-to-speech functionality.

UPDATE: September 21, 2012

There seem to be several new providers offering exactly what you are looking for: speech samples in, text out. The following are listed on Programmable Web:

Also note that Loquendo is now part of Nuance.

UPDATE: June 27, 2013

AT&T's Speech API has a few targeted SDKs (Android, iOS, PhoneGap, Titanium, Windows) - some of which are hosted on GitHub. There's even source for a Unity 3D demo.

UPDATE: January 23, 2014

OneTok has reformulated it's offerings as an SDK for iOS and Android.

Apparently the Voice Genie product has been thoroughly digested by Genesys such that little trace of it can be found. Given Genesys' positioning towards large enterprises, is difficult to know if they have any small-volume or commodity offerings.

Plumvoice seems to have expanded their offerings.

As with many before it, Vlingo is now part of Nuance.

(I've tried to update any broken links in original answer.)

UPDATE: October 31, 2015

Keeping this answer up-to-date is a Sisyphean task.

Voxeo's list of free (and low cost) IVR hosting providers now re-derects to AT&T Speech API, which, in full disclosure, I now have material involvement with therein, and as such, disqualifies me from providing linking to pretty much anything without impugning my credibility.

That said, there are many players in the speech/NLP market. Do diligence.

UPDATE: April 8, 2016

So now Google is totally upsetting the apple cart.

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文