从文本调用 Google 语音操作 - 例如构建我自己的语音操作应用程序
我有一个应用程序,我使用谷歌中的语音识别来获取用户语音输入作为文本。这有效,没问题 - 我收到了文字。
是否可以发送我的文本字符串,由与谷歌语音操作相同的逻辑/引擎进行解析,并让 AndroidOS 执行正确的操作?
或者启动一个意图,提示用户进行谷歌语音操作 - 规范将照此执行(例如听...),但如果没有匹配谷歌语音操作给出的位置,它将将结果广播回我的应用程序作为溢出处理的文本。
I have an application where I am using the voice recognition in google to get the user voice input as text. This works, no problem - I get the text.
Is it possible to send my TEXT STRING to be parsed by the same logic/engine as google's voice actions and get the AndroidOS to execute correct action?
Alternatively start an intent that will prompt the user for a google voice action - the spech will be executed as such (e.g listen to...) but if none match where given by google voice action it will broadcast back to my app with the result text to handle as overflow.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
解析返回的文本并确定用户的真正含义(而不是他们所说的文本)是一个复杂的问题。最简单的是,您可以维护一个关键字表,并在返回的文本中搜索关键字匹配项,并为某些关键字的含义提供预先确定的映射。更先进的系统使用统计分类技术来从转录的语音中识别正确的动作一个用户。如果您想探索高级分类器,您可能需要查看 Mallet。
抱歉,我认为 Google 不会向您提供他们的语音动作分类器,而只是提供识别器。
Parsing the returned text and determining what the user really meant (rather than the text of what they said) is a complex problem. On the simplest side, you can maintain a table of keywords and search the returned text for keyword matches and have predetermined mappings for the meanings of certain keywords. More advanced systems use statistical classification techniques to identify the correct actions from the transcribed speech from a user. If you want to explore advanced classifiers, you might want to look at Mallet.
Sorry, I don't think Google gives you access to their voice action classifier, just the recognizer.