使用 MS 语音识别服务器 API 进行转录
众所周知,目前 MS 语音 API (Microsoft.Speech) 的服务器版本不支持加载听写语法。
Q1:有办法解决这个问题吗?
Q2:我想到的想法之一是根据人们就特定主题进行的对话历史(例如聊天室)构建语法。该软件需要进行自然语言转录,但所使用的语言子集可能会受到一定限制。
这有可能相当准确和稳健吗?或者这是你不应该用非指示语法来做的事情?
谢谢
简
As we all know by now loading dictation grammars is not supported in the server version of MS speech API (Microsoft.Speech).
Q1: Is there a way to get around that?
Q2: One of the ideas I thought of would be to build a grammar based on a history of the conversations (chat room for instance) that people had on a particular topic. The software would need to do natural language transcription, but the subset of language used could possibly be somewhat limited.
Does this have any chances of being reasonably accurate and robust? Or is it something you are just not meant to do with non-dication grammars?
Thanks
Jan
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
我从事的一个项目试图通过大量使用命令和命令来模拟听写转录。控制语法。没用;不确定性太高,你会得到随机识别而不是合理的结果。
对不起。
I worked on a project that tried to simulate dictation transcription with a heavy use of command & control grammars. It didn't work; the uncertainty is just too high, and you get random recognitions instead of reasonable results.
Sorry.
我已经很多年没有使用过 SAPI 了,但在我看来,你想要的不是语法,而是一个语言模型。这是一个单词跟随另一个单词的一组概率(我的,这过于简化了......)无论如何,大多数语音引擎都使您能够基于示例文本语料库创建语言模型。文本越多,效果就越好。
I haven't worked with SAPI in a number of years, but it sounds to me like what you want is not a grammar, but a language model. This is a set of probabilities that one word follows another (my, that's an over simplification...) Anyway, most speech engines give you the ability to create a language model based on a corpus of sample text. The more text, the better it will work.