使用 MS 语音识别服务器 API 进行转录

发布于 2024-09-15 13:13:55 字数 230 浏览 13 评论 0原文

众所周知,目前 MS 语音 API (Microsoft.Speech) 的服务器版本不支持加载听写语法。

Q1:有办法解决这个问题吗?

Q2:我想到的想法之一是根据人们就特定主题进行的对话历史(例如聊天室)构建语法。该软件需要进行自然语言转录,但所使用的语言子集可能会受到一定限制。

这有可能相当准确和稳健吗?或者这是你不应该用非指示语法来做的事情?

谢谢

As we all know by now loading dictation grammars is not supported in the server version of MS speech API (Microsoft.Speech).

Q1: Is there a way to get around that?

Q2: One of the ideas I thought of would be to build a grammar based on a history of the conversations (chat room for instance) that people had on a particular topic. The software would need to do natural language transcription, but the subset of language used could possibly be somewhat limited.

Does this have any chances of being reasonably accurate and robust? Or is it something you are just not meant to do with non-dication grammars?

Thanks

Jan

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(2

荭秂 2024-09-22 13:13:55

我从事的一个项目试图通过大量使用命令和命令来模拟听写转录。控制语法。没用;不确定性太高,你会得到随机识别而不是合理的结果。

对不起。

I worked on a project that tried to simulate dictation transcription with a heavy use of command & control grammars. It didn't work; the uncertainty is just too high, and you get random recognitions instead of reasonable results.

Sorry.

帝王念 2024-09-22 13:13:55

我已经很多年没有使用过 SAPI 了,但在我看来,你想要的不是语法,而是一个语言模型。这是一个单词跟随另一个单词的一组概率(我的,这过于简化了......)无论如何,大多数语音引擎都使您能够基于示例文本语料库创建语言模型。文本越多,效果就越好。

I haven't worked with SAPI in a number of years, but it sounds to me like what you want is not a grammar, but a language model. This is a set of probabilities that one word follows another (my, that's an over simplification...) Anyway, most speech engines give you the ability to create a language model based on a corpus of sample text. The more text, the better it will work.

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文