Windows 7 语音识别中的自定义规范化
我正在使用 Windows 7 内置的语音识别功能来填写一些表格。它工作得很好,但我有一个问题。语音识别器会进行某种自动标准化。例如,如果我说“十二美元五十美分”,它会写为 12.5 美元。那挺好的。
但是,如果我说“15 x 3 米”,它会准确地写出该短语,而不是我希望写的内容:“15x3 米”。
如何自定义语音规范化过程来完成我需要的操作?
I am using Windows 7 built in Speech Recognition to fill some forms. It works quite well but I have a problem. The speech recognizer does some kind of automatic normalization. For example, if I say "twelve dollars and fifty cents" it writes $12.5. That's good.
However, if I say "15 by 3 metres" it writes that phrase exactly and not what I would like it to write: "15x3 m".
How can I customize the speech normalization process to do what I need to?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
您可以禁用它并实现您自己的语音处理程序,请查看此线程 此处
you can disable it and implement your own speech-handler, look at this thread here
使用 SAPI API 绝对无法做到这一点。 (lexicon API 很接近,但不是不可参数化。)
我知道自定义规范化条目的唯一方法是使用 听写资源工具包,但这对于这个目的来说太过分了 - 你必须从头开始重建整个听写语法(因为 DRK 不包括默认听写语法的来源)。
There is definitely no way to do this using the SAPI APIs. (The lexicon APIs are close, but aren't parameterizable.)
The only way I know to customize the normalization entries would be to use the Dictation Resource Kit, but that's overkill for this purpose - you would have to rebuild the entire dictation grammar from scratch (since the DRK doesn't include the sources for the default dictation grammar).