Windows Vista 语音识别引擎采样率
我需要从一组 WAV 文件中识别语音,这些文件未记录在执行识别的计算机上。我知道,如果您从以与引擎用于训练的录音相同的采样率录制的 WAV 文件中识别语音,结果会更好。
但我的问题是:Vista 引擎训练的采样率是多少?我似乎无法在任何地方找到此信息。
而且...您知道有什么方法可以将 WAV 文件从一种采样率转换为另一种采样率(从 C# 开始)吗?
谢谢!
I need to recognize the speech from a set of WAV files that are not recorded on the computer doing the recognition. I know that if you recognize the speech from WAV files that are recorded at the same sampling rate as the recordings that the engine used for training, the results will be better.
But my questions is: what's the sampling rate for which Vista's engine was trained? I cannot seem to find this information anywhere.
And also... do you know any method to convert the WAV files from one sampling rate to another, from C#?
Thanks!
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
SAPI 桌面引擎的默认采样率为 11 KHz。桌面引擎可以在任何高于此的采样率下正常工作。此外,如果您使用 SpBindToFile 帮助器。你没有提到你正在使用哪种编程语言,所以我假设是 C++。
The default sampling rate for the SAPI desktop engines is 11 KHz. The desktop engines work well with any sampling rate above that. Also, SAPI will resample the audio for you, if you use the SpBindToFile helper. You didn't mention which programming language you're using, so I assumed C++.