关于 FFT 和音调估计的几个问题
我需要对 FFT 和音调估计进行一些一般性的澄清。
1.) 我读到,FFT 的块大小越大,其精度就越高,尽管我知道这也有一个缺点。这是真的吗?因为我一直在试验,每当我使用 16384 的块大小而不是 8192 或 4096 时,我都会得到更糟糕的结果。有人可以向我解释一下吗?
2.) 最初,我认为从 FFT 获取音调只是获取最高强度的 bin 的简单问题。然而,在在这里发布并阅读了一些问题之后,我认为可能还有更多这样的问题。有人可以建议我如何从 FFT 中获得良好的音高估计吗?
3.)虽然我已经有了一个好主意,但有人可以简单地解释一下自相关器算法的作用吗? (我的想法是,它基本上是一种比较和对比算法,并且选择差异最小的算法)
非常感谢!
I have a couple of clarifications that I need with FFT and Pitch Estimation in general.
1.) I read that the larger your block size for your FFT, the better accuracy it will have, although I know that there is also a downside to this. Is this really true? Because Ive been experimenting and whenever I use a block size of 16384 as opposed to 8192 or 4096, I get worse results. Can someone clarify me about this?
2.) Initially, I believed that getting the pitch from the FFT is only a simple matter of getting the bin with the highest intensity. However, after posting and reading some questions here, I think that there may be more the this. Can someone suggest me on how to get a good pitch estimation from FFT?
3.) Although I already have a good idea, can someone just explain in simple terms what the auto-correlator algorithm does? (My idea is that its basically a compare and contrast algorithm and the one with the lowest difference is the chosen one)
Thanks a lot!
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)