梅尔频率倒谱系数如何工作?
我已经准备好了根据麦克风输入实时计算的 FFT 和音调 + 绝对频率。 现在我想计算音色。
我看到了梅尔频率倒谱系数 - MFCC,但我不太理解它。 有人可以给我一些这方面的建议吗?
I allready have FFT and pitch + absolute frequency calculated in real-time from input of microphone.
Now I want to calculate the timbre.
I saw Mel Frequency Cepstrum Coefficients - MFCCs but I didn't understand it very well.
Can someone give me some tips on this..
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
MFCC 结合了人类听觉(对数频率感知,梅尔标度)和物理方面的考虑乐器(这些系统通常具有明确定义的泛音,即和声——这就是 MFCC 的原因使用 FFT 的 FFT),给出乐器音色的简化表示(其中基频和响度被分解)。
人们可以就这个主题写无数页,并且网络上有很多可用的内容,因此一个更具体的问题可以清楚地解释您想知道的内容会很有帮助。计算 MFCC 的算法列在维基百科页面的顶部。
MFCCs combine consideration of aspects of human hearing (logarithmic frequency perception, the mel scale) and physics of musical instruments (these systems often have well defined overtones that are harmonic -- which is why the MFCCs use the FFT of the FFT), to give a simplified representation of the timbre of an instrument (where the fundamental frequency and loudness are factored out).
One could write endless pages on this topic, and there are many available on the web, so a more specific question that explains clearly what you want to know would be helpful. The algorithm for calculating MFCCs is listed at the top of the wikipedia page.