标准化 FFT 数据以适应人类听觉

发布于 2024-07-18 11:31:11 字数 835 浏览 3 评论 0原文

典型的音频 FFT 看起来与此非常相似，大部分操作发生在最左侧

http://www.flight404.com/blog/images/fft.jpg

他将其乘以部分正弦波以得到底部，但是这篇文章对这部分内容并不太具体它。这似乎也是对数据集的一种“足够好”的修改，而不是基于某些属性的修改。我知道人类的听觉更适合较高的频率，因此，大多数音乐都会放大低音和衰减高音，这样两种声音对我们来说具有相对相等的强度。

我的问题是需要对 FFT 进行哪些修改才能补偿这种标准衰减？

for(i = 0; i < fft.length; i++){
     fft[i] = fft[i] * Math.log(i + 1); // does, eh, ok but the high
                                        // end is still not really "loud"
                                        // enough
}

编辑::

http://en.wikipedia.org/wiki/Equal-loudness_contour

我看到这篇文章，我认为这可能是前进的方向，但 FFT 的某些属性可能仍然需要抵消。

原文

The typical FFT for audio looks pretty similar to this, with most of the action happening on the far left side

http://www.flight404.com/blog/images/fft.jpg

He multiplied it by a partial sine wave to get it to the bottom, but the article isn't too specific on this part of it. It also seems like a "good enough" modification of the dataset, rather than one based on some property. I understand that human hearing is better suited to the higher frequencies, thus, most music will have amplified bass and attenuated treble so that both sound to us as being of relatively equal strength.

My question is what modification needs to be done to the FFT to compensate for this standard falloff?

for(i = 0; i < fft.length; i++){
     fft[i] = fft[i] * Math.log(i + 1); // does, eh, ok but the high
                                        // end is still not really "loud"
                                        // enough
}

EDIT ::

http://en.wikipedia.org/wiki/Equal-loudness_contour

I came across this article, I think it might be the direction to head in, but there still might be some property of an FFT that needs to be counteracte.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

○闲身 2024-07-25 11:31:11

首先，您确定要这样做吗？补偿一些事情是有意义的，例如麦克风响应不平坦，但不是人类的感知。人们习惯于听到具有现实世界中声音频谱内容的声音，而不是沿着感知等响度曲线。如果您按照您建议的方式播放已修改的声音，那么听起来会很奇怪。也许有些人喜欢增强低频的音乐，但这是品味问题，而不是心理物理学问题。

或者，您可能出于其他原因进行补偿，例如，考虑到对较低频率的较差敏感度可能会增强压缩算法。这是这个想法吗？

如果您确实想通过等响度曲线进行归一化，则应注意大多数曲线和方程都是以声压级 (SPL) 为单位的。 SPL 是波形幅度平方的对数，因此当您使用 FFT 时，使用其平方（功率谱）可能是最简单的。（或者，当然，您可以通过其他方式进行补偿，例如在上面的等式中乘以 sqrt(log(i+1)) - 假设对数是逆等响度曲线的近似值。）

回复收藏 0 原文