We don’t allow questions seeking recommendations for software libraries, tutorials, tools, books, or other off-site resources. You can edit the question so it can be answered with facts and citations.
Closed 5 years ago.
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
接受
或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
发布评论
评论(7)
您可以使用此类,该类的速度足以进行实时音频分析
警告:此代码似乎源自 此处,并拥有 GPLv2 许可证。
You can use this class, which is fast enough for real time audio analysis
Warning: this code appears to be derived from here, and has a GPLv2 license.
使用该课程:https://www.ee.columbia。 edu/~ronw/code/MEAPsoft/doc/html/FFT_8java-source.html
简短说明:调用 fft() 提供 x 作为幅度数据, 是作为全零数组,函数返回后您的第一个答案将是 a[0]=x[0]^2+y[0]^2。
完整说明:FFT是复数变换,它需要N个复数并产生N个复数。所以 x[0] 是第一个数字的实部,y[0] 是复数部分。该函数进行就地计算,因此当函数返回 x 和 y 时,将具有变换的实数部分和复数部分。
一种典型的用途是计算音频的功率谱。您的音频样本只有实部,复数部分为 0。要计算功率谱,您需要将实数部分和复数部分的平方相加 P[0]=x[0]^2+y[0]^2。
另外,值得注意的是,当傅里叶变换应用于实数时,会产生对称结果 (x[0]==x[x.lenth-1])。 x[x.length/2] 处的数据具有来自频率 f=0Hz 的数据。 x[0]==x[x.length-1] 的数据频率等于采样率(例如,如果采样频率为 44000Hz,则意味着 f[0] 指的是 22kHz)。
完整过程:
比调整根据您的口味固定数量。
数字512定义了采样窗口,我就不解释了。只是避免减少太多。
数字 1024 必须始终是最后一个数字的双倍。
数字 50 定义了更新率。如果您的采样率为每秒 44000 个样本,则更新速率将为:R=44000/1024/50 = 0.85 秒。
Using the class at: https://www.ee.columbia.edu/~ronw/code/MEAPsoft/doc/html/FFT_8java-source.html
Short explanation: call fft() providing x as you amplitude data, y as all-zeros array, after the function returns your first answer will be a[0]=x[0]^2+y[0]^2.
Complete explanation: FFT is complex transform, it takes N complex numbers and produces N complex numbers. So x[0] is the real part of the first number, y[0] is the complex part. This function computes in-place, so when the function returns x and y will have the real and complex parts of the transform.
One typical usage is to calculate the power spectrum of audio. Your audio samples only have real part, you your complex part is 0. To calculate the power spectrum you add the square of the real and complex parts P[0]=x[0]^2+y[0]^2.
Also it's important to notice that the Fourier transform, when applied over real numbers, result in symmetrical result (x[0]==x[x.lenth-1]). The data at x[x.length/2] have the data from frequency f=0Hz. x[0]==x[x.length-1] has the data for a frequency equals to have the sampling rate (eg if you sampling was 44000Hz than it means f[0] refeers to 22kHz).
Full procedure:
Than adjust the fixed number for your taste.
The number 512 defines the sampling window, I won't explain it. Just avoid reducing it too much.
The number 1024 must be always the double of the last number.
The number 50 defines you update rate. If your sampling rate is 44000 samples per second you update rate will be: R=44000/1024/50 = 0.85 seconds.
Kissfft 是一个足够好的库,可以在 Android 上编译。它具有比 FFTW 更通用的许可证(尽管 FFTW 无疑更好)。
您可以在 libgdx https://github.com/libgdx/libgdx/blob/0.9.9/extensions/gdx-audio/src/com/badlogic/gdx/audio/analysis/KissFFT.java
或者如果您想要一个纯基于 Java 的解决方案,请尝试 jTransforms
https://sites.google.com/site/piotrwendykier/software/jtransforms
kissfft is a decent enough library that compiles on android. It has a more versatile license than FFTW (even though FFTW is admittedly better).
You can find an android binding for kissfft in libgdx https://github.com/libgdx/libgdx/blob/0.9.9/extensions/gdx-audio/src/com/badlogic/gdx/audio/analysis/KissFFT.java
Or if you would like a pure Java based solution try jTransforms
https://sites.google.com/site/piotrwendykier/software/jtransforms
使用这个类( EricLarch 的答案源自于此)。
使用说明
此函数用 FFT 输出替换输入数组。
输入
即如果您的输入是
(1+8i, 2+3j, 7-i, -10-3i)
输出
要获得经典的 FFT 图,您需要计算实部和虚部的幅度。
类似于:
另请参阅此 StackOverflow 答案,了解如果您的原始输入是幅度与时间的关系,如何获取频率。
Use this class (the one that EricLarch's answer is derived from).
Usage Notes
This function replaces your inputs arrays with the FFT output.
Input
i.e. if your input is
(1+8i, 2+3j, 7-i, -10-3i)
Output
To get your classic FFT graph, you will want to calculate the magnitude of the real and imaginary parts.
Something like:
Also see this StackOverflow answer for how to get frequencies if your original input was magnitude vs. time.
是的,有
JTransforms
维护在 github 此处 和可作为 Maven 插件此处使用。使用:
但是对于更新的 Gradle 版本,您需要使用类似以下内容的内容:
Yes, there is the
JTransforms
that is maintained on github here and avaiable as a Maven plugin here.Use with:
But with more recent, Gradle versions you need to use something like:
@王杰
您的输出幅度似乎比您链接的线程上给出的答案更好,但这仍然是幅度的平方...复数的幅度
是根据
链接线程中的答案计算的,表明对于纯实数输入,输出
应该使用 a2 或 a 作为输出,因为
b_(i) = a_(i+N/ 2)
表示表中的复杂部分位于第二个输出表的一半。
即实数输入表的输出表的后半部分是实数的共轭...
因此 z = a-ia 给出了一个幅度
,因此值得注意的是缩放因子...
我建议您在书或维基上查找所有这些内容以确保确定。
@J Wang
Your output magnitude seems better than the answer given on the thread you have linked however that is still magnitude squared ... the magnitude of a complex number
is calculated as
the answer in the linked thread suggests that for pure real inputs the outputs
should be using a2 or a for the output because the values for
with
b_(i) = a_(i+N/2)
meaning the complex part in their table is in the secondhalf of the output table.
i.e the second half of the output table for an input table of reals is the conjugate of the real ...
so
z = a-ia
giving a magnitudeso it is worth noting the scaling factors ...
I would recommend looking all this up in a book or on wiki to be sure.
不幸的是,最上面的答案只适用于数组,它的大小是 2 的幂,这是非常有限的。
我使用了 Jtransforms 库,它运行得很好,你可以将它与 Matlab 使用的函数进行比较。
这是我的代码,其中的注释引用了 matlab 如何转换任何信号并获取频率幅度( https://la.mathworks.com/help/matlab/ref/fft.html)
首先,在 build.gradle (应用程序)中添加以下内容
,这是用于转换简单正弦的代码波浪,就像魅力一样
Unfortunately the top answer only works for Array that its size is a power of 2, which is very limiting.
I used the Jtransforms library and it works perfectly, you can compare it to the function used by Matlab.
here is my code with comments referencing how matlab transforms any signal and gets the frequency amplitudes (https://la.mathworks.com/help/matlab/ref/fft.html)
first, add the following in the build.gradle (app)
and here it is the code for for transforming a simple sine wave, works like a charm