检测仅与声音文件的特定部分相关的样本数据

发布于 2024-12-25 06:36:40 字数 440 浏览 0 评论 0原文

我想提取与声音剪辑的某个区域相关的样本字节数据，例如声音剪辑中的单词，这样我就可以获得仅与特定单词相关的样本数据集合，然后我可以通过它发送快速傅里叶变换。我如何能够从整个声音文件的字节集合中识别出这个数据集合？文件中的一些字节数据在转换为 2 字节值后看起来像这样，因为它是 16 位声音文件（44100Hz 15 秒）。

我知道这些数据位于时域中，并且我没有看到数据有任何重大变化，例如用于识别静音的 0 集合。我是否能够在时域中执行此操作，或者是否必须将这些数据带到频域，然后过滤掉不必要的数据并执行反向 FFT 以获得有意义的数据集合。提前致谢。

原文

I want to extract sample byte data that is related to a certain area of a sound clip like, a word in a sound clip, so that I get a collection of sample data that is related only to the particular word which then I can send through a FFT. How will I be able to identify this collection of data from a collection of bytes that are there for the whole sound file? Some of the byte data from the file looks like this after converting them to 2 byte values because its a 16 bit sound file (44100Hz 15 sec).

I am aware that this data is in the time domain and I am not seeing any significant changes in data like a collection of 0’s to identify silence. Will I be able to do this in the time domain or would I be having to take this data to the frequency domain and then filter the unnecessary data and do a reverse FFT to get a collection of data that make sense. Thanks in advance.

分享到QQ

分享到微博