音频指纹识别和标准化

发布于 2024-12-28 07:31:03 字数 1259 浏览 2 评论 0原文

我编写了一个应用程序，允许使用描述的方法进行音频指纹在这里。它基本上将 mp3 转换为 wav，然后在数据库中创建一堆哈希码。然后，我使用我的 iPhone 创建一个录音，该录音有一些噪音，并比较哈希码并获取链接中记录的匹配项。哇，太酷了！！

我现在使用 USB 无线电接收器录制无线电样本。我在 byte[] 数组中获取声音数据，然后执行与存储哈希码完全相同的操作，然后尝试匹配它。这次不行了。

我的感觉是 mp3 已经标准化（对其应用了压缩），这可能就是区别。我想不出任何其他差异，因为它们（mp3 和收音机样本）都转换为 wav 格式（16 位）

我想我的问题是双重的：

如果我压缩收音机样本，你认为它会起作用吗？ p>
为此，我需要应用压缩函数，这意味着我需要使柔和的声音更大声，而更大声的声音更柔和。

我已经开始编写一个函数，它接受一个字节数组（16 位格式的 wav 数据），并希望循环遍历它并相应地调整样本值以进行压缩，但我对此感到挣扎：

List<short> ints = new List<short>();
        for (int j = 0; j < byteArray.Count; j+=2)
        {
            //so for 16 bits every 2 bytes in the array is a sample
            short sample16 = 0;
            byte[] sample = new byte[2];
            sample[0] = byteArray[j];
            sample[1] = byteArray[j+1];

            sample16 = (short)(double)BitConverter.ToInt16(sample, 0);
            //at this point change the sample according to the compression needed
            ints.Add(sample16);

            //back again to test it
            byte[] buffer11 = BitConverter.GetBytes(sample16);
        }

原文

I've written an application which allows audio fingerprinting using the method described here. It basically converts an mp3 to a wav and then creates a bunch of hashcodes in a database. I then create a recording using my iphone which has some noise and compare the hashcodes and get matches as documented in the link. Wow, its cool!!

Im now recording radio samples using a USB radio receiver. I get the sound data in a byte[] array and then do exactly the same thing where i store the hashcodes and then try to match it. This time it doesnt work.

My feeling is that the mp3 has been normalized (had compression applied to it) and this might be the difference. I couldnt think of any other differences as they are both (the mp3 and radio sample) converted to wav format (16bit)

I guess my question is twofold:

if i compress the radio sample do you think that itll work?
To do this i need to apply a compression function which means i need to make the soft sounds louder and the louder sounds softer.

Ive started writing a function which takes a byte array (of the wav data in 16 bit format) and wanted to cycle through it and adjust the sample values accordingly to do the compression but im struggling with this:

List<short> ints = new List<short>();
        for (int j = 0; j < byteArray.Count; j+=2)
        {
            //so for 16 bits every 2 bytes in the array is a sample
            short sample16 = 0;
            byte[] sample = new byte[2];
            sample[0] = byteArray[j];
            sample[1] = byteArray[j+1];

            sample16 = (short)(double)BitConverter.ToInt16(sample, 0);
            //at this point change the sample according to the compression needed
            ints.Add(sample16);

            //back again to test it
            byte[] buffer11 = BitConverter.GetBytes(sample16);
        }

分享到QQ

分享到微博