确定声学相似性的方法(但不是指纹识别)
我正在寻找在实践中有效的方法来确定不同歌曲之间的某种声学相似性。
到目前为止,我见过的大多数方法(MFCC 等)似乎实际上只是为了查找相同歌曲(即指纹识别,用于音乐识别而不是推荐)。虽然大多数推荐系统似乎都适用于网络数据(共同收听的歌曲)和标签。
大多数 Mpeg-7 音频描述符似乎也遵循这一思路。另外,它们中的大多数都是在“提取这个和那个”级别上定义的,但似乎没有人真正使用这些功能并使用它们来计算一些歌曲相似度。然而,即使是对相似项目的有效搜索...
http://gjay.sourceforge.net/ 等工具和 http://imms.luminal.org/ 似乎使用了一些简单的频谱分析、文件系统位置、标签,加上用户输入,例如“颜色”和手动分配的评级用户或歌曲被收听和跳过的频率。
那么:对于常见的音乐收藏,哪些音频特征的计算速度相当快,并且可用于生成有趣的播放列表并查找相似的歌曲?理想情况下,我想输入现有的播放列表,并找出一些与该播放列表匹配的歌曲。
所以我真的对声学相似性感兴趣,而不是识别/指纹。实际上,我只想从结果中删除相同的歌曲,因为我不想重复它们。 而且我也不寻找通过嗡嗡声进行查询。我什至没有连接麦克风。
哦,还有我不是在寻找在线服务。首先,我不想将所有数据发送给Apple等,其次我只想从我拥有的歌曲中获得推荐(我现在不想购买额外的音乐,而我还没有探索过)我所有的音乐都还没有转换成mp3……)其次我的音乐品味不是主流;我不希望系统一直推荐玛丽亚·凯莉。
另外,当然,我真的很感兴趣哪些技术有效,哪些技术无效……感谢您对相关文献和方法的任何建议。
I'm looking for methods that work in practise for determining some kind of acoustical similarity between different songs.
Most of the methods I've seen so far (MFCC etc.) seem actually to aim at finding identical songs only (i.e. fingerprinting, for music recognition not recommendation). While most recommendation systems seem to work on network data (co-listened songs) and tags.
Most Mpeg-7 audio descriptors also seem to be along this line. Plus, most of them are defined on the level of "extract this and that" level, but nobody seems to actually make any use of these features and use them for computing some song similarity. Yet even an efficient search of similar items...
Tools such as http://gjay.sourceforge.net/ and http://imms.luminal.org/ seem to use some simple spectral analysis, file system location, tags, plus user input such as the "color" and rating manually assigned by the user or how often the song was listened and skipped.
So: which audio features are reasonably fast to compute for a common music collection, and can be used to generate interesting playlists and find similar songs? Ideally, I'd like to feed in an existing playlist, and get out a number of songs that would match this playlist.
So I'm really interested in accoustic similarity, not so much identification / fingerprinting. Actually, I'd just want to remove identical songs from the result, because I don't want them twice.
And I'm also not looking for query by humming. I don't even have a microphone attached.
Oh, and I'm not looking for an online service. First of all, I don't want to send all my data to Apple etc., secondly I want to get only recommendations from the songs I own (I don't want to buy additional music right now, while I havn't explored all of my music. I havn't even converted all my CDs into mp3 yet ...) and secondly my music taste is not mainstream; I don't want the system to recommend Maria Carey all the time.
Plus of course, I'm really interested in what techniques work well, and which don't... Thank you for any recommendations of relevant literature and methods.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
只有一个应用程序在这方面做得非常好。 MusicIP 混音器。
http://www.spicefly.com/article.php?page=musicip-软件
它已经大约十年没有更新了(即使如此,界面还是有点笨拙),它需要非常旧的 Java 版本,并且不能处理所有文件格式 - 但它过去和现在仍然是跨平台的并且免费。它可以完成您要求的所有操作:为您收藏中的每个 mp3/ogg/flac/m3u 生成声学指纹,将它们保存到歌曲的标签中,并给定一首或多首歌曲,生成与这些歌曲类似的播放列表。它只使用歌曲的声学效果,因此它就像添加一首只有您自己的硬盘上拥有的未发行曲目一样,就像添加一首著名歌曲一样。
我喜欢它,但每次我更新操作系统/购买新计算机时,都需要很长时间才能使其再次运行。
Only one application has ever done this really well. MusicIP mixer.
http://www.spicefly.com/article.php?page=musicip-software
It hasn't been updated for about ten years (and even then the interface was a bit clunky), it requires a very old version of Java, and doesn't work with all file formats - but it was and still is cross-platform and free. It does everything you're asking : generates acoustic fingerprints for every mp3/ogg/flac/m3u in your collection, saves them to a tag on the song, and given one or more songs, generates a playlist similar to those songs. It only uses the acoustics of the songs, so it's just as likely to add an unreleased track which only you have on your own hard drive as a famous song.
I love it, but every time I update my operating system / buy a new computer it takes forever to get it working again.