首页 | 本学科首页   官方微博 | 高级检索  
     

压缩域鲁棒音乐指纹算法研究
引用本文:刘亚多,李伟,李晓强,汪竹蓉,冯瑞.压缩域鲁棒音乐指纹算法研究[J].电子学报,2010,38(5):1172-1176.
作者姓名:刘亚多  李伟  李晓强  汪竹蓉  冯瑞
作者单位:1.复旦大学计算机科学技术学院,上海 200433; 2.上海大学计算机工程与科学学院,上海 200072
基金项目:国家自然科学基金,上海市科技攻关计划,上海市科委重点科技攻关项目 
摘    要:对互联网海量MP3格式音乐数据进行基于内容的有效检索是当前一个重要而又很少涉及的研究方向.本文提出一种基于MDCT频谱熵的压缩域音频指纹算法,对各种常规频域和时间域的音频信号处理失真具有较强的鲁棒性.模拟实验在包含100首不同中文流行歌曲的音乐数据库上进行.对经受各种严重信号处理失真的粒度为5s左右的查询片段,能够取得超过90%的首位正确识别率.

关 键 词:音频指纹  压缩域  鲁棒性  MDCT频谱熵  音乐检索  
收稿时间:2009-6-10
修稿时间:2009-12-25

A Robust Compressed-Domain Music Fingerprinting Technique Based on MDCT Spectral Entropy
LIU Ya-duo,LI Wei,LI Xiao-qiang,WANG Zhu-rong,FENG Rui.A Robust Compressed-Domain Music Fingerprinting Technique Based on MDCT Spectral Entropy[J].Acta Electronica Sinica,2010,38(5):1172-1176.
Authors:LIU Ya-duo  LI Wei  LI Xiao-qiang  WANG Zhu-rong  FENG Rui
Affiliation:1.School of Computer Science and Technology,Fudan University,Shanghai 200433,China;2.School of Computer Engineering and Science,Shanghai University,Shanghai 200072,China
Abstract:With the proliferation of MP3 music,compressed-domain music information retrieval from the Internet has come into being an important and urgent research field.In this paper,we propose a novel compressed-domain audio fingerprinting algorithm based on MDCT spectral entropy.The input MP3 music file is first partially decompressed to obtain MDCT coefficients as intermediate results,whereby we calculate the MDCT spectral entropy through consecutive long windows and come to the final fingerprint sequence by magnitude relationship modeling.Such fingerprint exhibits strong robustness against various frequency-and time-domain audio distortions due to its statistically stable nature.Experimental results show that in our test database which is composed of 100 distinct Chinese pop songs,a 5s music clip is sufficient to identify its original recording in real time,with more than 90% top one precision rate even under various severe audio signal distortions.
Keywords:audio fingerprinting  compressed domain  MDCT spectral entropy  robustness  audio identification
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《电子学报》浏览原始摘要信息
点击此处可从《电子学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号