首页 | 本学科首页   官方微博 | 高级检索  
     

基于音乐认知原理的音乐旋律发现技术
引用本文:李海峰,孙佳音,张田,马琳. 基于音乐认知原理的音乐旋律发现技术[J]. 信号处理, 2010, 26(10): 1456-1465
作者姓名:李海峰  孙佳音  张田  马琳
作者单位:哈尔滨工业大学 计算机科学与技术学院
基金项目:国家自然科学基金项目No. 60772076、语言语音教育部-微软重点实验室开放基金资助项目,黑龙江省留学归国基金LC03C10的支持 
摘    要:旋律是音乐主题思想的最重要表现手段,分析音乐的旋律、研究智能旋律分析处理方法是音乐信息检索领域的重要课题之一。本文根据脑神经科学及认知心理学关于人类对音乐感知特点的研究成果,引入听觉显著度(AS)的概念,提出了基于音乐认知理论的模拟人类认知过程的旋律发现技术。在前期处理阶段,针对音乐特性采用常数Q变换(CQT)建立音乐的频谱统计模型,采用贝叶斯理论计算每个半音子带数据分布的听觉显著度特征,利用时序神经网络检测各个时刻的听觉变化,得到旋律分量的候选。在后期处理阶段,我们提出了表达形式接近乐理与认知的旋律流(Melody Stream)的概念,以人对音乐和弦感知结果作为先验知识,进行旋律候选分量的规范化处理。在包含各种乐曲风格的实验音乐数据库上,验证了所提取结果同人类听感的接近程度,根据旋律流来捕捉传统旋律线获得了75%的准确率,主观听感打分对旋律流的接受度超过90%。 

关 键 词:音乐认知   旋律发现   听觉显著度   常数Q变换   旋律流
收稿时间:2009-10-20

A Music Cognition Based Music Melody Detection Approach
LI Hai-feng,SUN Jia-yin,ZHANG Tian,MA Lin. A Music Cognition Based Music Melody Detection Approach[J]. Signal Processing(China), 2010, 26(10): 1456-1465
Authors:LI Hai-feng  SUN Jia-yin  ZHANG Tian  MA Lin
Affiliation:School of Computer Science and Technology, Harbin Institute of Technology
Abstract:As the most important expression of music’s motivation and subject, melody is specially studied in field of Music Information Retrieval (MIR), and researchers have made great efforts to find intelligent information processing and analysis methods for melody estimation and analysis. Based on the achievements in music cognition domain from both neuroscience and cognitive psychology, this paper applies the concept of auditory saliency (AS) and proposes a novel approach for melody detection in polyphonic music through the simulation of human’s musical cognition mechanism and characteristics. Firstly in the preprocessing stage, the constant Q transform (CQT) is applied for spectrum calculation, and spectrum model estimation. The AS feature for each semitone is calculated using Bayesian theory according to the semitone’s spectrum distribution over every frequency band. A special time accumulation artificial neural network is used to simulate the human neural system in order to detect salient features as melody candidate contents. In the post processing stage, a novel musicology and cognition related concept of Melody Stream is introduced to regulate melody candidates according to chord perception results. The results of the proposed melody detection methods and its similarity to human perception are evaluated on a small dataset with hundreds of music pieces that cover a number of typical music styles. Experiment results showed that the performance of the proposed strategy may cover more than 75% of the traditional melody line, and the subjective acceptance is measured to more than 90%. 
Keywords:
本文献已被 万方数据 等数据库收录!
点击此处可从《信号处理》浏览原始摘要信息
点击此处可从《信号处理》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号