首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于频域特征和过渡段判决的端点检测算法
引用本文:郭逾,张二华,刘驰. 一种基于频域特征和过渡段判决的端点检测算法[J]. 山东大学学报(工学版), 2016, 46(2): 57-63. DOI: 10.6040/j.issn.1672-3961.2.2015.147
作者姓名:郭逾  张二华  刘驰
作者单位:南京理工大学计算机科学与工程学院, 江苏 南京 210094
摘    要:为了提高语音端点检测的准确性,增强端点检测算法在噪声环境下的鲁棒性,提出两种新的端点检测参数。其中,基于临界频带的谱熵参数综合考虑了人耳对语音的感知特性以及语音信号和噪声信号的频域分布差异,差值频域能量参数考虑了语音帧和无声帧在频域上的能量差异。结合两种参数的优点,构成一种鲁棒的端点检测参数,同时,为了避免因阀值判决的单一性而产生误判,在端点检测过程中加入了基于特征分布统计的过渡段判决。试验结果表明,本研究提出的语音端点检测算法对语音帧和无声帧具有较好的区分性,在不同噪声且信噪比较低情况下,端点检测准确率相比传统抗噪端点检测算法均有所提升,特别是在非平稳噪声下,准确率提升超过5%。

关 键 词:过渡段判决  能量熵  频域能量  谱熵  端点检测  临界频带  
收稿时间:2015-05-12

An endpoint detection algorithm based on frequency-domain characteristics and transition fragment judgment
GUO Yu,ZHANG Erhua,LIU Chi. An endpoint detection algorithm based on frequency-domain characteristics and transition fragment judgment[J]. Journal of Shandong University of Technology, 2016, 46(2): 57-63. DOI: 10.6040/j.issn.1672-3961.2.2015.147
Authors:GUO Yu  ZHANG Erhua  LIU Chi
Affiliation:School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, Jiangsu, China
Abstract:In order to improve the accuracy of speech endpoint detection as well as enhance robustness of the endpoint detection algorithm in noisy environment, two new endpoint detection parameters were proposed. The spectrum entropy based on critical band took both perceptual characteristics of the human auditory system and the differences between speech and noise signals in frequency domain distribution into account, as well as the minus frequency-domain energy parameter paid attention to the difference between speech frames and silence frames in frequency energy. The advantages of those two parameters were combined to constitute a robust endpoint detection parameter. Meanwhile, in order to avoid the miscarriage of judgment caused by the unitary threshold, the transition fragment judgment based on statistics of characteristics distribution was applied. The experiment results showed that the endpoint detection algorithm had better discrimination for speech frames and silence frames, the algorithm could carry out better accuracy than other conventional anti-noisy endpoint detection algorithms under different and low signal-to-noise ratio noisy environments, especially in the case of non-stationary noise, the accuracy improved by more than 5%.
Keywords:transition fragment judgment  energy entropy  frequency-domain energy  critical band  endpoint detection  spectrum entropy  
本文献已被 CNKI 等数据库收录!
点击此处可从《山东大学学报(工学版)》浏览原始摘要信息
点击此处可从《山东大学学报(工学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号