首页 | 本学科首页   官方微博 | 高级检索  
     

利用波束形成和神经网络进行语音增强
引用本文:龚杰,冯海泓,陈友元,方义,牟宏宇.利用波束形成和神经网络进行语音增强[J].声学技术,2020,39(3):323-328.
作者姓名:龚杰  冯海泓  陈友元  方义  牟宏宇
作者单位:中国科学院声学研究所东海研究站, 上海 201815;中国科学院大学, 北京 100049;中国科学院声学研究所东海研究站, 上海 201815;中国科学院声学研究所, 北京 100190;中国科学院大学, 北京 100049
基金项目:国家自然科学基金面上项目(11474309)、上海市生物医药领域科技支撑项目(19441900500)、中国科学院声学研究所青年英才计划项目(QNYC201710)。
摘    要:语音增强在语音信号处理的前端非常重要,直接影响后端语音识别等效果。目前用神经网络进行单通道语音分离对于解决鸡尾酒会问题取得了很大的进步,但是用于复杂混合语音时分离效果仍不令人满意。针对单通道情形下的不足,使用多通道结构形成4个方向的超指向波束,结合神经网络算法实现对于指定方向的目标语音增强。仿真和实验结果表明,该算法相较于超指向波束形成算法和谱减法在多种评价指标上均有了明显的提升。

关 键 词:语音增强  超指向波束形成  神经网络
收稿时间:2019/3/18 0:00:00
修稿时间:2019/5/4 0:00:00

Speech enhancement by beamforming and neural network
GONG Jie,FENG Haihong,CHEN Youyuan,FANG Yi,MOU Hongyu.Speech enhancement by beamforming and neural network[J].Technical Acoustics,2020,39(3):323-328.
Authors:GONG Jie  FENG Haihong  CHEN Youyuan  FANG Yi  MOU Hongyu
Affiliation:Shanghai Acoustics Laboratory, Institute of Acoustics, Chinese Academy of Sciences, Shanghai 201815, China;University of Chinese Academy of Sciences, Beijing 100049, China;Shanghai Acoustics Laboratory, Institute of Acoustics, Chinese Academy of Sciences, Shanghai 201815, China;Institute of Acoustics, Chinese Academy of Sciences, Beijing 100190, China;University of Chinese Academy of Sciences, Beijing 100049, China
Abstract:Speech enhancement is a very important front-end work in speech signal processing and it directly affects the back-end''s speech recognition effects. At present, the single-channel speech separation using neural networks has made great progress in solving cocktail party problem, but its separation effect on complex mixed speeches is not satisfactory. Aiming at the shortcoming of single channel, the multi-channel structure is adopted to form 4 super-directivity beams. The speech enhancement in a given direction is realized by using the multi-channel structure combined with neural network algorithm. The simulation and experimental results show that the proposed method has the obvious improvement in a variety of evaluation index compared with the super-directivity beamforming algorithm and spectral subtraction algorithm.
Keywords:speech enhancement  super-directivity beamforming  neural networks
本文献已被 CNKI 维普 等数据库收录!
点击此处可从《声学技术》浏览原始摘要信息
点击此处可从《声学技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号