首页 | 本学科首页   官方微博 | 高级检索  
     

基于深度神经网络的语种识别*
引用本文:崔瑞莲,宋彦,蒋兵,戴礼荣.基于深度神经网络的语种识别*[J].模式识别与人工智能,2015,28(12):1093-1099.
作者姓名:崔瑞莲  宋彦  蒋兵  戴礼荣
作者单位:中国科学技术大学 语音及语言信息处理国家工程实验室 合肥230027
基金项目:国家自然科学基金项目(No.61172158)资助
摘    要:语音段的有效表示方法存在易混淆语种和短时语音段识别率较低等问题,为满足不同时长和方言的识别要求,提出基于深度神经网络不同层的有效语音段表示方法.采用含有中间瓶颈层的深层神经网络作为前端特征提取,综合利用该网络的输出层和中间瓶颈层输出结果,得到不同形式的语音段表示并用于语种识别.在美国国家标准技术局语种识别评测2009年和2011年阿拉伯方言数据集上验证了方法的有效性.

关 键 词:语种识别  深度神经网络  语音段表示  深度瓶颈特征  
收稿时间:2014-11-17

Language Identification Based on Deep Neural Network
CUI Rui-Lian,SONG Yan,JIANG Bing,DAI Li-Rong.Language Identification Based on Deep Neural Network[J].Pattern Recognition and Artificial Intelligence,2015,28(12):1093-1099.
Authors:CUI Rui-Lian  SONG Yan  JIANG Bing  DAI Li-Rong
Affiliation:National Engineering Laboratory for Speech and Language Information Processing,
University of Science and Technology of China, Hefei 230027
Abstract:Aiming at the problems of confusable dialects and short-duration utterance in automatic spoken language identification (LID), an improved utterance representation method is proposed based on different layers of deep neural network (DNN). Deep bottleneck network (DBN), a DNN with an internal bottleneck layer, is employed as a front-end feature extractor. Different representations based on output layer and middle bottleneck layer of DBN for LID are obtained and fused. Evaluations on the NIST LRE2009 dataset and NIST LRE2011 Arabic dialect dataset demonstrate that the proposed method based on DBN achieves good performance.
Keywords:Language Identification  Deep Neural Network  Utterance Representation  Deep Bottleneck Feature  
点击此处可从《模式识别与人工智能》浏览原始摘要信息
点击此处可从《模式识别与人工智能》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号