基于DTW-GMM的光纤传感系统声纹识别方法 Voiceprint recognition method of optical fiber sensingsystem based on DTW-GMM期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于DTW-GMM的光纤传感系统声纹识别方法

引用本文：	杨佳沛,王宇,彭广建,白清,刘昕,靳宝全. 基于DTW-GMM的光纤传感系统声纹识别方法[J]. 电子测量与仪器学报, 2024, 38(4): 176-186

作者姓名：	杨佳沛王宇彭广建白清刘昕靳宝全

作者单位：	太原理工大学电子信息与光学工程学院太原030024;1.太原理工大学电子信息与光学工程学院太原030024；2.太原理工大学新型传感器与智能控制教育部重点实验室太原030024；3.山西省交通科技研发有限公司太原030024

基金项目：	山西省重点研发计划项目（202102130501021）、山西省水利科学技术研究与推广项目（2024GM18）、中央引导地方科技发展资金项目（YDZJSX20231B004）、山西省科技创新团队项目（201805D131003）资助

摘要：	为了满足易燃易爆环境的声纹识别需求，设计了直线型萨格奈克干涉光纤声音传感系统，利用维纳滤波算法对语音数据进行了降噪，通过三电平削波法获取了基音周期特征，采用动态时间规整算法筛选了说话人样本，并提取了梅尔频率倒谱系数特征，运用高斯混合模型期望最大化算法开展了声纹识别实验研究，同时探究了光纤声音传感系统的频率响应特性与声纹特征，研究了采集语音幅值对声纹识别结果的影响。实验结果表明，系统可实现300~3 500 Hz频率段的声音信号感知，声音幅值从0.9 V降至0.15 V时最大与次大对数似然值之差由35.5降至10.9，识别结果从成功变为失败。重复性实验表明，在10 km的传感光纤上，距声源2 m位置处，传感系统可对400段时长为3~5 s之间的文本无关语音段实现准确检测，且综合识别准确率为94.75％。本系统有望为易燃易爆环境中的设备故障、应急救援、渗漏监测等领域提供声纹识别的解决方案。
关键词：	光纤传感；萨格奈克干涉；声纹识别；高斯混合模型
Voiceprint recognition method of optical fiber sensingsystem based on DTW-GMM

Yang Jiapei,Wang Yu,Peng Guangjian,Bai Qing,Liu Xin,Jin Baoquan. Voiceprint recognition method of optical fiber sensingsystem based on DTW-GMM[J]. Journal of Electronic Measurement and Instrument, 2024, 38(4): 176-186

Authors:	Yang Jiapei Wang Yu Peng Guangjian Bai Qing Liu Xin Jin Baoquan

Affiliation:	College of Electronic Information and Optical Engineering, Taiyuan University of Technology, Taiyuan 030024, China;1.College of Electronic Information and Optical Engineering, Taiyuan University of Technology, Taiyuan 030024, China;2.Key Laboratory of Advanced Transducers and Intelligent Control System, Ministry of Education, Taiyuan University ofTechnology, Taiyuan 030024, China; 3.Shanxi Transportation Technology Research & Development Company Limited, Taiyuan 030024, China

Abstract:	In order to meet the demand of voiceprint recognition in flammable and explosive environment. A linear Sagnac interference optical fiber acoustic sensor system has been designed. Speech data was denoised using the Wiener filtering algorithm, and pitch features were extracted through three-level clipping. Speaker samples were screened using dynamic time warping, and Mel-frequency cepstral coefficients were extracted as features. Voiceprint recognition experiments were conducted utilizing the Gaussian mixture model-expectation maximization algorithm, concurrently investigating the frequency response characteristics of the optical fiber acoustic sensor system and their relationship with voiceprint features. The influence of the amplitude of acquired speech on voiceprint recognition outcomes was studied. Experimental results demonstrate that the system can realize the sound signal perception in the frequency range of 300~3 500 Hz. When the sound amplitude decreases from 0.9 to 0.15 V, the difference between the maximum and second-largest log-likelihood values drops from 35.5 to 10.9, the recognition result changed from success to failure. Repetition experiments show that, at a distance of 2 meters from the sound source along a 10-kilometer sensing fiber, the system accurately recognizes 400 speech segments of 3 to 5 seconds duration, unrelated to any specific text, achieving an overall identification accuracy rate of 94.75%. This system holds promise as a solution for voiceprint recognition in applications such as equipment fault diagnosis and emergency response within flammable and explosive environments.

Keywords:	optical fiber sensing Sagnac interference voiceprint recognition Gaussian mixture model

	点击此处可从《电子测量与仪器学报》浏览原始摘要信息
	点击此处可从《电子测量与仪器学报》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏