首页 | 本学科首页   官方微博 | 高级检索  
     


Approaches to Speaker Detection and Tracking in Conversational Speech
Affiliation:1. Department of Electronics and Communication Engineering, National Institute of Technology, Patna, India;2. Department of Computer Science, University of Crete, Greece;3. Department of Electronics and Communication Engineering, National Institute of Technology, Sikkim, India;4. Department of Electronics and Electrical Engineering, Indian Institute of Technology, Guwahati, India;1. Dept. of Signal Theory, Telematics and Communications, University of Granada, Granada, Spain;2. Dept. of Computer Science, University of Sheffield, Sheffield, UK;1. Digital Media Engineering and Technology Department, German University in Cairo, Egypt;2. Computer and Systems Engineering Department, Ain Shams University, Abbassia, Cairo 11517, Egypt
Abstract:Dunn, Robert B., Reynolds, Douglas A., and Quatieri, Thomas F., Approaches to Speaker Detection and Tracking in Conversational Speech, Digital Signal Processing10(2000), 93–112.Two approaches to detecting and tracking speakers in multispeaker audio are described. Both approaches use an adapted Gaussian mixture model, universal background model (GMM-UBM) speaker detection system as the core speaker recognition engine. In one approach, the individual log-likelihood ratio scores, which are produced on a frame-by-frame basis by the GMM-UBM system, are used to first partition the speech file into speaker homogenous regions and then to create scores for these regions. We refer to this approach as internal segmentation. Another approach uses an external segmentationalgorithm, based on blind clustering, to partition the speech file into speaker homogenous regions. The adapted GMM-UBM system then scores each of these regions as in the single-speaker recognition case. We show that the external segmentation system outperforms the internal segmentation system for both detection and tracking. In addition, we show how different components of the detection and tracking algorithms contribute to the overall system performance.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号