Development of speech corpora for speaker recognition research and evaluation in Indian languages |
| |
Authors: | Hemant A. Patil T. K. Basu |
| |
Affiliation: | (1) Dhirubhai Ambani Institute of Information and Communication Technology (DA-IICT), Gandhinagar, Gujarat, India;(2) Department of Electrical Engineering, Indian Institute of Technology, Kharagpur, 721302, India |
| |
Abstract: | Automatic Speaker Recognition (ASR) refers to the task of identifying a person based on his or her voice with the help of machines. ASR finds its potential applications in telephone based financial transactions, purchase of credit card and in forensic science and social anthropology for the study of different cultures and languages. Results of ASR are highly dependent on database, i.e., the results obtained in ASR are meaningless if recording conditions are not known. In this paper, a methodology and a typical experimental setup used for development of corpora for various tasks in the text-independent speaker identification in different Indian languages, viz., Marathi, Hindi, Urdu and Oriya have been described. Finally, an ASR system is presented to evaluate the corpora. |
| |
Keywords: | Speaker recognition Dialectal zones in Maharashtra and Orissa Data collection Corpus design LP cepstrum Mel cepstrum Polynomial classifier |
本文献已被 SpringerLink 等数据库收录! |
|