Boosted learning in dynamic Bayesian networks for multimodal speaker detection期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Boosted learning in dynamic Bayesian networks for multimodal speaker detection

Authors:	Garg A. Pavlovic V. Rehg J.M.

Affiliation:	IBM Almaden Res. Center, San Jose, CA, USA;

Abstract:	Bayesian network models provide an attractive framework for multimodal sensor fusion. They combine an intuitive graphical representation with efficient algorithms for inference and learning. However, the unsupervised nature of standard parameter learning algorithms for Bayesian networks can lead to poor performance in classification tasks. We have developed a supervised learning framework for Bayesian networks, which is based on the Adaboost algorithm of Schapire and Freund. Our framework covers static and dynamic Bayesian networks with both discrete and continuous states. We have tested our framework in the context of a novel multimodal HCI application: a speech-based command and control interface for a Smart Kiosk. We provide experimental evidence for the utility of our boosted learning approach.

Keywords: