首页 | 本学科首页   官方微博 | 高级检索  
     


Data-driven models for timing feedback responses in a Map Task dialogue system
Affiliation:1. IBM India Research Labs, Bangalore, India;2. Department of Electrical Engineering, Indian Institute of Technology (IIT), Madras, Chennai, Tamil Nadu 630036, India;3. Department of Signal Theory, Telematics and Communications, University of Granada, Spain;1. Department of Artificial Intelligence, School of Physics and Artificial Intelligence, University of Veracruz, Sebastián Camacho 5, Centro, Xalapa, Veracruz, 91000, Mexico;2. National Laboratory of Advanced Informatics (LANIA) A.C., Rébsamen 80, Centro, Xalapa, Veracruz, 91000, Mexico;1. Research Center for Information Technology Innovation, Academia Sinica, No. 128, Academia Road, Section 2, Nankang, Taipei 11529, Taiwan;2. National Institute of Information and Communications Technology (NICT), 3-5 Hikaridai, Keihanna Science City 6190289, Japan
Abstract:Traditional dialogue systems use a fixed silence threshold to detect the end of users’ turns. Such a simplistic model can result in system behaviour that is both interruptive and unresponsive, which in turn affects user experience. Various studies have observed that human interlocutors take cues from speaker behaviour, such as prosody, syntax, and gestures, to coordinate smooth exchange of speaking turns. However, little effort has been made towards implementing these models in dialogue systems and verifying how well they model the turn-taking behaviour in human–computer interactions. We present a data-driven approach to building models for online detection of suitable feedback response locations in the user's speech. We first collected human–computer interaction data using a spoken dialogue system that can perform the Map Task with users (albeit using a trick). On this data, we trained various models that use automatically extractable prosodic, contextual and lexico-syntactic features for detecting response locations. Next, we implemented a trained model in the same dialogue system and evaluated it in interactions with users. The subjective and objective measures from the user evaluation confirm that a model trained on speaker behavioural cues offers both smoother turn-transitions and more responsive system behaviour.
Keywords:Spoken dialogue systems  Timing feedback  Turn-taking  User evaluation
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号