首页 | 本学科首页   官方微博 | 高级检索  
     


REX-J: Japanese referring expression corpus of situated dialogs
Authors:Philipp Spanger  Masaaki Yasuhara  Ryu Iida  Takenobu Tokunaga  Asuka Terai  Naoko Kuriyama
Affiliation:1. Department of Computer Science, Tokyo Institute of Technology, Tokyo, Japan
2. Global Edge Institute, Tokyo Institute of Technology, Tokyo, Japan
3. Department of Human System Science, Tokyo Institute of Technology, Tokyo, Japan
Abstract:Identifying objects in conversation is a fundamental human capability necessary to achieve efficient collaboration on any real world task. Hence the deepening of our understanding of human referential behaviour is indispensable for the creation of systems that collaborate with humans in a meaningful way. We present the construction of REX-J, a multi-modal Japanese corpus of referring expressions in situated dialogs, based on the collaborative task of solving the Tangram puzzle. This corpus contains 24 dialogs with over 4?h of recordings and over 1,400 referring expressions. We outline the characteristics of the collected data and point out the important differences from previous corpora. The corpus records extra-linguistic information during the interaction (e.g. the position of pieces, the actions on the pieces) in synchronization with the participants’ utterances. This in turn allows us to discuss the importance of creating a unified model of linguistic and extra-linguistic information from a new perspective. Demonstrating the potential uses of this corpus, we present the analysis of a specific type of referring expression (“action-mentioning expression”) as well as the results of research into the generation of demonstrative pronouns. Furthermore, we discuss some perspectives on potential uses of this corpus as well as our planned future work, underlining how it is a valuable addition to the existing databases in the community for the study and modeling of referring expressions in situated dialog.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号