Assessing the importance of audio/video synchronization for simultaneous translation of video sequences |
| |
Authors: | Nicolas Staelens Jonas De Meulenaere Lizzy Bleumers Glenn Van Wallendael Jan De Cock Koen Geeraert Nick Vercammen Wendy Van den Broeck Brecht Vermeulen Rik Van de Walle Piet Demeester |
| |
Affiliation: | 1. Department of Information Technology, Ghent University, IBBT, Ghent, Belgium 2. Studies on Media, Information and Telecommunication, Free University of Brussels, IBBT, Brussels, Belgium 3. Department of Electronics and Information Systems, Ghent University, IBBT, Ghent, Belgium 4. Televic N.V., Izegem, Belgium
|
| |
Abstract: | Lip synchronization is considered a key parameter during interactive communication. In the case of video conferencing and television broadcasting, the differential delay between audio and video should remain below certain thresholds, as recommended by several standardization bodies. However, further research has also shown that these thresholds can be relaxed, depending on the targeted application and use case. In this article, we investigate the influence of lip sync on the ability to perform real-time language interpretation during video conferencing. Furthermore, we are also interested in determining proper lip sync visibility thresholds applicable to this use case. Therefore, we conducted a subjective experiment using expert interpreters, which were required to perform a simultaneous translation, and non-experts. Our results show that significant differences are obtained when conducting subjective experiments with expert interpreters. As interpreters are primarily focused on performing the simultaneous translation, lip sync detectability thresholds are higher compared with existing recommended thresholds. As such, primary focus and the targeted application and use case are important factors to be considered when selecting proper lip sync acceptability thresholds. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|