首页 | 本学科首页   官方微博 | 高级检索  
     


Opening the Knowledge Dam: Speech Recognition for Video Search
Authors:Vered Silber-Varod  Amir Winer  Nitza Geri
Affiliation:The Open University of Israel, Raanana, Israel
Abstract:Automatic Speech Recognition (ASR) may increase access to spoken information captured in videos. ASR is needed, especially for online academic video lectures that gradually replace class lectures and traditional textbooks. This conceptual article examines how technological barriers to ASR in under-resourced languages impair accessibility to video content and demonstrates it with the empirical findings of Hebrew ASR evaluations. We compare ASR with Optical Character Recognition (OCR) as facilitating access to textual and speech content and show their current performance in under-resourced languages. We target ASR of under-resourced languages as the main barrier to searching academic video lectures. We further show that information retrieval technologies, such as smart video players that combine both ASR and OCR capacities, must come to the fore once ASR technologies have matured. Therefore, suggesting that the current state of information retrieval from video lectures in under-resourced languages is equivalent to a knowledge dam.
Keywords:Automatic speech recognition (ASR)  under-resourced languages  academic video lectures  search  optical character recognition (OCR)
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号