首页 | 本学科首页   官方微博 | 高级检索  
     


A novel mutual nearest neighbor based symmetry for text frame classification in video
Authors:Palaiahnakote Shivakumara [Author Vitae]  Anjan Dutta [Author Vitae]
Affiliation:a School of Computing, National University of Singapore, Singapore
b Computer Vision Center, Universitat Autònoma de Barcelona, Barcelona, Spain
c Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, India
Abstract:In the field of multimedia retrieval in video, text frame classification is essential for text detection, event detection, event boundary detection, etc. We propose a new text frame classification method that introduces a combination of wavelet and median moment with k-means clustering to select probable text blocks among 16 equally sized blocks of a video frame. The same feature combination is used with a new Max-Min clustering at the pixel level to choose probable dominant text pixels in the selected probable text blocks. For the probable text pixels, a so-called mutual nearest neighbor based symmetry is explored with a four-quadrant formation centered at the centroid of the probable dominant text pixels to know whether a block is a true text block or not. If a frame produces at least one true text block then it is considered as a text frame otherwise it is a non-text frame. Experimental results on different text and non-text datasets including two public datasets and our own created data show that the proposed method gives promising results in terms of recall and precision at the block and frame levels. Further, we also show how existing text detection methods tend to misclassify non-text frames as text frames in term of recall and precision at both the block and frame levels.
Keywords:Wavelet-median moments  Video image  Mutual nearest neighbor  Frame classification  Text block location
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号