期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Character confidence based on N-best list for keyword spotting in online Chinese handwritten documents

Heng ZhangAuthor Vitae Da-Han WangAuthor VitaeCheng-Lin Liu 《Pattern recognition》2014

In keyword spotting from handwritten documents by text query, the word similarity is usually computed by combining character similarities, which are desired to approximate the logarithm of the character probabilities. In this paper, we propose to directly estimate the posterior probability (also called confidence) of candidate characters based on the N-best paths from the candidate segmentation-recognition lattice. On evaluating the candidate segmentation-recognition paths by combining multiple contexts, the scores of the N-best paths are transformed to posterior probabilities using soft-max. The parameter of soft-max (confidence parameter) is estimated from the character confusion network, which is constructed by aligning different paths using a string matching algorithm. The posterior probability of a candidate character is the summation of the probabilities of the paths that pass through the candidate character. We compare the proposed posterior probability estimation method with some reference methods including the word confidence measure and the text line recognition method. Experimental results of keyword spotting on a large database CASIA-OLHWDB of unconstrained online Chinese handwriting demonstrate the effectiveness of the proposed method. 相似文献

2.

Keyword spotting in unconstrained handwritten Chinese documents using contextual word model

Liang Huang Fei Yin Qing-Hu Chen Cheng-Lin Liu 《Image and vision computing》2013

相似文献

3.

Keyword-guided word spotting in historical printed documents using synthetic data and user feedback

T. Konidaris B. Gatos K. Ntzios I. Pratikakis S. Theodoridis S. J. Perantonis 《International Journal on Document Analysis and Recognition》2007,9(2-4):167-177

In this paper, we propose a novel technique for word spotting in historical printed documents combining synthetic data and user feedback. Our aim is to search for keywords typed by the user in a large collection of digitized printed historical documents. The proposed method consists of the following stages: (1) creation of synthetic image words; (2) word segmentation using dynamic parameters; (3) efficient feature extraction for each word image and (4) a retrieval procedure that is optimized by user feedback. Experimental results prove the efficiency of the proposed approach. 相似文献

4.

On foreground — background separation in low quality document images 总被引：1，自引：0，他引：1

Utpal Garain Thierry Paquet Laurent Heutte 《International Journal on Document Analysis and Recognition》2006,8(1):47-63

相似文献