首页 | 本学科首页   官方微博 | 高级检索  
     


Fuzzy String Matching with a Deep Neural Network
Authors:Daniel Shapiro  Nathalie Japkowicz  Mathieu Lemay  Miodrag Bolic
Affiliation:1. School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa, Ontario, Canada;2. Lemay Solutions Consulting Inc, Ottawa, Ontario, Canada;3. Lemay Solutions Consulting Inc, Ottawa, Ontario, Canada
Abstract:A deep learning neural network for character-level text classification is described in this work. The system spots keywords in the text output of an optical character recognition system using memoization and by encoding the text into feature vectors related to letter frequency. Recognizing error messages in a set of generated images, dictionary and spell-check-based approaches achieved 69% to 88% accuracy, while various deep learning approaches achieved 91% to 96% accuracy, and a combination of deep learning with a dictionary achieved 97% accuracy. The contribution of this work to the state of the art is to describe a new approach for character-level deep neural network classification of noisy text.
Keywords:
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号