首页 | 本学科首页   官方微博 | 高级检索  
     

中文文本自动校错系统中知识库及其构造方法研究
引用本文:张仰森,曹元大,徐波.中文文本自动校错系统中知识库及其构造方法研究[J].小型微型计算机系统,2004,25(12):2237-2242.
作者姓名:张仰森  曹元大  徐波
作者单位:1. 北京理工大学,计算机科学与工程系,北京,100081;山西大学,计算机科学系,山西,太原,030006
2. 北京理工大学,计算机科学与工程系,北京,100081
3. 中国科学院,自动化所,模式识别国家重点实验室,北京,100080
基金项目:山西省青年科技研究基金项目(2002015)资助.
摘    要:阐述了在中文文本校错系统研究和实现过程中 ,面向文本错误查找与纠错建议产生的语言知识获取及知识库构建的思想及其实现算法 .针对数据稀疏问题探讨了查错知识库的存取技术 ,针对不同错误源 ,重点研究了相似码词典、字驱动双向词典和骨架键词典的构造方法 .基于所构建的知识库而实现的中文文本校错系统 ,其查错的召回率和精确率以及纠错建议的有效率都得到很大的提高

关 键 词:知识获取  查错知识库  纠错知识库  自动校错系统
文章编号:1000-1220(2004)12-2237-06

Research of Knowledge Sets and its Structuring Method for Chinese Text Automatic Error Detection and Correction System
ZHANG Yang-sen ,,CAO Yuan-da ,XU Bo.Research of Knowledge Sets and its Structuring Method for Chinese Text Automatic Error Detection and Correction System[J].Mini-micro Systems,2004,25(12):2237-2242.
Authors:ZHANG Yang-sen      CAO Yuan-da  XU Bo
Affiliation:ZHANG Yang-sen 1,2,3,CAO Yuan-da 1,XU Bo 3 1
Abstract:The thought and its realization algorithm to get the language knowledge that contained in Chinese text and structure automatic error detection and correction needed knowledge sets are interpreted in this paper. Because of the data sparse problem, the accessing technology of error detection knowledge sets is discussed. According to different error sources, the ways of structuring the similar-code dictionary, the two-direction dictionary driven by Chinese character and the skeleton key dictionary, are presented. The Chinese text automatic error detection and correction system based on this knowledge sets have achieved effect, its recall rate and accuracy rate as well as the validity of suggestion of error correction have got great raising.
Keywords:knowledge getting  the knowledge sets of error detection  the knowledge sets of error correction  corpus  automatic error detection and correction system
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号