首页 | 本学科首页   官方微博 | 高级检索  
     

汉语文本中特殊符号串的自动识别技术
引用本文:李宏乔,樊孝忠.汉语文本中特殊符号串的自动识别技术[J].计算机工程,2004,30(12):114-115,180.
作者姓名:李宏乔  樊孝忠
作者单位:北京理工大学计算机科学与技术系,北京,100081
摘    要:提出从组成形式和上下文语境两个方面来自动识别汉语文本中的各种特殊符号串。其组成形式用包含约束式的上下文无关文法来描述,改进的LR分析方法进行形式识别;上下文语境采用基于知网概念的特征向量来表达,向量间的欧式距离表示语境间的相似度。实践证明该技术方案是相当有效的。

关 键 词:特殊符号串  约束式  上下文语境  特征向量
文章编号:1000-3428(2004)12-0114-02

Technique of Special Strings Automatic Recognition in Chinese Texts
LI Hongqiao,FAN Xiaozhong.Technique of Special Strings Automatic Recognition in Chinese Texts[J].Computer Engineering,2004,30(12):114-115,180.
Authors:LI Hongqiao  FAN Xiaozhong
Abstract:This paper puts forward two ways: the formalization and context to recognize the special strings in Chinese texts. Context free grammar extended by constrained formula and an improved LR parser are adopted to formalize and recognize the special strings. The context is represented by the eigenvector based on Hownet and their similarity is measured by Euclid distance between their eigenvectors. This technique is proved to be quite effective in practice.
Keywords:Special strings  Constrained formula  Context  Eigenvector  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号