首页 | 本学科首页   官方微博 | 高级检索  
     

依存分析和HMM相结合的信息抽取方法
引用本文:袁璐,蒙祖强,许珂.依存分析和HMM相结合的信息抽取方法[J].计算机工程与应用,2012,48(9):138-140.
作者姓名:袁璐  蒙祖强  许珂
作者单位:广西大学计算机与电子信息学院,南宁,530004
基金项目:国家自然科学基金(No.61063032);广西教育厅科研基金项目(No.201012MS010).
摘    要:信息抽取是文本信息处理的一个重要环节,当前的信息抽取研究工作大多针对半结构化的文本。针对自由文本,提出一种依存分析和HMM相结合的文本信息抽取算法,该算法在运用依存分析对句子进行浅层句法分析的基础上制定相应规则,形成输入序列,结合HMM易于建立、适应性好、抽取精度较高的优势,实现自由文本的信息抽取。实验结果表明,新的算法在召回率、准确率和正确率指标上均有良好的性能,说明了算法的有效性,为文本信息的抽取提供了新思路。

关 键 词:信息抽取  自由文本  隐马尔可夫模型  依存分析

Method of text information extraction based on dependency parsing and HMM
YUAN Lu , MENG Zuqiang , XU Ke.Method of text information extraction based on dependency parsing and HMM[J].Computer Engineering and Applications,2012,48(9):138-140.
Authors:YUAN Lu  MENG Zuqiang  XU Ke
Affiliation:College of Computer and Electronic Information, Guangxi University, Nanning 530004, China
Abstract:Information extraction is an important part of text information processing. The current information extraction researches mostly focus on semi-structured text. It proposes a novel text information extraction algorithm based on the combination of dependency parsing and HMM. The algorithm formulates appropriate rules based on applying dependency parsing to shallow syntactic analysis of sentences, forming the input sequence of HMM to achieve free text information extraction combining the advantage of easily building, good adaptability and high extraction accuracy of HMM. Experimental results show that the new algorithm has very good performance on recall rate, accuracy and correct rate.
Keywords:information extraction  free text  Hidden Markov Model(HMM)  dependency parsing
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号