首页 | 本学科首页   官方微博 | 高级检索  
     

基于二阶隐马尔可夫模型的文本信息抽取
引用本文:周顺先,林亚平,王耀南,易叶青.基于二阶隐马尔可夫模型的文本信息抽取[J].电子学报,2007,35(11):2226-2231.
作者姓名:周顺先  林亚平  王耀南  易叶青
作者单位:1. 湖南大学计算机与通信学院,湖南长沙 410082;2. 湖南大学电气与信息工程学院,湖南长沙 410082
基金项目:国家高技术研究发展计划(863计划),湖南省重点自然科学基金
摘    要:隐马尔可夫模型是文本信息抽取的重要方法之一.在一阶隐马尔可夫模型中,假设状态转移概率和观察值输出概率仅依赖于模型当前的状态,一定程度降低了信息抽取的精确度.而二阶隐马尔可夫模型合理地考虑了概率和模型历史状态的关联性,对错误信息有更强的识别能力.提出了基于二阶隐马尔可夫模型的文本信息抽取算法;分析了二阶隐马尔可夫模型在文本信息抽取中的有效性;仿真实验表明,新的算法比基于一阶隐马尔可夫模型的算法具有更高的抽取精确度.

关 键 词:文本信息抽取  一阶隐马尔可夫模型  二阶隐马尔可夫模型  精确度  
文章编号:0372-2112(2007)11-2226-06
收稿时间:2007-02-05
修稿时间:2007-06-05

Text Information Extraction Based on the Second-Order Hidden Markov Model
ZHOU Shun-xian,LIN Ya-ping,WANG Yao-nan,YI Ye-qing.Text Information Extraction Based on the Second-Order Hidden Markov Model[J].Acta Electronica Sinica,2007,35(11):2226-2231.
Authors:ZHOU Shun-xian  LIN Ya-ping  WANG Yao-nan  YI Ye-qing
Affiliation:1. College of Computer and Communication,Hunan University.Changsha,Hunan 410082,China;2. College of Electrical and Information Engineering,Hunan University.Changsha,Hunan 410082,China
Abstract:Hidden Markov model is one of important approaches for text information extraction.In the first-order hidden Markov model,there is the hypothesis that the transition probability of state and the output probability of observation are only depen- dent on the current state of the model,which debases the precision of information extraction comparatively.The relationship between the probability and the model's historical states is considered reasonably in the second-order hidden Markov model which has stronger performance of recognition for incorrect information.An algorithm of text information extraction based on the second-order hidden Markov model is proposed.The validity of the second-order hidden Markov model in information extraction is analyzed. Simulation Experiments show that the new algorithm has higher precision than the algorithm based on the first-order hidden Markov model.
Keywords:text information extraction  the first-order hidden Markov model  the second-order hidden Markov model  precision
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《电子学报》浏览原始摘要信息
点击此处可从《电子学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号