首页 | 本学科首页   官方微博 | 高级检索  
     

基于Internet的军事演习信息抽取系统
引用本文:李跃进,赵晶,林鸿飞.基于Internet的军事演习信息抽取系统[J].计算机工程与应用,2006,42(14):214-218.
作者姓名:李跃进  赵晶  林鸿飞
作者单位:大连理工大学计算机系,大连,116024
摘    要:论文论述了Web文档的信息抽取的基本方法,设计并实现了一个基于Internet上的军事演习信息抽取系统—SBIES。在系统中引入了分装器的机器学习算法来获取网页抽取规则,采用基于最大熵模型的组块分析方法进行部分语法分析,利用模式匹配的方法实现信息的自动抽取,以数据库与XML相结合的方式组织信息库,并实现信息的Web表示和查询。系统测试结果表明,它具有较高的抽取召回率和抽准确率。

关 键 词:信息抽取  最大熵模型  组块分析  模式匹配
文章编号:1002-8331-(2006)14-0214-05
收稿时间:2005-12
修稿时间:2005-12

Sham Battle Information Extraction System Based on Internet
Li Yuejin,Zhao Jing,Lin Hongfei.Sham Battle Information Extraction System Based on Internet[J].Computer Engineering and Applications,2006,42(14):214-218.
Authors:Li Yuejin  Zhao Jing  Lin Hongfei
Affiliation:Department of Computer, Dalian University of Technology, Dalian 116024
Abstract:Information Extraction plays an important role in knowledge acquisition and information service.This paper discusses briefly the key techniques for information extraction,and it designs and implements a Sham Battle Information Extraction System(SBIES).It constructs automatically wrappers by machine learning algorithms,applies Maximum Entropy model to conduct Chinese chunk parsing and makes use of a sets of extraction patterns to extract specific information and relationships from relevant HTML documents.Moreover,it also combines the XML expression with the organization of database,so it realizes the presentation and query of information extracted based on Web.It shows higher recall and precision by testing SBIES.
Keywords:information extraction  maximum entropy principle  trunk analysis  pattern matching
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号