首页 | 本学科首页   官方微博 | 高级检索  
     

基于D-S证据理论的XML文档潜在信息获取算法
引用本文:陈华城,杜学绘,陈性元,夏春涛.基于D-S证据理论的XML文档潜在信息获取算法[J].计算机应用研究,2013,30(4):1187-1190.
作者姓名:陈华城  杜学绘  陈性元  夏春涛
作者单位:解放军信息工程大学 电子技术学院, 郑州 450004
基金项目:国家“973”计划资助项目(2011CB311801); 河南省科技创新人才计划资助项目(114200510001)
摘    要:传统的XML文档检索方法主要是基于关键词匹配的检索,忽略了关键词的语义信息和蕴涵于信息组合中的潜在信息。针对上述问题,提出了基于D-S证据理论的XML文档潜在信息的获取算法。该算法通过引入本体定义了概念间的语义关系和信息的组合方式,提出了基于D-S证据理论的检索模型和指标权重的计算方法,并结合似然函数设计了一个动态的阈值,有效地消除语义匹配过程中存在的不确定性,解决了信息组合中潜在信息的获取问题。此外,还将该算法应用于电子政务领域个人和企业敏感信息的检测中,实验证明了该算法比传统的方法有着更高的查准率和查全率。

关 键 词:D-S证据理论  可扩展标记语言  潜在信息  本体  动态阈值

XML document latent information extractionalgorithm based on D-S evidence theory
CHEN Hua-cheng,DU Xue-hui,CHEN Xing-yuan,XIA Chun-tao.XML document latent information extractionalgorithm based on D-S evidence theory[J].Application Research of Computers,2013,30(4):1187-1190.
Authors:CHEN Hua-cheng  DU Xue-hui  CHEN Xing-yuan  XIA Chun-tao
Affiliation:Institute of Electronic Technology, PLA Information Engineering University, Zhengzhou 450004, China
Abstract:Traditional XML document retrieval methods are mainly based on keywords' match, which ignore keywords' semantics and latent information contained in information combination. This paper proposed an algorithm of XML document latent information extraction based on D-S evidence theory. Firstly it used ontology to define the relationships between semantic concepts and the combination mode, and next proposed a retrieval model based on D-S evidence theory. Then it presented the computation of evidence weight, and finally designed a dynamic threshold with plausible function. It solved the problems of uncertainty in semantic match and retrieve of latent information. Furthermore, it presented the algorithm's application in the detection of personal and enterprises' sensitive information in e-government domain. The experiment proves that the proposed algorithm has higher precision and recall.
Keywords:D-S evidence theory  XML  latent information  ontology  dynamic threshold
本文献已被 CNKI 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号