首页 | 本学科首页   官方微博 | 高级检索  
     


Log integration on large scale for global networking monitoring
Authors:Jia-jia Miao   Quan-yuan Wu  Yan Jia
Affiliation:[1]School of Computer, National University of Defense Technology, Changsha 410073, China [2]Institute of Command Automation, PLA University of Science and Technology, Nanjing 210007, China
Abstract:Supposing that the overall situation is dug out from the distributed monitoring nodes, there should be two critical obstacles, heterogenous schema and instance, to integrating heterogeneous data from different monitoring sensors. To tackle the challenge of heterogenous schema, an instance-based approach for schema mapping, named instance-based machine-learning (IML) approach was described. And to solve the problem of heterogenous instance, a novel approach, called statistic-based clustering (SBC) approach, which utilized clustering and statistics technologies to match large scale sources holistically, was also proposed. These two algorithms utilized the machine-leaning and clustering technology to improve the accuracy. Experimental analysis shows that the IML approach is more precise than SBC approach, reaching at least precision of 81% and recall rate of 82%. Simulation studies further show that SBC can tackle large scale sources holistically with 85% recall rate when there are 38 data sources.
Keywords:machine-learning  clustering  data integration  schema matching  instance matching
本文献已被 维普 万方数据 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号