首页 | 本学科首页   官方微博 | 高级检索  
     

基于属性约简和相对熵的离群点检测算法
引用本文:胡云,李慧,施珺,蔡虹.基于属性约简和相对熵的离群点检测算法[J].山东大学学报(工学版),2011,41(6):31-36.
作者姓名:胡云  李慧  施珺  蔡虹
作者单位:1.淮海工学院计算机工程学院, 江苏 连云港 222000; 2.南京大学计算机科学与技术系, 江苏 南京 210000
基金项目:江苏省自然科学基金资助项目(BK2008190)
摘    要:本研究结合信息熵与粗糙集理论中的属性约简技术,提出了一种新颖的离群点检测算法。这种方法通过在更小的属性子空间去获得相同或相近的离群数据集,使对离群数据的分析更加集中于较小的目标域。该算法对原属性空间进行划分,通过分析计算将具有最大相对熵与负相对势的对象集合判定为离群点集合。为了验证算法的有效性,还在通用数据集上进行了测试,理论分析和实验结果表明该离群点检测算法是有效可行的。

关 键 词:属性简约  相对熵  离群点检测  
收稿时间:2011-04-15

An outlier detection algorithm based on attribute reduction and relative entropy
HU Yun,LI Hui,SHI Jun,CAI Hong.An outlier detection algorithm based on attribute reduction and relative entropy[J].Journal of Shandong University of Technology,2011,41(6):31-36.
Authors:HU Yun  LI Hui  SHI Jun  CAI Hong
Affiliation:1. School of Computer Engineering, Huaihai Institute of Technology, Lianyungang 222000, China;2. Department of Computer Science and Technology, Nanjing University, Nanjing 210000, China
Abstract:A new outlier detection algorithm combining a rough set and information entropy technology was proposed. This approach could obtain similar outlier sets by means of searching in an attributes subspace, which could lead the analysis of outlier detection to focus better on narrow and specific object fields. This algorithm divided the original attribute space into several segments, which filtered out those subjects with largest relative entropy negative relative cardinality as the outliers. To prove this algorithm’s effectiveness, experiments on a real world dataset were conducted. Theoretical analysis and experimental results showed that this method of outlier detection was efficient and effective.
Keywords:attribute deduction  relative entropy  outlier detection
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《山东大学学报(工学版)》浏览原始摘要信息
点击此处可从《山东大学学报(工学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号