首页 | 本学科首页   官方微博 | 高级检索  
     

基于核函数-主成分维数约减的离群点检测
引用本文:徐雪松,刘耀宗,赵学龙,张宏,刘凤玉. 基于核函数-主成分维数约减的离群点检测[J]. 计算机工程, 2008, 34(8): 82-84
作者姓名:徐雪松  刘耀宗  赵学龙  张宏  刘凤玉
作者单位:南京理工大学计算机科学与技术学院,南京,210094;南京理工大学计算机科学与技术学院,南京,210094;南京理工大学计算机科学与技术学院,南京,210094;南京理工大学计算机科学与技术学院,南京,210094;南京理工大学计算机科学与技术学院,南京,210094
摘    要:为了提高高维数据集合离群数据挖掘效率,该文分析传统的离群数据挖掘算法,提出一种离群点检测算法。该算法将非线性问题转化为高维特征空间中的线性问题,利用核函数-主成分进行维数约减,逐个扫描数据对象的投影分量,判断数据点是否为离群点,适用于线性可分数据集的离群点、线性不可分数据集的离群点的检测。实验表明了该算法的优越性。

关 键 词:维数消减  核函数  主成分
文章编号:1000-3428(2008)08-0082-03
修稿时间:2007-05-03

Outliers Detection Based on Kernel Function-Principle Component Dimension Reduction
XU Xue-song,LIU Yao-zong,ZHAO Xue-long,ZHANG Hong,LIU Feng-yu. Outliers Detection Based on Kernel Function-Principle Component Dimension Reduction[J]. Computer Engineering, 2008, 34(8): 82-84
Authors:XU Xue-song  LIU Yao-zong  ZHAO Xue-long  ZHANG Hong  LIU Feng-yu
Affiliation:??Department of Computer Science and Technology, Nanjing University of Science and Technology, Nanjing 210094??
Abstract:The data dimension reduction is a method that can enhance the outliers mining efficiency based on higher-dimension data set. This paper analyzes classical outlier mining algorithm, proposes a novel outlier detection algorithm, transforms nonlinear large-scale data into linear data in the feature space, and introduces a kernel function and principal component data transformation to reduce data dimension. On the basis of each resulting vector, it is determined which data is outlier data one by one. This paper shows that the algorithm is used to detect linear separable outlier data, and to detect nonlinear inseparable outlier data. Experimental results indicate that the algorithm is predominant.
Keywords:dimension reduction  kernel function  principal component
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号