首页 | 本学科首页   官方微博 | 高级检索  
     

基于局部线性加权的离群点检测方法
引用本文:徐雪松,宋东明,张谞,张宏,刘凤玉.基于局部线性加权的离群点检测方法[J].计算机科学,2008,35(5):154-157.
作者姓名:徐雪松  宋东明  张谞  张宏  刘凤玉
作者单位:南京理工大学计算机科学与技术学院,南京,210094
摘    要:为了提高高维数据集合离群数据挖掘效率,在分析了传统的离群数据挖掘算法优点和缺点的基础上,提出了一种基于局部线性加权的离群点检测算法.该算法利用LLE算法的思想寻找样本数据的内在嵌入分布,并通过距离公式和离群点权值判别式进行权值数据判定,根据权值的大小标识出数据集中的离群点.仿真实验的结果表明了该方法能够有效地发现高维数据集中的离群点.与此同时,该算法具有参数估计简单、参数影响不大等优点.该算法为离群点检测问题的机器学习提供了一条新的途径.

关 键 词:局部线性嵌入  高维数据  非线性降维  离群数据

Research of Detection of Outliers Based on Locally Linear Weighted Value
XU Xue-song,SONG Dong-ming,ZHANG Xu,ZHANG Hong,LIU Feng-yu.Research of Detection of Outliers Based on Locally Linear Weighted Value[J].Computer Science,2008,35(5):154-157.
Authors:XU Xue-song  SONG Dong-ming  ZHANG Xu  ZHANG Hong  LIU Feng-yu
Affiliation:XU Xue-song SONG Dong-ming ZHANG Xu ZHANG Hong LIU Feng-yu (Department of Computer Science , Technology,Nanjing University of Science , Technology,Nanjing 210094,China)
Abstract:The data dimension reduction is the main method that can enhance the outliers mining efficiency based on higher-dimension data set.The research of detection of outliers based on locally linear weighted value is proposed after analyzing the advantages and disadvantages of the classical outlier mining algorithm in the paper.With the idea of Lo- cal Linear Embedding,the algorithm tries to find distribution of internal embedding of the samples,and determines value-weighted by combining distance formula with dis...
Keywords:Loeally linear embedding  High dimensional data  Nonlinear dimensionality reduction  Outliers  
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号