首页 | 本学科首页   官方微博 | 高级检索  
     

基于距离的异常数据挖掘算法及其应用
引用本文:赵泽茂,何坤金,胡友进. 基于距离的异常数据挖掘算法及其应用[J]. 计算机应用与软件, 2005, 22(9): 105-107
作者姓名:赵泽茂  何坤金  胡友进
作者单位:河海大学计算机与信息工程学院,江苏,常州,213022;河海大学计算机与信息工程学院,江苏,常州,213022;河海大学计算机与信息工程学院,江苏,常州,213022
摘    要:给出了基于距离的异常数据的数量化定义,提出了基于距离的多指标的异常数据挖掘算法,这种算法适合于一般的海量数据库中的数据分析,以学生考试成绩作为实例进行了分析,可以从中动态地挖掘异常数据。作为特例,把单指标的异常数据挖掘算法应用于校园网Web服务器日志文件,给出了上网用户的频率分析图。

关 键 词:异常数据挖掘  Web日志  学生成绩  上网行为模式
收稿时间:2004-06-09
修稿时间:2004-06-09

ALGORITHMS FOR MINING DISTANCE-BASED OUTLIER AND ITS APPLICATION
Zhao Zemao,He Kunjin,Hu Youjin. ALGORITHMS FOR MINING DISTANCE-BASED OUTLIER AND ITS APPLICATION[J]. Computer Applications and Software, 2005, 22(9): 105-107
Authors:Zhao Zemao  He Kunjin  Hu Youjin
Abstract:The quantitative definition of outlier data based on the distance was presented. The multi-criterion algorithm for mining outlier data based on the distance was also proposed. The proposed algorithm was very fit for data analysis in large database, and was applied to the student score in order to mining dynamic outliers. As for special example, the single-criterion algorithm for mining outlier data based on the distance was applied to the Web service log in campus networks. The frequency analysis chart including outlier data sign was presented.
Keywords:Outlier data mining Web log Student score Behavior mode of getting Internet
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号