首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于哈希链表的高效概念漂移连续属性处理算法
引用本文:王涛,李舟军,颜跃进.一种基于哈希链表的高效概念漂移连续属性处理算法[J].计算机工程与科学,2008,30(8):65-68.
作者姓名:王涛  李舟军  颜跃进
作者单位:1. 国防科技大学计算机学院,湖南,长沙,410073
2. 北京航空航天大学计算机学院,北京,100083
摘    要:本文重点研究了数据流挖掘中存在概念漂移情形的连续属性处理算法。数据流是一种增量、在线、实时的数据模型。VFDT是数据流挖掘中数据呈稳态分布情形下最成功的算法之一;CVFDT是有效解决数据流挖掘中概念漂移问题的算法之一。基于CVFDT,本文提出了有效地解决数据流挖掘中存在概念漂移情形的连续属性处理问题的扩展哈希表算法HashCVFDT。该算法在属性值插入、查找和删除时具有哈希表的快速性,而在选取每个连续属性的最优化划分节点时解决了哈希表不能有序输出的缺点。

关 键 词:数据流挖掘  CVFDT连续属性  概念漂移  扩展哈希表

An Efficient Continuous-Valued Attribute Handling Algorithm for Mining Concept-Drifting Data Streams Based on the Extended Hash Table
WANG Tao,LI Zhou-jun,YAN Yue-jin.An Efficient Continuous-Valued Attribute Handling Algorithm for Mining Concept-Drifting Data Streams Based on the Extended Hash Table[J].Computer Engineering & Science,2008,30(8):65-68.
Authors:WANG Tao  LI Zhou-jun  YAN Yue-jin
Abstract:This paper focuses on continuous-valued attribute handling for mining concept-drifting data streams.Data stream is an incremental,online and real-time model.VFDT is one of the most successful algorithms in data stream mining when data take on a state of stable distribution;CVFDT is one of the effective algorithms for resolving the problem of concept drifting in data stream mining.Based on CVFDT,the paper proposes an efficient continuous-valued attribute handling method named Hash CVFDT for mining concept-drifting data streams based on the extended hash table.The algorithm is as fast as the hash table in attribute inserting,seeking and deleting,and solves the flaws of the hash table which cannot output.Sequently when selecting the optimally partitioned nodes of each continuous-valued attribute.
Keywords:data streaming  CVFDT:continuous-valued attribute  concept drifting  extended hash table
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号