首页 | 本学科首页   官方微博 | 高级检索  
     

非规则流中高维数据流典型相关性分析并行计算方法
引用本文:周勇,卢晓伟,程春田.非规则流中高维数据流典型相关性分析并行计算方法[J].软件学报,2012,23(5):1053-1072.
作者姓名:周勇  卢晓伟  程春田
作者单位:1. 大连理工大学软件学院,辽宁大连,116620
2. 大连理工大学水利学院,辽宁大连,116624
基金项目:国家杰出青年基金(51025934)
摘    要:为了满足在计算资源受限的环境下高维数据流处理的实时性要求,提出一种方法——基于GPU(graphic processing unit)的非规则流中高维数据流的处理模型和具体的可行架构,并分析设计了相关的并行算法.该六层模型是将GPU处理数据的高宽带性能结合进滑动窗口中数据流的分析,进而在该框架下基于统一计算设备架构(compute unified device architecture,简称CUDA),使用数据立方模型以及降维约简技术并行分析了多条高维数据流的典型相关性.理论分析和实验结果均表明,该并行处理方法能够在线精确地识别同步滑动窗口模式下高维数据流之间的相关性.相对于纯CPU方法,该方法具有显著的速度优势,很好地满足了高维数据流的实时性需求,可以作为通用的分析方法广泛应用于数据流挖掘领域.

关 键 词:图形处理器  高维数据流  典型相关性  统一计算设备架构  降维约简技术
收稿时间:2010/4/26 0:00:00
修稿时间:2010/12/9 0:00:00

Parallel Computing Method of Canonical Correlation Analysis for High-Dimensional Data Streams in Irregular Streams
ZHOU Yong,LU Xiao-Wei and CHENG Chun-Tian.Parallel Computing Method of Canonical Correlation Analysis for High-Dimensional Data Streams in Irregular Streams[J].Journal of Software,2012,23(5):1053-1072.
Authors:ZHOU Yong  LU Xiao-Wei and CHENG Chun-Tian
Affiliation:1(School of Software,Dalian University of Technology,Dalian 116620,China)2(School of Hydraulic Engineering,Dalian University of Technology,Dalian 116024,China)
Abstract:This paper addresses an approach that uses GPU(graphic processing unit)-based processing architecture model and its parallel algorithm for high-dimensional data streams over the irregular streams in order to satisfy the real-time requirement under the resource-constraints.This six layers model combines the GPU high wide-band property of data processing with analysis data stream in a sliding window.Next,canonical correlation analysis is carried out between two high-dimensional data streams,by a data cube pattern,and a dimensionality-reduction method in this framework based on compute unified device architecture(CUDA).The theoretical analysis and experimental results show that the parallel processing method can detect correlations on high dimension data streams,online,accurately in the synchronous sliding window mode.According to the pure CPU method,this technique has significant speed advantage and conducts the real-time requirement of high-dimensional data stream very well.It provides a common strategy for the applied field of data stream mining.
Keywords:graphic processing unit(GPU)  high-dimensional data stream  canonical correlation  compute unified device architecture  dimensionality-reduction technique
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号