首页 | 本学科首页   官方微博 | 高级检索  
     

基于分布式架构的时间序列局部相似检测算法
引用本文:林炀,江育娥,林劼. 基于分布式架构的时间序列局部相似检测算法[J]. 计算机应用, 2016, 36(12): 3285-3291. DOI: 10.11772/j.issn.1001-9081.2016.12.3285
作者姓名:林炀  江育娥  林劼
作者单位:福建师范大学 软件学院, 福州 350108
基金项目:国家自然科学基金资助项目(61472082);福建省自然科学基金资助项目(2014J01220)。
摘    要:基于动态时间规整算法思想的CrossMatch算法可以用来解决序列间的部分相似问题,但是由于算法时间空间复杂度过高,需要消耗大量的计算资源,因此无法应用于长序列之间的计算。针对以上问题,提出了一个基于分布式平台上的时间序列局部相似性检测算法。将CrossMatch算法实现在了分布式框架上,解决了计算资源不足的问题。首先需要对序列进行切分,分别放置在不同的节点上;其次,各节点分别处理各自序列的相似部分;最后,通过对结果进行汇总并拼接,找出序列间的局部相似。实验结果表明,该算法在准确性上和CrossMatch相近,在时间上也有提升。改进后的分布式算法不仅解决了单机无法处理的长序列计算问题,而且可以通过增加并行计算节点数提高运行速度。

关 键 词:动态时间规整  MapReduce  时间序列  局部相似性  并行化  
收稿时间:2016-06-22
修稿时间:2016-08-25

Local similarity detection algorithm for time series based on distributed architecture
LIN Yang,JIANG Yu'e,LIN Jie. Local similarity detection algorithm for time series based on distributed architecture[J]. Journal of Computer Applications, 2016, 36(12): 3285-3291. DOI: 10.11772/j.issn.1001-9081.2016.12.3285
Authors:LIN Yang  JIANG Yu'e  LIN Jie
Affiliation:Faculty of Software, Fujian Normal University, Fuzhou Fujian 350108, China
Abstract:The CrossMatch algorithm based on the idea of Dynamic Time Warping (DTW) can be used to solve the problems of local similarity between time series. However, due to the high complexity of time and space, large amounts of computing resources are required. Thus, it is almost impossible to be used for long sequences. To solve the above mentioned problems, a new algorithm for local similarity detection based on distributed platform was proposed. The proposed algorithm was a distributed solution for CrossMatch. The problem of insufficient computing resources including time and space requirement was solved. Firstly, the series should be splited and distributed on several nodes. Secondly, the local similarity of every node's own series was dealt with. Finally, the results would be merged and assembled in order to find the local similarity of series. The experimental results show that the accuracy between the proposed algorithm and the CrossMatch algorithm is similar, and the proposed algorithm uses less time. The improved distributed algorithm can not only solve the computation problem of long sequence of time series which can not be processed by a single machine, but also improve the running speed by increasing the number of parallel computing nodes.
Keywords:Dynamic Time Warping (DTW)   MapReduce   time series   local similarity   parallelization
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号