首页 | 本学科首页   官方微博 | 高级检索  
     


A clustering algorithm for multiple data streams based on spectral component similarity
Authors:Ling Chen  Ling-Jun Zou  Li Tu
Affiliation:1. Department of Mathematics, Imperial College London, London SW7 2AZ, United Kingdom;2. Heilbronn Institute for Mathematical Research, University of Bristol, Bristol BS8 9AG, United Kingdom;1. Govt. Engineering College, Department of Computer Science, Painavu, Idukki, Kerala, India;2. National Institute of Interdisciplinary Science and Technology, Trivandrum, Kerala, India;3. Cochin University of Science and Technology, Cochin, Kerala, India
Abstract:We propose a new algorithm to cluster multiple and parallel data streams using spectral component similarity analysis, a new similarity metric. This new algorithm can effectively cluster data streams that show similar behaviour to each other but with unknown time delays. The algorithm performs auto-regressive modelling to measure the lag correlation between the data streams and uses it as the distance metric for clustering. The algorithm uses a sliding window model to continuously report the most recent clustering results and to dynamically adjust the number of clusters. Our experimental results on real and synthetic datasets show that our algorithm has better clustering quality, efficiency, and stability than other existing methods.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号