Pattern-based time-series subsequence clustering using radial distribution functions |
| |
Authors: | Anne M Denton Christopher A Besemann Dietmar H Dorr |
| |
Affiliation: | (1) Department of Computer Science and Operations Research, North Dakota State University, Fargo, ND 58105-5164, USA |
| |
Abstract: | Clustering of time series subsequence data commonly produces results that are unspecific to the data set. This paper introduces
a clustering algorithm, that creates clusters exclusively from those subsequences that occur more frequently in a data set
than would be expected by random chance. As such, it partially adopts a pattern mining perspective into clustering. When subsequences
are being labeled based on such clusters, they may remain without label. In fact, if the clustering was done on an unrelated
time series it is expected that the subsequences should not receive a label. We show that pattern-based clusters are indeed
specific to the data set for 7 out of 10 real-world sets we tested, and for window-lengths up to 128 time points. While kernel-density-based
clustering can be used to find clusters with similar properties for window sizes of 8–16 time points, its performance degrades
fast for increasing window sizes.
|
| |
Keywords: | Density-based clustering Time series subsequence clustering Clustering noisy data Noise elimination Time series labeling |
本文献已被 SpringerLink 等数据库收录! |
|