首页 | 本学科首页   官方微博 | 高级检索  
     

基于符号化聚合近似的时间序列相似性复合度量方法
引用本文:刘芬,郭躬德.基于符号化聚合近似的时间序列相似性复合度量方法[J].计算机应用,2013,33(1):192-198.
作者姓名:刘芬  郭躬德
作者单位:1. 福建师范大学 数学与计算机科学学院, 福州 350007 2. 福建师范大学 网络安全与密码技术福建省高校重点实验室, 福州 350007
基金项目:国家自然科学基金资助项目(61070062,61175123);福建高校产学合作科技重大项目(2010H6007)
摘    要:基于关键点的符号化聚合近似(SAX)改进算法(KP_SAX)在SAX的基础上利用关键点对时间序列进行点距离度量,能更有效地计算时间序列的相似性,但对时间序列的模式信息体现不足,仍不能合理地度量时间序列的相似性。针对SAX与KP_SAX存在的缺陷,提出了一种基于SAX的时间序列相似性复合度量方法。综合了点距离和模式距离两种度量,先利用关键点将分段累积近似(PAA)法平均分段进一步细分成各个子分段;再用一个包含此两种距离信息的三元组表示每个子分段;最后利用定义的复合距离度量公式计算时间序列间的相似性,计算结果能更有效地反映时间序列间的差异。实验结果显示,改进方法的时间效率比KP_SAX算法仅降低了0.96%,而在时间序列区分度性能上优于KP_SAX算法和SAX算法。

关 键 词:时间序列  符号化聚合近似  相似性  模式距离  复合度量  
收稿时间:2012-07-29
修稿时间:2012-09-07

Composite metric method for time series similarity measurement based on symbolic aggregate approximation
LIU Fen,GUO Gongde.Composite metric method for time series similarity measurement based on symbolic aggregate approximation[J].journal of Computer Applications,2013,33(1):192-198.
Authors:LIU Fen  GUO Gongde
Affiliation:1. Key Laboratory of Network Security and Cryptology, Fujian Normal University, Fuzhou Fujian 350007, China
2. School of Mathematics and Computer Science, Fujian Normal University, Fuzhou Fujian 350007,China
Abstract:Key point-based Symbolic Aggregate approximation (SAX) improving algorithm (KP_SAX) uses key points to measure point distance of time series based on SAX, which can measure the similarity of time series more effectively. However, it is too short of information about the patterns of time series to measure the similarity of time series reasonably. To overcome the defects, a composite metric method of time series similarity measurement based on SAX was proposed. The method synthesized both point distance measurement and pattern distance measurement. First, key points were used to further subdivide the Piecewise Aggregate Approximation (PAA) segments into several sub-segments, and then a triple including the information about the two kinds of distance measurement was used to represent each sub-segment. Finally a composite metric formula was used to measure the similarity between two time series. The calculation results can reflect the difference between two time series more effectively. The experimental results show that the proposed method is only 0.96% lower than KP_SAX algorithm in time efficiency. However, it is superior to the KP_SAX algorithm and the traditional SAX algorithm in differentiating between two time series.
Keywords:time series                                                                                                                          Symbolic Aggregate approximation (SAX)                                                                                                                          similarity                                                                                                                          pattern distance                                                                                                                          composite metric
本文献已被 CNKI 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号