首页 | 本学科首页   官方微博 | 高级检索  
     

基于短序列分组和拼接策略的子序列快速查询算法
引用本文:范纯龙,王靖云,滕一平,丁国辉.基于短序列分组和拼接策略的子序列快速查询算法[J].计算机应用研究,2020,37(6):1702-1706,1749.
作者姓名:范纯龙  王靖云  滕一平  丁国辉
作者单位:沈阳航空航天大学 计算机学院 辽宁省大规模分布式系统实验室,沈阳 110136;沈阳航空航天大学 计算机学院 辽宁省大规模分布式系统实验室,沈阳 110136;沈阳航空航天大学 计算机学院 辽宁省大规模分布式系统实验室,沈阳 110136;沈阳航空航天大学 计算机学院 辽宁省大规模分布式系统实验室,沈阳 110136
摘    要:子序列查询技术在金融、商业、医疗等领域均有重要应用,但因DTW(dynamic time warping)等相似性比对算法的时间复杂度较高,子序列长度对检索时间影响很大,限制了数据集上长子序列检索的效率。针对这一问题提出一种子序列快速查询算法。首先对数据集中特定长度下所有子序列进行分组并标记出代表性子序列;然后在查询时将查询序列切分成定长的小段序列,并用DTW算法确定与小段序列相似的代表子序列候选集;最后对候选集进行序列拼接,获取到查询结果序列。实验表明新算法效率较典型算法提高约10倍。

关 键 词:序列数据查询  动态时间规整  子序列  序列分组
收稿时间:2018/11/12 0:00:00
修稿时间:2020/4/16 0:00:00

Fast subsequence query algorithm based on short sequence grouping and assembling strategy
Fan Chunlong,Wang Jingyun,Teng Yiping and Ding Guohui.Fast subsequence query algorithm based on short sequence grouping and assembling strategy[J].Application Research of Computers,2020,37(6):1702-1706,1749.
Authors:Fan Chunlong  Wang Jingyun  Teng Yiping and Ding Guohui
Affiliation:Large-scale distributed system laboratory in Liaoning Province,Shenyang Aerospace University,,,
Abstract:Subsequence query technique has important applications in several fields such as finance, commerce and healthcare. However, due to the high time complexity of similarity comparison algorithms such as dynamic time warping(DTW), the length of subsequence has a great influence on the retrieval time, which limits the efficiency of long subsequence retrieval on data sets. This paper presented a fast subsequence query algorithm based on short sequence grouping and assembling strategy. This algorithm first separated all subsequences which were in a given length in the data set into groups and marked out the representative subsequence for each group. Then it cut the query sequence into small query sequences with a fixed length during the query processing, and usied DTW algorithm to compute the candidate sets of subsequences. The representative subsequence had a high similarity with the small query sequences. Finally, it assembled all the sequences in the candidate sets to derive the query result sequences. Experiments show that the efficiency of the new algorithm is about 10 times higher than that of the typical algorithm.
Keywords:sequential data query  dynamic time warping  subsequence  sequence grouping
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号