首页 | 本学科首页   官方微博 | 高级检索  
     

REPS:一种高效的容错并行概率流Skyline查询方法
引用本文:张卫华,李小勇,马俊,余杰.REPS:一种高效的容错并行概率流Skyline查询方法[J].计算机科学,2015,42(8):225-230, 264.
作者姓名:张卫华  李小勇  马俊  余杰
作者单位:国防科技大学计算机学院 长沙410073,国防科技大学计算机学院 长沙410073,国防科技大学计算机学院 长沙410073,国防科技大学计算机学院 长沙410073
基金项目:本文受国家自然科学基金项目(61303191,5),国家重点基础研究发展规划(973)项目(2011CB302601)资助
摘    要:概率数据流的并行Skyline查询作为当前大数据分析的一个重要方面,在诸多实际应用中发挥着重要作用。针对并行概率流Skyline查询过程中因发生故障而导致查询结果不准确和查询中断等问题,提出了一种基于复制的容错并行Skyline查询方法REPS。该方法选择参与并行处理的计算节点作为副本节点,并采用层次-循环式数据副本放置策略,选择优先级高的副本恢复数据来保证数据恢复的高效性;同时将故障检测、丢失数据恢复和查询过程恢复贯穿于整个查询更新过程中,以减少容错处理的额外通信和计算开销,并实现快速的容错并行查询。实验结果表明,REPS方法不仅在无故障发生和单个节点失效时具有较高的查询处理效率,而且对于多节点失效情形,仍然能够保持较高的查询处理速率且满足查询需求。

关 键 词:概率Skyline  容错查询  数据流  并行查询  大数据

REPS:An Efficient Fault-tolerant Approach for Parallel Skyline Queries over Probabilistic Data Streams
ZHANG Wei-hu,LI Xiao-yong,MA Jun and YU Jie.REPS:An Efficient Fault-tolerant Approach for Parallel Skyline Queries over Probabilistic Data Streams[J].Computer Science,2015,42(8):225-230, 264.
Authors:ZHANG Wei-hu  LI Xiao-yong  MA Jun and YU Jie
Affiliation:College of Computer,National University of Defense Technology,Changsha 410073,China,College of Computer,National University of Defense Technology,Changsha 410073,China,College of Computer,National University of Defense Technology,Changsha 410073,China and College of Computer,National University of Defense Technology,Changsha 410073,China
Abstract:The parallel Skyline query over probabilistic data streams,as an important aspect of big data analysis,plays an important role in various applications.To deal with the problem that the query results may be incorrect and interrupted,due to various faults occurred in the process of parallel Skyline queries over probabilistic streams,a replication-based fault-tolerant distributed parallel Skyline query scheme named REPS was proposed.Specifically,the compute nodes are also regarded as the replication nodes,and a layer-alternation strategy for replica placement is proposed in REPS.Thus,the lost data can be recovered efficiently by selecting the replicas with high priority.Moreover,the processes of fault detection,data recovery and query recovery are throughout the whole query updating process,in order to reduce the communication and computation overhead and achieve rapid fault-tolerant parallel query processing.Extensive experimental results demonstrate that REPS method not only has high efficiency when no failure occurs or a single node fails,but also can maintain a high processing rate and meet the query requirement even when multiple failures occur.
Keywords:Probabilistic Skyline  Fault-tolerant queries  Data streams  Parallel queries  Big data
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机科学》浏览原始摘要信息
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号