首页 | 本学科首页   官方微博 | 高级检索  
     

集群数据库系统的日志复制和故障恢复
引用本文:王嘉豪,蔡鹏,钱卫宁,周傲英.集群数据库系统的日志复制和故障恢复[J].软件学报,2017,28(3):476-489.
作者姓名:王嘉豪  蔡鹏  钱卫宁  周傲英
作者单位:华东师范大学数据科学与工程学院, 上海 200062,华东师范大学数据科学与工程学院, 上海 200062;桂林电子科技大学广西可信软件重点实验室, 广西桂林 541004,华东师范大学数据科学与工程学院, 上海 200062,华东师范大学数据科学与工程学院, 上海 200062
基金项目:国家863项目课题(2015AA015307);国家自然科学基金(61332006,61432006,61672232);广西可信软件重点实验室研究课题(kx201602)
摘    要:互联网、社交、购物、金融等各类应用直接面临海量用户的高并发访问,传统的单点数据库逐渐成为这些应用系统的瓶颈,而众多互联网应用能够良好运行的主要原因是使用了基于集群环境的数据管理系统作支撑。与传统数据库系统相比,基于集群环境的数据库系统具有更好的扩展性和可用性,而日志复制是保证这些特性的核心组件。传统的主备架构的日志复制在异常情况下对未决事务日志处理不佳,导致数据副本之间存在不一致的风险,另外,分布式系统领域的一致性算法缺乏对事务一致性的处理,而且在选主时存在活锁、多主和频繁选主的问题,无法直接适用于事务日志复制。本文提出了一种集群环境下的事务日志复制策略和恢复机制,能够有效处理未提交日志,提供了强弱两种读一致性,并且提出了一种轻量级的选主算法,可以避免以上的选主问题。本文在开源OceanBase分布式数据库系统中实现了上述机制,并使用基准测试工具对系统进行测试,通过一系列实验验证了系统的扩展性和可用性。

关 键 词:日志复制  故障恢复  一致性  可用性  数据库系统
收稿时间:2016/7/30 0:00:00
修稿时间:2016/9/14 0:00:00

Log Replication and Recovery in Cluster-Based Database System
WANG Jia-Hao,CAI Peng,QIAN Wei-Ning and ZHOU Ao-Ying.Log Replication and Recovery in Cluster-Based Database System[J].Journal of Software,2017,28(3):476-489.
Authors:WANG Jia-Hao  CAI Peng  QIAN Wei-Ning and ZHOU Ao-Ying
Affiliation:School of Computer Science and Software Engineering, East China Normal University, Shanghai 200062, China,School of Computer Science and Software Engineering, East China Normal University, Shanghai 200062, China;GuangxiKeyLaboratoryofTrustedSoftware, Guilin University of Electronic Technology, Guilin 541004, China,School of Computer Science and Software Engineering, East China Normal University, Shanghai 200062, China and School of Computer Science and Software Engineering, East China Normal University, Shanghai 200062, China
Abstract:Many applications such as social networking,online shopping and online finance may receive highly concurrent data access from massive Internet users.In this scenario,traditional single node database systems gradually become the bottleneck of the system,and the main reason for many successful Internet applications is the use of cluster-based data management systems.Compared with traditional database systems,cluster-based distributed database systems have better scalability and availability,and log replication is one of the core components to build these features.Master-Slave based log replication cannot handle the uncertain logs while failure occurs,resulting in the risk of inconsistency among different copies.Consensus algorithms cannot be directly applied to the database system due to the lack of transaction consistency model,and there are also some troubles in leader election like live lock,as well as double master and continually election problem.In this paper,we introduce a log replication strategy and corresponding recovery technique for cluster environments,which can effectively process the uncertain logs and provide two read consistency options,i.e.strong and weak consistency,and proposed a lightweight master election algorithm to avoid the above master election issues.Finally,we implement our algorithms in the OceanBase distributed database system,and use benchmark tool to test it,experiments show that the algorithm can improve the scalability and availability efficiently.
Keywords:log replication  recovery  consistency  availability  database system
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号