首页 | 本学科首页   官方微博 | 高级检索  
     

基于数据库日志的变化数据捕获研究
引用本文:邹先霞,贾维嘉,潘久辉.基于数据库日志的变化数据捕获研究[J].小型微型计算机系统,2012,33(3):531-536.
作者姓名:邹先霞  贾维嘉  潘久辉
作者单位:1. 中南大学信息科学与工程学院,长沙410083;暨南大学计算机系,广州510632
2. 中南大学信息科学与工程学院,长沙410083;香港城市大学计算机系,香港九龙
3. 暨南大学计算机系,广州,510632
基金项目:中央高校基本科研业务费专项资金资助;广东省科技攻关项目(2010B010600026)资助;广东省产学研项目(2007B090400095)资助;教育部科学技术重点项目(210257)资助
摘    要:变化数据捕获方法是数据集成基础设施的战略组成部分,不断推动ETL、EAI等技术的发展.许多数据库厂商都提供了自己的CDC(Change data capture)产品,但只限于针对本身的数据库系统,价格也比较昂贵.虽然通过扫描数据库日志文件可以捕获变化数据,但大多数数据库系统都不提供日志文件的内部格式而只是提供日志访问的程序接口,如Oracle,SQL Server和DB2等.这些提供的接口有的访问活动日志,有的访问稳定日志,有的访问归档日志,因此很难保证读取日志文件的可靠性.现有的研究主要是如何利用程序应用接口读取日志文件,忽略了对可靠性的分析.本文针对读取不同类型的日志文件的可靠性条件进行了分析,提出了可靠读取规则及读取算法;并提出了从日志文件中有效抽取变化数据算法,实验证明了可靠性分析模型.

关 键 词:变化数据捕获  数据库日志  日志扫描  可靠性

Research of Log-based Change Data Capture
ZOU Xian-xia , JIA Wei-jia , PAN Jiu-hui.Research of Log-based Change Data Capture[J].Mini-micro Systems,2012,33(3):531-536.
Authors:ZOU Xian-xia  JIA Wei-jia  PAN Jiu-hui
Affiliation:1(School of Information Science and Engineering,Central South University,Changsha 410083,China) 2(Department of Computer Science,City University of Hong Kong,Kow loon,Hong Kong,SAR China) 3(Department of Computer Science,Jinan University,Guangzhou 510632,China)
Abstract:CDC(Change data capture) is a strategic component in the data integration infrastructure and compliments other technologies such ETL(Extract-Transform-Load) and EAI(Enterprise Application Integration).Many companies have home-grown CDC solutions that are typically limited in scope and are costly and difficult to maintain.By scanning the transactions log files one can capture the changes,but most database systems do not provide internal format of their log files.However few databases like Oracle,SQL Server and DB2 provide a programmed library to interface with their transaction log files.Some of these API(Application Program Interface) can read active redo logs,some read redo logs,and some read archive logs.In some cases it is difficult to ensure the reliability of reading log files.Existing researches are how to read log records by using API but ignore the analysis of reliability.We considered the reliability conditions of reading different log files and propose the algorithm of reading log records.And the algorithm is proposed to efficiently extract change data.The experiment shows the analysis model.
Keywords:change data capture  database log  log scanning  reliability
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号