首页 | 本学科首页   官方微博 | 高级检索  
     

基于HBase的交通流数据实时存储系统
引用本文:陆婷,房俊,乔彦克.基于HBase的交通流数据实时存储系统[J].计算机应用,2015,35(1):103-107.
作者姓名:陆婷  房俊  乔彦克
作者单位:北方工业大学 云计算研究中心, 北京100041
基金项目:北京市自然科学基金重点项目(4131001);北京市属高等学校创新团队建设与教师职业发展规划项目(IDHT20130502);北大方正集团有限公司数字出版技术国家重点实验室开放课题;北方工业大学科研启动基金资助项目
摘    要:交通流数据具有多来源、高速率、体量大等特征,传统数据存储方法和系统暴露出扩展性弱和存储实时性低等问题.针对上述问题,设计并实现了一套基于HBase交通流数据实时存储系统.该系统采用分布式存储架构,通过前端的预处理操作对数据进行规范化整理,利用多源缓冲区结构对不同类型的流数据进行队列划分,并结合一致性哈希算法、多线程技术、行键优化设计等策略将数据并行存储到HBase集群服务器中.实验结果表明:该系统与基于Oracle的实时存储系统相比,其存储性能提升了3~5倍;与原生的HBase方法相比,其存储性能提升了2~3倍,并且具有良好的扩展性能.

关 键 词:流数据  多源缓冲区  数据切分  一致性哈希算法  实时存储  HBase
收稿时间:2014-07-18
修稿时间:2014-09-09

HBase-based real-time storage system for traffic stream data
LU Ting , FANG Jun , QIAO Yanke.HBase-based real-time storage system for traffic stream data[J].journal of Computer Applications,2015,35(1):103-107.
Authors:LU Ting  FANG Jun  QIAO Yanke
Affiliation:Research Center for Cloud Computing, North China University of Technology, Beijing 100041, China
Abstract:Traffic stream data has characteristics of multi-source, high speed and large volume, etc. When dealing with these data, the traditional methods and systems of data storage have exposed the problems of weak scalability and low real-time storage. To address these problems, this work designed and implemented a HBase-based real-time storage system for traffic streaming data. The system adopted the distributed storage architecture, standardized data through front-end preprocessing, divided different kinds of streaming data into different queues by using multi-source cache structure, and combined the consistent Hash algorithm, multi-thread and row-key optimization strategy to write data into HBase cluster in parallel. The experimental results demonstrate that, compared with the real-time storage system based on Oracle, the storage performance of the system has 3-5 times increment. When compared with the original HBase, it has 2-3 times increment of storage performance and it also has good scalability.
Keywords:streaming data  multi-source buffer  data sharding  consistent Hash algorithm  real-time storage  HBase
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号