首页 | 本学科首页   官方微博 | 高级检索  
     

分片计数布隆过滤器及其在Hbase二级索引的应用
引用本文:黄璨,方旭昇,张朝泉.分片计数布隆过滤器及其在Hbase二级索引的应用[J].计算机系统应用,2016,25(3):119-123.
作者姓名:黄璨  方旭昇  张朝泉
作者单位:南京航空航天大学 经济与管理学院, 南京 211106,南京航空航天大学 经济与管理学院, 南京 211106,南京航空航天大学 经济与管理学院, 南京 211106
摘    要:针对Hadoop Database(Hbase)仅支持主索引结构,即通过主键和主键的range来检索数据的问题,提出利用Counting Bloom Filter的新变体建立二级索引来支持非主键数据的检索.分析了已有的Counting Bloom Filter(CBF)技术,针对CBF溢出概率高的问题,提出一种新的Split Counting Bloom Filter(SCBF)技术,SCBF将标准CBF分成多个相互独立的区域,由这多个区域共同存储元素的fingerprint.实验结果表明,与标准CBF相比,SCBF降低了溢出概率,充分提高了过滤器的性能,可以很好地用来建立Hbase二级索引.

关 键 词:Hbase  二级索引  非主键数据  计数布隆过滤器  分片计数布隆过滤器
收稿时间:2015/6/24 0:00:00
修稿时间:9/6/2015 12:00:00 AM

Split Counting Bloom Filter and its Application in Hbase Secondary Index
HUANG Can,FANG Xu-Sheng and ZHANG Chao-Quan.Split Counting Bloom Filter and its Application in Hbase Secondary Index[J].Computer Systems& Applications,2016,25(3):119-123.
Authors:HUANG Can  FANG Xu-Sheng and ZHANG Chao-Quan
Affiliation:School of Economics and Management, Nanjing University of Aeronautics & Astronautics, Nanjing 211106, China,School of Economics and Management, Nanjing University of Aeronautics & Astronautics, Nanjing 211106, China and School of Economics and Management, Nanjing University of Aeronautics & Astronautics, Nanjing 211106, China
Abstract:A new variant of Counting Bloom Filter was set up to build Hbase secondary index to support the retrieval of non-primary key data, which solved the problem that Hbase only supported the main index structure and retrieve data through the primary key and the primary key range. The new variant, Split Counting Bloom Filter(SCBF), was proposed according to the high overflow probability problem of Counting Bloom Filter(CBF) after analyzing existing CBF technology. SCBF divided standard CBF into multiple independent regions, which stored elements' fingerprint by all these areas. Comparing SCBF with CBF, the experimental result shows that, SCBF contributes to much lower overflow probability, which improves the performance of filter, and can be used to build the Hbase secondary index.
Keywords:Hbase  secondary index  non-primary key  data counting bloom filter  split counting bloom filter
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号