首页 | 本学科首页   官方微博 | 高级检索  
     

拆分型Bloom Filter
引用本文:肖明忠,代亚非,李晓明.拆分型Bloom Filter[J].电子学报,2004,32(2):241-245.
作者姓名:肖明忠  代亚非  李晓明
作者单位:北京大学计算机科学技术系,北京 100871
基金项目:国家重点基础研究发展计划(973计划),国家高技术研究发展计划(863计划)
摘    要:Bloom Filter对数据集合采用一个位串表示并能有效支持集合元素的哈希查找操作.在对Bloom Filter及其改进型进行综述性分析研究并探讨它们的实用性之后,本文提出了使用位矩阵表示数据集合的拆分型Bloom Filter并对其作了分析比较研究,以允许集合元素不断增加的分布式系统应用模型为例,证明它能缓解增长问题并能有效节省全局的集合表示空间需求量.

关 键 词:Bloom  Filters  哈希查找  分布式系统  
文章编号:0372-2112(2004)02-0241-05
收稿时间:2003-02-17

Split Bloom Filter
XIAO Ming zhong,DAI Ya fei,LI Xiao ming.Split Bloom Filter[J].Acta Electronica Sinica,2004,32(2):241-245.
Authors:XIAO Ming zhong  DAI Ya fei  LI Xiao ming
Affiliation:Department of Computer Science & Technology,Peking University,Beijing 100871,China
Abstract:A Bloom Filter is a simple space-efficient randomized data structure for representing a set in order to support membership queries,which uses an m-bit array to represent a data set and queries by hashing.The representation is the payoff for allowing a small rate of false positives in membership queries;that is,queries might wrongly regard an element as member of the set.However,for many applications,especially large-scale data set systems,the space savings and the locate time constantly outweigh this drawback when the probability of an error is sufficiently low and can suffer from by the application.The paper firstly surveys Bloom Filter and its variants in detail,and gives mathematical analysis behind them about space/time/error rate tradeoffs in order to explain their practicability.Then,we present a new kind of Bloom Filter-Split Bloom Filter,which uses a s×m-bit matrix to represent a set,and give analysis in detail as the formers.In distributed systems,each network node owns a data set,which is reasonable that some nodes are large number of data while the large number of nodes are a bit of data,if all nodes uses same parameters of algorithm,then it will give rise to a memory space wholly wasted.In addition,if the number of the elements of a node date set increases continually,the error rate will increasingly make the representation nonsensically.We prove that the Split Bloom Filter can efficiently solve or weaken the two problems.
Keywords:Bloom Filters
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《电子学报》浏览原始摘要信息
点击此处可从《电子学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号