首页 | 本学科首页   官方微博 | 高级检索  
     

基于多级相关图的大规模词典完美哈希函数构造算法
引用本文:李海涛.基于多级相关图的大规模词典完美哈希函数构造算法[J].计算机工程与科学,2010,32(12):128.
作者姓名:李海涛
摘    要:在哈希函数中,如果两个不同的单词被映射到同一个槽,那么我们称为冲突。当哈希函数存在冲突时,将降低词典查找的速度。由于完美哈希函数完全避免了冲突,因此在许多对查找性能要求较高的应用中广泛使用。本文就此提出了一种基于多级相关图的大规模词典完美哈希函数的构造算法。词典单词的每个字符(首字母除外)都用两个平滑函数平滑为两个字符,构建平滑后词典对应的多级相关图,多级相关图的结点度都比较小,而且分布比较均匀,因此更容易生成完美哈希函数。实验表明:基于多级相关图的哈希函数构造算法适用于大规模词典,填充因子接近1,同时工作空间比已有算法都要小。

关 键 词:完美哈希函数  多极相关图  大规模词典  平滑

A Perfect Hash Function for Large-Scale Dictionaries Based on Multi-Stage Dependency Graphs
LI Hai-tao.A Perfect Hash Function for Large-Scale Dictionaries Based on Multi-Stage Dependency Graphs[J].Computer Engineering & Science,2010,32(12):128.
Authors:LI Hai-tao
Abstract:In the hash function,if there are two different words are mapped into the same slot,we call it a conflict.When the hash function appears in the conflict,it slows the dictionary's finding speed.Since the perfect hash function completely avoids the conflict,it is used popularly in the application of high function demands.So a perfect hash function based on multi-stage dependency graphs is presented in this paper.Each character(except for the first one) of a key is smoothed into two smoothed characters using two different smoothing functions.A multi-stage dependency graph is constructed.The degree of the vertices in the multi-stage dependency graph of the smoothed key-set is much more evenly distributed,so that a perfect hash function is easier to obtain.The perfect hash function is promising to give solution to large key-sets.The load factor remains still high even for huge key-sets over large character-sets.Experiments indicate that the load factor is competitive,and meanwhile its working space is less than that of the existing hash functions.
Keywords:perfect hash function  multi-stage dependency graph  lager-scale dictionary  smoothing techniques
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号