基于QPSO-LSSVM的数据库相似重复记录检测算法 |
| |
引用本文: | 梁雪 任剑锋 景丽. 基于QPSO-LSSVM的数据库相似重复记录检测算法[J]. 计算机科学, 2012, 39(11): 157-159 |
| |
作者姓名: | 梁雪 任剑锋 景丽 |
| |
作者单位: | (河南财经政法大学计算机与信息工程学院 郑州 450002) |
| |
摘 要: | 针对大规模数据库的相似重复记录的检测问题,提出了一种量子群优化算法(QPSO)与最小二乘支持向量机(LSSVM)相结合的相似重复记录检测方法(QPSC}LSSVM)。首先计算记录字段的相似度值;然后利用QPSO对LSSVM参数进行优化,构建相似重复记录检测模型;最后通过具体数据集进行仿真测试实验。仿真结果表明,QPSCLLSSVM不仅提高了重复记录检测准确率,而且提高了检测效率,是一种有效的相似重复记录检测算法。
|
关 键 词: | 量子粒子群优化算法,最小二乘支持向量机,相似重复记录,检测 |
Approximate Duplicate Record Detection Algorithm Based on PSO and LSSVM |
| |
Abstract: | Approximately duplicate record detection algorithm was proposed based on quantum swarm algorithm (QP-SO) and least squares support vector machine (LSSVM) to solve the large-scale database approximation duplicate re-cord detection problem. Firstly, the record field similarity values are calculated, and then the LSSVM parameters are op-timized,by QPSO to construction the approximately duplicate records detection model, finally simulation experimentsarc carried out on the data set. hhe simulation results show that QPSC}I_SSVM not only improves the accuracy of theduplicate record detection but also improves the detection efficiency,and it is an effective approximate duplicate record,Detection algorithm. |
| |
Keywords: | Quantum particle swarm optimization Least square support vector machines Approximately duplicate record Detection |
|
| 点击此处可从《计算机科学》浏览原始摘要信息 |
|
点击此处可从《计算机科学》下载全文 |