首页 | 本学科首页   官方微博 | 高级检索  
     

一种新的重名消解算法在保险领域中的应用研究*
引用本文:姚宇峰. 一种新的重名消解算法在保险领域中的应用研究*[J]. 计算机应用研究, 2012, 29(3): 994-997
作者姓名:姚宇峰
作者单位:常熟理工学院计算机科学与工程学院,江苏苏州,215500
基金项目:常熟理工学院青年教师科研启动基金资助项目(QZ0912)
摘    要:研究客户重名消解问题。针对以往重名消解方法如文本聚类的方法需考虑大量无用词汇并需要人工设定阈值以及类别数量,而基于信息抽取的人物相关属性相似度方法对于人物信息的抽取具有依赖性,提出了一种改进的重名消解算法。该算法首先对具有相同标志的客户进行属性匹配,合并匹配成功的标志;然后进行链接分析,对客户合作网的结构进行分析,将具有相同标志并与同一个代理人实体合作的客户归为一个客户实体,并把具有相同合作对的信息加以分析合并;最后通过原子团簇分析法进行聚类分析。仿真实验结果表明,所提改进算法对中文字符串的匹配处理进行了优化,执行效率高,适合于以大量数据为特征的保险领域的重名消解。

关 键 词:重名消解  数据挖掘  保险领域  实体

Application of data mining in customer name disambiguation of insurance field
YAO Yu-feng. Application of data mining in customer name disambiguation of insurance field[J]. Application Research of Computers, 2012, 29(3): 994-997
Authors:YAO Yu-feng
Affiliation:YAO Yu-feng(College of Computer Science & Engineering,Changshu Institute of Technology,Suzhou Jiangsu 215500,China)
Abstract:This paper researched the solution to customer name disambiguation of the field of insurance.Aiming at the former name disambiguation methods such as text clustering method need to be considered in a lot of useless words,manually set the threshold,and gave he numbers of type,and the method of character-related properties of similarity based on information extraction depends on the character information,proposed a new name disambiguation method.Firstly,applied the same attribute matching,merging the identity of a successful match and then used link analysis,analyzed structural analysis of customers network,the entities had the same identity and classified cooperate with the same policy to a customer entity,merged the same cooperating information.Finally,analyzed cluster analysis cluster.Experiment results show that the proposed method can optimize the chinese text string matching process and have the high implementation efficiency,especially suitable for large amounts of data to the insurance sector is characterized by digestion of the same name.
Keywords:name disambiguation  data mining  insurance field  entity
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号