基于网页文本依存特征的人名消歧 Name Disambiguation Based on Dependency Feature in Web Page Text期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于网页文本依存特征的人名消歧

引用本文：	杨欣欣,李培峰,朱巧明. 基于网页文本依存特征的人名消歧[J]. 计算机工程, 2012, 38(19): 133-136

作者姓名：	杨欣欣李培峰朱巧明

作者单位：	苏州大学计算机科学与技术学院,江苏苏州215006;江苏省计算机信息处理技术重点实验室,江苏苏州215006

基金项目：	国家自然科学基金资助项目（60970056,61070123,61003155）; 江苏省自然科学基金资助项目（BK2008160）; 高等学校博士学科点专项基金资助项目（20093201110006）; 模式识别国家重点实验室开放课题基金资助项目

摘要：	研究互联网中的人名消歧问题.抽取与网页文本中人名关键字实体相关的依存特征及命名实体等辅助特征,利用二层聚类算法,根据依存特征将可信度高的文档聚类,使用辅助特征将剩余文档加到现有聚类结果中,由此实现人名消歧.实验结果证明,该方法消歧效果优于其他人名消歧方法.
关键词：	人名歧义依存特征人名消歧命名实体聚类
收稿时间：	2011-12-30
Name Disambiguation Based on Dependency Feature in Web Page Text

YANG Xin-xin , LI Pei-feng , ZHU Qiao-ming. Name Disambiguation Based on Dependency Feature in Web Page Text[J]. Computer Engineering, 2012, 38(19): 133-136

Authors:	YANG Xin-xin LI Pei-feng ZHU Qiao-ming

Affiliation:	1.School of Computer Science ＆ Technology,Soochow University,Suzhou 215006,China;2.Jiangsu Provincial Key Lab of Computer Information Processing Technology,Suzhou 215006,China)

Abstract:	This paper works on the common ambiguity problem on Internet.The following is the proposed method： extract the dependency features which are related to the key name entities in the Web page text,while extract supporting features such as named entity extraction;cluster these features by a two-step cluster algorithm which clusters the documents with high reliability in the first stage and then merges the other documents to the existing clustering results.Experimental result shows that the proposed disambiguation system has better performance than common methods.

Keywords:	name ambiguity dependency feature name disambiguation named entity clustering
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《计算机工程》浏览原始摘要信息
	点击此处可从《计算机工程》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏