基于中文维基百科链接结构与分类体系的语义相关度计算 Computing Semantic Relatedness Using Chinese Wikipedia Links and Taxonomy期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于中文维基百科链接结构与分类体系的语义相关度计算

引用本文：	汪祥,贾焰,周斌,丁兆云,梁政.基于中文维基百科链接结构与分类体系的语义相关度计算[J].小型微型计算机系统,2011,32(11).

作者姓名：	汪祥贾焰周斌丁兆云梁政

作者单位：	国防科学技术大学计算机学院,长沙,410073

基金项目：	国家八六三高技术研究发展计划项目(2011AA010702,2010AA012505)资助; 国家自然科学基金项目(60933005,60873204)资助

摘要：	自然语言词汇的语义相关度的计算需要获取大量的背景知识,而维基百科是当前规模最大的百科全书,其不仅是一个规模巨大的语料库,而且还是一个包含了大量人类背景知识和语义关系的知识库,研究表明,其是进行语义计算的理想资源,本文提出了一种将维基百科的链接结构和分类体系相结合计算中文词汇语义相关度的算法,算法只利用了维基百科的链接结构和分类体系,无需进行复杂的文本处理,计算所需的开销较小.在多个人工评测的数据集上的实验结果显示,获得了比单独使用链接结构或分类体系的算法更好的效果,在最好的情况下,Spearman相关系数提高了30.96％.
关键词：	语义相关度语义相关性语义相似性维基百科
Computing Semantic Relatedness Using Chinese Wikipedia Links and Taxonomy

WANG Xiang , JIA Yan , ZHOU Bin , DING Zhao-yun , LIANG Zheng.Computing Semantic Relatedness Using Chinese Wikipedia Links and Taxonomy[J].Mini-micro Systems,2011,32(11).

Authors:	WANG Xiang JIA Yan ZHOU Bin DING Zhao-yun LIANG Zheng

Affiliation:	WANG Xiang,JIA Yan,ZHOU Bin,DING Zhao-yun,LIANG Zheng(School of Computer Science,National University of Defense Technology,Changsha 410073,China)

Abstract:	Any attempt to compute semantics relatedness of natural language words needs a lot of background knowledge.Studies have shown that Wikipedia,which is the largest encyclopedia and could be used not only as a corpus but also a knowledge base with rich semantic information,is the ideal resource for semantic computation.In this paper,a new algorithm based on Wikipedia links and taxonomy is proposed to compute semantic relatedness of words.Since the algorithm uses only Wikipedia link structure and taxonomy,there...

Keywords:	semantic relatedness semantic relevance semantic similarity wikipedia
本文献已被 CNKI 万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏