首页 | 本学科首页   官方微博 | 高级检索  
     

传递信息分类的句子间相似性度量
引用本文:李林,周一民.传递信息分类的句子间相似性度量[J].计算机工程与应用,2009,45(31):15-17.
作者姓名:李林  周一民
作者单位:北京航空航天大学 计算机学院,北京 100191
基金项目:国家重点基础研究发展规划(973) 
摘    要:提出了一种计算英文句子间相似度的方法。基于句子所传递的信息——其描述的对象、描述对象的属性和动作,首先将待比较的两个句子进行语块分析,并从中提取以上三个方面的信息;然后通过语义向量的方法,分别计算两个句子在这三个方面的相似度;最后将它们结合起来作为两个句子的整体相似度,并通过训练得到最优的结合参数。实验表明,提出的方法与目前计算句子间相似度的方法相比更加符合人工判断句子间相似度的过程,表现出更高的准确性,达到了较高的性能指标。

关 键 词:句子相似度  词汇语义相似度  语块分析  语义向量  
收稿时间:2009-8-20
修稿时间:2009-9-11  

Sentence similarity measurement based on information category it contains
LI Lin,ZHOU Yi-min.Sentence similarity measurement based on information category it contains[J].Computer Engineering and Applications,2009,45(31):15-17.
Authors:LI Lin  ZHOU Yi-min
Affiliation:School of Computer Science and Engineering,Beihang University,Beijing 100191,China
Abstract:A method is proposed to determine English sentence similarities.Based on the information a sentence delivers:objects, properties and actions,the two compared sentences are chunked and further the above information is extracted.Then the similarities between objects,properties,and actions from the two sentences are calculated based on a semantic vector method. Finally the overall sentence similarity is defined as a combination of these three similarities by a parameter training method. Experiments show that the proposed method makes the sentence similarity comparison similar to the people's comprehension to the meanings of the sentences and also achieves a better performance with a high accuracy.
Keywords:sentence similarity  word semantic similarity  chunking  semantic vector
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号