首页 | 本学科首页   官方微博 | 高级检索  
     

基于属性论的文本相似度计算
引用本文:潘谦红,王炬,史忠植. 基于属性论的文本相似度计算[J]. 计算机学报, 1999, 22(6): 651-655
作者姓名:潘谦红  王炬  史忠植
作者单位:中国科学院计算技术研究所,北京,100080
基金项目:国家八六三高技术研究发展计划
摘    要:以属性论为理论依据,分析了文本属性与属性重心剖分模型的关系,建立了文本属性重心剖分模型,并在属性坐标系中表示文本向属与查询式向量,确定向量之间的匹配基准,计算匹配距离,从而建立一个文本与查询式之间的匹配相似度计算公式,该模型有效地描述文本属性和查询式属性之间的关系。

关 键 词:信息检索  人工智能  属性论
修稿时间:1997-12-12

TEXT SIMILARITY COMPUTING BASED ON ATTRIBUTE THEORY
PAN Qian-Hong,WANG Ju,SHI Zhong-zhi. TEXT SIMILARITY COMPUTING BASED ON ATTRIBUTE THEORY[J]. Chinese Journal of Computers, 1999, 22(6): 651-655
Authors:PAN Qian-Hong  WANG Ju  SHI Zhong-zhi
Abstract:Generally, in the process of information retrieval(IR), the users first put forward their key words to the system that they want to search. Then the key words are analyzed to the special format, they are matched with the document database whose results are considered as the results that are related to the users' interests. There are several IR models, such as reverse document model, vector space model, generalized vector space model and latent semantic model and so on. According to attribute theory, this paper analyses the relationship between textual attributes and the attribute barycenter coordinate model, and establishes the text attribute barycenter coordinate model. Within the coordinate, a text vector and a query vector can be represented. After deciding the criterion and computing the distance between the vectors, a formula that computes the similarity between the texts and the queries is shown.
Keywords:Information retrieval   artificial intelligence   attribute theory.
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号