首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
概念与文档的语义相似度计算   总被引:1,自引:0,他引:1  
将本体作为背景知识引入到概念之间相似度和文档之间相似度的计算中。通过图模型表示本体中概念以及概念之间的语义关系,用来将一个概念和一个文档扩展为一个语义模糊集,并计算模糊集合之间的相似度。文档相似度的计算是在概念相似度计算的基础之上。在概念相似度的计算过程中引入了语义相似度矩阵以及基于共信息理论的模糊相似度方法。  相似文献   

2.
In this paper, we proposed a novel approach based on topic ontology for tag recommendation. The proposed approach intelligently generates tag suggestions to blogs. In this approach, we construct topic ontology through enriching the set of categories in existing small ontology called as Open Directory Project. To construct topic ontology, a set of topics and their associated semantic relationships is identified automatically from the corpus‐based external knowledge resources such as Wikipedia and WordNet. The construction relies on two folds such as concept acquisition and semantic relation extraction. In the first fold, a topic‐mapping algorithm is developed to acquire the concepts from the semantic of Wikipedia. A semantic similarity‐clustering algorithm is used to compute the semantic similarity measure to group the set of similar concepts. The second is the semantic relation extraction algorithm, which derives associated semantic relations between the set of extracted topics from the lexical patterns between synsets in WordNet. A suitable software prototype is created to implement the topic ontology construction process. A Jena API framework is used to organize the set of extracted semantic concepts and their corresponding relationship in the form of knowledgeable representation of Web ontology language. Thus, Protégé tool provides the platform to visualize the automatically constructed topic ontology successfully. Using the constructed topic ontology, we can generate and suggest the most suitable tags for the new resource to users. The applicability of topic ontology with a spreading activation algorithm supports efficient recommendation in practice that can recommend the most popular tags for a specific resource. The spreading activation algorithm can assign the interest scores to the existing extracted blog content and tags. The weight of the tags is computed based on the activation score determined from the similarity between the topics in constructed topic ontology and content of the existing blogs. High‐quality tags that has the highest activation score is recommended to the users. Finally, we conducted experimental evaluation of our tag recommendation approach using a large set of real‐world data sets. Our experimental results explore and compare the capabilities of our proposed topic ontology with the spreading activation tag recommendation approach with respect to the existing AutoTag mechanism. And also discuss about the improvement in precision and recall of recommended tags on the data sets of Delicious and BibSonomy. The experiment shows that tag recommendation using topic ontology results in the folksonomy enrichment. Thus, we report the results of an experiment mean to improve the performance of the tag recommendation approach and its quality.  相似文献   

3.
提出了一种词汇和本体概念间的语义相似度计算方法。该方法利用编辑距离和维基百科从语法和语义两方面综合考虑词汇和概念间的语义相似度。在领域本体的指导下,将方法应用于语义标注过程,建立词汇与本体概念之间的映射。在标注过程中建立知识库,提高算法性能,实验结果说明该方法是行之有效的。  相似文献   

4.
基于属性结构的本体映射方法   总被引:1,自引:1,他引:0       下载免费PDF全文
提出基于本体概念属性结构的本体映射方法。从多角度提取本体属性的描述特征,综合处理本体之间属性的映射,形成2个本体共同的属性结构树。利用属性结构树寻找概念属性的层次关系,以度量本体概念间的语义相似度。实验结果表明,属性结构是影响本体映射的因素之一,该方法有效。  相似文献   

5.
现有的语义Web服务匹配算法没有考虑到本体概念间的多元关系,导致概念的语义不能被完整地反映出来,从而影响了算法的匹配性能。利用本体概念间的多元关系定义了一种语义距离,并通过该语义距离给出了概念间的语义相似度计算方法,在此基础上提出基于语义相似度的Web服务匹配算法。该算法通过本体概念间的语义相似度来反映Web服务的匹配程度。最后,通过对比实验验证了该算法的可行性和有效性。  相似文献   

6.
一种基于本体的句子相似度计算方法   总被引:2,自引:0,他引:2  
刘宏哲 《计算机科学》2013,40(1):251-256
提出了一种基于树结构本体的句子相似度计算方法。利用本体概念与句子中关键词之间建立的语义索引,构建句子与本体间的直接和间接语义联系,据此提取描述句子的语义向量,从而计算句子间的语义相似度。应用微软研究院的意译语料库(MSRP)对本方法进行了验证,结果表明:与相关的计算方法相比,本方法在不完备附加信息应用前提下获得了较好的准确率和召回率。  相似文献   

7.
为弥补传统的语义标注方法在词语或句子成分之间关系描述方面的不足,该文提出了一种基于本体和依存句法的非结构化文本语义关系标注算法。算法以句子为单位,综合POS(Part of Speech)、语义辞典、语言学特征等因素对句子中词汇的语义关系进行识别,利用词语间的依存关系对词语进行语义组合,从而实现词汇语义关系标注。结合语义标注过程中的语义匹配度、语义丰富度等特征,设计了评价算法,用以衡量标注结果的正确性。实验结果表明,该标注算法能获得较高的准确率,在大规模语料下效果尤为显著。  相似文献   

8.
一种改进的本体语义相似度计算及其应用   总被引:5,自引:1,他引:5  
词语相似度研究,是知识表示以及信息检索领域中的一个重要内容.词语相似度的计算方法一般是利用大规模的语料库来统计.本体给词语间相似度计算带来了新的机会.利用本体结构上的ISA关系,提出了本体内部概念之间的相似度计算方法.实验结果表明,该方法能充分利用本体特点来计算相关概念之间的相似度.结合一个简单本体,介绍了如何计算概念间的相似度,及其在智能检索系统中的应用.  相似文献   

9.

Text summarization presents several challenges such as considering semantic relationships among words, dealing with redundancy and information diversity issues. Seeking to overcome these problems, we propose in this paper a new graph-based Arabic summarization system that combines statistical and semantic analysis. The proposed approach utilizes ontology hierarchical structure and relations to provide a more accurate similarity measurement between terms in order to improve the quality of the summary. The proposed method is based on a two-dimensional graph model that makes uses statistical and semantic similarities. The statistical similarity is based on the content overlap between two sentences, while the semantic similarity is computed using the semantic information extracted from a lexical database whose use enables our system to apply reasoning by measuring semantic distance between real human concepts. The weighted ranking algorithm PageRank is performed on the graph to produce significant score for all document sentences. The score of each sentence is performed by adding other statistical features. In addition, we address redundancy and information diversity issues by using an adapted version of Maximal Marginal Relevance method. Experimental results on EASC and our own datasets showed the effectiveness of our proposed approach over existing summarization systems.

  相似文献   

10.
概念的语义相似度计算是自然语言处理等领域的重要研究内容,基于语义距离的概念相似度计算是其主要方法。在分析现有算法存在弊端的基础上,提出基于领域本体群组划分的概念语义距离计算方法。首先给出多概念群组下概念语义距离的计算规则,然后分别提出群组内和群组间的概念语义距离计算方法,通过引入正向和反向的语义距离来解决上下位关系概念对的语义相似度非对称性,并通过概念节点的位置动态分配关系的权值来处理其他非上下位的二元关系。实验表明,基于领域本体群组划分的概念语义距离计算方法是有效的,与其他典型的同类方法相比,具有明显的优势。  相似文献   

11.
随着本体的增多,本体异构是本体间互操作的主要障碍,阻碍了本体信息共享,解决本体异构最好的方法是本体映射。本体映射的关键是概念相似度的计算,但现今的计算模型考虑的影响因素比较单一。结合距离语义相似度和属性语义相似度,提出了一种综合语义相似度计算方法。实验证明,该方法可以提高计算结果的精确度。  相似文献   

12.
领域本体的概念相似度计算   总被引:11,自引:1,他引:11  
随着本体在信息检索、人工智能等领域的广泛应用,面向本体的概念相似度计算成为了本体研究的一大热点。当前领域本体中概念相似度的研究主要是利用概念的上下位关系进行计算,但这并没有完整反映出概念的语义信息。论文提出的算法将概念相似度计算分为两层,一层是概念语义初始相似度层,其主要利用概念之间的距离来计算概念的初始相似度。另一层是概念非上下位关系相似度层,其在概念初始相似度的基础上,计算概念通过非上下位关系体现出的相似度。最后通过综合计算,得到领域本体中概念的实际相似度。实验证明,该方法充分利用了本体中概念的语义信息,得到的结果也比较合理。  相似文献   

13.
基于本体结构的概念间语义相似度算法   总被引:2,自引:0,他引:2       下载免费PDF全文
针对本体模型的结构特点,从模型概念间的宽度、深度、密度等方面分析本体概念相似度的计算,将其合并为结构因素。结合语义重合度、语义距离等影响相似度的因素综合考虑,提出一种基于本体结构的计算概念间语义相似度的算法。通过建立本体模型并进行实验分析,总结出本体结构方面各因素对本题概念语义相似度的影响。  相似文献   

14.
15.
姚佳岷  杨思春 《计算机应用》2013,33(6):1579-1586
本体映射能很好地解决语义网中的本体异构性问题,其核心在于计算本体概念的相似度。针对现有的概念相似度计算的精度和查准率不高,提出一种改进的概念相似度计算模型。首先利用本体特征之间的偏序关系建立形式背景和概念格,然后在结构层次求出概念间的交不可约元集,并通过对集合里各元素的语义关系进行量化计算出概念间的相似度。实例和分析结果表明,改进的概念相似度计算模型在F-Score上有明显提高。  相似文献   

16.
基于个性化本体的图像语义标注和检索   总被引:1,自引:0,他引:1  
针对目前图像检索系统较难实现语义检索的问题,提出了一种新的以本体为核心的图像语义标注和检索模型。构建个性化本体描述图像语义,继而提取基于概念集的图像语义特征并利用本体中“Is-A”关系设计相似性度量方法最终实现语义扩展检索。其难点在于顶级本体向个性化本体进化,以及基于概念集和“Is-A”关系实现语义相似度量的方法。通过系统的初步实现与相关实验的验证,该模型的检索准确度可达88.6%,明显高于传统的基于关键字和基于通用本体的图像检索,实现了图像智能检索功能。  相似文献   

17.
Ontology reuse is recommended as a key factor to develop cost-effective and high-quality ontologies because it could reduce development costs by avoiding rebuilding existing ontologies. Selecting the desired ontology from existing ontologies is essential for ontology reuse. Until now, much research on ontology selection has focused on lexical-level support. However, in these cases, it is almost impossible to find an ontology that includes all the concepts matched by the search terms at the semantic level. Finding an ontology that meets users’ needs requires a new ontology selection and ranking mechanism based on semantic similarity matching. We propose an ontology selection and ranking model consisting of selection standards and metrics based on better semantic matching capabilities. The model we propose presents two novel features different from previous research models. First, it enhances the ontology selection and ranking method practically and effectively by enabling semantic matching of taxonomy or relational linkage between concepts. Second, it identifies what measures should be used to rank ontologies in the given context and what weight should be assigned to each selection measure.  相似文献   

18.
为了解决中文本体非分类关系抽取问题,提出了基于语义依存分析的非分类关系抽取方法.利用语义角色标注和依存语法分析思想,分析得到了文本句子的语义依存结构,提取其中具有语义依存关系的动词框架,通过计算语义相似度,发现了动词框架中概念间的非分类关系和关系名称.实验结果表明该方法能够有效地实现非分类关系的抽取和关系的语义标注.  相似文献   

19.
20.
The mapping method that is based on the name and structure of the ontology elements is the strategy used in most mapping methods. Methods using the name often only use the similarity between the individual elements in the ontology to predict the semantic relations between two ontologies, while the latter measure the mapping between two ontologies by means of the structural relations between the elements. The effects of these two kinds of mapping strategies are not ideal. Addressing this issue, the work presented in this paper proposes an ontology mapping approach, in which the ontology element name and structure are combined. It uses the approaches based on linguistics and distance to generate a variable weight semantic graph. On this graph, the similarity of element names and structure are calculated through iterative computation. In the process of iteration, similarity result values are constantly adjusted. The approach avoids the problem of single methods that cannot use the entire amount of ontology information; therefore, it provides a more ideal mapping result. For making full use of the message of ontology, our implementation and experimental results are provided to demonstrate the effectiveness of the mapping approach.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号