Concept vector for semantic similarity and relatedness based on WordNet structure |
| |
Authors: | Hongzhe LiuAuthor Vitae Hong BaoAuthor Vitae De XuAuthor Vitae |
| |
Affiliation: | a Beijing Jiaotong University, Beijing, China b Beijing Union University, Beijing Key Laboratory of Information Service Engineering, Beijing, China |
| |
Abstract: | We define WordNet based hierarchy concept tree (HCT) and hierarchy concept graph (HCG), HCT contains hyponym/hypernym kind of relation in WordNet while HCG has more meronym/holonym kind of edges than in HCT, and present an advanced concept vector model for generalizing standard representations of concept similarity in terms of WordNet-based HCT. In this model, each concept node in the hierarchical tree has ancestor and descendent concept nodes composing its relevancy nodes, thus a concept node is represented as a concept vector according to its relevancy nodes’ local density and the similarity of the two concepts is obtained by computing the cosine similarity of their vectors. In addition, the model is adjustable in terms of multiple descendent concept nodes. This paper also provides a method by which this concept vector may be applied with regard to HCG into HCT. With this model, semantic similarity and relatedness are computed based on HCT and HCG. The model contains structural information inherent to and hidden in the HCT and HCG. Our experiments showed that this model compares favorably to others and is flexible in that it can make comparisons between any two concepts in a WordNet-like structure without relying on any additional dictionary or corpus information. |
| |
Keywords: | Concept similarity Concept relatedness Concept vector model Hierarchical concept tree Hierarchical concept graph WordNet |
本文献已被 ScienceDirect 等数据库收录! |
|