A new similarity computing method based on concept similarity in Chinese text processing |
| |
Authors: | Jing Peng DongQing Yang ShiWei Tang TengJiao Wang Jun Gao |
| |
Affiliation: | [1]School of Electronics Engineering and Computer Science, Peking University, Beijing 100871, China; [2]Department of Science and Technology, Chengdu Municipal Public Security, Bureau, Chengdu 610017, China |
| |
Abstract: | The paper proposes a new text similarity computing method based on concept similarity in Chinese text processing. The new method converts text to words vector space model at first,and then splits words into a set of concepts. Through computing the inner products between concepts,it obtains the similarity between words. The new method computes the similarity of text based on the similarity of words at last. The contributions of the paper include:1) propose a new computing formula between words;2) propose a new text similarity computing method based on words similarity;3) successfully use the method in the application of similarity computing of WEB news;and 4) prove the validity of the method through extensive experiments. |
| |
Keywords: | concept similarity similarity computing vector space inner product space |
本文献已被 维普 SpringerLink 等数据库收录! |