Domain ontology graph model and its application in Chinese text classification |
| |
Authors: | James N. K. Liu Yu-lin He Edward H. Y. Lim Xi-zhao Wang |
| |
Affiliation: | 1. Department of Computing, The Hong Kong Polytechnic University, Kowloon, Hong Kong 2. College of Mathematics and Computer Science, Hebei University, Baoding, 071002, China
|
| |
Abstract: | This paper proposes an ontology learning method which is used to generate a graphical ontology structure called ontology graph. The ontology graph defines the ontology and knowledge conceptualization model, and the ontology learning process defines the method of semiautomatic learning and generates ontology graphs from Chinese texts of different domains, the so-called domain ontology graph (DOG). Meanwhile, we also define two other ontological operations—document ontology graph generation and ontology graph-based text classification, which can be carried out with the generated DOG. This research focuses on Chinese text data, and furthermore, we conduct two experiments: the DOG generation and ontology graph-based text classification, with Chinese texts as the experimental data. The first experiment generates ten DOGs as the ontology graph instances to represent ten different domains of knowledge. The generated DOGs are then further used for the second experiment to provide performance evaluation. The ontology graph-based approach is able to achieve high text classification accuracy (with 92.3 % in f-measure) over other text classification approaches (such as 86.8 % in f-measure for tf–idf approach). The better performance in the comparative experiments reveals that the proposed ontology graph knowledge model, the ontology learning and generation process, and the ontological operations are feasible and effective. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|