首页 | 本学科首页   官方微博 | 高级检索  
     

Text Classification Based on Domain Ontology
作者姓名:Huazhen  Gu  Kuanjiu  Zhou
作者单位:Systems Engineering Institute, Dalian University of Technology, Dalian 116024, China
摘    要:With the quick increase of information and knowledge, automatically classifying text documents is becoming a hotspot of knowledge management. A critical capability of knowledge management systems is to classify the text documents into different categories, which are meaningful to users. In this paper, a text topic classification model based on domain ontology by using Vector Space Model is proposed. Eigenvectors as the input to the vector space model are constructed by utilizing concepts and hierarchical structure of ontology, which also provides the domain knowledge. However, a limited vocabulary problem is encountered while mapping keywords to their corresponding ontology concepts. A synonymy lexicon is utilized to extend the ontology and compress the eigenvector. The problem that eigenvectors are too large and complex to be calculated in traditional methods can be solved. At last, combing the concept's supporting, a top-down method according to the ontology structure is used to complete topic classification. An experimental system is implemented and the model is applied to this practical system. Test results show that this model is feasible.

关 键 词:级别分类  分类方法  数据处理  程序设计

Text Classification Based on Domain Ontology
Huazhen Gu Kuanjiu Zhou.Text Classification Based on Domain Ontology[J].Journal of Communication and Computer,2006,3(5):29-32.
Abstract:
Keywords:Domain Ontology  Hierarchical Classification  Vector Space Model
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号