首页 | 本学科首页   官方微博 | 高级检索  
     

DocOnto——一种基于本体的文本分类器
引用本文:杨喜权,孙娜,张野,孔德冉.DocOnto——一种基于本体的文本分类器[J].计算机应用,2008,28(Z2).
作者姓名:杨喜权  孙娜  张野  孔德冉
作者单位:1. 东北师范大学计算机学院,长春,130117
2. 东北师范大学计算机学院,长春,130117;渤海大学商学院,辽宁,锦州,121013
基金项目:国家自然科学基金资助项目  
摘    要:基于概念类别属性,在Protege平台下构建了茶领域本体,并实现基于茶领域本体的DocOnto文本分类器.在该分类器上对茶文档、酒文档和比萨文档进行分类实验,并与朴素贝叶斯分类器的实验结果对比,表明DocOnto分类器在综合查准率相当的情况下,有效地提高召回率,获得更高的F1指标.

关 键 词:文本分类器  领域本体  贝叶斯分类器

DocOnto:Ontology-based text classifier
YANG Xi-quan,SUN Na,ZHANG Ye,KONG De-ran.DocOnto:Ontology-based text classifier[J].journal of Computer Applications,2008,28(Z2).
Authors:YANG Xi-quan  SUN Na  ZHANG Ye  KONG De-ran
Affiliation:YANG Xi-quan1,SUN Na1,ZHANG Ye1,2,KONG De-ran1(1.School of Computer Science,Northeast Normal University,Changchun Jilin 130117,China,2.School of Business,Bohai University,Jinzhou Liaoning 121013,China)
Abstract:Tea domain ontology was constructed and an ontology-based text classifier named DocOnto was implemented based on classes of concept in Protege.In the experiment,tea texts,wine texts and pizza texts were respectively classified to their corresponding categories by DocOnto,and the experimental results comparisons were also made between DocOnto and Naive Bayes.It shows that at the equivalent level of comprehensive precision,DocOnto can improve the recall effectively and get the higher F1 index.
Keywords:text classifier  domain ontology  naive bayes classifier  
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号