首页 | 本学科首页   官方微博 | 高级检索  
     


A Comparison of Word- and Sense-Based Text Categorization Using Several Classification Algorithms
Authors:Athanasios Kehagias  Vassilios Petridis  Vassilis G. Kaburlasos  Pavlina Fragkou
Affiliation:(1) Department of Math., Phys. and Comp. Sciences, Division of Mathematics, Aristotle University of Thessaloniki (AUTh), GR-54124 Thessaloniki, Greece;(2) Department of Electrical and Computer Engineering, Division of Electronics and Computer Engineering, Aristotle University of Thessaloniki (AUTh), GR-54124 Thessaloniki, Greece;(3) Department of Industrial Informatics, Division of Software Systems, Technological Educational Institute of Kavala, GR-65404 Kavala, Greece
Abstract:Most of the text categorization algorithms in the literature represent documents as collections of words. An alternative which has not been sufficiently explored is the use of word meanings, also known as senses. In this paper, using several algorithms, we compare the categorization accuracy of classifiers based on words to that of classifiers based on senses. The document collection on which this comparison takes place is a subset of the annotated Brown Corpus semantic concordance. A series of experiments indicates that the use of senses does not result in any significant categorization improvement.
Keywords:text categorization  word senses  information retrieval  FLNMAP with voting
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号