首页 | 本学科首页   官方微博 | 高级检索  
     

A Fuzzy Approach to Classification of Text Documents
引用本文:刘惟一,宋宁. A Fuzzy Approach to Classification of Text Documents[J]. 计算机科学技术学报, 2003, 18(5): 0-0. DOI: 10.1007/BF02947124
作者姓名:刘惟一  宋宁
作者单位:[1]DepartmentofComputerScience,YunnanUniversity,Kunming650091,P.R.China [2]DepartmentofMetallurgy,KunmingUniversityofScienceandTechnology,Kunming650093,P.R.China
基金项目:This work is supported by the National Natural Science Foundation of China (Grant No.60263006), the Foundation of the Key Laboratory of Intelligent Information Processing, Institute of Computing Technology Chinese Academy of Sciences (Grant No.IIP2002-2)
摘    要:This paper discusses the classification problems of text documents. Based on the concept of the proximity degree, the set of words is partitioned into some equivalence classes.Particularly, the concepts of the semantic field and association degree are given in this paper.Based on the above concepts, this paper presents a fuzzy classification approach for document categorization. Furthermore, applying the concept of the entropy of information, the approaches to select key words from the set of words covering the classification of documents and to construct the hierarchical structure of key words are obtained.

关 键 词:文本文件分类 模糊逼近 关联语义学 接近度 联想度

A fuzzy approach to classification of text documents
Liu WeiYi,Song Ning. A fuzzy approach to classification of text documents[J]. Journal of Computer Science and Technology, 2003, 18(5): 0-0. DOI: 10.1007/BF02947124
Authors:Liu WeiYi  Song Ning
Affiliation:(1) Department of Computer Science, Yunnan University, 650091 Kunning, P.R. China;(2) The Key Laboratory of Intelligent Information Processing, Institute of Computer Technology, The Chinese Academy of Sciences, 100080 Beijing, P.R. China;(3) Department of Metallurgy, Kunming University of Science and Technology, 650093 Kunming, P.R. China
Abstract:This paper discusses the classification problems of text documents. Based on the concept of the proximity degree, the set of words is partitioned into some equivalence classes. Particularly, the concepts of the semantic field and association degree are given in this paper. Based on the above concepts, this paper presents a fuzzy classification approach for document categorization. Furthermore, applying the concept of the entropy of information, the approaches to select key words from the set of words covering the classification of documents and to construct the hierarchical structure of key words are obtained.
Keywords:text document classification   fuzzy approach   semantic association
本文献已被 CNKI 维普 万方数据 SpringerLink 等数据库收录!
点击此处可从《计算机科学技术学报》浏览原始摘要信息
点击此处可从《计算机科学技术学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号