基于粗糙集理论的文本分类算法研究 Text Classification Algorithm Study Based on Rough Set Theory期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于粗糙集理论的文本分类算法研究

引用本文：	林珣,李志蜀,周勇.基于粗糙集理论的文本分类算法研究[J].计算机科学,2011,38(11):239-240,263.

作者姓名：	林珣李志蜀周勇

作者单位：	1. 西南财经大学经济信息工程学院成都610071;四川大学计算机学院成都610064 2. 四川大学计算机学院成都610064 3. 华兴职业技术学院成都610071

基金项目：	国家自然科学基金(60803106)资助

摘要：	文本分类是中文信息处理的重要研究领域。给文本分配一个或多个不同的类别，可提高文本检索和存储的处理效率。粗糙集是一种不需要任何先验信息的分类方法，通过对文本分词、过滤掉停用词之后把剩余的词语作为特征项，然后把文本用向量空间模型表示出来，将文本集转化成不带决策属性的信息系统，用粗糙集理论中核心内容属性约简实现对文本的分类。实验表明，该方法的查准率和查全率都有所提高。
关键词：	文本分类，粗糙集，约简
Text Classification Algorithm Study Based on Rough Set Theory

LIN Xun,Li Zhi-shu,ZHOU Yong.Text Classification Algorithm Study Based on Rough Set Theory[J].Computer Science,2011,38(11):239-240,263.

Authors:	LIN Xun Li Zhi-shu ZHOU Yong

Affiliation:	LIN Xun1,2 Li Zhi-shu2 ZHOU Yong3(School of Economic Information Engineering,Southwestern University of Finance and Economics(SWUFE),Chengdu 610074,China)1(School of Computer,Sichuan University(SCU),Chengdu 610064,China)2(Huaxing Vocational and Technical College,Chengdu 610071,China)3

Abstract:	Text dataset is transformed to information system without attribute of decision making and the core content of attribute reduction has been applied to text classification. Experiment shows that the precision rate and recall rate are enhanced in this method; furthermore, it does not require any a priori information.

Keywords:	Text classification Rough set Reduction
本文献已被 CNKI 万方数据等数据库收录！
	点击此处可从《计算机科学》浏览原始摘要信息
	点击此处可从《计算机科学》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏