首页 | 本学科首页   官方微博 | 高级检索  
     

基于粗糙集的文本分类方法在网络科技资源应用集成环境中的应用
引用本文:侯凡,周明全,耿国华,李杰.基于粗糙集的文本分类方法在网络科技资源应用集成环境中的应用[J].计算机应用与软件,2009,26(3).
作者姓名:侯凡  周明全  耿国华  李杰
作者单位:1. 西北大学信息科学与技术学院,陕西,西安,710127
2. 北京师范大学信息科学与技术学院,北京,100875
基金项目:国家科技基础条件平台建设项目 
摘    要:网络科技资源应用集成环境所汇集到的信息纷繁复杂,使得用户对信息的浏览、检索造成了一定的困难.首先对所有汇集到的信息向量化,然后通过对IF-IDF权重构造函数进行了改良,使其更加适合本项目的实际情况,接着利用粗糙集理论进行属性约简,生成最终的决策表对科技信息进行分类.最终结果证明,提出的分类系统比传统人工分类的效率有较大提高,取得了良好的效果.

关 键 词:文本分类  权重函数  粗糙集  属性约简

APPLYING ROUGH SETS BASED METHOD FOR TEXT CATEGORIZATION TO INTEGRATED ENVIRONMENT OF NETWORK TECHNOLOGY RESOURCE APPLICATION
HOU Fan,ZHOU Mingquan,GENG Guohua,LI Jie.APPLYING ROUGH SETS BASED METHOD FOR TEXT CATEGORIZATION TO INTEGRATED ENVIRONMENT OF NETWORK TECHNOLOGY RESOURCE APPLICATION[J].Computer Applications and Software,2009,26(3).
Authors:HOU Fan  ZHOU Mingquan  GENG Guohua  LI Jie
Affiliation:School of Information Science and Technology;Northwest University;Xi'an 710127;Shaanxi;China;School of Information Science and Technology;Beijing Normal University;Beijing 100875;China
Abstract:The information collected by Integrated Environment of Network Technology Resource Application is too complicated to browse and retrieve for users.In this paper it changes all the collected information into vectors,and then by improving the IF-IDF weighting function to make it adapting actual situation of this project better.After that,the concept of rough sets was used to reduce information's attribute,the final decision table was generated to classify the science and technology information.According to th...
Keywords:Text categorization Weighting formula Rough sets Attribute reduction  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号