首页 | 本学科首页   官方微博 | 高级检索  
     

文本自动分类中的词权重与分类算法
引用本文:刁倩,王永成,何骥,张惠惠.文本自动分类中的词权重与分类算法[J].中文信息学报,2000,14(3):25-29.
作者姓名:刁倩  王永成  何骥  张惠惠
作者单位:1.上海交通大学电脑应用技术研究所2.上海交通大学图书馆
摘    要:本文详细阐述了自动分类中的词与文献的相关权重的经典计算方法IDF(Inverse Document Frequency) ,进一步总结了两种典型的分类算法——Bayes判别准则与向量空间模型(VSM) ,并提出结合词权重和分类算法进行分类的具体公式以及相关实验结果。

关 键 词:自动分类  IDF  Bayes判别准则  向量空间模型(VSM)  
修稿时间:1999年8月20日

Term Weighting and Classification Algorithms
Diao Qian Wang Yongcheng Zhang Huihui He Ji Institute of Computer Technology,Shanghai Jiao Tong University Shanghai Bao Zhaolong Library of Shanghai Jiao Tong University Shanghai.Term Weighting and Classification Algorithms[J].Journal of Chinese Information Processing,2000,14(3):25-29.
Authors:Diao Qian Wang Yongcheng Zhang Huihui He Ji Institute of Computer Technology  Shanghai Jiao Tong University Shanghai Bao Zhaolong Library of Shanghai Jiao Tong University Shanghai
Affiliation:1.Institute of Computer Technology , Shanghai Jiao Tong University2.Bao Zhaolong Library of Shanghai Jiao Tong University
Abstract:In this paper ,a classical term weighting method — IDF (Inverse Document Frequency) is discussed detailedly. The two important classification algorithms — Bayes Judge Rule and VSM (Vector Space Model) are summarized. Furthermore , the way of how to combine term weighting methods with two classification algorithm is also provided in the paper.
Keywords:Automatic classification  IDF  Bayes judge rule  VSM  
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号