首页 | 本学科首页   官方微博 | 高级检索  
     

TFIDF算法研究综述
引用本文:施聪莺,徐朝军,杨晓江.TFIDF算法研究综述[J].计算机应用,2009,29(Z1).
作者姓名:施聪莺  徐朝军  杨晓江
作者单位:南京师范大学,教育技术系,南京,210097
摘    要:文本分类中特征项权重的赋予对于分类效果有较大的影响,TFIDF算法是权重计算的重要算法之一.在ú回顾TFIDF算法发展历史的基础上,考察了其固有缺陷,总结诸多学者对其的改进方法,并对TFIDF算法新的应用领域进行了概括,并通过实验验证相关改进算法,为读者更好地应用TFIDF算法提供参考.

关 键 词:文本分类

Study of TFIDF algorithm
SHI Cong-ying,XU Chao-jun,YANG Xiao-Jiang.Study of TFIDF algorithm[J].journal of Computer Applications,2009,29(Z1).
Authors:SHI Cong-ying  XU Chao-jun  YANG Xiao-Jiang
Affiliation:Department of Educational Technology;Nanjing Normal University;Nanjing Jiangsu 210097;China
Abstract:In text categorization,the weight of term has great impact on the classification results.Term Frequency and Inverse Documentation Frequency(TFIDF) is one of the key algorithms of term weighting.This paper reviewed the development of the TFIDF algorithm,studied its inherent defects,and summarized some scholars' improvements to it.Meanwhile,the survey generalized its new application fields.To verify their effects on the classification results,the author carried out some experiments on the ameliorative algorit...
Keywords:TFIDF  VSM
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号