首页 | 本学科首页   官方微博 | 高级检索  
     

基于改进贝叶斯模型的中文邮件分类算法
引用本文:王宁,张建忠,何云,申庆永,徐敬东.基于改进贝叶斯模型的中文邮件分类算法[J].计算机工程与应用,2006,42(31):97-100,113.
作者姓名:王宁  张建忠  何云  申庆永  徐敬东
作者单位:南开大学,计算机科学与技术系,天津,300071
摘    要:通过分析常见的贝叶斯分类方法和实现模型,提出了一种适用于中文邮件的分类算法——基于混合模型的最小风险贝叶斯方法。混合模型将二项独立模型和多项式模型相结合,提高邮件分类的查全率,同时,在此基础上应用最小风险贝叶斯方法,进一步提高准确率。实验表明,应用改进的方法可以得到更准确的邮件分类效果。

关 键 词:邮件分类  中文分词  最小风险  混合模型  贝叶斯
文章编号:1002-8331(2006)31-0097-04
收稿时间:2006-04
修稿时间:2006-04

Algorithm of Chinese Mail Classification Based on Improved Bayesian Model
WANG Ning,ZHANG Jian-zhong,HE Yun,SHEN Qing-yong,XU Jing-dong.Algorithm of Chinese Mail Classification Based on Improved Bayesian Model[J].Computer Engineering and Applications,2006,42(31):97-100,113.
Authors:WANG Ning  ZHANG Jian-zhong  HE Yun  SHEN Qing-yong  XU Jing-dong
Affiliation:Department of Computer Science and Technology,Nankai University,Tianjin 300071,China
Abstract:With studying some popular methods and models for Bayesian approach,one kind of text classificatory algorithm the paper proposed a new algorithm which was fit for Chinese mails,risk minimization Bayes based on hybird model.The hybird model unified Binary Independence Model and Muhinomial Model,improved the recall of mail filter,in the meanwhile,using the risk minimization Bayes on hybird model,improved the precision.The result of experiments demonstrates that the new algorithm gains better performance in mail classification.
Keywords:mail classification  Chinese word segmentation  minimum risk  hybrid model  Bayes
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号