李欣倩,杨 哲,任 佳.基于互信息与层次聚类双重特征选择的改进朴素贝叶斯算法[J].测控技术,2022,41(2):36-40
基于互信息与层次聚类双重特征选择的改进朴素贝叶斯算法
Improved Naive Bayes Algorithm Based on Dual Feature Selection of Mutual Information and Hierarchical Clustering
  
DOI:10.19708/j.ckjs.2022.02.005
中文关键词:  朴素贝叶斯  双重特征选择  互信息  层次聚类
英文关键词:Naive Bayes  dual feature selection  mutual information  hierarchical clustering
基金项目:浙江省自然科学基金项目(LY17F030024);浙江理工大学“521人才培养计划”项目(11130132521508)
作者单位
李欣倩 浙江理工大学 机械与自动控制学院 
杨 哲 浙江理工大学 机械与自动控制学院 
任 佳 浙江理工大学 机械与自动控制学院 
摘要点击次数: 527
全文下载次数: 367
中文摘要:
      根据朴素贝叶斯算法的特征条件独立假设,提出一种基于互信息和层次聚类双重特征选择的改进朴素贝叶斯算法。通过互信息方法剔除不相关的特征,然后依据欧氏距离将删减后的特征进行分层聚类,通过粒子群算法得到聚类簇的数量,最后将每个聚类簇中与类别互信息最高的特征合并为特征子集,并由朴素贝叶斯算法得到分类准确率。根据实验结果可知,该算法可以有效减少特征之间的相关性,提升算法的分类性能。
英文摘要:
      According to the assumption of feature condition independence of the naive Bayes algorithm,an improved naive Bayes algorithm based on dual feature selection of mutual information and hierarchical clustering is proposed.The irrelevant features are eliminated by the mutual information method,and the deleted features are hierarchically clustered according to the Euclidean distance,and the number of clusters is obtained by the particle swarm optimization algorithm.Finally,the feature with the highest mutual information with the category in each cluster is merged into feature subsets,and the classification accuracy is obtained by the Naive Bayes algorithm.According to the experimental results,the algorithm can effectively reduce the correlation between features and improve the classification performance of the algorithm.
查看全文  查看/发表评论  下载PDF阅读器
关闭