首页 | 本学科首页   官方微博 | 高级检索  
     

基于稳定性的基因表达数据聚类算法选择
引用本文:易辉,宋晓峰.基于稳定性的基因表达数据聚类算法选择[J].计算机与应用化学,2008,25(5).
作者姓名:易辉  宋晓峰
作者单位:南京航空航天大学生物医学工程系,江苏,南京,210016
摘    要:聚类是一种常用的基因表达数据处理手段,然而它又是主观的,如何选择符合数据内在分布的聚类算法成为目前急待解决的问题.根据经验,当选择最佳簇数k后,采用合理的聚类算法对目标数据重复聚类时,结果稳定性较好.因此提出一种基于稳定性的聚类算法选择.该方法将聚类结果的簇间分离度、簇内紧致度和聚类结果稳定性三者结合起来.在验证和应用三组数据时发现,比传统的评估方法,基于稳定性的聚类算法选择更客观、更可靠.

关 键 词:基因表达数据  聚类  稳定性  评估  选择

A stability-based clustering selection for gene expression data
Yi Hui,Song Xiaofeng.A stability-based clustering selection for gene expression data[J].Computers and Applied Chemistry,2008,25(5).
Authors:Yi Hui  Song Xiaofeng
Abstract:Clustering algorithm has become a typical tool for gene expression data analysis recently.However,it is subjective.Algo- rithm Selection is an urge problem it facing.Finding that with the correct number of clusters,the results of most suitable clustering al- gorithms are stable while repeated,a stability-based selection method was proposed. It combines compactness of clusters,separation between clusters and stability of clustering results.Three data sets were used to test the ability of this method,and it performed better with a more objective process than traditional methods.
Keywords:gene expression data  clustering  stability  validation  selection
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号