首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于粗糙k均值的双聚类算法
引用本文:胡云,苗夺谦,王睿智,陈敏.一种基于粗糙k均值的双聚类算法[J].计算机科学,2007,34(11):174-177.
作者姓名:胡云  苗夺谦  王睿智  陈敏
作者单位:同济大学计算机科学与技术系,上海,201804
基金项目:国家自然科学基金 , 高等学校博士学科点专项科研项目
摘    要:双聚类算法是为了发现基因表达数据矩阵中局部相似性而提出的新聚类方法。本文根据Cheng和Church的打分理论采用自底向上的策略,首先用粗糙k均值算法生成初始的基因数据块,再对这些数据块添加行和列,生成初始的双聚类。然后,删除初始的双聚类中一致性波动不好的行和列,从而得到最终的双聚类。实验表明,该算法能够高效地生成具有共表达水平的双聚类,更能找到一致波动水平很高的双聚类。

关 键 词:粗糙集  k均值聚类  双聚类分析  基因表达数据

A Biclustering Algorithm Based on Rough K-means
HU Yun,MIAO Duo-Qian,WANG Rui-Zhi,CHEN Min.A Biclustering Algorithm Based on Rough K-means[J].Computer Science,2007,34(11):174-177.
Authors:HU Yun  MIAO Duo-Qian  WANG Rui-Zhi  CHEN Min
Affiliation:HU Yun,MIAO Duo-Qian,WANG Rui-Zhi,CHEN Min (Department of Computer Science and Engineering of Tongii University,Shanghai
Abstract:Biclustering is a new branch of clustering which is proposed to discover the local homogenous pattern. Based on the score scheme which is proposed by Cheng and Church, this paper uses bottom-up strategy, firstly finds the gene data submatrix using Rough K-means algorithm, then adds some conditions and genes to the block, finally deletes some conditions and genes from the block if necessary, the final results are the biclusters. The experiment shows that this algorithm can efficiently find the co-regulation patterns from the exist gene expression data, especially those highly homogenous patterns.
Keywords:Rough set  K-means clustering  Biclustering  Gene expression data
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机科学》浏览原始摘要信息
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号