首页 | 本学科首页   官方微博 | 高级检索  
     

基因表达数据的聚类分析研究进展
引用本文:岳峰,孙亮,王宽全,王永吉,左旺孟.基因表达数据的聚类分析研究进展[J].自动化学报,2008,34(2):113-120.
作者姓名:岳峰  孙亮  王宽全  王永吉  左旺孟
作者单位:1.哈尔滨工业大学计算机学院生物信息技术研究中心 哈尔滨 150001
基金项目:国家高技术研究发展计划(863计划)(2006AA012308),国家自然科学基金(60373053,60571025)资助~~
摘    要:基因表达数据的爆炸性增长迫切需求自动、有效的数据分析工具. 目前聚类分析已成为分析基因表达数据获取生物学信息的有力工具. 为了更好地挖掘基因表达数据, 近年来提出了许多改进的传统聚类算法和新聚类算法. 本文首先简单介绍了基因表达数据的获取和表示, 之后系统地介绍了近年来应用在基因表达数据分析中的聚类算法. 根据聚类目标的不同将算法分为基于基因的聚类、基于样本的聚类和两路聚类, 并对每类算法介绍了其生物学的含义及其难点, 详细讨论了各种算法的基本原理及优缺点. 最后总结了当前的基因表达数据的聚类分析方法,并对发展趋势作了进一步的展望.

关 键 词:DNA微阵列    基因表达数据    聚类分析
文章编号:10.3724/SP.J.1004.2008.00113
收稿时间:2006-11-1
修稿时间:2006年11月1日

State-of-the-art of Cluster Analysis of Gene Expression Data
YUE Feng,SUN Liang,WANG Kuan-Quan,WANG Yong-Ji,ZUO Wang-Meng.State-of-the-art of Cluster Analysis of Gene Expression Data[J].Acta Automatica Sinica,2008,34(2):113-120.
Authors:YUE Feng  SUN Liang  WANG Kuan-Quan  WANG Yong-Ji  ZUO Wang-Meng
Affiliation:1.Biocomputing Research Center, School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001;2.Laboratory for Internet Software Technologies, Institute ofSoftware, Chinese Academy of Sciences, Beijing 100080
Abstract:The flood of gene expression data provided by the DNA microarray technology has driven the development of automated analysis techniques and tools.Cluster analysis is an effective and practical method to mine the huge amount of gene expression data to gain important genetic and biological information.Many improved conventional clustering algorithms as well as new clustering algorithms have been proposed recently to process the gene expression data.This survey first introduces how to produce and represent the gene expression data,and then discusses the state-of-the-art cluster algorithms applied to gene expression data.According to the goals of clustering,clustering algorithms are divided into three categories:gene-based clustering,sample-based clustering,and biclustering.Basic biological principles and challenges for each category are presented.For each category,the basic principle is discussed in detail as well as its advantages and drawbacks.This paper concludes with a summarization in this field and a discussion of future trends.
Keywords:DNA microarray  gene expression data  cluster analysis
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《自动化学报》浏览原始摘要信息
点击此处可从《自动化学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号