首页 | 本学科首页   官方微博 | 高级检索  
     

基于离散时序基因表达数据的双聚类算法
引用本文:许 涛,尚学群,杨蜜静,王 淼.基于离散时序基因表达数据的双聚类算法[J].计算机应用研究,2013,30(12):3551-3556.
作者姓名:许 涛  尚学群  杨蜜静  王 淼
作者单位:西北工业大学 计算机学院 计算机软件与理论系, 西安 710129
基金项目:国家“973”计划资助项目(2012CB316203); 国家自然科学基金资助项目(61272121)
摘    要:目前应用于基因表达数据上的双聚类算法大多是基于真实数据提出的, 因此易受噪声干扰, 且这些算法很少考虑样本间的时序性。提出了一种有效的时间点连续的双聚类挖掘算法DTCB, 从离散的时序基因表达数据中挖掘出时间点连续的最大共表达双聚类。该算法使用了一种新的数据离散化方法, 同时提出了三种在离散数据集下基因间的共表达关系; 为了提高挖掘效率, DTCB使用了有效的剪枝和输出策略, 可以在不产生候选集的情况下一次性挖掘出所有的最大共表达双聚类。通过实验分析, 证明DTCB具有高效的性能和良好的鲁棒性, 且结果具有较好的统计和生物意义。

关 键 词:时序基因表达数据  双聚类  共表达  时间点连续  离散化

Bicluster algorithm on discrete time-series gene expression data
XU Tao,SHANG Xue-qun,YANG Mi-jing,WANG Miao.Bicluster algorithm on discrete time-series gene expression data[J].Application Research of Computers,2013,30(12):3551-3556.
Authors:XU Tao  SHANG Xue-qun  YANG Mi-jing  WANG Miao
Affiliation:School of Computer Science & Technology, Northwestern Polytechnical University, Xi'an 710129, China
Abstract:At present, the bicluster algorithms applied to the gene expression data were mostly based on real data. Therefore, they were susceptible to noise interference, and these algorithms rarely considered the time sequence between samples. This paper proposed an efficient time-continuous bicluster algorithm DTCB to mine the maximal time-continuous biclusters from the discrete time-series gene expression data. It used a new discretization method on gene expression data and defined three co-expression relations between genes in the discrete dataset. DTCB adopted several pruning and output techniques to improve the efficiency. It could produce maximal co-expression biclusters without candidate maintenance. The experimental results show that DTCB has efficient performance and better robustness. Simultaneously, the results can be of more statistical and biological significance.
Keywords:time-series gene expression data  bicluster  co-expression  time-continuous  discretization
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号