首页 | 本学科首页   官方微博 | 高级检索  
     

基于模块度优化的蛋白质网络集团探测与分析
引用本文:梅娟,纪志成. 基于模块度优化的蛋白质网络集团探测与分析[J]. 计算机与应用化学, 2012, 29(5): 591-596
作者姓名:梅娟  纪志成
作者单位:1. 江南大学电气自动化研究所,江苏,无锡,214000;无锡城市学院电子信息工程系,江苏,无锡,214000
2. 江南大学电气自动化研究所,江苏,无锡,214000
基金项目:江苏省博士后科研资助计划,无锡城市学院院级重点课题
摘    要:探测蛋白质相互作用网络中的功能模块对于理解生物系统的组织和功能具有重要的意义。目前,普遍的做法是将蛋白质相互作用网络表示成一个图,利用各种图聚类算法来挖掘功能模块。本文采用了基于模块度优化的图聚类算法来探测蛋白质相互作用网络中的集团,从具有2617个节点11855个相互作用的酵母蛋白相互作用网络中探测出68个集团。对于得到的集团,首先从拓扑结构的角度验证其的确是内部连接稠密的子图,然后分析了MIPS数据库中ComplexCat提供的已知的蛋白质复合体与这些集团的重叠情况,发现很多蛋白质复合体完全包含在某些集团中,最后使用超几何聚集分布的P值来分析一个集团对某个特定功能的富集程度,并根据最小的P值对应的功能来注释该集团的主要功能,发现集团中大部分的蛋白质具有相同的功能。研究结果表明,该方法探测的集团具有重要的生物学功能意义。

关 键 词:蛋白质相互作用网络  复合体  功能模块  模块度  图聚类

Detecting and analyzing of communities in protein-protein interaction network based on modularity
Mei Juan , Ji Zhicheng. Detecting and analyzing of communities in protein-protein interaction network based on modularity[J]. Computers and Applied Chemistry, 2012, 29(5): 591-596
Authors:Mei Juan    Ji Zhicheng
Affiliation:1* (1. Institute of Electrical Automation, Jiangnan University, Wuxi, 214000, Jiangsu, China) (2. Department of Electronic and Information Technology, Wuxi City College of Vocational Technology, Wuxi, 214000, Jiangsu, China)
Abstract:Detecting functional modules in protein-protein interaction (PPI) networks is very important to understand the organization and function of the biological system. At present, a common method of revealing functional modules in PPI networks is graph clustering where PPI networks are modeled as a graph in which vertices represent proteins and edges represent interactions. Here, a modularity-based method was used to find communities in PPI networks. Using this method, 68 communities were detected from a network involving 11 855 interactions among 2 617 proteins in yeast. For the communities outputted by the method, we firstly assessed the validity from the topology perspective and found that they are densely connected local subgraphs. Then, we matched known protein complexes annotated by ComplexCat database in MIPS against these communities and found that known protein complexes are largely contained in them in their entirety. At last, we used hypergeometric distribution P-value to measure whether a community is enriched with proteins from a particular category more than would be expected by chance. We assigned each community the main function with the lowest P-value in all categories and found that most proteins in the same community have the same function. Tests show that communities revealed are with significant biological functions.
Keywords:protein-protein interaction networks  complexes  functional modules  modularity  graph clustering
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号