首页 | 本学科首页   官方微博 | 高级检索  
     

PPI网络的改进谱聚类算法
引用本文:吴 爽,雷秀娟,郭 玲. PPI网络的改进谱聚类算法[J]. 计算机应用研究, 2012, 29(7): 2442-2446
作者姓名:吴 爽  雷秀娟  郭 玲
作者单位:1. 陕西师范大学计算机科学学院,西安,710062
2. 陕西师范大学生命科学学院,西安,710062
基金项目:国家自然科学基金资助项目(61100164、61173190); 陕西省自然科学基础研究计划资助项目(2010JQ8034); 中央高校基本科研业务费专项资金资助项目(GK200902016); 陕西师范大学研究生创新基金资助项目(2011CXS030)
摘    要:蛋白质相互作用(PPI)网络是生物信息学的一个新的研究领域。近年来谱聚类算法在未知蛋白质的功能预测方面发挥了重要作用,但是它要求事先确定聚类数目,为此提出了一种基于边的得分搜索的谱聚类算法。该算法采用谱聚类方法对数据进行预处理,并通过构造蛋白质节点之间的边的得分矩阵找到数据样本之间的相关性,同时融入粒子群算法来确定边的得分的最佳选择阈值,最后用广度优先遍历结点的方法得到聚类结果。算法在PPI网络数据集上进行了测试,结果表明该算法不但可以自动确定聚类数目,而且聚类结果的正确率和F-measure值都得到了提高。

关 键 词:谱聚类算法  粒子群优化算法  蛋白质相互作用网络

Clustering PPI networks based on improved spectral clustering method
WU Shuang,LEI Xiu-juan,GUO Ling. Clustering PPI networks based on improved spectral clustering method[J]. Application Research of Computers, 2012, 29(7): 2442-2446
Authors:WU Shuang  LEI Xiu-juan  GUO Ling
Affiliation:a. School of Computer Science, b. School of Life Science, Shaanxi Normal University, Xi'an 710062, China
Abstract:Protein-protein interaction PPI network is a new research field in the bioinformatics. Recently spectral clustering algorithm has played an important role in the field of predicting the function of unknown proteins. However, the cluster number must be predefined. With regard to this problem, this paper proposed a spectral clustering algorithm combining with edge-based scoring searching method. Firstly, the algorithm preprocessed the PPI data via spectral clustering, then constructed the scoring matrix of edges connecting protein nodes with each other to find the relationship of dataset, and adopted particle swarm optimization algorithm to determine optimal threshold of the score of edge. Finally, it obtained the clustering results by means of breadth first traversing the protein nodes. Tested this algorithm on the PPI dataset, and the results prove that the algorithm can not only automatically determine the cluster number, but also improve both the precision value and F-measure value.
Keywords:spectral clustering algorithm   particle swarm optimization algorithm   protein-protein interaction network
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号