首页 | 本学科首页   官方微博 | 高级检索  
     

蛋白-核酸相互作用的支持向量机预测模型
引用本文:袁友浪,刘亮,钮冰,陆文聪,蔡煜东.蛋白-核酸相互作用的支持向量机预测模型[J].计算机与应用化学,2010,27(2).
作者姓名:袁友浪  刘亮  钮冰  陆文聪  蔡煜东
作者单位:1. 上海大学理学院化学系,上海,200444
2. 上海大学系统生物研究所,上海,200444
基金项目:上海市重点学科建设项目资助 
摘    要:与核酸作用的蛋白质在基因功能许多方面扮演着极其重要的角色,预测蛋白质是否与核酸作用在生物信息学领域受到广泛关注。本文用氨基酸组成、氨基酸物化特性和蛋白质结构等信息作为特征参数,通过支持向量机方法预测了与核酸作用的蛋白质。分别取与rRNA,RNA和DNA作用的3个蛋白质数据集,用SVM训练,筛选最优核函数,优化核函数参数,建立分类判别模型,并用于预测蛋白质是否与核酸作用。结果表明:即使对同源相似性低于40%的蛋白质,通过用10-crossvalidation(交叉验证)方法测试上述3个数据集都分别有93.75%、83.41%、81.85%的预测正确率。用外部测试集测试所得模型分别有93.8%、84.2%、81.9%的预测正确率。在此基础上,我们建立了1个预测蛋白质与核酸是否作用的网上在线软件系统。网址是:http://chemdata.shu.edu.cn/protein_na。

关 键 词:蛋白质  核酸  支持向量机  10折交叉验证  预测模型

Prediction of nucleic acid-binding proteins using support vector machines
Yuan Youlang,Liu Liang,Niu Bing,Lu Wencong,Cai Yudong.Prediction of nucleic acid-binding proteins using support vector machines[J].Computers and Applied Chemistry,2010,27(2).
Authors:Yuan Youlang  Liu Liang  Niu Bing  Lu Wencong  Cai Yudong
Abstract:In this work,we integrated SVMs,protein sequence amino acid composition,and associated physicochemical properties into the study of nucleic-acid-binding proteins prediction.We developed the binary classification for rRNA-,RNA-,DNA-binding proteins that play an important role in the control of many cell processes.Each SVM model can be used to predict whether a protein belongs to rRNA-,RNA-,or DNA-hinding protein class.10-crossvalidation was performed on the protein data sets in which the sequences identity was~40%.Test results show that the accuracies of SVM models for rRNA-,RNA-,DNA-binding proteins are 93.75%,83.41%,81.85%,respectively.The predictions were also performed on the test data set.The results agree well with the prior knowledge of those proteins and show the effectiveness of physicochemical properties of sequence in the protein function prediction.On the basis of our work,an online server for predicting the nucleic acid-binding proteins using SVM was available on http://chemdata,shu.edu.cn/protein_na.
Keywords:protein  nucleic acid  SVMs  10-crossvalidation  prediction model
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号