首页 | 本学科首页   官方微博 | 高级检索  
     


Predicting protein secondary structure using a mixed-modal SVM method in a compound pyramid model
Authors:Bingru Yang  Qu Wu  Zhou Ying  Haifeng Sui
Affiliation:1. Laboratoire SUBATECH, UMR 6457 CNRS-IN2P3/Ecole des Mines de Nantes/PRES UNAM, 4 rue A. Kastler, 44307 Nantes Cedex, France;2. Radiochemistry Lab, School of Nuclear Science & Technology, Lanzhou University, Lanzhou 730000, China;3. Institute of Plasma Physics, Chinese Academy of Sciences, P.O. Box 1126, Hefei, Anhui 230031 PR China;4. ANDRA, Research and Development Division, 1/7 rue Jean Monnet, 92298 Châtenay-Malabry cedex, France
Abstract:Accurate protein secondary structure prediction plays an important role in direct tertiary structure modeling, and can also significantly improve sequence analysis and sequence-structure threading for structure and function determination. Hence improving the accuracy of secondary structure prediction is essential for future developments throughout the field of protein research.In this article, we propose a mixed-modal support vector machine (SVM) method for predicting protein secondary structure. Using the evolutionary information contained in the physicochemical properties of each amino acid and a position-specific scoring matrix generated by a PSI-BLAST multiple sequence alignment as input for a mixed-modal SVM, secondary structure can be predicted at significantly increased accuracy. Using a Knowledge Discovery Theory based on the Inner Cognitive Mechanism (KDTICM) method, we have proposed a compound pyramid model, which is composed of three layers of intelligent interface that integrate a mixed-modal SVM (MMS) module, a modified Knowledge Discovery in Databases (KDD1) process, a mixed-modal back propagation neural network (MMBP) module and so on.Testing against data sets of non-redundant protein sequences returned values for the Q3 accuracy measure that ranged from 84.0% to 85.6%,while values for the SOV99 segment overlap measure ranged from 79.8% to 80.6%. When compared using a blind test dataset from the CASP8 meeting against currently available secondary structure prediction methods, our new approach shows superior accuracy.Availability: http://www.kdd.ustb.edu.cn/protein_Web/.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号