首页 | 本学科首页   官方微博 | 高级检索  
     


Prediction of lysine propionylation sites using biased SVM and incorporating four different sequence features into Chou’s PseAAC
Affiliation:1. College of Science, Shenyang Aerospace University, 110136, People’s Republic of China;2. College of Information and Communication Engineering, Dalian Minzu University, 116600, People’s Republic of China;1. Department of Chemical Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, United States;2. Department of Sustainable Biomaterials, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, United States;1. Université de Lyon, Université Lyon 1, CNRS, UMR5223, Ingénierie des Matériaux Polymères, 15 Boulevard A. Latarjet, 69622 Villeurbanne, France;2. Université de Cergy-Pontoise, Laboratoire de Physicochimie des Polymères et des Interfaces (LPPI–EA 2528)–I-Mat-5, mail Gay-Lussac, 95031 Cergy-Pontoise, France;3. Université Grenoble Alpes, CNRS/CEA-INAC-SPrAM, 38000 Grenoble, France;1. Department of Chemistry, Faculty of Science and Mathematics, Universiti Pendidikan Sultan Idris, 35900 Tanjong Malim, Perak, Malaysia;2. Nanotechnology Research Centre, Faculty of Science and Mathematics, Universiti Pendidikan Sultan Idris, 35900 Tanjong Malim, Perak, Malaysia;3. Pusat Pengajian Sains Kimia, Universiti Sains Malaysia, 11800 USM, Pulau Pinang, Malaysia;1. Macromolecules and Interfaces Institute, Virginia Tech, Blacksburg, VA 24061, United States;2. Department of Sustainable Biomaterials, Virginia Tech, Blacksburg, VA 24061, United States;1. Unidad de Investigacion Medica en Bioquimica, Centro Medico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico;2. Facultad de Estudios Superiores Zaragoza, Universidad Nacional Autónoma de México, Mexico City, Mexico
Abstract:Lysine propionylation is an important and common protein acylation modification in both prokaryotes and eukaryotes. To better understand the molecular mechanism of propionylation, it is important to identify propionylated substrates and their corresponding propionylation sites accurately. In this study, a novel bioinformatics tool named PropPred is developed to predict propionylation sites by using multiple feature extraction and biased support vector machine. On the one hand, various features are incorporated, including amino acid composition, amino acid factors, binary encoding, and the composition of k-spaced amino acid pairs. And the F-score feature method and the incremental feature selection algorithm are adopted to remove the redundant features. On the other hand, the biased support vector machine algorithm is used to handle the imbalanced problem in propionylation sites training dataset. As illustrated by 10-fold cross-validation, the performance of PropPred achieves a satisfactory performance with a Sensitivity of 70.03%, a Specificity of 75.61%, an accuracy of 75.02% and a Matthew’s correlation coefficient of 0.3085. Feature analysis shows that some amino acid factors play the most important roles in the prediction of propionylation sites. These analysis and prediction results might provide some clues for understanding the molecular mechanisms of propionylation. A user-friendly web-server for PropPred is established at 123.206.31.171/PropPred/.
Keywords:Post-translational modification  Propionylation  Feature extraction  Incremental feature selection  Biased support vector machine
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号