首页 | 本学科首页   官方微博 | 高级检索  
     


An ensemble of support vector machines for predicting virulent proteins
Authors:Loris Nanni  Alessandra Lumini
Affiliation:1. Railway System Engineering, University of Science and Technology, Uiwang-si, Republic of Korea;2. Transportation Environmental Research Team, Korea Railroad Research Institute, Uiwang-si 437-757, Republic of Korea;3. School of Urban and Environmental Engineering, Ulsan National Institute of Science and Technology, Ulsan 44919, Republic of Korea;4. Department of Environmental Engineering, Seoul National University of Science and Technology, Seoul01800, Republic of Korea;1. UNESCO Chair “Appropriate Technologies for Human Development”, Departamento de Geodinámica, Estratigrafía y Paleontología. Facultad de Ciencias Geológicas, C/ José Antonio Novais 2, Universidad Complutense de Madrid, Ciudad Universitaria, 28040 Madrid, Spain;2. Corporación Autónoma Regional del Valle del Cauca, Carrera 56 #11-36, Santiago de Cali 760036, Colombia;1. Department of Public Health, University of North Carolina at Charlotte, Charlotte, NC;2. Department of Sociology, University of North Carolina at Charlotte, Charlotte, NC;3. Department of Health and Wellness, University of North Carolina at Asheville, Asheville, NC;4. Department of Community and Family Medicine and Global Health, Duke University, Durham, NC
Abstract:It is important to develop a reliable system for predicting bacterial virulent proteins for finding novel drug/vaccine and for understanding virulence mechanisms in pathogens.In this work we have proposed a bacterial virulent protein prediction method based on an ensemble of classifiers where the features are extracted directly from the amino acid sequence of a given protein. It is well known in the literature that the features extracted from the evolutionary information of a given protein are better than the features extracted from the amino acid sequence. Our method tries to fill the gap between the amino acid sequence based approaches and the evolutionary information based approaches.An extensive evaluation according to a blind testing protocol, where the parameters of the system are calculated using the training set and the system is validated in three different independent datasets, has demonstrated the validity of the proposed method.
Keywords:Virulent proteins  Machine learning  Ensemble of classifiers  Support vector machines
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号