首页 | 本学科首页   官方微博 | 高级检索  
     


A novel ant colony optimization based single path hierarchical classification algorithm for predicting gene ontology
Affiliation:1. National University of Computer and Emerging Sciences, Computer Science Department, Islamabad, Pakistan;2. Al Imam Mohammad Ibn Saud Islamic University (IMSIU),College of Computer and Information Sciences, Riyadh, Saudi Arabia;1. Faculty of Information Engineering, China University of Geosciences, Wuhan 430074, China;2. National Engineering Research Center of Geographic Information System, China University of Geosciences, Wuhan 430074, China;3. Department of Mathematics, Hangzhou Dianzi University, Hangzhou 310018, China
Abstract:There exist numerous state of the art classification algorithms that are designed to handle the data with nominal or binary class labels. Unfortunately, less attention is given to the genre of classification problems where the classes are organized as a structured hierarchy; such as protein function prediction (target area in this work), test scores, gene ontology, web page categorization, text categorization etc. The structured hierarchy is usually represented as a tree or a directed acyclic graph (DAG) where there exist IS-A relationship among the class labels. Class labels at upper level of the hierarchy are more abstract and easy to predict whereas class labels at deeper level are most specific and challenging for correct prediction. It is helpful to consider this class hierarchy for designing a hypothesis that can handle the tradeoff between prediction accuracy and prediction specificity. In this paper, a novel ant colony optimization (ACO) based single path hierarchical classification algorithm is proposed that incorporates the given class hierarchy during its learning phase. The algorithm produces IF–THEN ordered rule list and thus offer comprehensible classification model. Detailed discussion on the architecture and design of the proposed technique is provided which is followed by the empirical evaluation on six ion-channels data sets (related to protein function prediction) and two publicly available data sets. The performance of the algorithm is encouraging as compared to the existing methods based on the statistically significant Student's t-test (keeping in view, prediction accuracy and specificity) and thus confirm the promising ability of the proposed technique for hierarchical classification task.
Keywords:Hierarchical classification  Ant colony optimization  Bio-informatics data sets with gene ontology  Protein function prediction  Correlation based IF–THEN rule list
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号