首页 | 本学科首页   官方微博 | 高级检索  
     


Using hidden Markov models to predict DNA-binding proteins with sequence and structure information
Authors:Yi-Yu Hsu  Wei-Jhih Chen  Shu-Hui Chen  Hung-Yu Kao
Affiliation:1. Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan, Taiwan, ROC
2. Institute of Medical Informatics, National Cheng Kung University, Tainan, Taiwan, ROC
3. Department of Chemistry, National Cheng Kung University, Tainan, Taiwan, ROC
Abstract:In the post-genome period, the protein domain structures are published rapidly, but they have not been studied comprehensively. To figure out the cell function, the protein–DNA interactions decrypt the protein domain structures in recent research. Several machine-learning based methods are applied to the issue; however, they are not efficient to translate the tertiary structure characteristics of proteins into appropriate features for predicting the DNA-binding proteins. In this work, a novel machine-learning approach based on hidden Markov models identifies the characteristics of DNA-binding proteins with their amino acid sequences and tertiary structures. After we distill the features from DNA-binding proteins, a support vector machine based classifier predicts general DNA-binding proteins with the accuracy of 88.45 % through fivefolds cross-validation. Furthermore, we construct a response element specific classifier for predicting response element specific DNA-binding proteins, and the performance achieves the precision of 96.57 % with recall rate as 88.83 % in average. To verify the prediction of DNA-binding proteins, we used the DNA-binding proteins from MCF-7 that are likely to bind with estrogen response elements (ERE), and the results show that our methods can apply to practice.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号