首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Peptide-MHC binding is an important prerequisite event and has immediate consequences to immune response. Those peptides binding to MHC molecules can activate the T-cell immunity, and they are useful for understanding the immune mechanism and developing vaccines for diseases. Accurate prediction of the binding between peptides and MHC-II molecules has long been a challenge in bioinformatics. Recently, instead of differentiating peptides as binder or non-binder, researchers are more interested in making predictions directly on peptide binding affinities. In this paper, we investigate the use of relevance vector machine to quantitatively predict the binding affinities between MHC-II molecules and peptides. In our scheme, a new encoding scheme is used to generate the input vectors, and then by using relevance vector machine we develop the prediction models on the basis of binding cores, which are recognized in an iterative self-consistent way. When applied to three MHC-II molecules DRB1*0101, DRB1*0401 and DRB1*1501, our method produces consistently better performance than several popular quantitative methods, in terms of cross-validated squared error, cross-validated correlation coefficient, and area under ROC curve. All evidences indicate that our method is an effective tool for MHC-II binding affinity prediction.  相似文献   

2.
Peptide vaccination for cancer immunotherapy requires identification of peptide epitopes derived from antigenic proteins associated with the tumor. Such peptides can bind to MHC proteins (MHC molecules) on the tumor-cell surface, with the potential to initiate a host immune response against the tumor. Computer prediction of peptide epitopes can be based on known motifs for peptide sequences that bind to a certain MHC molecule, on algorithms using experimental data as a training set, or on structure-based approaches. We have developed an algorithm, which we refer to as PePSSI, for flexible structural prediction of peptide binding to MHC molecules. Here, we have applied this algorithm to identify peptide epitopes (of nine amino acids, the common length) from the sequence of the cancer-testis antigen KU-CT-1, based on the potential of these peptides to bind to the human MHC molecule HLA-A2. We compared the PePSSI predictions with those of other algorithms and found that several peptides predicted to be strong HLA-A2 binders by PePSSI were similarly predicted by another structure-based algorithm, PREDEP. The results show how structure-based prediction can identify potential peptide epitopes without known binding motifs and suggest that side chain orientation in binding peptides may be obtained using PePSSI.  相似文献   

3.
4.
5.
Strategies for selecting informative data points for training prediction algorithms are important, particularly when data points are difficult and costly to obtain. A Query by Committee (QBC) training strategy for selecting new data points uses the disagreement between a committee of different algorithms to suggest new data points, which most rationally complement existing data, that is, they are the most informative data points. In order to evaluate this QBC approach on a real-world problem, we compared strategies for selecting new data points. We trained neural network algorithms to obtain methods to predict the binding affinity of peptides binding to the MHC class I molecule, HLA-A2. We show that the QBC strategy leads to a higher performance than a baseline strategy where new data points are selected at random from a pool of available data. Most peptides bind HLA-A2 with a low affinity, and as expected using a strategy of selecting peptides that are predicted to have high binding affinities also lead to more accurate predictors than the base line strategy. The QBC value is shown to correlate with the measured binding affinity. This demonstrates that the different predictors can easily learn if a peptide will fail to bind, but often conflict in predicting if a peptide binds. Using a carefully constructed computational setup, we demonstrate that selecting peptides with a high QBC performs better than low QBC peptides independently from binding affinity. When predictors are trained on a very limited set of data they cannot be expected to disagree in a meaningful way and we find a data limit below which the QBC strategy fails. Finally, it should be noted that data selection strategies similar to those used here might be of use in other settings in which generation of more data is a costly process.  相似文献   

6.
With its implications for vaccine discovery, the accurate prediction of T cell epitopes is one of the key aspirations of computational vaccinology. We have developed a robust multivariate statistical method, based on partial least squares, for the quantitative prediction of peptide binding to major histocompatibility complexes (MHC), the principal checkpoint on the antigen presentation pathway. As a service to the immunobiology community, we have made a Perl implementation of the method available via a World Wide Web server. We call this server MHCPred. Access to the server is freely available from the URL: http://www.jenner.ac.uk/MHCPred. We have exemplified our method with a model for peptides binding to the common human MHC molecule HLA-B*3501.  相似文献   

7.
MHC II类分子结合肽的预测对于免疫研究和疫苗设计非常重要,然而其结合肽长度的可变性等原因使其预测变得极为困难,提出了一种基于广义选择性神经网络集成的MHC II分子结合肽预测算法,该算法是一种双层集成模型。第一层是用微分进化算法去生成初始神经网络集成池,第二层是从初始神经网络集成池中选择部分组成最终的神经网络集成。实验结果表明广义选择性神经网络集成比传统的选择性神经网络有更好的泛化性能。  相似文献   

8.
9.
Peptides that induce and recall T-cell responses are called T-cell epitopes. T-cell epitopes may be useful in a subunit vaccine against malaria. Computer models that simulate peptide binding to MHC are useful for selecting candidate T-cell epitopes since they minimize the number of experiments required for their identification. We applied a combination of computational and immunological strategies to select candidate T-cell epitopes. A total of 86 experimental binding assays were performed in three rounds of identification of HLA-A11 binding peptides from the six preerythrocytic malaria antigens. Thirty-six peptides were experimentally confirmed as binders. We show that the cyclical refinement of the ANN models results in a significant improvement of the efficiency of identifying potential T-cell epitopes.  相似文献   

10.
曾安  潘丹  郑启伦  彭宏 《计算机科学》2007,34(6):226-230
T细胞表位预测技术对于减少实验合成重叠肽、研究病原体与机体作用的免疫机制以及深入理解T细胞介导的免疫特异性均有重要意义。为增强T细胞表位预测模型的可理解性,本文在通过肽的预处理构建出存储等长肽段的决策表之后,设计出了一种基于粗集的T细胞表位预测方法。该方法由基于信息熵的属性约简完备算法和基于锚点知识的属性值顺序约简改进算法共同组成。基于HLA-DR4(B10401)编码的MHCII类分子结合肽的实验数据表明,在预测精度与传统神经网络方法大致相当的基础上,本文方法可以提取出用于帮助专家理解MHC分子与抗原肽结合机理的产生式规则。  相似文献   

11.
用于T细胞表位预测的分类器集成方法*   总被引:1,自引:1,他引:0  
T细胞表位预测技术对于减少实验合成重叠肽,理解T细胞介导的免疫特异性和研制亚单位多肽及基因疫苗均有重要意义.为弥补已有基于机器学习方法的T细胞表位预测模型的可理解性的不足并进一步提高模型的预测精度,首先通过肽的预处理构建出了存储等长肽段的决策表,而后提出了基于粗糙集的分类器集成算法.该算法不但综合利用了基于信息熵的属性约简完备算法和其他属性约简算法的优势,而且将T细胞表位预测领域中的锚点知识融入到了属性值约简过程中.最后利用该算法来预测MHC Ⅱ类分子HLA-DR4(B1·0401)的结合肽,首次提取出了预测精度高且能帮助专家理解MHC分子与抗原肽的结合机理的产生式规则,为下一步的分子建模工作奠定了基础.  相似文献   

12.
The insulin-like growth factor-1 receptor (IGF-1R) plays a key role in proliferation, growth, differentiation, and development of several human malignancies including breast and pancreatic adenocarcinoma. IGF-1R targeted immunotherapeutic approaches are particularly attractive, as they may potentially elicit even stronger antitumor responses than traditional targeted approaches. Cancer peptide vaccines can produce immunologic responses against cancer cells by triggering helper T cell (Th) or cytotoxic T cells (CTL) in association with Major Histocompatibility Complex (MHC) class I or II molecules on the cell surface of antigen presenting cells. In our previous study, we set a technique based on molecular docking in order to find the best MHC class I and II binder peptides using GOLD. In the present work, molecular docking analyses on a library consisting of 30 peptides mimicking discontinuous epitopes from IGF-1R extracellular domain identified peptides 249 and 86, as the best MHC binder peptides to both MHC class I and II molecules. The receptors most often targeted by peptide 249 are HLA-DR4, HLA-DR3 and HLA-DR2 and those most often targeted by peptide 86 are HLA-DR4, HLA-DP2 and HLA-DR3. These findings, based on bioinformatics analyses, can be conducted in further experimental analyses in cancer therapy and vaccine design.  相似文献   

13.
The non-covalent interaction between single-walled carbon nanotube and surfactant peptides makes them soluble in biological media to be used in nano-medicine, drug delivery and gene therapy. Pervious study has shown that two important parameters in binding peptides into nanotubes are hydrophobic effect and the number of aromatic amino acids. Ten surfactant peptides with the length of eight residue, including Lys, Trp, Tyr, Phe and Val, were designed to investigate the important parameters in binding peptides to a (6, 6) carbon nanotube. 500 ns MD simulation was performed for free surfactant peptides in water or near to a nanotube. Our results have indicated that the binding affinity of peptides to nanotube increases with the increase of aromatic residue content. Also, among aromatic residues, the peptides containing Trp residues have higher binding affinity to nanotube compared to the peptides with Phe or Tyr residue. Steric hindrance between bulky aromatic residues in peptide sequence has negative influence in binding peptide to nanotube, and in designing a surfactant peptide, the number and distance of aromatic residue and polarity of them should be taken into account. Our results also show that in docking peptides to nanotube, full-flexible docking leads to incorrect results.  相似文献   

14.
One-shot learning of object categories   总被引:6,自引:0,他引:6  
Learning visual models of object categories notoriously requires hundreds or thousands of training examples. We show that it is possible to learn much information about a category from just one, or a handful, of images. The key insight is that, rather than learning from scratch, one can take advantage of knowledge coming from previously learned categories, no matter how different these categories might be. We explore a Bayesian implementation of this idea. Object categories are represented by probabilistic models. Prior knowledge is represented as a probability density function on the parameters of these models. The posterior model for an object category is obtained by updating the prior in the light of one or more observations. We test a simple implementation of our algorithm on a database of 101 diverse object categories. We compare category models learned by an implementation of our Bayesian approach to models learned from by maximum likelihood (ML) and maximum a posteriori (MAP) methods. We find that on a database of more than 100 categories, the Bayesian approach produces informative models when the number of training examples is too small for other methods to operate successfully.  相似文献   

15.
Relational models are the most common representation of structured data, and acyclic database theory is important in relational databases. In this paper, we propose the method for constructing the Bayesian network structure from dependencies implied in multiple relational schemas. Based on the acyclic database theory and its relationships with probabilistic networks, we are to construct the Bayesian network structure starting from implied independence information instead of mining database instances. We first give the method to find the maximum harmoniousness subset for the multi-valued dependencies on an acyclic schema, and thus the most information of conditional independencies can be retained. Further, aiming at multi-relational environments, we discuss the properties of join graphs of multiple 3NF database schemas, and thus the dependencies between separate relational schemas can be obtained. In addition, on the given cyclic join dependency, the transformation from cyclic to acyclic database schemas is proposed by virtue of finding a minimal acyclic augmentation. An applied example shows that our proposed methods are feasible.  相似文献   

16.
Many database search methods have been developed for peptide identification throughout a large peptide data set. Most of these approaches attempt to build a decision function that allows the identification of an experimental spectrum. This function is built either starting from similarity measures for the database peptides to identify the most similar one to a given spectrum, or by applying useful learning techniques considering the database itself as a training data. In this paper, we propose a peptide identification method based on a similarity measure for peptide-spectrum matches. Our method takes into account peak intensity distribution and applies it in a probabilistic scoring model to rank peptide matches. The main goal of our approach is to highlight the relationship between peak intensities and peptide cleavage positions on the one hand and to show its impact on peptide identification on the other hand. To evaluate our method, a set of experiments have been undertaken into two high mass spectrum accuracy data sets. The obtained results show the effectiveness of our proposed approach.  相似文献   

17.
在软件开发初期及时识别出软件存在的缺陷,可以帮助项目管理团队及时优化开发测试资源分配,以便对可能含有缺陷的软件进行严格的质量保证活动,这对于软件的高质量交付有着重要的作用,因此,软件缺陷预测成为软件工程领域内一个研究热点。虽然人们已经使用多种机器学习算法建立了缺陷预测模型,但还没有对这些模型的贝叶斯方法进行研究。提出了无信息先验和信息先验的贝叶斯Logistic回归方法来建立缺陷预测模型,并对贝叶斯Logistic回归的优势以及先验信息在贝叶斯Logistic回归中的作用进行了研究。最后,在PROMISE数据集上与其他已有缺陷预测方法(LR、NB、RF、SVM)进行了比较研究,结果表明:贝叶斯Logistic回归方法可以取得很好的预测性能。  相似文献   

18.
Bayesian max-margin models have shown superiority in various practical applications, such as text categorization, collaborative prediction, social network link prediction and crowdsourcing, and they conjoin the flexibility of Bayesian modeling and predictive strengths of max-margin learning. However, Monte Carlo sampling for these models still remains challenging, especially for applications that involve large-scale datasets. In this paper, we present the stochastic subgradient Hamiltonian Monte Carlo (HMC) methods, which are easy to implement and computationally efficient. We show the approximate detailed balance property of subgradient HMC which reveals a natural and validated generalization of the ordinary HMC. Furthermore, we investigate the variants that use stochastic subsampling and thermostats for better scalability and mixing. Using stochastic subgradient Markov Chain Monte Carlo (MCMC), we efficiently solve the posterior inference task of various Bayesian max-margin models and extensive experimental results demonstrate the effectiveness of our approach.  相似文献   

19.
Sparse on-line gaussian processes   总被引:7,自引:0,他引:7  
We develop an approach for sparse representations of gaussian process (GP) models (which are Bayesian types of kernel machines) in order to overcome their limitations for large data sets. The method is based on a combination of a Bayesian on-line algorithm, together with a sequential construction of a relevant subsample of the data that fully specifies the prediction of the GP model. By using an appealing parameterization and projection techniques in a reproducing kernel Hilbert space, recursions for the effective parameters and a sparse gaussian approximation of the posterior process are obtained. This allows for both a propagation of predictions and Bayesian error measures. The significance and robustness of our approach are demonstrated on a variety of experiments.  相似文献   

20.
Consideration of binding competitiveness of a drug candidate against natural ligands and other drugs that bind to the same receptor site may facilitate the rational development of a candidate into a potent drug. A strategy that can be applied to computer-aided drug design is to evaluate ligand-receptor interaction energy or other scoring functions of a designed drug with that of the relevant ligands known to bind to the same binding site. As a tool to facilitate such a strategy, a database of ligand-receptor interaction energy is developed from known ligand-receptor 3D structural entries in the Protein Databank (PDB). The Energy is computed based on a molecular mechanics force field that has been used in the prediction of therapeutic and toxicity targets of drugs. This database also contains information about ligand function and other properties and it can be accessed at http://xin.cz3.nus.edu.sg/group/CLiBE.asp. The computed energy components may facilitate the probing of the mode of action and other profiles of binding. A number of computed energies of some PDB ligand-receptor complexes in this database are studied and compared to experimental binding affinity. A certain degree of correlation between the computed energy and experimental binding affinity is found, which suggests that the computed energy may be useful in facilitating a qualitative analysis of drug binding competitiveness.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号