首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
烃类物质闪点定量构效的关系   总被引:1,自引:0,他引:1  
应用CODESSA软件计算300种烃类物质的分子结构描述符,用启发式回归(HM)、最佳多元线性回归(B-MLR)法,以筛选出来的分子描述符建立线性回归模型.用B-MLR法所选4个描述符作为支持向量机(SVM)的输入,建立非线性模型.预测结果表明:所建模型稳健,泛化能力强,预测误差小.非线性模型(R2=0.9884,RMSE=8.7570)的性能优于线性回归模型(HM:R2=0.9815,RMSE=11.0653;B-MLR:R2=0.9814,RMSE=11.1041),预测的效果令人满意.  相似文献   

2.
3.
以2D-autocorrelation描述符为结构参数,采用PSO和逐步回归的方法进行变量筛选,再结合SVM等机器学习算法对28种苯丙烯盐类化合物对EBV-EA病毒的抑制性活性进行定量构效关系(QSAR)研究.研究结果表明,PSO-v-SVM模型具有最优的模型稳健性和预测效果.由PSO选入的构成该模型的5个2D-autocorrelation描述符为ATS5v,ATS6e,ATS8e,ATS3p,GATS5p;该模型对训练集的拟合和留一法交叉验证结果的相关系数R~2和q_(cv)~2分别为0.986和0.930,对测试集预测结果的相关系数R~2_(ext)达0.955.对5个变量的理化意义的分析表明,极化率、Van der Waals体积和电负性对苯丙烯盐类化合物的抑制性活性影响分别约占57.13%、15.90%和26.97%.  相似文献   

4.
5.
6.
Peptide-MHC binding is an important prerequisite event and has immediate consequences to immune response. Those peptides binding to MHC molecules can activate the T-cell immunity, and they are useful for understanding the immune mechanism and developing vaccines for diseases. Accurate prediction of the binding between peptides and MHC-II molecules has long been a challenge in bioinformatics. Recently, instead of differentiating peptides as binder or non-binder, researchers are more interested in making predictions directly on peptide binding affinities. In this paper, we investigate the use of relevance vector machine to quantitatively predict the binding affinities between MHC-II molecules and peptides. In our scheme, a new encoding scheme is used to generate the input vectors, and then by using relevance vector machine we develop the prediction models on the basis of binding cores, which are recognized in an iterative self-consistent way. When applied to three MHC-II molecules DRB1*0101, DRB1*0401 and DRB1*1501, our method produces consistently better performance than several popular quantitative methods, in terms of cross-validated squared error, cross-validated correlation coefficient, and area under ROC curve. All evidences indicate that our method is an effective tool for MHC-II binding affinity prediction.  相似文献   

7.
基于遗传算法的支持向量机预测含能材料密度的研究   总被引:4,自引:2,他引:2  
基于遗传算法(genetic algorithm,GA)的变量筛选和支持向量机(support vector machine,SVM),提出了一种改进的定量结构-性质相关(quantitative structure detonation relationship,QSPR)建模方法——遗传-支持向量机(GA-SVM),并用其建立含能材料的定量结构-爆轰性能关系(QSDR)模型,此外还应用标准SVM方法建立了QSDR模型,并用这2种模型进行呋咱系含能化合物密度的预测,随机选取85%化合物作为训练集,用来建立模型,其余化合物作为测试集来测试模型的预测能力。预测结果的交互检验的相关系数平方分别为0.9887和0.9885,平均相对误差分别为1.16%和2.12%,表明了2种建模方法的有效性。通过对2种模型的预测能力进行比较,GA-SVM方法建立的QSDR模型能更好地预测呋咱系含能化合物的密度,更利于实际应用。  相似文献   

8.
Peptide-major histocompatibility complex (MHC) binding is an important prerequisite event and has immediate consequences to immune response. Those peptides binding to MHC molecules can activate the T-cell immunity, and they are useful for understanding the immune mechanism and developing vaccines for diseases. Recently, researchers are interested in making prediction about binding affinity instead of differentiating the peptides as binder or non-binder. In this paper, we use sparse Bayesian regression algorithm proposed by Tipping [M.E. Tipping, Sparse Bayesian learning and the relevance vector machine. J. Mach. Learn. Res. (2001)] to derive position-specific scoring matrices from allele-related peptides, and develop the models allowing for the prediction of MHC-II binding affinity. We explore the peptide length and peptide flanking residue length's impact on binding affinity, and incorporate these factors into our models to enhance prediction performance. When applied to the datasets from AntiJen database and IEDB database, our method produces better performances than several popular quantitative methods.  相似文献   

9.
10.
用文献设定的结构参数和本文设定的结构参数,分别与由HyperChem7.5Student Evaluation计算得到的量化参数作为自变量构成2组数据,以逐步回归,遗传算法-偏最小二乘法(GA-PLS)和遗传算法-支持向量机(GA-SVM)等算法就黄酮类化合物对PTKs抑制性进行QSAR研究。用各算法模型处理数据,由本文设定的结构参数构成的数据集获得的预测结果更好,表明采用取代基团类型和取代位置结合的编码参数包含的信息更为丰富,对物质性质的描述更加合理。在各种算法中, GA-SVM模型均具有最佳预测效果,该算法对2组数据作留一法预测处理得到的相关系数R和PTKs抑制性实验值与预测值的平均绝对误差MAE分别为0.7595,0.2871和0.7864,0.2883。研究还表明,GA-PLS和GA-SVM联用算法的预测效果远高于单独使用的PLS和SVM算法;由逐步回归建立的MLR模型对2组数据进行计算处理,尽管拟合时相关系数R分别达到0.8136和0.8250,但作留一法交互验证时却下降到0.7113和0.7354,明显低于GA-PLS和GA-SVM联用算法。  相似文献   

11.
12.
对125个磺胺类碳酸酐酶Ⅱ抑制剂的生物活性进行了预测研究。利用ADRIANA.Code软件计算得到了化合物的一系列2D和3D结构描述符,从中选用了12个描述符进行建模。分别用数学随机划分的方法和Kohonen自组织神经网络的方法把数据集划分成两组不同的训练集和测试集。对于这两组不同的训练集和测试集,分别利用多元线性回归(MLR)和支持向量机(SVM)的方法进行建模,共得到4个模型。其中SVM得到的2个模型,训练集的相关系数在0.92以上,测试集预测的相关系数都在0.90以上。所有模型可进一步用于碳酸酐酶Ⅱ抑制剂的虚拟筛选。  相似文献   

13.
Diagnosis of potential faults concealed inside power transformers is the key of ensuring stable electrical power supply to consumers. Support vector machine (SVM) is a new machine learning method based on the statistical learning theory, which is a powerful tool for solving the problem with small sampling, nonlinearity and high dimension. The selection of SVM parameters has an important influence on the classification accuracy of SVM. However, it is very difficult to select appropriate SVM parameters. In this study, support vector machine with genetic algorithm (SVMG) is applied to fault diagnosis of a power transformer, in which genetic algorithm (GA) is used to select appropriate free parameters of SVM. The experimental data from several electric power companies in China are used to illustrate the performance of the proposed SVMG model. The experimental results indicate that the SVMG method can achieve higher diagnostic accuracy than IEC three ratios, normal SVM classifier and artificial neural network.  相似文献   

14.
Due to the economic significance of bankruptcy prediction of companies for financial institutions, investors and governments, many quantitative methods have been used to develop effective prediction models. Support vector machine (SVM), a powerful classification method, has been used for this task; however, the performance of SVM is sensitive to model form, parameter setting and features selection. In this study, a new approach based on direct search and features ranking technology is proposed to optimise features selection and parameter setting for 1-norm and least-squares SVM models for bankruptcy prediction. This approach is also compared to the SVM models with parameter optimisation and features selection by the popular genetic algorithm technique. The experimental results on a data set with 2010 instances show that the proposed models are good alternatives for bankruptcy prediction.  相似文献   

15.
16.
17.
QSAR研究中,判断模型预测能力至关重要。长期以来,模型的预测能力是使用留一法或留k法等内部验证来确定,但在2004年形成的OECD规则中,已明确规定必须使用外部验证集去评价模型的预测能力。为了研究内部验证和外部验证与模型预测能力之间的关系,本文以45种睾酮和二氢睾酮衍生物以及37种萘锟酯衍生物为研究对象,以E-Dragon计算的分子描述符作为自变量,在增n减l算法选择变量的基础上,采用SVM算法对同种物质的不同活性以及不同物质的不同活性建立QSAR模型,研究QSAR/QSPR建模时的不同验证方式与模型预测能力的关系。研究结果表明,模型的预测能力与内部验证结果的好坏无必然联系,而结合外部验证的检验结果则是判断模型预测能力的可靠依据。  相似文献   

18.
应用多元线性回归、人工神经网络、支持向量机3种方法,对加入聚乙二醇、十二烷基苯磺酸钠、石油磺酸盐和部分水解聚丙烯酰胺四种处理剂的蒙脱土悬浮液的电动电位进行预测。在模型训练中,分别采用了神经网络集成和非启发式参数优化来提高人工神经网络和支持向量机模型的泛化能力。检验结果表明,参数优化的支持向量机模型预测精度最高,其平均误差率为3.88%,最大误差率为7.55%。  相似文献   

19.
支持向量机(SVM)是一类新型机器学习方法,首先简要介绍了SVM的基本原理,进而分析了该方法应用于异常检测,最后对基于支持向量机(SVM)的异常检测在路由器中的实现方法进行了简要论述和仿真实验。  相似文献   

20.
支持向量机是一种基于统计学习理论的新颖的机器学习方法,该方法已经广泛用于解决分类与回归问题。标准的支持向量机算法需要解一个二次规划问题,当训练样本较多时,其运算速度一般很慢。为了提高运算速度,介绍了一种基于线性规划的支持向量回归算法,并由此提出几种新的回归模型,同时将它们应用到混沌时间序列预测中,并比较了它们的预测性能。在实际应用中,可以根据具体情况灵活地选择所需模型。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号