首页 | 本学科首页   官方微博 | 高级检索  
     


Machine-Learning-Based Genome-Wide Association Studies for Uncovering QTL Underlying Soybean Yield and Its Components
Authors:Mohsen Yoosefzadeh-Najafabadi  Milad Eskandari  Sepideh Torabi  Davoud Torkamaneh  Dan Tulpan  Istvan Rajcan
Affiliation:1.Department of Plant Agriculture, University of Guelph, Guelph, ON N1G 2W1, Canada; (M.Y.-N.); (S.T.); (I.R.);2.Département de Phytologie, Université Laval, Québec City, QC G1V 0A6, Canada;3.Department of Animal Biosciences, University of Guelph, Guelph, ON N1G 2W1, Canada;
Abstract:A genome-wide association study (GWAS) is currently one of the most recommended approaches for discovering marker-trait associations (MTAs) for complex traits in plant species. Insufficient statistical power is a limiting factor, especially in narrow genetic basis species, that conventional GWAS methods are suffering from. Using sophisticated mathematical methods such as machine learning (ML) algorithms may address this issue and advance the implication of this valuable genetic method in applied plant-breeding programs. In this study, we evaluated the potential use of two ML algorithms, support-vector machine (SVR) and random forest (RF), in a GWAS and compared them with two conventional methods of mixed linear models (MLM) and fixed and random model circulating probability unification (FarmCPU), for identifying MTAs for soybean-yield components. In this study, important soybean-yield component traits, including the number of reproductive nodes (RNP), non-reproductive nodes (NRNP), total nodes (NP), and total pods (PP) per plant along with yield and maturity, were assessed using a panel of 227 soybean genotypes evaluated at two locations over two years (four environments). Using the SVR-mediated GWAS method, we were able to discover MTAs colocalized with previously reported quantitative trait loci (QTL) with potential causal effects on the target traits, supported by the functional annotation of candidate gene analyses. This study demonstrated the potential benefit of using sophisticated mathematical approaches, such as SVR, in a GWAS to complement conventional GWAS methods for identifying MTAs that can improve the efficiency of genomic-based soybean-breeding programs.
Keywords:data-driven models  FarmCPU  genome-wide association study  MLM  QTL  soybean breeding  support-vector machine
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号