1
|
Vitale P, Montesinos-López O, Gerard G, Velu G, Tadesse Z, Montesinos-López A, Dreisigacker S, Pacheco A, Toledo F, Saint Pierre C, Pérez-Rodríguez P, Gardner K, Crespo-Herrera L, Crossa J. Improving wheat grain yield genomic prediction accuracy using historical data. G3 (BETHESDA, MD.) 2025; 15:jkaf038. [PMID: 40056458 PMCID: PMC12005153 DOI: 10.1093/g3journal/jkaf038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/31/2024] [Revised: 02/08/2025] [Accepted: 02/12/2025] [Indexed: 03/10/2025]
Abstract
Genomic selection is an essential tool to improve genetic gain in wheat breeding. This study aimed to enhance prediction accuracy for grain yield across various selection environments using CIMMYT's (International Maize and Wheat Improvement Center) historical dataset. Ten years of grain yield data from 6 selection environments were analyzed, with the populations of 5 years (2018-2023) as the validation population and earlier years (back to 2013-2014) as the training population. Generally, we observed that as the number of training years increased, the prediction accuracy tended to improve or stabilize. For instance, in the late heat stress selection environment (beds late heat stress), prediction accuracy increased from 0.11 (1 training year) to 0.23 (5 years), stabilizing at 0.26. Similar trends were observed in the intermediate drought selection environment (beds with 2 irrigations), with prediction accuracy rising from 0.12 (1 year) to 0.21 (4 years) but minimal improvement beyond that. Conversely, some selection environments, such as flat 5 irrigations (flat optimal environment), did not significantly increase, with the prediction accuracy fluctuating around 0.09-0.14 regardless of the number of training years used. Additionally, average genetic diversity within the training population and the validation population influenced prediction accuracy. Indeed, a negative correlation between prediction accuracy and the genetic distance was observed. This highlights the need to balance genetic diversity to enhance the predictive power of genomic selection models. These findings exhibit the benefits of using an extended historical dataset while considering genetic diversity to maximize prediction accuracy in genomic selection strategies for wheat breeding, ultimately supporting the development of high-yielding varieties.
Collapse
Affiliation(s)
- Paolo Vitale
- International Maize and Wheat Improvement Center (CIMMYT), Km 45 Carretera México-Veracruz, El Batan, Edo. de México 5623, Mexico
| | | | - Guillermo Gerard
- International Maize and Wheat Improvement Center (CIMMYT), Km 45 Carretera México-Veracruz, El Batan, Edo. de México 5623, Mexico
| | - Govindan Velu
- International Maize and Wheat Improvement Center (CIMMYT), Km 45 Carretera México-Veracruz, El Batan, Edo. de México 5623, Mexico
| | - Zerihun Tadesse
- International Maize and Wheat Improvement Center (CIMMYT), Km 45 Carretera México-Veracruz, El Batan, Edo. de México 5623, Mexico
| | - Abelardo Montesinos-López
- Centro Universitario de Ciencias Exactas e Ingenierías (CUCEI), Universidad de Guadalajara, Guadalajara 44430, Jalisco, Mexico
| | - Susanne Dreisigacker
- International Maize and Wheat Improvement Center (CIMMYT), Km 45 Carretera México-Veracruz, El Batan, Edo. de México 5623, Mexico
| | - Angela Pacheco
- International Maize and Wheat Improvement Center (CIMMYT), Km 45 Carretera México-Veracruz, El Batan, Edo. de México 5623, Mexico
| | - Fernando Toledo
- International Maize and Wheat Improvement Center (CIMMYT), Km 45 Carretera México-Veracruz, El Batan, Edo. de México 5623, Mexico
| | - Carolina Saint Pierre
- International Maize and Wheat Improvement Center (CIMMYT), Km 45 Carretera México-Veracruz, El Batan, Edo. de México 5623, Mexico
| | | | - Keith Gardner
- International Maize and Wheat Improvement Center (CIMMYT), Km 45 Carretera México-Veracruz, El Batan, Edo. de México 5623, Mexico
| | - Leonardo Crespo-Herrera
- International Maize and Wheat Improvement Center (CIMMYT), Km 45 Carretera México-Veracruz, El Batan, Edo. de México 5623, Mexico
| | - José Crossa
- International Maize and Wheat Improvement Center (CIMMYT), Km 45 Carretera México-Veracruz, El Batan, Edo. de México 5623, Mexico
- Colegio de Postgraduados, Montecillo, Edo. de México 56231, Mexico
| |
Collapse
|
2
|
Stadler A, Müller WG, Futschik A. A comparison of design algorithms for choosing the training population in genomic models. Front Genet 2025; 15:1462855. [PMID: 40018340 PMCID: PMC11865072 DOI: 10.3389/fgene.2024.1462855] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Accepted: 12/20/2024] [Indexed: 03/01/2025] Open
Abstract
In contemporary breeding programs, typically genomic best linear unbiased prediction (gBLUP) models are employed to drive decisions on artificial selection. Experiments are performed to obtain responses on the units in the breeding program. Due to restrictions on the size of the experiment, an efficient experimental design must usually be found in order to optimize the training population. Classical exchange-type algorithms from optimal design theory can be employed for this purpose. This article suggests several variants for the gBLUP model and compares them to brute-force approaches from the genomics literature for various design criteria. Particular emphasis is placed on evaluating the computational runtime of algorithms along with their respective efficiencies over different sample sizes. We find that adapting classical algorithms from optimal design of experiments can help to decrease runtime, while maintaining efficiency.
Collapse
Affiliation(s)
- Alexandra Stadler
- Institute of Applied Statistics, Johannes Kepler University, Linz, Austria
| | - Werner G. Müller
- Institute of Applied Statistics, Johannes Kepler University, Linz, Austria
| | - Andreas Futschik
- Institute of Applied Statistics, Johannes Kepler University, Linz, Austria
| |
Collapse
|
3
|
Li W, Zhang M, Fan J, Yang Z, Peng J, Zhang J, Lan Y, Chai M. Analysis of the genetic basis of fiber-related traits and flowering time in upland cotton using machine learning. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2025; 138:36. [PMID: 39853381 DOI: 10.1007/s00122-025-04821-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2024] [Accepted: 01/11/2025] [Indexed: 01/26/2025]
Abstract
Cotton is an important crop for fiber production, but the genetic basis underlying key agronomic traits, such as fiber quality and flowering days, remains complex. While machine learning (ML) has shown great potential in uncovering the genetic architecture of complex traits in other crops, its application in cotton has been limited. Here, we applied five machine learning models-AdaBoost, Gradient Boosting Regressor, LightGBM, Random Forest, and XGBoost-to identify loci associated with fiber quality and flowering days in cotton. We compared two SNP dataset down-sampling methods for model training and found that selecting SNPs with an Fscale value greater than 0 outperformed randomly selected SNPs in terms of model accuracy. We further performed machine learning quantitative trait loci (mlQTLs) analysis for 13 traits related to fiber quality and flowering days. These mlQTLs were then compared to those identified through genome-wide association studies (GWAS), revealing that the machine learning approach not only confirmed known loci but also identified novel QTLs. Additionally, we evaluated the effect of population size on model accuracy and found that larger population sizes resulted in better predictive performance. Finally, we proposed candidate genes for the identified mlQTLs, including two argonaute 5 proteins, Gh_A09G104100 and Gh_A09G104400, for the FL3/FS2 locus, as well as GhFLA17 and Syntaxin-121 (Gh_D09G143700) for the FSD09_2/FED09_2 locus. Our findings demonstrate the efficacy of machine learning in enhancing the identification of genetic loci in cotton, providing valuable insights for improving cotton breeding strategies.
Collapse
Affiliation(s)
- Weinan Li
- Nanfan Research Institute, Chinese Academy of Agricultural Sciences, Sanya, 572024, Hainan, China
- College of Electronic Engineering (College of Artificial Intelligence), South China Agricultural University, Guangzhou, 510642, Guangdong, China
| | - Mingjun Zhang
- State Key Laboratory of Cotton Bio-Breeding and Integrated Utilization, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China
| | - Jingchao Fan
- Agricultural Information Institute of Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Zhaoen Yang
- State Key Laboratory of Cotton Bio-Breeding and Integrated Utilization, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China
| | - Jun Peng
- Nanfan Research Institute, Chinese Academy of Agricultural Sciences, Sanya, 572024, Hainan, China
- State Key Laboratory of Cotton Bio-Breeding and Integrated Utilization, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China
| | - Jianhua Zhang
- Nanfan Research Institute, Chinese Academy of Agricultural Sciences, Sanya, 572024, Hainan, China.
- Agricultural Information Institute of Chinese Academy of Agricultural Sciences, Beijing, 100081, China.
| | - Yubin Lan
- College of Electronic Engineering (College of Artificial Intelligence), South China Agricultural University, Guangzhou, 510642, Guangdong, China.
| | - Mao Chai
- State Key Laboratory of Cotton Bio-Breeding and Integrated Utilization, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China.
| |
Collapse
|
4
|
Chen YR, Frei UK, Lübberstedt T. Genomic estimated selection criteria and parental contributions in parent selection increase genetic gain of maternal haploid inducers in maize. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:248. [PMID: 39369351 DOI: 10.1007/s00122-024-04744-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Accepted: 09/13/2024] [Indexed: 10/07/2024]
Abstract
KEY MESSAGE Parental combinations determined by genomic estimated usefulness and parental contributions of the lines in bridging population can enhance the genetic gain of traits of interest in maternal haploid inducer breeding. Parent selection in crosses aligns well with the quantitative trait performance in the progenies. We herein take advantage of estimated genetic values (EGV) and usefulness criteria (UC) of bi-parental combinations by genomic prediction (GP) to compare the empirical performance of doubled haploid inducer (DHI) progenies of eight elite inducers crosses in a half-diallel. We used parental contribution and discovery of superiors from elite-by-historical bridging populations to enhance genetic gain for long-term selection. In this empirical study, the narrow-sense heritabilities of four traits of interest (Days to flowering, DTF; haploid induction rate, HIR; plant height, PHT; Total primary branch length, PBL) in DHI population were 0.81, 0.71, 0.45 and 0.46, respectively. The genomic estimated EGV_Mid/Mean and EGV/UC_Inferior was significantly correlated with the sample mean of progenies and inferiors in four traits in the breeding and bridging population. EGV/UC_Superior were significantly correlated with the mean of superiors in DTF, PHT, and PBL in breeding and bridging populations. The genomic estimated parent contributions in DH progenies of bridging populations enabled discovery of favorable genome region from historical inducers to improve the genetic gain of HIR for long-term selection.
Collapse
Affiliation(s)
- Yu-Ru Chen
- Department of Agronomy, Iowa State University, Ames, IA, 50011-1051, USA
- Crop Science Division, Ministry of Agriculture, Taiwan Agricultural Research Institute, Taichung, Taiwan
| | - Ursula K Frei
- Department of Agronomy, Iowa State University, Ames, IA, 50011-1051, USA
| | - Thomas Lübberstedt
- Department of Agronomy, Iowa State University, Ames, IA, 50011-1051, USA.
| |
Collapse
|
5
|
Peixoto MA, Coelho IF, Leach KA, Lübberstedt T, Bhering LL, Resende MFR. Use of simulation to optimize a sweet corn breeding program: implementing genomic selection and doubled haploid technology. G3 (BETHESDA, MD.) 2024; 14:jkae128. [PMID: 38869242 PMCID: PMC11304600 DOI: 10.1093/g3journal/jkae128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Revised: 04/06/2024] [Accepted: 05/21/2024] [Indexed: 06/14/2024]
Abstract
Genomic selection and doubled haploids hold significant potential to enhance genetic gains and shorten breeding cycles across various crops. Here, we utilized stochastic simulations to investigate the best strategies for optimize a sweet corn breeding program. We assessed the effects of incorporating varying proportions of old and new parents into the crossing block (3:1, 1:1, 1:3, and 0:1 ratio, representing different degrees of parental substitution), as well as the implementation of genomic selection in two distinct pipelines: one calibrated using the phenotypes of testcross parents (GSTC scenario) and another using F1 individuals (GSF1). Additionally, we examined scenarios with doubled haploids, both with (DH) and without (DHGS) genomic selection. Across 20 years of simulated breeding, we evaluated scenarios considering traits with varying heritabilities, the presence or absence of genotype-by-environment effects, and two program sizes (50 vs 200 crosses per generation). We also assessed parameters such as parental genetic mean, average genetic variance, hybrid mean, and implementation costs for each scenario. Results indicated that within a conventional selection program, a 1:3 parental substitution ratio (replacing 75% of parents each generation with new lines) yielded the highest performance. Furthermore, the GSTC model outperformed the GSF1 model in enhancing genetic gain. The DHGS model emerged as the most effective, reducing cycle time from 5 to 4 years and enhancing hybrid gains despite increased costs. In conclusion, our findings strongly advocate for the integration of genomic selection and doubled haploids into sweet corn breeding programs, offering accelerated genetic gains and efficiency improvements.
Collapse
Affiliation(s)
- Marco Antônio Peixoto
- Laboratório de Biometria, Universidade Federal de Viçosa, Viçosa, Minas Gerais 36570-900, Brazil
- Sweet Corn Breeding and Genomics Lab, University of Florida, Gainesville, FL 32611, USA
| | - Igor Ferreira Coelho
- Laboratório de Biometria, Universidade Federal de Viçosa, Viçosa, Minas Gerais 36570-900, Brazil
- Sweet Corn Breeding and Genomics Lab, University of Florida, Gainesville, FL 32611, USA
| | - Kristen A Leach
- Sweet Corn Breeding and Genomics Lab, University of Florida, Gainesville, FL 32611, USA
| | | | - Leonardo Lopes Bhering
- Laboratório de Biometria, Universidade Federal de Viçosa, Viçosa, Minas Gerais 36570-900, Brazil
| | - Márcio F R Resende
- Sweet Corn Breeding and Genomics Lab, University of Florida, Gainesville, FL 32611, USA
| |
Collapse
|
6
|
Alemu A, Åstrand J, Montesinos-López OA, Isidro Y Sánchez J, Fernández-Gónzalez J, Tadesse W, Vetukuri RR, Carlsson AS, Ceplitis A, Crossa J, Ortiz R, Chawade A. Genomic selection in plant breeding: Key factors shaping two decades of progress. MOLECULAR PLANT 2024; 17:552-578. [PMID: 38475993 DOI: 10.1016/j.molp.2024.03.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 01/22/2024] [Accepted: 03/08/2024] [Indexed: 03/14/2024]
Abstract
Genomic selection, the application of genomic prediction (GP) models to select candidate individuals, has significantly advanced in the past two decades, effectively accelerating genetic gains in plant breeding. This article provides a holistic overview of key factors that have influenced GP in plant breeding during this period. We delved into the pivotal roles of training population size and genetic diversity, and their relationship with the breeding population, in determining GP accuracy. Special emphasis was placed on optimizing training population size. We explored its benefits and the associated diminishing returns beyond an optimum size. This was done while considering the balance between resource allocation and maximizing prediction accuracy through current optimization algorithms. The density and distribution of single-nucleotide polymorphisms, level of linkage disequilibrium, genetic complexity, trait heritability, statistical machine-learning methods, and non-additive effects are the other vital factors. Using wheat, maize, and potato as examples, we summarize the effect of these factors on the accuracy of GP for various traits. The search for high accuracy in GP-theoretically reaching one when using the Pearson's correlation as a metric-is an active research area as yet far from optimal for various traits. We hypothesize that with ultra-high sizes of genotypic and phenotypic datasets, effective training population optimization methods and support from other omics approaches (transcriptomics, metabolomics and proteomics) coupled with deep-learning algorithms could overcome the boundaries of current limitations to achieve the highest possible prediction accuracy, making genomic selection an effective tool in plant breeding.
Collapse
Affiliation(s)
- Admas Alemu
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden.
| | - Johanna Åstrand
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden; Lantmännen Lantbruk, Svalöv, Sweden
| | | | - Julio Isidro Y Sánchez
- Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223 Madrid, Spain
| | - Javier Fernández-Gónzalez
- Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223 Madrid, Spain
| | - Wuletaw Tadesse
- International Center for Agricultural Research in the Dry Areas (ICARDA), Rabat, Morocco
| | - Ramesh R Vetukuri
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
| | - Anders S Carlsson
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
| | | | - José Crossa
- International Maize and Wheat Improvement Center (CIMMYT), Km 45, Carretera México-Veracruz, Texcoco, México 52640, Mexico
| | - Rodomiro Ortiz
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden.
| | - Aakash Chawade
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
| |
Collapse
|
7
|
Chiaravallotti I, Lin J, Arief V, Jahufer Z, Osorno JM, McClean P, Jarquin D, Hoyos-Villegas V. Simulations of multiple breeding strategy scenarios in common bean for assessing genomic selection accuracy and model updating. THE PLANT GENOME 2024; 17:e20388. [PMID: 38317595 DOI: 10.1002/tpg2.20388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 07/24/2023] [Accepted: 08/20/2023] [Indexed: 02/07/2024]
Abstract
The aim of this study was to evaluate the accuracy of the ridge regression best linear unbiased prediction model across different traits, parent population sizes, and breeding strategies when estimating breeding values in common bean (Phaseolus vulgaris). Genomic selection was implemented to make selections within a breeding cycle and compared across five different breeding strategies (single seed descent, mass selection, pedigree method, modified pedigree method, and bulk breeding) following 10 breeding cycles. The model was trained on a simulated population of recombinant inbreds genotyped for 1010 single nucleotide polymorphism markers including 38 known quantitative trait loci identified in the literature. These QTL included 11 for seed yield, eight for white mold disease incidence, and 19 for days to flowering. Simulation results revealed that realized accuracies fluctuate depending on the factors investigated: trait genetic architecture, breeding strategy, and the number of initial parents used to begin the first breeding cycle. Trait architecture and breeding strategy appeared to have a larger impact on accuracy than the initial number of parents. Generally, maximum accuracies (in terms of the correlation between true and estimated breeding value) were consistently achieved under a mass selection strategy, pedigree method, and single seed descent method depending on the simulation parameters being tested. This study also investigated model updating, which involves retraining the prediction model with a new set of genotypes and phenotypes that have a closer relation to the population being tested. While it has been repeatedly shown that model updating generally improves prediction accuracy, it benefited some breeding strategies more than others. For low heritability traits (e.g., yield), conventional phenotype-based selection methods showed consistent rates of genetic gain, but genetic gain under genomic selection reached a plateau after fewer cycles. This plateauing is likely a cause of faster fixation of alleles and a diminishing of genetic variance when selections are made based on estimated breeding value as opposed to phenotype.
Collapse
Affiliation(s)
| | - Jennifer Lin
- Department of Plant Science, McGill University, Montreal, Quebec, Canada
| | - Vivi Arief
- School of Agriculture and Food Sustainability Faculty of Science, University of Queensland, Brisbane, Australia
| | - Zulfi Jahufer
- School of Agriculture and Food Sustainability Faculty of Science, University of Queensland, Brisbane, Australia
| | - Juan M Osorno
- Department of Plant Sciences, North Dakota State University, Fargo, North Dakota, USA
| | - Phil McClean
- Department of Plant Sciences, North Dakota State University, Fargo, North Dakota, USA
| | - Diego Jarquin
- Agronomy Department, University of Florida, Gainesville, Florida, USA
| | | |
Collapse
|
8
|
Martins FB, Aono AH, Moraes ADCL, Ferreira RCU, Vilela MDM, Pessoa-Filho M, Rodrigues-Motta M, Simeão RM, de Souza AP. Genome-wide family prediction unveils molecular mechanisms underlying the regulation of agronomic traits in Urochloa ruziziensis. FRONTIERS IN PLANT SCIENCE 2023; 14:1303417. [PMID: 38148869 PMCID: PMC10749977 DOI: 10.3389/fpls.2023.1303417] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Accepted: 11/15/2023] [Indexed: 12/28/2023]
Abstract
Tropical forage grasses, particularly those belonging to the Urochloa genus, play a crucial role in cattle production and serve as the main food source for animals in tropical and subtropical regions. The majority of these species are apomictic and tetraploid, highlighting the significance of U. ruziziensis, a sexual diploid species that can be tetraploidized for use in interspecific crosses with apomictic species. As a means to support breeding programs, our study investigates the feasibility of genome-wide family prediction in U. ruziziensis families to predict agronomic traits. Fifty half-sibling families were assessed for green matter yield, dry matter yield, regrowth capacity, leaf dry matter, and stem dry matter across different clippings established in contrasting seasons with varying available water capacity. Genotyping was performed using a genotyping-by-sequencing approach based on DNA samples from family pools. In addition to conventional genomic prediction methods, machine learning and feature selection algorithms were employed to reduce the necessary number of markers for prediction and enhance predictive accuracy across phenotypes. To explore the regulation of agronomic traits, our study evaluated the significance of selected markers for prediction using a tree-based approach, potentially linking these regions to quantitative trait loci (QTLs). In a multiomic approach, genes from the species transcriptome were mapped and correlated to those markers. A gene coexpression network was modeled with gene expression estimates from a diverse set of U. ruziziensis genotypes, enabling a comprehensive investigation of molecular mechanisms associated with these regions. The heritabilities of the evaluated traits ranged from 0.44 to 0.92. A total of 28,106 filtered SNPs were used to predict phenotypic measurements, achieving a mean predictive ability of 0.762. By employing feature selection techniques, we could reduce the dimensionality of SNP datasets, revealing potential genotype-phenotype associations. The functional annotation of genes near these markers revealed associations with auxin transport and biosynthesis of lignin, flavonol, and folic acid. Further exploration with the gene coexpression network uncovered associations with DNA metabolism, stress response, and circadian rhythm. These genes and regions represent important targets for expanding our understanding of the metabolic regulation of agronomic traits and offer valuable insights applicable to species breeding. Our work represents an innovative contribution to molecular breeding techniques for tropical forages, presenting a viable marker-assisted breeding approach and identifying target regions for future molecular studies on these agronomic traits.
Collapse
Affiliation(s)
- Felipe Bitencourt Martins
- Center for Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, São Paulo, Brazil
| | - Alexandre Hild Aono
- Center for Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, São Paulo, Brazil
| | - Aline da Costa Lima Moraes
- Department of Plant Biology, Biology Institute, University of Campinas (UNICAMP), Campinas, São Paulo, Brazil
| | | | | | - Marco Pessoa-Filho
- Embrapa Cerrados, Brazilian Agricultural Research Corporation, Brasília, Brazil
| | | | - Rosangela Maria Simeão
- Embrapa Gado de Corte, Brazilian Agricultural Research Corporation, Campo Grande, Mato Grosso, Brazil
| | - Anete Pereira de Souza
- Center for Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, São Paulo, Brazil
- Department of Plant Biology, Biology Institute, University of Campinas (UNICAMP), Campinas, São Paulo, Brazil
| |
Collapse
|
9
|
Brzozowski LJ, Campbell MT, Hu H, Yao L, Caffe M, Gutiérrez LA, Smith KP, Sorrells ME, Gore MA, Jannink JL. Genomic prediction of seed nutritional traits in biparental families of oat (Avena sativa). THE PLANT GENOME 2023; 16:e20370. [PMID: 37539632 DOI: 10.1002/tpg2.20370] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Revised: 05/01/2023] [Accepted: 06/20/2023] [Indexed: 08/05/2023]
Abstract
Selection for more nutritious crop plants is an important goal of plant breeding to improve food quality and contribute to human health outcomes. While there are efforts to integrate genomic prediction to accelerate breeding progress, an ongoing challenge is identifying strategies to improve accuracy when predicting within biparental populations in breeding programs. We tested multiple genomic prediction methods for 12 seed fatty acid content traits in oat (Avena sativa L.), as unsaturated fatty acids are a key nutritional trait in oat. Using two well-characterized oat germplasm panels and other biparental families as training populations, we predicted family mean and individual values within families. Genomic prediction of family mean exceeded a mean accuracy of 0.40 and 0.80 using an unrelated and related germplasm panel, respectively, where the related germplasm panel outperformed prediction based on phenotypic means (0.54). Within family prediction accuracy was more variable: training on the related germplasm had higher accuracy than the unrelated panel (0.14-0.16 and 0.05-0.07, respectively), but variability between families was not easily predicted by parent relatedness, segregation of a locus detected by a genome-wide association study in the panel, or other characteristics. When using other families as training populations, prediction accuracies were comparable to the related germplasm panel (0.11-0.23), and families that had half-sib families in the training set had higher prediction accuracy than those that did not. Overall, this work provides an example of genomic prediction of family means and within biparental families for an important nutritional trait and suggests that using related germplasm panels as training populations can be effective.
Collapse
Affiliation(s)
- Lauren J Brzozowski
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, New York, USA
- USDA-ARS, Robert W. Holley Center for Agriculture and Health, Ithaca, New York, USA
| | - Malachy T Campbell
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, New York, USA
| | - Haixiao Hu
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, New York, USA
| | - Linxing Yao
- Analytical Resources Core-Bioanalysis and Omics, Colorado State University, Fort Collins, Colorado, USA
| | - Melanie Caffe
- Department of Agronomy, Horticulture & Plant Science, South Dakota State University, Brookings, South Dakota, USA
| | - Lucı A Gutiérrez
- Department of Agronomy, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Kevin P Smith
- Department of Agronomy & Plant Genetics, University of Minnesota, Saint Paul, Minnesota, USA
| | - Mark E Sorrells
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, New York, USA
| | - Michael A Gore
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, New York, USA
| | - Jean-Luc Jannink
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, New York, USA
- USDA-ARS, Robert W. Holley Center for Agriculture and Health, Ithaca, New York, USA
| |
Collapse
|
10
|
de Verdal H, Baertschi C, Frouin J, Quintero C, Ospina Y, Alvarez MF, Cao TV, Bartholomé J, Grenier C. Optimization of Multi-Generation Multi-location Genomic Prediction Models for Recurrent Genomic Selection in an Upland Rice Population. RICE (NEW YORK, N.Y.) 2023; 16:43. [PMID: 37758969 PMCID: PMC10533757 DOI: 10.1186/s12284-023-00661-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Accepted: 09/19/2023] [Indexed: 09/29/2023]
Abstract
Genomic selection is a worthy breeding method to improve genetic gain in recurrent selection breeding schemes. The integration of multi-generation and multi-location information could significantly improve genomic prediction models in the context of shuttle breeding. The Cirad-CIAT upland rice breeding program applies recurrent genomic selection and seeks to optimize the scheme to increase genetic gain while reducing phenotyping efforts. We used a synthetic population (PCT27) of which S0 plants were all genotyped and advanced by selfing and bulk seed harvest to the S0:2, S0:3, and S0:4 generations. The PCT27 was then divided into two sets. The S0:2 and S0:3 progenies for PCT27A and the S0:4 progenies for PCT27B were phenotyped in two locations: Santa Rosa the target selection location, within the upland rice growing area, and Palmira, the surrogate location, far from the upland rice growing area but easier for experimentation. While the calibration used either one of the two sets phenotyped in one or two locations, the validation population was only the PCT27B phenotyped in Santa Rosa. Five scenarios of genomic prediction and 24 models were performed and compared. Training the prediction model with the PCT27B phenotyped in Santa Rosa resulted in predictive abilities ranging from 0.19 for grain zinc concentration to 0.30 for grain yield. Expanding the training set with the inclusion of the PCT27A resulted in greater predictive abilities for all traits but grain yield, with increases from 5% for plant height to 61% for grain zinc concentration. Models with the PCT27B phenotyped in two locations resulted in higher prediction accuracy when the models assumed no genotype-by-environment (G × E) interaction for flowering (0.38) and grain zinc concentration (0.27). For plant height, the model assuming a single G × E variance provided higher accuracy (0.28). The gain in predictive ability for grain yield was the greatest (0.25) when environment-specific variance deviation effect for G × E was considered. While the best scenario was specific to each trait, the results indicated that the gain in predictive ability provided by the multi-location and multi-generation calibration was low. Yet, this approach could lead to increased selection intensity, acceleration of the breeding cycle, and a sizable economic advantage for the program.
Collapse
Affiliation(s)
- Hugues de Verdal
- CIRAD, UMR AGAP Institut, 34398, Montpellier, France.
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, 34398, Montpellier, France.
| | - Cédric Baertschi
- CIRAD, UMR AGAP Institut, 34398, Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, 34398, Montpellier, France
| | - Julien Frouin
- CIRAD, UMR AGAP Institut, 34398, Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, 34398, Montpellier, France
| | - Constanza Quintero
- Alliance Bioversity-CIAT, A.A.6713, Km 17 Recta Palmira Cali, Cali, Colombia
| | - Yolima Ospina
- Alliance Bioversity-CIAT, A.A.6713, Km 17 Recta Palmira Cali, Cali, Colombia
| | | | - Tuong-Vi Cao
- CIRAD, UMR AGAP Institut, 34398, Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, 34398, Montpellier, France
| | - Jérôme Bartholomé
- CIRAD, UMR AGAP Institut, 34398, Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, 34398, Montpellier, France
- Alliance Bioversity-CIAT, A.A.6713, Km 17 Recta Palmira Cali, Cali, Colombia
| | - Cécile Grenier
- CIRAD, UMR AGAP Institut, 34398, Montpellier, France.
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, 34398, Montpellier, France.
- Alliance Bioversity-CIAT, A.A.6713, Km 17 Recta Palmira Cali, Cali, Colombia.
| |
Collapse
|
11
|
Winn ZJ, Lyerly JH, Brown-Guedira G, Murphy JP, Mason RE. Utilization of a publicly available diversity panel in genomic prediction of Fusarium head blight resistance traits in wheat. THE PLANT GENOME 2023; 16:e20353. [PMID: 37194437 DOI: 10.1002/tpg2.20353] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 04/25/2023] [Accepted: 04/27/2023] [Indexed: 05/18/2023]
Abstract
Fusarium head blight (FHB) is an economically and environmentally concerning disease of wheat (Triticum aestivum L). A two-pronged approach of marker-assisted selection coupled with genomic selection has been suggested when breeding for FHB resistance. A historical dataset comprised of entries in the Southern Uniform Winter Wheat Scab Nursery (SUWWSN) from 2011 to 2021 was partitioned and used in genomic prediction. Two traits were curated from 2011 to 2021 in the SUWWSN: percent Fusarium damaged kernels (FDK) and deoxynivalenol (DON) content. Heritability was estimated for each trait-by-environment combination. A consistent set of check lines was drawn from each year in the SUWWSN, and k-means clustering was performed across environments to assign environments into clusters. Two clusters were identified as FDK and three for DON. Cross-validation on SUWWSN data from 2011 to 2019 indicated no outperforming training population in comparison to the combined dataset. Forward validation for FDK on the SUWWSN 2020 and 2021 data indicated a predictive accuracyr ≈ 0.58 $r \approx 0.58$ andr ≈ 0.53 $r \approx 0.53$ , respectively. Forward validation for DON indicated a predictive accuracy ofr ≈ 0.57 $r \approx 0.57$ andr ≈ 0.45 $r \approx 0.45$ , respectively. Forward validation using environments in cluster one for FDK indicated a predictive accuracy ofr ≈ 0.65 $r \approx 0.65$ andr ≈ 0.60 $r \approx 0.60$ , respectively. Forward validation using environments in cluster one for DON indicated a predictive accuracy ofr ≈ 0.67 $r \approx 0.67$ andr ≈ 0.60 $r \approx 0.60$ , respectively. These results indicated that selecting environments based on check performance may produce higher forward prediction accuracies. This work may be used as a model for utilizing public resources for genomic prediction of FHB resistance traits across public wheat breeding programs.
Collapse
Affiliation(s)
- Zachary J Winn
- Department of Crop and Soil Sciences, North Carolina State University, Raleigh, North Carolina, USA
- Department of Crop and Soil Sciences, Colorado State University, Fort Collins, Colorado, USA
| | - Jeanette H Lyerly
- Department of Crop and Soil Sciences, North Carolina State University, Raleigh, North Carolina, USA
| | - Gina Brown-Guedira
- Department of Crop and Soil Sciences, North Carolina State University, Raleigh, North Carolina, USA
- USDA-ARS, Raleigh, North Carolina, USA
| | - Joseph P Murphy
- Department of Crop and Soil Sciences, North Carolina State University, Raleigh, North Carolina, USA
| | - Richard Esten Mason
- Department of Crop and Soil Sciences, Colorado State University, Fort Collins, Colorado, USA
| |
Collapse
|
12
|
Ficht A, Konkin DJ, Cram D, Sidebottom C, Tan Y, Pozniak C, Rajcan I. Genomic selection for agronomic traits in a winter wheat breeding program. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 136:38. [PMID: 36897431 DOI: 10.1007/s00122-023-04294-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Accepted: 12/19/2022] [Indexed: 06/18/2023]
Abstract
rAMP-seq based genomic selection for agronomic traits has been shown to be a useful tool for winter wheat breeding programs by increasing the rate of genetic gain. Genomic selection (GS) is an effective strategy to employ in a breeding program that focuses on optimizing quantitative traits, which results in the ability for breeders to select the best genotypes. GS was incorporated into a breeding program to determine the potential for implementation on an annual basis, with emphasis on selecting optimal parents and decreasing the time and costs associated with phenotyping large numbers of genotypes. The design options for applying repeat amplification sequencing (rAMP-seq) in bread wheat were explored, and a low-cost single primer pair strategy was implemented. A total of 1870 winter wheat genotypes were phenotyped and genotyped using rAMP-seq. The optimization of training to testing population size showed that the 70:30 ratio provided the most consistent prediction accuracy. Three GS models were tested, rrBLUP, RKHS and feed-forward neural networks using the University of Guelph Winter Wheat Breeding Program (UGWWBP) and Elite-UGWWBP populations. The models performed equally well for both populations and did not differ in prediction accuracy (r) for most agronomic traits, with the exception of yield, where RKHS performed the best with an r = 0.34 and 0.39 for each population, respectively. The ability to operate a breeding program where multiple selection strategies, including GS, are utilized will lead to higher efficiency in the program and ultimately lead to a higher rate of genetic gain.
Collapse
Affiliation(s)
- Alexandra Ficht
- Department of Plant Agriculture, University of Guelph, Crop Science Building, 50 Stone Road East, Guelph, ON, N1G 2W1, Canada
| | - David J Konkin
- Aquatic and Crop Resource Development Research Centre, National Research Council of Canada, Saskatoon, Canada
| | - Dustin Cram
- Aquatic and Crop Resource Development Research Centre, National Research Council of Canada, Saskatoon, Canada
| | - Christine Sidebottom
- Aquatic and Crop Resource Development Research Centre, National Research Council of Canada, Saskatoon, Canada
| | - Yifang Tan
- Aquatic and Crop Resource Development Research Centre, National Research Council of Canada, Saskatoon, Canada
| | - Curtis Pozniak
- Department of Plant Sciences, Crop Development Centre, University of Saskatchewan, Room 2E64, Agriculture Building, 51 Campus Drive, Saskatoon, SK, S7N 5A8, Canada
| | - Istvan Rajcan
- Department of Plant Agriculture, University of Guelph, Crop Science Building, 50 Stone Road East, Guelph, ON, N1G 2W1, Canada.
| |
Collapse
|
13
|
Rooney TE, Kunze KH, Sorrells ME. Genome-wide marker effect heterogeneity is associated with a large effect dormancy locus in winter malting barley. THE PLANT GENOME 2022; 15:e20247. [PMID: 35971877 DOI: 10.1002/tpg2.20247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Accepted: 06/20/2022] [Indexed: 06/15/2023]
Abstract
Prediction of trait values in plant breeding populations typically relies on assumptions about marker effect homogeneity across populations. Evidence is presented for winter malting barley (Hordeum vulgare L.) germination traits that a single, causative, large-effect gene in the Seed dormancy 1 region on Chromosome 5H, HvAlaAT1 (Qsd1), leads to heterogeneous estimated marker effects genome wide between groups of otherwise related individuals carrying different Qsd1 alleles. This led to reduced prediction accuracy across alleles when a model was trained either on individuals carrying both alleles or one allele. Several genomic prediction models were tested to increase prediction accuracy within the Qsd1 allele groups. Small gains (5-12%) in prediction accuracy were realized using structured genomic best linear unbiased predictor models when information about the Qsd1 allele was used to stratify the population. We concluded that a single large-effect locus can lead to heterogeneous marker effects in the same breeding family. Variance partitioning based on large-effect loci can be used to inform best practices in designing genomic prediction models; however, there are likely few cases for which it may be practical to do this. For malting barley, if germination traits are highly associated with malting quality traits, then similar steps should be considered for malting quality trait prediction.
Collapse
Affiliation(s)
- Travis E Rooney
- Plant Breeding and Genetics Section, School of Integrative Plant Sciences, Cornell Univ., Ithaca, NY, 14853, USA
| | - Karl H Kunze
- Plant Breeding and Genetics Section, School of Integrative Plant Sciences, Cornell Univ., Ithaca, NY, 14853, USA
| | - Mark E Sorrells
- Plant Breeding and Genetics Section, School of Integrative Plant Sciences, Cornell Univ., Ithaca, NY, 14853, USA
| |
Collapse
|
14
|
Mbo Nkoulou LF, Ngalle HB, Cros D, Adje COA, Fassinou NVH, Bell J, Achigan-Dako EG. Perspective for genomic-enabled prediction against black sigatoka disease and drought stress in polyploid species. FRONTIERS IN PLANT SCIENCE 2022; 13:953133. [PMID: 36388523 PMCID: PMC9650417 DOI: 10.3389/fpls.2022.953133] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Accepted: 09/28/2022] [Indexed: 06/16/2023]
Abstract
Genomic selection (GS) in plant breeding is explored as a promising tool to solve the problems related to the biotic and abiotic threats. Polyploid plants like bananas (Musa spp.) face the problem of drought and black sigatoka disease (BSD) that restrict their production. The conventional plant breeding is experiencing difficulties, particularly phenotyping costs and long generation interval. To overcome these difficulties, GS in plant breeding is explored as an alternative with a great potential for reducing costs and time in selection process. So far, GS does not have the same success in polyploid plants as with diploid plants because of the complexity of their genome. In this review, we present the main constraints to the application of GS in polyploid plants and the prospects for overcoming these constraints. Particular emphasis is placed on breeding for BSD and drought-two major threats to banana production-used in this review as a model of polyploid plant. It emerges that the difficulty in obtaining markers of good quality in polyploids is the first challenge of GS on polyploid plants, because the main tools used were developed for diploid species. In addition to that, there is a big challenge of mastering genetic interactions such as dominance and epistasis effects as well as the genotype by environment interaction, which are very common in polyploid plants. To get around these challenges, we have presented bioinformatics tools, as well as artificial intelligence approaches, including machine learning. Furthermore, a scheme for applying GS to banana for BSD and drought has been proposed. This review is of paramount impact for breeding programs that seek to reduce the selection cycle of polyploids despite the complexity of their genome.
Collapse
Affiliation(s)
- Luther Fort Mbo Nkoulou
- Genetics, Biotechnology, and Seed Science Unit (GBioS), Department of Plant Sciences, Faculty of Agronomic Sciences, University of Abomey Calavi, Cotonou, Benin
- Unit of Genetics and Plant Breeding (UGAP), Department of Plant Biology, Faculty of Sciences, University of Yaoundé 1, Yaoundé, Cameroon
- Institute of Agricultural Research for Development, Centre de Recherche Agricole de Mbalmayo (CRAM), Mbalmayo, Cameroon
| | - Hermine Bille Ngalle
- Unit of Genetics and Plant Breeding (UGAP), Department of Plant Biology, Faculty of Sciences, University of Yaoundé 1, Yaoundé, Cameroon
| | - David Cros
- Centre de Coopération Internationale en Recherche Agronomique pour le Développement (CIRAD), Unité Mixte de Recherche (UMR) Amélioration Génétique et Adaptation des Plantes méditerranéennes et tropicales (AGAP) Institut, Montpellier, France
- Unité Mixte de Recherche (UMR) Amélioration Génétique et Adaptation des Plantes méditerranéennes et tropicales (AGAP) Institut, University of Montpellier, Centre de Coopération Internationale en Recherche Agronomique pour le Développement (CIRAD), Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement (INRAE), Institut Agro, Montpellier, France
| | - Charlotte O. A. Adje
- Genetics, Biotechnology, and Seed Science Unit (GBioS), Department of Plant Sciences, Faculty of Agronomic Sciences, University of Abomey Calavi, Cotonou, Benin
| | - Nicodeme V. H. Fassinou
- Genetics, Biotechnology, and Seed Science Unit (GBioS), Department of Plant Sciences, Faculty of Agronomic Sciences, University of Abomey Calavi, Cotonou, Benin
| | - Joseph Bell
- Unit of Genetics and Plant Breeding (UGAP), Department of Plant Biology, Faculty of Sciences, University of Yaoundé 1, Yaoundé, Cameroon
| | - Enoch G. Achigan-Dako
- Genetics, Biotechnology, and Seed Science Unit (GBioS), Department of Plant Sciences, Faculty of Agronomic Sciences, University of Abomey Calavi, Cotonou, Benin
| |
Collapse
|
15
|
A divide-and-conquer approach for genomic prediction in rubber tree using machine learning. Sci Rep 2022; 12:18023. [PMID: 36289298 PMCID: PMC9605989 DOI: 10.1038/s41598-022-20416-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Accepted: 09/13/2022] [Indexed: 01/20/2023] Open
Abstract
Rubber tree (Hevea brasiliensis) is the main feedstock for commercial rubber; however, its long vegetative cycle has hindered the development of more productive varieties via breeding programs. With the availability of H. brasiliensis genomic data, several linkage maps with associated quantitative trait loci have been constructed and suggested as a tool for marker-assisted selection. Nonetheless, novel genomic strategies are still needed, and genomic selection (GS) may facilitate rubber tree breeding programs aimed at reducing the required cycles for performance assessment. Even though such a methodology has already been shown to be a promising tool for rubber tree breeding, increased model predictive capabilities and practical application are still needed. Here, we developed a novel machine learning-based approach for predicting rubber tree stem circumference based on molecular markers. Through a divide-and-conquer strategy, we propose a neural network prediction system with two stages: (1) subpopulation prediction and (2) phenotype estimation. This approach yielded higher accuracies than traditional statistical models in a single-environment scenario. By delivering large accuracy improvements, our methodology represents a powerful tool for use in Hevea GS strategies. Therefore, the incorporation of machine learning techniques into rubber tree GS represents an opportunity to build more robust models and optimize Hevea breeding programs.
Collapse
|
16
|
Meena MR, Appunu C, Arun Kumar R, Manimekalai R, Vasantha S, Krishnappa G, Kumar R, Pandey SK, Hemaprabha G. Recent Advances in Sugarcane Genomics, Physiology, and Phenomics for Superior Agronomic Traits. Front Genet 2022; 13:854936. [PMID: 35991570 PMCID: PMC9382102 DOI: 10.3389/fgene.2022.854936] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 05/26/2022] [Indexed: 11/13/2022] Open
Abstract
Advances in sugarcane breeding have contributed significantly to improvements in agronomic traits and crop yield. However, the growing global demand for sugar and biofuel in the context of climate change requires further improvements in cane and sugar yields. Attempts to achieve the desired rates of genetic gain in sugarcane by conventional breeding means are difficult as many agronomic traits are genetically complex and polygenic, with each gene exerting small effects. Unlike those of many other crops, the sugarcane genome is highly heterozygous due to its autopolyploid nature, which further hinders the development of a comprehensive genetic map. Despite these limitations, many superior agronomic traits/genes for higher cane yield, sugar production, and disease/pest resistance have been identified through the mapping of quantitative trait loci, genome-wide association studies, and transcriptome approaches. Improvements in traits controlled by one or two loci are relatively easy to achieve; however, this is not the case for traits governed by many genes. Many desirable phenotypic traits are controlled by quantitative trait nucleotides (QTNs) with small and variable effects. Assembling these desired QTNs by conventional breeding methods is time consuming and inefficient due to genetic drift. However, recent developments in genomics selection (GS) have allowed sugarcane researchers to select and accumulate desirable alleles imparting superior traits as GS is based on genomic estimated breeding values, which substantially increases the selection efficiency and genetic gain in sugarcane breeding programs. Next-generation sequencing techniques coupled with genome-editing technologies have provided new vistas in harnessing the sugarcane genome to look for desirable agronomic traits such as erect canopy, leaf angle, prolonged greening, high biomass, deep root system, and the non-flowering nature of the crop. Many desirable cane-yielding traits, such as single cane weight, numbers of tillers, numbers of millable canes, as well as cane quality traits, such as sucrose and sugar yield, have been explored using these recent biotechnological tools. This review will focus on the recent advances in sugarcane genomics related to genetic gain and the identification of favorable alleles for superior agronomic traits for further utilization in sugarcane breeding programs.
Collapse
Affiliation(s)
- Mintu Ram Meena
- Regional Centre, ICAR-Sugarcane Breeding Institute, Karnal, India
- *Correspondence: Mintu Ram Meena, ; Chinnaswamy Appunu,
| | - Chinnaswamy Appunu
- ICAR-Sugarcane Breeding Institute, Coimbatore, India
- *Correspondence: Mintu Ram Meena, ; Chinnaswamy Appunu,
| | - R. Arun Kumar
- ICAR-Sugarcane Breeding Institute, Coimbatore, India
| | | | - S. Vasantha
- ICAR-Sugarcane Breeding Institute, Coimbatore, India
| | | | - Ravinder Kumar
- Regional Centre, ICAR-Sugarcane Breeding Institute, Karnal, India
| | - S. K. Pandey
- Regional Centre, ICAR-Sugarcane Breeding Institute, Karnal, India
| | - G. Hemaprabha
- ICAR-Sugarcane Breeding Institute, Coimbatore, India
| |
Collapse
|
17
|
Li Z, Liu S, Conaty W, Zhu QH, Moncuquet P, Stiller W, Wilson I. Genomic prediction of cotton fibre quality and yield traits using Bayesian regression methods. Heredity (Edinb) 2022; 129:103-112. [PMID: 35523950 PMCID: PMC9338257 DOI: 10.1038/s41437-022-00537-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Revised: 04/05/2022] [Accepted: 04/07/2022] [Indexed: 01/26/2023] Open
Abstract
Genomic selection or genomic prediction (GP) has increasingly become an important molecular breeding technology for crop improvement. GP aims to utilise genome-wide marker data to predict genomic breeding value for traits of economic importance. Though GP studies have been widely conducted in various crop species such as wheat and maize, its application in cotton, an essential renewable textile fibre crop, is still significantly underdeveloped. We aim to develop a new GP-based breeding system that can improve the efficiency of our cotton breeding program. This article presents a GP study on cotton fibre quality and yield traits using 1385 breeding lines from the Commonwealth Scientific and Industrial Research Organisation (CSIRO, Australia) cotton breeding program which were genotyped using a high-density SNP chip that generated 12,296 informative SNPs. The aim of this study was twofold: (1) to identify the models and data sources (i.e. genomic and pedigree) that produce the highest prediction accuracies; and (2) to assess the effectiveness of GP as a selection tool in the CSIRO cotton breeding program. The prediction analyses were conducted under various scenarios using different Bayesian predictive models. Results highlighted that the model combining genomic and pedigree information resulted in the best cross validated prediction accuracies: 0.76 for fibre length, 0.65 for fibre strength, and 0.64 for lint yield. Overall, this work represents the largest scale genomic selection studies based on cotton breeding trial data. Prediction accuracies reported in our study indicate the potential of GP as a breeding tool for cotton. The study highlighted the importance of incorporating pedigree and environmental factors in GP models to optimise the prediction performance.
Collapse
Affiliation(s)
- Zitong Li
- CSIRO Agriculture & Food, GPO Box 1600, Canberra, ACT, 2601, Australia.
| | - Shiming Liu
- CSIRO Agriculture & Food, Locked Bag 59, Narrabri, NSW, 2390, Australia
| | - Warren Conaty
- CSIRO Agriculture & Food, Locked Bag 59, Narrabri, NSW, 2390, Australia
| | - Qian-Hao Zhu
- CSIRO Agriculture & Food, GPO Box 1600, Canberra, ACT, 2601, Australia
| | | | - Warwick Stiller
- CSIRO Agriculture & Food, Locked Bag 59, Narrabri, NSW, 2390, Australia
| | - Iain Wilson
- CSIRO Agriculture & Food, GPO Box 1600, Canberra, ACT, 2601, Australia
| |
Collapse
|
18
|
A joint learning approach for genomic prediction in polyploid grasses. Sci Rep 2022; 12:12499. [PMID: 35864135 PMCID: PMC9304331 DOI: 10.1038/s41598-022-16417-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Accepted: 07/11/2022] [Indexed: 12/20/2022] Open
Abstract
Poaceae, among the most abundant plant families, includes many economically important polyploid species, such as forage grasses and sugarcane (Saccharum spp.). These species have elevated genomic complexities and limited genetic resources, hindering the application of marker-assisted selection strategies. Currently, the most promising approach for increasing genetic gains in plant breeding is genomic selection. However, due to the polyploidy nature of these polyploid species, more accurate models for incorporating genomic selection into breeding schemes are needed. This study aims to develop a machine learning method by using a joint learning approach to predict complex traits from genotypic data. Biparental populations of sugarcane and two species of forage grasses (Urochloa decumbens, Megathyrsus maximus) were genotyped, and several quantitative traits were measured. High-quality markers were used to predict several traits in different cross-validation scenarios. By combining classification and regression strategies, we developed a predictive system with promising results. Compared with traditional genomic prediction methods, the proposed strategy achieved accuracy improvements exceeding 50%. Our results suggest that the developed methodology could be implemented in breeding programs, helping reduce breeding cycles and increase genetic gains.
Collapse
|
19
|
Building a Calibration Set for Genomic Prediction, Characteristics to Be Considered, and Optimization Approaches. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2467:77-112. [PMID: 35451773 DOI: 10.1007/978-1-0716-2205-6_3] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
The efficiency of genomic selection strongly depends on the prediction accuracy of the genetic merit of candidates. Numerous papers have shown that the composition of the calibration set is a key contributor to prediction accuracy. A poorly defined calibration set can result in low accuracies, whereas an optimized one can considerably increase accuracy compared to random sampling, for a same size. Alternatively, optimizing the calibration set can be a way of decreasing the costs of phenotyping by enabling similar levels of accuracy compared to random sampling but with fewer phenotypic units. We present here the different factors that have to be considered when designing a calibration set, and review the different criteria proposed in the literature. We classified these criteria into two groups: model-free criteria based on relatedness, and criteria derived from the linear mixed model. We introduce criteria targeting specific prediction objectives including the prediction of highly diverse panels, biparental families, or hybrids. We also review different ways of updating the calibration set, and different procedures for optimizing phenotyping experimental designs.
Collapse
|
20
|
Li Y, Ruperao P, Batley J, Edwards D, Martin W, Hobson K, Sutton T. Genomic prediction of preliminary yield trials in chickpea: Effect of functional annotation of SNPs and environment. THE PLANT GENOME 2022; 15:e20166. [PMID: 34786880 DOI: 10.1002/tpg2.20166] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Accepted: 09/14/2021] [Indexed: 06/13/2023]
Abstract
Achieving yield potential in chickpea (Cicer arietinum L.) is limited by many constraints that include biotic and abiotic stresses. Combining next-generation sequencing technology with advanced statistical modeling has the potential to increase genetic gain efficiently. Whole genome resequencing data was obtained from 315 advanced chickpea breeding lines from the Australian chickpea breeding program resulting in more than 298,000 single nucleotide polymorphisms (SNPs) discovered. Analysis of population structure revealed a distinct group of breeding lines with many alleles that are absent from recently released Australian cultivars. Genome-wide association studies (GWAS) using these Australian breeding lines identified 20 SNPs significantly associated with grain yield in multiple field environments. A reduced level of nucleotide diversity and extended linkage disequilibrium suggested that some regions in these chickpea genomes may have been through selective breeding for yield or other traits. A large introgression segment that introduced from C. echinospermum for phytophthora root rot resistance was identified on chromosome 6, yet it also has unintended consequences of reducing yield due to linkage drag. We further investigated the effect of genotype by environment interaction on genomic prediction of yield. We found that the training set had better prediction accuracy when phenotyped under conditions relevant to the targeted environments. We also investigated the effect of SNP functional annotation on prediction accuracy using different subsets of SNPs based on their genomic locations: regulatory regions, exome, and alternative splice sites. Compared with the whole SNP dataset, a subset of SNPs did not significantly decrease prediction accuracy for grain yield despite consisting of a smaller number of SNPs.
Collapse
Affiliation(s)
- Yongle Li
- School of Agriculture, Food and Wine, The Univ. of Adelaide, Adelaide, SA, 5064, Australia
| | - Pradeep Ruperao
- Statistics, Bioinformatics and Data Management, ICRISAT, Hyderabad, 502324, India
| | - Jacqueline Batley
- School of Biological Sciences, The Univ. of Western Australia, Perth, WA, 6001, Australia
| | - David Edwards
- School of Biological Sciences, The Univ. of Western Australia, Perth, WA, 6001, Australia
| | - William Martin
- Dep. of Agriculture and Fisheries, Warwick, Qld, 4370, Australia
| | - Kristy Hobson
- NSW Dep. of Primary Industries, Tamworth, NSW, 2340, Australia
| | - Tim Sutton
- School of Agriculture, Food and Wine, The Univ. of Adelaide, Adelaide, SA, 5064, Australia
- South Australian Research and Development Institute, Adelaide, SA, 5064, Australia
| |
Collapse
|
21
|
Abstract
Traditional tree improvement is cumbersome and costly. Our main objective was to assess the extent to which genomic data can currently accelerate and improve decision making in this field. We used diameter at breast height (DBH) and wood density (WD) data for 4430 tree genotypes and single-nucleotide polymorphism (SNP) data for 2446 tree genotypes. Pedigree reconstruction was performed using a combination of maximum likelihood parentage assignment and matching based on identity-by-state (IBS) similarity. In addition, we used best linear unbiased prediction (BLUP) methods to predict phenotypes using SNP markers (GBLUP), recorded pedigree information (ABLUP), and single-step “blended” BLUP (HBLUP) combining SNP and pedigree information. We substantially improved the accuracy of pedigree records, resolving the inconsistent parental information of 506 tree genotypes. This led to substantially increased predictive ability (i.e., by up to 87%) in HBLUP analyses compared to a baseline from ABLUP. Genomic prediction was possible across populations and within previously untested families with moderately large training populations (N = 800–1200 tree genotypes) and using as few as 2000–5000 SNP markers. HBLUP was generally more effective than traditional ABLUP approaches, particularly after dealing appropriately with pedigree uncertainties. Our study provides evidence that genome-wide marker data can significantly enhance tree improvement. The operational implementation of genomic selection has started in radiata pine breeding in New Zealand, but further reductions in DNA extraction and genotyping costs may be required to realise the full potential of this approach.
Collapse
|
22
|
Brainard SH, Ellison SL, Simon PW, Dawson JC, Goldman IL. Genetic characterization of carrot root shape and size using genome-wide association analysis and genomic-estimated breeding values. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022; 135:605-622. [PMID: 34782932 PMCID: PMC8866378 DOI: 10.1007/s00122-021-03988-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/25/2021] [Accepted: 10/27/2021] [Indexed: 06/13/2023]
Abstract
The principal phenotypic determinants of market class in carrot-the size and shape of the root-are under primarily additive, but also highly polygenic, genetic control. The size and shape of carrot roots are the primary determinants not only of yield, but also market class. These quantitative phenotypes have historically been challenging to objectively evaluate, and thus subjective visual assessment of market class remains the primary method by which selection for these traits is performed. However, advancements in digital image analysis have recently made possible the high-throughput quantification of size and shape attributes. It is therefore now feasible to utilize modern methods of genetic analysis to investigate the genetic control of root morphology. To this end, this study utilized both genome wide association analysis (GWAS) and genomic-estimated breeding values (GEBVs) and demonstrated that the components of market class are highly polygenic traits, likely under the influence of many small effect QTL. Relatively large proportions of additive genetic variance for many of the component phenotypes support high predictive ability of GEBVs; average prediction ability across underlying market class traits was 0.67. GWAS identified multiple QTL for four of the phenotypes which compose market class: length, aspect ratio, maximum width, and root fill, a previously uncharacterized trait which represents the size-independent portion of carrot root shape. By combining digital image analysis with GWAS and GEBVs, this study represents a novel advance in our understanding of the genetic control of market class in carrot. The immediate practical utility and viability of genomic selection for carrot market class is also described, and concrete guidelines for the design of training populations are provided.
Collapse
Affiliation(s)
- Scott H Brainard
- Department of Horticulture, University of Wisconsin-Madison, Madison, WI, 53706, USA.
| | - Shelby L Ellison
- Department of Horticulture, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Philipp W Simon
- Department of Horticulture, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Vegetable Crops Research Unit, US Department of Agriculture-Agricultural Research Service, Madison, WI, 53706, USA
| | - Julie C Dawson
- Department of Horticulture, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Irwin L Goldman
- Department of Horticulture, University of Wisconsin-Madison, Madison, WI, 53706, USA
| |
Collapse
|
23
|
Bartholomé J, Prakash PT, Cobb JN. Genomic Prediction: Progress and Perspectives for Rice Improvement. Methods Mol Biol 2022; 2467:569-617. [PMID: 35451791 DOI: 10.1007/978-1-0716-2205-6_21] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Genomic prediction can be a powerful tool to achieve greater rates of genetic gain for quantitative traits if thoroughly integrated into a breeding strategy. In rice as in other crops, the interest in genomic prediction is very strong with a number of studies addressing multiple aspects of its use, ranging from the more conceptual to the more practical. In this chapter, we review the literature on rice (Oryza sativa) and summarize important considerations for the integration of genomic prediction in breeding programs. The irrigated breeding program at the International Rice Research Institute is used as a concrete example on which we provide data and R scripts to reproduce the analysis but also to highlight practical challenges regarding the use of predictions. The adage "To someone with a hammer, everything looks like a nail" describes a common psychological pitfall that sometimes plagues the integration and application of new technologies to a discipline. We have designed this chapter to help rice breeders avoid that pitfall and appreciate the benefits and limitations of applying genomic prediction, as it is not always the best approach nor the first step to increasing the rate of genetic gain in every context.
Collapse
Affiliation(s)
- Jérôme Bartholomé
- CIRAD, UMR AGAP Institut, Montpellier, France.
- AGAP Institut, Univ Montpellier, CIRAD, INRAE, Montpellier SupAgro, Montpellier, France.
- Rice Breeding Platform, International Rice Research Institute, Manila, Philippines.
| | | | | |
Collapse
|
24
|
Howard R, Jarquin D, Crossa J. Overview of Genomic Prediction Methods and the Associated Assumptions on the Variance of Marker Effect, and on the Architecture of the Target Trait. Methods Mol Biol 2022; 2467:139-156. [PMID: 35451775 DOI: 10.1007/978-1-0716-2205-6_5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Genomic selection (GS) is a methodology that revolutionized the process of breeding improved genetic materials in plant and animal breeding programs. It uses predicted genomic values of the potential of untested/unobserved genotypes as surrogates of phenotypes during the selection process. Such that the predicted genomic values are obtained using exclusively the marker profiles of the untested genotypes, and these potentially can be used by breeders for screening the genotypes to be advanced in the breeding pipeline, to identify potential parents for next improvement cycles, or to find optimal crosses for targeting genotypes among others. Conceptually, GS initially requires a set of genotypes with both molecular marker information and phenotypic data for model calibration and then the performance of untested genotypes is predicted using their marker profiles only. Hence, it is expected that breeders would look at these values in order to conduct selections. Even though the concept of GS seems trivial, due to the high dimensional nature of the data delivered from modern sequencing technologies where the number of molecular markers (p) excess by far the number of data points available for model fitting (n; p ≫ n) a complete renovated set of prediction models was needed to cope with this challenge. In this chapter, we provide a conceptual framework for comparing statistical models to overcome the "large p, small n problem." Given the very large diversity of GS models only the most popular are presented here; mainly we focused on linear regression-based models and nonparametric models that predict the genetic estimated breeding values (GEBV) in a single environment considering a single trait only, mainly in the context of plant breeding.
Collapse
Affiliation(s)
- Réka Howard
- University of Nebraska-Lincoln, Lincoln, NE, USA.
| | | | - José Crossa
- International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Mexico
| |
Collapse
|
25
|
Tilhou NW, Casler MD. Subsampling and DNA pooling can increase gains through genomic selection in switchgrass. THE PLANT GENOME 2021; 14:e20149. [PMID: 34626166 DOI: 10.1002/tpg2.20149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Accepted: 07/22/2021] [Indexed: 06/13/2023]
Abstract
Genomic selection (GS) can accelerate breeding cycles in perennial crops such as the bioenergy grass switchgrass (Panicum virgatum L.). The sequencing costs of GS can be reduced by pooling DNA samples in the training population (TP), only sequencing TP phenotypic outliers, or pooling candidate population (CP) samples. These strategies were simulated for two traits (spring vigor and anthesis date) in three breeding populations. Sequencing only the outlier 50% of the TP phenotype distribution resulted in a penalty of <5% of the predictive ability, measured using cross-validation. Predictive ability also decreased when sequencing progressively fewer TP DNA pools, but TPs constructed from only two phenotypically contrasting DNA samples retained a mean of >80% predictive ability relative to individual TP sequencing. Novel group testing methods allowed greater than one CP individual to be screened per sequenced DNA sample but resulted in a predictive ability penalty. To determine the impact of reduced sequencing, genetic gain was calculated for seven GS scenarios with variable sequencing budgets. Reduced TP sequencing and most CP pooling methods were superior to individual sequence-based GS when sequencing resources were restricted (2,000 DNA samples per 5-yr cycle). Only one scenario was superior to individual sequencing when sequencing budgets were large (8,000 DNA samples per 5-yr cycle). This study highlights multiple routes for reduced sequencing costs in GS.
Collapse
Affiliation(s)
- Neal Wepking Tilhou
- Department of Agronomy, University of Wisconsin, 1575 Linden Dr, Madison, WI, 53706, USA
| | - Michael D Casler
- U.S. Dairy Forage Research Center, USDA-ARS, 1925 Linden Dr, Madison, WI, 53706-1108, USA
| |
Collapse
|
26
|
Rio S, Gallego-Sánchez L, Montilla-Bascón G, Canales FJ, Isidro Y Sánchez J, Prats E. Genomic prediction and training set optimization in a structured Mediterranean oat population. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2021; 134:3595-3609. [PMID: 34341832 DOI: 10.1007/s00122-021-03916-w] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Accepted: 07/13/2021] [Indexed: 05/22/2023]
Abstract
The strong genetic structure observed in Mediterranean oats affects the predictive ability of genomic prediction as well as the performance of training set optimization methods. In this study, we investigated the efficiency of genomic prediction and training set optimization in a highly structured population of cultivars and landraces of cultivated oat (Avena sativa) from the Mediterranean basin, including white (subsp. sativa) and red (subsp. byzantina) oats, genotyped using genotype-by-sequencing markers and evaluated for agronomic traits in Southern Spain. For most traits, the predictive abilities were moderate to high with little differences between models, except for biomass for which Bayes-B showed a substantial gain compared to other models. The consistency between the structure of the training population and the population to be predicted was key to the predictive ability of genomic predictions. The predictive ability of inter-subspecies predictions was indeed much lower than that of intra-subspecies predictions for all traits. Regarding training set optimization, the linear mixed model optimization criteria (prediction error variance (PEVmean) and coefficient of determination (CDmean)) performed better than the heuristic approach "partitioning around medoids," even under high population structure. The superiority of CDmean and PEVmean could be explained by their ability to adapt the representation of each genetic group according to those represented in the population to be predicted. These results represent an important step towards the implementation of genomic prediction in oat breeding programs and address important issues faced by the genomic prediction community regarding population structure and training set optimization.
Collapse
Affiliation(s)
- Simon Rio
- Centro de Biotecnologia y Genómica de Plantas (CBGP, UPM-INIA), Instituto Nacional de Investigación y Tecnologia Agraria y Alimentaria (INIA), Universidad Politécnica de Madrid (UPM), Campus de Montegancedo-UPM, 28223, Pozuelo de Alarcón, Madrid, Spain.
| | - Luis Gallego-Sánchez
- Institute for Sustainable Agriculture, Spanish Research Council (CSIC), Córdoba, Spain
| | | | - Francisco J Canales
- Institute for Sustainable Agriculture, Spanish Research Council (CSIC), Córdoba, Spain
| | - Julio Isidro Y Sánchez
- Centro de Biotecnologia y Genómica de Plantas (CBGP, UPM-INIA), Instituto Nacional de Investigación y Tecnologia Agraria y Alimentaria (INIA), Universidad Politécnica de Madrid (UPM), Campus de Montegancedo-UPM, 28223, Pozuelo de Alarcón, Madrid, Spain
| | - Elena Prats
- Institute for Sustainable Agriculture, Spanish Research Council (CSIC), Córdoba, Spain
| |
Collapse
|
27
|
Mahadevaiah C, Appunu C, Aitken K, Suresha GS, Vignesh P, Mahadeva Swamy HK, Valarmathi R, Hemaprabha G, Alagarasan G, Ram B. Genomic Selection in Sugarcane: Current Status and Future Prospects. FRONTIERS IN PLANT SCIENCE 2021; 12:708233. [PMID: 34646284 PMCID: PMC8502939 DOI: 10.3389/fpls.2021.708233] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Accepted: 08/24/2021] [Indexed: 05/18/2023]
Abstract
Sugarcane is a C4 and agro-industry-based crop with a high potential for biomass production. It serves as raw material for the production of sugar, ethanol, and electricity. Modern sugarcane varieties are derived from the interspecific and intergeneric hybridization between Saccharum officinarum, Saccharum spontaneum, and other wild relatives. Sugarcane breeding programmes are broadly categorized into germplasm collection and characterization, pre-breeding and genetic base-broadening, and varietal development programmes. The varietal identification through the classic breeding programme requires a minimum of 12-14 years. The precise phenotyping in sugarcane is extremely tedious due to the high propensity of lodging and suckering owing to the influence of environmental factors and crop management practices. This kind of phenotyping requires data from both plant crop and ratoon experiments conducted over locations and seasons. In this review, we explored the feasibility of genomic selection schemes for various breeding programmes in sugarcane. The genetic diversity analysis using genome-wide markers helps in the formation of core set germplasm representing the total genomic diversity present in the Saccharum gene bank. The genome-wide association studies and genomic prediction in the Saccharum gene bank are helpful to identify the complete genomic resources for cane yield, commercial cane sugar, tolerances to biotic and abiotic stresses, and other agronomic traits. The implementation of genomic selection in pre-breeding, genetic base-broadening programmes assist in precise introgression of specific genes and recurrent selection schemes enhance the higher frequency of favorable alleles in the population with a considerable reduction in breeding cycles and population size. The integration of environmental covariates and genomic prediction in multi-environment trials assists in the prediction of varietal performance for different agro-climatic zones. This review also directed its focus on enhancing the genetic gain over time, cost, and resource allocation at various stages of breeding programmes.
Collapse
Affiliation(s)
| | - Chinnaswamy Appunu
- Division of Crop Improvement, ICAR-Sugarcane Breeding Institute, Coimbatore, India
| | - Karen Aitken
- CSIRO (Commonwealth Scientific and Industrial Research Organization), St. Lucia, QLD, Australia
| | | | - Palanisamy Vignesh
- Division of Crop Improvement, ICAR-Sugarcane Breeding Institute, Coimbatore, India
| | | | | | - Govind Hemaprabha
- Division of Crop Improvement, ICAR-Sugarcane Breeding Institute, Coimbatore, India
| | - Ganesh Alagarasan
- Division of Crop Improvement, ICAR-Sugarcane Breeding Institute, Coimbatore, India
| | - Bakshi Ram
- Division of Crop Improvement, ICAR-Sugarcane Breeding Institute, Coimbatore, India
| |
Collapse
|
28
|
Isidro y Sánchez J, Akdemir D. Training Set Optimization for Sparse Phenotyping in Genomic Selection: A Conceptual Overview. FRONTIERS IN PLANT SCIENCE 2021; 12:715910. [PMID: 34589099 PMCID: PMC8475495 DOI: 10.3389/fpls.2021.715910] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Accepted: 08/10/2021] [Indexed: 06/13/2023]
Abstract
Genomic selection (GS) is becoming an essential tool in breeding programs due to its role in increasing genetic gain per unit time. The design of the training set (TRS) in GS is one of the key steps in the implementation of GS in plant and animal breeding programs mainly because (i) TRS optimization is critical for the efficiency and effectiveness of GS, (ii) breeders test genotypes in multi-year and multi-location trials to select the best-performing ones. In this framework, TRS optimization can help to decrease the number of genotypes to be tested and, therefore, reduce phenotyping cost and time, and (iii) we can obtain better prediction accuracies from optimally selected TRS than an arbitrary TRS. Here, we concentrate the efforts on reviewing the lessons learned from TRS optimization studies and their impact on crop breeding and discuss important features for the success of TRS optimization under different scenarios. In this article, we review the lessons learned from training population optimization in plants and the major challenges associated with the optimization of GS including population size, the relationship between training and test set (TS), update of TRS, and the use of different packages and algorithms for TRS implementation in GS. Finally, we describe general guidelines to improving the rate of genetic improvement by maximizing the use of the TRS optimization in the GS framework.
Collapse
Affiliation(s)
- Julio Isidro y Sánchez
- Centro de Biotecnologia y Genómica de Plantas, Instituto Nacional de Investigación y Tecnologia Agraria y Alimentaria, Universidad Politécnica de Madrid, Campus de Montegancedo, Madrid, Spain
| | - Deniz Akdemir
- Animal and Crop Science Division, Agriculture and Food Science Centre, University College Dublin, Dublin, Ireland
| |
Collapse
|
29
|
Esuma W, Ozimati A, Kulakow P, Gore MA, Wolfe MD, Nuwamanya E, Egesi C, Kawuki RS. Effectiveness of genomic selection for improving provitamin A carotenoid content and associated traits in cassava. G3 (BETHESDA, MD.) 2021; 11:jkab160. [PMID: 33963852 PMCID: PMC8496257 DOI: 10.1093/g3journal/jkab160] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Accepted: 04/26/2021] [Indexed: 11/14/2022]
Abstract
Global efforts are underway to develop cassava with enhanced levels of provitamin A carotenoids to sustainably meet increasing demands for food and nutrition where the crop is a major staple. Herein, we tested the effectiveness of genomic selection (GS) for rapid improvement of cassava for total carotenoids content and associated traits. We evaluated 632 clones from Uganda's provitamin A cassava breeding pipeline and 648 West African introductions. At harvest, each clone was assessed for level of total carotenoids, dry matter content, and resistance to cassava brown streak disease (CBSD). All clones were genotyped with diversity array technology and imputed to a set of 23,431 single nucleotide polymorphic markers. We assessed predictive ability of four genomic prediction methods in scenarios of cross-validation, across population prediction, and inclusion of quantitative trait loci markers. Cross-validations produced the highest mean prediction ability for total carotenoids content (0.52) and the lowest for CBSD resistance (0.20), with G-BLUP outperforming other models tested. Across population, predictions showed low ability of Ugandan population to predict the performance of West African clones, with the highest predictive ability recorded for total carotenoids content (0.34) and the lowest for CBSD resistance (0.12) using G-BLUP. By incorporating chromosome 1 markers associated with carotenoids content as independent kernel in the G-BLUP model of a cross-validation scenario, prediction ability slightly improved from 0.52 to 0.58. These results reinforce ongoing efforts aimed at integrating GS into cassava breeding and demonstrate the utility of this tool for rapid genetic improvement.
Collapse
Affiliation(s)
- Williams Esuma
- National Crops Resources Research Institute, Kampala, Uganda
| | - Alfred Ozimati
- National Crops Resources Research Institute, Kampala, Uganda
| | - Peter Kulakow
- International Institute for Tropical Agriculture, Ibadan, Nigeria
| | - Michael A Gore
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
| | - Marnin D Wolfe
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
| | | | - Chiedozie Egesi
- International Institute for Tropical Agriculture, Ibadan, Nigeria
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
| | - Robert S Kawuki
- National Crops Resources Research Institute, Kampala, Uganda
| |
Collapse
|
30
|
Alkemade JA, Messmer MM, Arncken C, Leska A, Annicchiarico P, Nazzicari N, Książkiewicz M, Voegele RT, Finckh MR, Hohmann P. A High-Throughput Phenotyping Tool to Identify Field-Relevant Anthracnose Resistance in White Lupin. PLANT DISEASE 2021; 105:1719-1727. [PMID: 33337235 DOI: 10.1094/pdis-07-20-1531-re] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
The seed- and air-borne pathogen Colletotrichum lupini, the causal agent of lupin anthracnose, is the most important disease in white lupin (Lupinus albus) worldwide and can cause total yield loss. The aims of this study were to establish a reliable high-throughput phenotyping tool to identify anthracnose resistance in white lupin germplasm and to evaluate a genomic prediction model, accounting for previously reported resistance quantitative trait loci, on a set of independent lupin genotypes. Phenotyping under controlled conditions, performing stem inoculation on seedlings, showed to be applicable for high throughput, and its disease score strongly correlated with field plot disease assessments (r = 0.95, P < 0.0001) and yield (r = -0.64, P = 0.035). Traditional one-row field disease phenotyping showed no significant correlation with field plot disease assessments (r = 0.31, P = 0.34) and yield (r = -0.45, P = 0.17). Genomically predicted resistance values showed no correlation with values observed under controlled or field conditions, and the parental lines of the recombinant inbred line population used for constructing the prediction model exhibited a resistance pattern opposite to that displayed in the original (Australian) environment used for model construction. Differing environmental conditions, inoculation procedures, or population structure may account for this result. Phenotyping a diverse set of 40 white lupin accessions under controlled conditions revealed eight accessions with improved resistance to anthracnose. The standardized area under the disease progress curves (sAUDPC) ranged from 2.1 to 2.8, compared with the susceptible reference accession with a sAUDPC of 3.85. These accessions can be incorporated into white lupin breeding programs. In conclusion, our data support stem inoculation-based disease phenotyping under controlled conditions as a time-effective approach to identify field-relevant resistance, which can now be applied to further identify sources of resistance and their underlying genetics.
Collapse
Affiliation(s)
- Joris A Alkemade
- Department of Crop Sciences, Research Institute of Organic Agriculture (FiBL), Frick, Switzerland
| | - Monika M Messmer
- Department of Crop Sciences, Research Institute of Organic Agriculture (FiBL), Frick, Switzerland
| | - Christine Arncken
- Department of Crop Sciences, Research Institute of Organic Agriculture (FiBL), Frick, Switzerland
| | - Agata Leska
- Getreidezüchtung Peter Kunz (gzpk), Feldbach, Switzerland
| | | | - Nelson Nazzicari
- CREA, Research Centre for Animal Production and Aquaculture, Lodi, Italy
| | | | - Ralf T Voegele
- Institute of Phytomedicine, University of Hohenheim, Stuttgart, Germany
| | - Maria R Finckh
- Department of Ecological Plant Protection, University of Kassel, Witzenhausen, Germany
| | - Pierre Hohmann
- Department of Crop Sciences, Research Institute of Organic Agriculture (FiBL), Frick, Switzerland
| |
Collapse
|
31
|
Puglisi D, Delbono S, Visioni A, Ozkan H, Kara İ, Casas AM, Igartua E, Valè G, Piero ARL, Cattivelli L, Tondelli A, Fricano A. Genomic Prediction of Grain Yield in a Barley MAGIC Population Modeling Genotype per Environment Interaction. FRONTIERS IN PLANT SCIENCE 2021; 12:664148. [PMID: 34108982 PMCID: PMC8183822 DOI: 10.3389/fpls.2021.664148] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Accepted: 04/26/2021] [Indexed: 06/12/2023]
Abstract
Multi-parent Advanced Generation Inter-crosses (MAGIC) lines have mosaic genomes that are generated shuffling the genetic material of the founder parents following pre-defined crossing schemes. In cereal crops, these experimental populations have been extensively used to investigate the genetic bases of several traits and dissect the genetic bases of epistasis. In plants, genomic prediction models are usually fitted using either diverse panels of mostly unrelated accessions or individuals of biparental families and several empirical analyses have been conducted to evaluate the predictive ability of models fitted to these populations using different traits. In this paper, we constructed, genotyped and evaluated a barley MAGIC population of 352 individuals developed with a diverse set of eight founder parents showing contrasting phenotypes for grain yield. We combined phenotypic and genotypic information of this MAGIC population to fit several genomic prediction models which were cross-validated to conduct empirical analyses aimed at examining the predictive ability of these models varying the sizes of training populations. Moreover, several methods to optimize the composition of the training population were also applied to this MAGIC population and cross-validated to estimate the resulting predictive ability. Finally, extensive phenotypic data generated in field trials organized across an ample range of water regimes and climatic conditions in the Mediterranean were used to fit and cross-validate multi-environment genomic prediction models including G×E interaction, using both genomic best linear unbiased prediction and reproducing kernel Hilbert space along with a non-linear Gaussian Kernel. Overall, our empirical analyses showed that genomic prediction models trained with a limited number of MAGIC lines can be used to predict grain yield with values of predictive ability that vary from 0.25 to 0.60 and that beyond QTL mapping and analysis of epistatic effects, MAGIC population might be used to successfully fit genomic prediction models. We concluded that for grain yield, the single-environment genomic prediction models examined in this study are equivalent in terms of predictive ability while, in general, multi-environment models that explicitly split marker effects in main and environmental-specific effects outperform simpler multi-environment models.
Collapse
Affiliation(s)
- Damiano Puglisi
- Dipartimento di Agricoltura, Alimentazione e Ambiente (Di3A), Università di Catania, Catania, Italy
| | - Stefano Delbono
- Council for Agricultural Research and Economics–Research Centre for Genomics and Bioinformatics, Fiorenzuola d’Arda, Italy
| | - Andrea Visioni
- Biodiversity and Crop Improvement Program, International Center for Agricultural Research in the Dry Areas, Avenue Hafiane Cherkaoui, Rabat, Morocco
| | - Hakan Ozkan
- Department of Field Crops, Faculty of Agriculture, University of Cukurova, Adana, Turkey
| | - İbrahim Kara
- Bahri Dagdas International Agricultural Research Institute, Konya, Turkey
| | - Ana M. Casas
- Aula Dei Experimental Station (EEAD-CSIC), Spanish Research Council, Zaragoza, Spain
| | - Ernesto Igartua
- Aula Dei Experimental Station (EEAD-CSIC), Spanish Research Council, Zaragoza, Spain
| | - Giampiero Valè
- DiSIT, Dipartimento di Scienze e Innovazione Tecnologica, Università del Piemonte Orientale, Vercelli, Italy
| | - Angela Roberta Lo Piero
- Dipartimento di Agricoltura, Alimentazione e Ambiente (Di3A), Università di Catania, Catania, Italy
| | - Luigi Cattivelli
- Council for Agricultural Research and Economics–Research Centre for Genomics and Bioinformatics, Fiorenzuola d’Arda, Italy
| | - Alessandro Tondelli
- Council for Agricultural Research and Economics–Research Centre for Genomics and Bioinformatics, Fiorenzuola d’Arda, Italy
| | - Agostino Fricano
- Council for Agricultural Research and Economics–Research Centre for Genomics and Bioinformatics, Fiorenzuola d’Arda, Italy
| |
Collapse
|
32
|
Characterization of effects of genetic variants via genome-scale metabolic modelling. Cell Mol Life Sci 2021; 78:5123-5138. [PMID: 33950314 PMCID: PMC8254712 DOI: 10.1007/s00018-021-03844-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2021] [Revised: 04/15/2021] [Accepted: 04/23/2021] [Indexed: 12/19/2022]
Abstract
Genome-scale metabolic networks for model plants and crops in combination with approaches from the constraint-based modelling framework have been used to predict metabolic traits and design metabolic engineering strategies for their manipulation. With the advances in technologies to generate large-scale genotyping data from natural diversity panels and other populations, genome-wide association and genomic selection have emerged as statistical approaches to determine genetic variants associated with and predictive of traits. Here, we review recent advances in constraint-based approaches that integrate genetic variants in genome-scale metabolic models to characterize their effects on reaction fluxes. Since some of these approaches have been applied in organisms other than plants, we provide a critical assessment of their applicability particularly in crops. In addition, we further dissect the inferred effects of genetic variants with respect to reaction rate constants, abundances of enzymes, and concentrations of metabolites, as main determinants of reaction fluxes and relate them with their combined effects on complex traits, like growth. Through this systematic review, we also provide a roadmap for future research to increase the predictive power of statistical approaches by coupling them with mechanistic models of metabolism.
Collapse
|
33
|
Gonçalves MTV, Morota G, Costa PMDA, Vidigal PMP, Barbosa MHP, Peternelli LA. Near-infrared spectroscopy outperforms genomics for predicting sugarcane feedstock quality traits. PLoS One 2021; 16:e0236853. [PMID: 33661948 PMCID: PMC7932073 DOI: 10.1371/journal.pone.0236853] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Accepted: 01/20/2021] [Indexed: 11/19/2022] Open
Abstract
The main objectives of this study were to evaluate the prediction performance of genomic and near-infrared spectroscopy (NIR) data and whether the integration of genomic and NIR predictor variables can increase the prediction accuracy of two feedstock quality traits (fiber and sucrose content) in a sugarcane population (Saccharum spp.). The following three modeling strategies were compared: M1 (genome-based prediction), M2 (NIR-based prediction), and M3 (integration of genomics and NIR wavenumbers). Data were collected from a commercial population comprised of three hundred and eighty-five individuals, genotyped for single nucleotide polymorphisms and screened using NIR spectroscopy. We compared partial least squares (PLS) and BayesB regression methods to estimate marker and wavenumber effects. In order to assess model performance, we employed random sub-sampling cross-validation to calculate the mean Pearson correlation coefficient between observed and predicted values. Our results showed that models fitted using BayesB were more predictive than PLS models. We found that NIR (M2) provided the highest prediction accuracy, whereas genomics (M1) presented the lowest predictive ability, regardless of the measured traits and regression methods used. The integration of predictors derived from NIR spectroscopy and genomics into a single model (M3) did not significantly improve the prediction accuracy for the two traits evaluated. These findings suggest that NIR-based prediction can be an effective strategy for predicting the genetic merit of sugarcane clones.
Collapse
Affiliation(s)
| | - Gota Morota
- Department of Animal and Poultry Sciences, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States of America
| | | | | | | | | |
Collapse
|
34
|
A look-ahead Monte Carlo simulation method for improving parental selection in trait introgression. Sci Rep 2021; 11:3918. [PMID: 33594238 PMCID: PMC7887201 DOI: 10.1038/s41598-021-83634-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Accepted: 01/14/2021] [Indexed: 11/08/2022] Open
Abstract
Multiple trait introgression is the process by which multiple desirable traits are converted from a donor to a recipient cultivar through backcrossing and selfing. The goal of this procedure is to recover all the attributes of the recipient cultivar, with the addition of the specified desirable traits. A crucial step in this process is the selection of parents to form new crosses. In this study, we propose a new selection approach that estimates the genetic distribution of the progeny of backcrosses after multiple generations using information of recombination events. Our objective is to select the most promising individuals for further backcrossing or selfing. To demonstrate the effectiveness of the proposed method, a case study has been conducted using maize data where our method is compared with state-of-the-art approaches. Simulation results suggest that the proposed method, look-ahead Monte Carlo, achieves higher probability of success than existing approaches. Our proposed selection method can assist breeders to efficiently design trait introgression projects.
Collapse
|
35
|
Beche E, Gillman JD, Song Q, Nelson R, Beissinger T, Decker J, Shannon G, Scaboo AM. Genomic prediction using training population design in interspecific soybean populations. MOLECULAR BREEDING : NEW STRATEGIES IN PLANT IMPROVEMENT 2021; 41:15. [PMID: 37309481 PMCID: PMC10236090 DOI: 10.1007/s11032-021-01203-6] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Accepted: 01/11/2021] [Indexed: 06/14/2023]
Abstract
Agronomically important traits generally have complex genetic architecture, where many genes have a small and largely additive effect. Genomic prediction has been demonstrated to increase genetic gain and efficiency in plant breeding programs beyond marker-assisted selection and phenotypic selection. The objective of this study was to evaluate the impact of allelic origin, marker density, training population size, and cross-validation schemes on the accuracy of genomic prediction models in an interspecific soybean nested association mapping (NAM) panel. Three cross-validation schemes were used: (a) Within-Family (WF): training population and predictions are made exclusively within each family; (b) Across All families (AF): all the individuals from the three families were randomly assigned to either the training or validation set; (c) Leave one Family out (LFO): each family is predicted using a training set that contains the other two families. Predictive abilities increased with training population size up to 350 individuals, but no significant gains were noted beyond 250 individuals in the training population. The number of markers had a limited impact on the observed predictive ability across traits; increasing markers used in the model above 1000 revealed no significant increases in prediction accuracy. Predictive abilities for AF were not significantly different from the WF method, and predictive abilities across populations for the WF method had a range of 0.58 to 0.70 for maturity, protein, meal, and oil. Our results also showed encouraging prediction accuracies for grain yield (0.58-0.69) using the WF method. Partitioning genomic prediction between G. max and G. soja alleles revealed useful information to select material with a larger allele contribution from both parents and could accelerate allele introgression from exotic germplasm into the elite soybean gene pool. Supplementary Information The online version contains supplementary material available at 10.1007/s11032-021-01203-6.
Collapse
Affiliation(s)
- Eduardo Beche
- Division of Plant Science, University of Missouri, Columbia, MO USA
| | | | - Qijian Song
- Soybean Genomics and Improvement Laboratory, USDA-ARS, Beltsville, MD USA
| | - Randall Nelson
- Department of Crop Sciences, University of Illinois, and USDA-Agricultural Research Service (retired), 1101 W. Peabody Dr., Urbana, IL 61801 USA
| | - Tim Beissinger
- Division of Plant Breeding Methodology, Department of Crop Sciences, Georg-August-Universität, Göttingen, Germany
| | - Jared Decker
- Division of Animal Science, University of Missouri, Columbia, MO USA
| | - Grover Shannon
- Division of Plant Science, University of Missouri, Columbia, MO USA
| | - Andrew M. Scaboo
- Division of Plant Science, University of Missouri, Columbia, MO USA
| |
Collapse
|
36
|
Heslot N, Feoktistov V. Optimization of Selective Phenotyping and Population Design for Genomic Prediction. JOURNAL OF AGRICULTURAL, BIOLOGICAL AND ENVIRONMENTAL STATISTICS 2020. [DOI: 10.1007/s13253-020-00415-1] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
|
37
|
Bhatta M, Gutierrez L, Cammarota L, Cardozo F, Germán S, Gómez-Guerrero B, Pardo MF, Lanaro V, Sayas M, Castro AJ. Multi-trait Genomic Prediction Model Increased the Predictive Ability for Agronomic and Malting Quality Traits in Barley ( Hordeum vulgare L.). G3 (BETHESDA, MD.) 2020; 10:1113-1124. [PMID: 31974097 PMCID: PMC7056970 DOI: 10.1534/g3.119.400968] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/26/2019] [Accepted: 01/22/2020] [Indexed: 12/20/2022]
Abstract
Plant breeders regularly evaluate multiple traits across multiple environments, which opens an avenue for using multiple traits in genomic prediction models. We assessed the potential of multi-trait (MT) genomic prediction model through evaluating several strategies of incorporating multiple traits (eight agronomic and malting quality traits) into the prediction models with two cross-validation schemes (CV1, predicting new lines with genotypic information only and CV2, predicting partially phenotyped lines using both genotypic and phenotypic information from correlated traits) in barley. The predictive ability was similar for single (ST-CV1) and multi-trait (MT-CV1) models to predict new lines. However, the predictive ability for agronomic traits was considerably increased when partially phenotyped lines (MT-CV2) were used. The predictive ability for grain yield using the MT-CV2 model with other agronomic traits resulted in 57% and 61% higher predictive ability than ST-CV1 and MT-CV1 models, respectively. Therefore, complex traits such as grain yield are better predicted when correlated traits are used. Similarly, a considerable increase in the predictive ability of malting quality traits was observed when correlated traits were used. The predictive ability for grain protein content using the MT-CV2 model with both agronomic and malting traits resulted in a 76% higher predictive ability than ST-CV1 and MT-CV1 models. Additionally, the higher predictive ability for new environments was obtained for all traits using the MT-CV2 model compared to the MT-CV1 model. This study showed the potential of improving the genomic prediction of complex traits by incorporating the information from multiple traits (cost-friendly and easy to measure traits) collected throughout breeding programs which could assist in speeding up breeding cycles.
Collapse
Affiliation(s)
- Madhav Bhatta
- Department of Agronomy, University of Wisconsin-Madison, 1575 Linden Dr., WI, 53706
| | - Lucia Gutierrez
- Department of Agronomy, University of Wisconsin-Madison, 1575 Linden Dr., WI, 53706,
| | - Lorena Cammarota
- Department of plant production, Facultad de Agronomía, Universidad de la República, Ruta 3, Km363, Paysandú 60000, Uruguay
- Maltería Uruguay S.A. Ruta 55, Km26, Ombúes de Lavalle, Uruguay
| | | | - Silvia Germán
- Instituto Nacional de Investigación Agropuecuaria, Estación Experimental La Estanzuela, Ruta 50, Km11, Colonia, Uruguay
| | | | | | - Valeria Lanaro
- Latitud, LATU Foundation, Av Italia 6201, Montevideo 11500, Uruguay, and
| | - Mercedes Sayas
- Maltería Oriental S.A., Camino Abrevadero 5525, Montevideo 12400, Uruguay
| | - Ariel J Castro
- Department of plant production, Facultad de Agronomía, Universidad de la República, Ruta 3, Km363, Paysandú 60000, Uruguay
| |
Collapse
|