1
|
Sleper J, Tapia R, Lee S, Whitaker V. Within-family genomic selection in strawberry: Optimization of marker density, trial design, and training set composition. THE PLANT GENOME 2025; 18:e20550. [PMID: 39789751 PMCID: PMC11718141 DOI: 10.1002/tpg2.20550] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/13/2024] [Revised: 10/16/2024] [Accepted: 12/06/2024] [Indexed: 01/12/2025]
Abstract
Genomic selection is a widely used quantitative method of determining the genetic value of an individual from genomic information and phenotypic data. In this study, we used a large, multi-year training population of 3248 individuals from the University of Florida strawberry (Fragaria × ananassa Duchesne) breeding program. We coupled this training population with a test population of 1460 individuals derived from 20 biparental families. Using these two populations, we tested different genomic selection methods of predicting each family separately for the purpose of within-family selection of seedlings for multiple yield-related traits in strawberry. The methodology we considered were comprised of 11 different marker densities, 10 different training set sizes, four different training set composition techniques, and one to five clonal replications for each individual in the training population. We demonstrated that prediction accuracy varied among the 20 biparental families from 0.05 to 0.63 for the three traits investigated. We also showed that a medium-density genotyping strategy (1500-1650 single nucleotide polymorphisms) could be 95%-97% as effective as a high-density genotyping platform and that imputation to the more dense platform always improved accuracy. Training set composition techniques had no discernible effect on prediction accuracy. However, increasing training set size improved prediction accuracy, and accuracy did not plateau even when training sets exceeded 3000 individuals. Finally, we showed that the number of clonal replicates in field trials could be reduced by 80% without any negative effects on genomic selection accuracy.
Collapse
Affiliation(s)
- Joshua Sleper
- Plant Breeding Graduate Program, Horticultural Sciences DepartmentUniversity of Florida, IFAS Gulf Coast Research and Education CenterWimaumaFloridaUSA
| | - Ronald Tapia
- Horticultural Sciences Department, IFAS Citrus Research and Education CenterUniversity of FloridaLake AlfredFloridaUSA
| | - Seonghee Lee
- Plant Breeding Graduate Program, Horticultural Sciences DepartmentUniversity of Florida, IFAS Gulf Coast Research and Education CenterWimaumaFloridaUSA
| | - Vance Whitaker
- Plant Breeding Graduate Program, Horticultural Sciences DepartmentUniversity of Florida, IFAS Gulf Coast Research and Education CenterWimaumaFloridaUSA
| |
Collapse
|
2
|
Åstrand J, Odilbekov F, Vetukuri R, Ceplitis A, Chawade A. Leveraging genomic prediction to surpass current yield gains in spring barley. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:260. [PMID: 39503804 PMCID: PMC11541387 DOI: 10.1007/s00122-024-04763-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Accepted: 10/12/2024] [Indexed: 11/09/2024]
Abstract
KEY MESSAGE Genetic gain in Nordic spring barley varieties was estimated to 1.07% per year. Additionally, genomic predictive ability for yield was 0.61 in a population of breeding lines. Barley is one of the most important crops in Europe and meeting the growing demand for food and feed requires continuous increase in yield. Genomic prediction (GP) has the potential to be a cost-efficient tool in breeding for complex traits; however, the rate of yield improvement in current barley varieties is unknown. This study therefore investigated historical and current genetic gains in spring barley and how accounting for row-type population stratification in a breeding population influences GP results. The genetic gain in yield was estimated using historical data from field trials from 2014 to 2022, with 22-60 market varieties grown yearly. The genetic gain was estimated to 1.07% per year for all varieties, serving as a reference point for future breeding progress. To analyse the potential of using GP in spring barley a population of 375 breeding lines of two-row and six-row barley were tested in multi-environment trials in 2019-2022. The genetic diversity of the row-types was examined and used as a factor in the predictions, and the potential to predict untested locations using yield data from other locations was explored. This resulted in an overall predictive ability of 0.61 for yield (kg/ha), with 0.57 and 0.19 for the separate two-row and the six-row breeding lines, respectively. Together this displays the potential of implementing GP in breeding programs and the genetic gain in spring barley market varieties developed through GP will help in quantifying the benefit of GP over conventional breeding in the future.
Collapse
Affiliation(s)
- Johanna Åstrand
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
- Lantmännen Lantbruk, Svalöv, Sweden
| | | | - Ramesh Vetukuri
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
| | | | - Aakash Chawade
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden.
| |
Collapse
|
3
|
Duarte D, Jurcic EJ, Dutour J, Villalba PV, Centurión C, Grattapaglia D, Cappa EP. Genomic selection in forest trees comes to life: unraveling its potential in an advanced four-generation Eucalyptus grandis population. FRONTIERS IN PLANT SCIENCE 2024; 15:1462285. [PMID: 39539292 PMCID: PMC11558521 DOI: 10.3389/fpls.2024.1462285] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/09/2024] [Accepted: 10/03/2024] [Indexed: 11/16/2024]
Abstract
Genomic Selection (GS) in tree breeding optimizes genetic gains by leveraging genomic data to enable early selection of seedlings without phenotypic data reducing breeding cycle and increasing selection intensity. Traditional assessments of the potential of GS in forest trees have typically focused on model performance using cross-validation within the same generation but evaluating effectively realized predictive ability (RPA) across generations is crucial. This study estimated RPAs for volume growth (VOL), wood density (WD), and pulp yield (PY) across four generations breeding of Eucalyptus grandis. The training set spanned three generations, including 34,461 trees with three-year growth data, 6,014 trees with wood quality trait data, and 1,918 trees with 12,695 SNPs (single nucleotide polymorphisms) data. Employing single-step genomic BLUP, we compared the genomic predictions of breeding values (GEBVs) for 1,153 fourth-generation full-sib seedlings in the greenhouse with their later-collected phenotypic estimated breeding values (EBVs) at age three years. RPAs were estimated using three GS targets (individual trees, trees within families, and families), two selection criteria (single- and multiple-trait), and training populations of either all 1,918 genotyped trees or the 67 direct ancestors of the selection candidates. RPAs were higher for wood quality traits (0.33 to 0.59) compared to VOL (0.14 to 0.19) and improved for wood traits (0.42 to 0.75) but not for VOL when trained only with direct ancestors, highlighting the challenges in accurately predicting growth traits. GS was more effective at excluding bottom-ranked candidates than selecting top-ranked ones. The between-family GS approach outperformed individual-tree selection for VOL (0.11 to 0.16) and PY (0.72 to 0.75), but not for WD (0.43 vs. 0.42). Furthermore, higher levels of relatedness and lower genotype by environment (G × E) interaction between training and testing populations enhanced RPAs for VOL (0.39). In summary, despite limited effectiveness in ranking top VOL individuals, GS effectively identified low-performing individuals and families. These multi-generational findings underscore GS's potential in tree breeding, stressing the importance of considering relatedness and G × E interaction for optimal performance.
Collapse
Affiliation(s)
| | - Esteban J. Jurcic
- Instituto Nacional de Tecnología Agropecuaria (INTA), Instituto de Recursos Biológicos, Centro de Investigación en Recursos Naturales, Buenos Aires, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | | | - Pamela V. Villalba
- Instituto de Agrobiotecnología y Biología Molecular (IABiMo), INTA-CONICET, Buenos Aires, Argentina
| | | | - Dario Grattapaglia
- Plant Genetics Laboratory, EMBRAPA Genetic Resources and Biotechnology, Brasilia, Brazil
| | - Eduardo P. Cappa
- Instituto Nacional de Tecnología Agropecuaria (INTA), Instituto de Recursos Biológicos, Centro de Investigación en Recursos Naturales, Buenos Aires, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| |
Collapse
|
4
|
Adunola P, Ferrão LFV, Benevenuto J, Azevedo CF, Munoz PR. Genomic selection optimization in blueberry: Data-driven methods for marker and training population design. THE PLANT GENOME 2024; 17:e20488. [PMID: 39087863 DOI: 10.1002/tpg2.20488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Revised: 04/25/2024] [Accepted: 06/04/2024] [Indexed: 08/02/2024]
Abstract
Genomic prediction is a modern approach that uses genome-wide markers to predict the genetic merit of unphenotyped individuals. With the potential to reduce the breeding cycles and increase the selection accuracy, this tool has been designed to rank genotypes and maximize genetic gains. Despite this importance, its practical implementation in breeding programs requires critical allocation of resources for its application in a predictive framework. In this study, we integrated genetic and data-driven methods to allocate resources for phenotyping and genotyping tailored to genomic prediction. To this end, we used a historical blueberry (Vaccinium corymbosun L.) breeding dataset containing more than 3000 individuals, genotyped using probe-based target sequencing and phenotyped for three fruit quality traits over several years. Our contribution in this study is threefold: (i) for the genotyping resource allocation, the use of genetic data-driven methods to select an optimal set of markers slightly improved prediction results for all the traits; (ii) for the long-term implication, we carried out a simulation study and emphasized that data-driven method results in a slight improvement in genetic gain over 30 cycles than random marker sampling; and (iii) for the phenotyping resource allocation, we compared different optimization algorithms to select training population, showing that it can be leveraged to increase predictive performances. Altogether, we provided a data-oriented decision-making approach for breeders by demonstrating that critical breeding decisions associated with resource allocation for genomic prediction can be tackled through a combination of statistics and genetic methods.
Collapse
Affiliation(s)
- Paul Adunola
- Blueberry Breeding and Genomics Lab, Horticultural Sciences Department, University of Florida, Gainesville, Florida, USA
| | - Luis Felipe V Ferrão
- Blueberry Breeding and Genomics Lab, Horticultural Sciences Department, University of Florida, Gainesville, Florida, USA
| | - Juliana Benevenuto
- Blueberry Breeding and Genomics Lab, Horticultural Sciences Department, University of Florida, Gainesville, Florida, USA
| | - Camila F Azevedo
- Blueberry Breeding and Genomics Lab, Horticultural Sciences Department, University of Florida, Gainesville, Florida, USA
- Statistics Department, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil
| | - Patricio R Munoz
- Blueberry Breeding and Genomics Lab, Horticultural Sciences Department, University of Florida, Gainesville, Florida, USA
| |
Collapse
|
5
|
Roscher-Ehrig L, Weber SE, Abbadi A, Malenica M, Abel S, Hemker R, Snowdon RJ, Wittkop B, Stahl A. Phenomic Selection for Hybrid Rapeseed Breeding. PLANT PHENOMICS (WASHINGTON, D.C.) 2024; 6:0215. [PMID: 39049840 PMCID: PMC11268845 DOI: 10.34133/plantphenomics.0215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 06/19/2024] [Indexed: 07/27/2024]
Abstract
Phenomic selection is a recent approach suggested as a low-cost, high-throughput alternative to genomic selection. Instead of using genetic markers, it employs spectral data to predict complex traits using equivalent statistical models. Phenomic selection has been shown to outperform genomic selection when using spectral data that was obtained within the same generation as the traits that were predicted. However, for hybrid breeding, the key question is whether spectral data from parental genotypes can be used to effectively predict traits in the hybrid generation. Here, we aimed to evaluate the potential of phenomic selection for hybrid rapeseed breeding. We performed predictions for various traits in a structured population of 410 test hybrids, grown in multiple environments, using near-infrared spectroscopy data obtained from harvested seeds of both the hybrids and their parental lines with different linear and nonlinear models. We found that phenomic selection within the hybrid generation outperformed genomic selection for seed yield and plant height, even when spectral data was collected at single locations, while being less affected by population structure. Furthermore, we demonstrate that phenomic prediction across generations is feasible, and selecting hybrids based on spectral data obtained from parental genotypes is competitive with genomic selection. We conclude that phenomic selection is a promising approach for rapeseed breeding that can be easily implemented without any additional costs or efforts as near-infrared spectroscopy is routinely assessed in rapeseed breeding.
Collapse
Affiliation(s)
| | - Sven E. Weber
- Department of Plant Breeding,
Justus Liebig University, Giessen, Germany
| | | | | | | | | | - Rod J. Snowdon
- Department of Plant Breeding,
Justus Liebig University, Giessen, Germany
| | - Benjamin Wittkop
- Department of Plant Breeding,
Justus Liebig University, Giessen, Germany
| | - Andreas Stahl
- Julius Kuehn Institute (JKI), Federal Research Centre for Cultivated Plants,
Institute for Resistance Research and Stress Tolerance, Quedlinburg, Germany
| |
Collapse
|
6
|
Tadese D, Piepho HP, Hartung J. Accuracy of prediction from multi-environment trials for new locations using pedigree information and environmental covariates: the case of sorghum (Sorghum bicolor (L.) Moench) breeding. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:181. [PMID: 38985188 PMCID: PMC11236881 DOI: 10.1007/s00122-024-04684-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Accepted: 06/25/2024] [Indexed: 07/11/2024]
Abstract
KEY MESSAGES We investigate a method of extracting and fitting synthetic environmental covariates and pedigree information in multilocation trial data analysis to predict genotype performances in untested locations. Plant breeding trials are usually conducted across multiple testing locations to predict genotype performances in the targeted population of environments. The predictive accuracy can be increased by the use of adequate statistical models. We compared linear mixed models with and without synthetic covariates (SCs) and pedigree information under the identity, the diagonal and the factor-analytic variance-covariance structures of the genotype-by-location interactions. A comparison was made to evaluate the accuracy of different models in predicting genotype performances in untested locations using the mean squared error of predicted differences (MSEPD) and the Spearman rank correlation between predicted and adjusted means. A multi-environmental trial (MET) dataset evaluated for yield performance in the dry lowland sorghum (Sorghum bicolor (L.) Moench) breeding program of Ethiopia was used. For validating our models, we followed a leave-one-location-out cross-validation strategy. A total of 65 environmental covariates (ECs) obtained from the sorghum test locations were considered. The SCs were extracted from the ECs using multivariate partial least squares analysis and subsequently fitted in the linear mixed model. Then, the model was extended accounting for pedigree information. According to the MSEPD, models accounting for SC improve predictive accuracy of genotype performances in the three of the variance-covariance structures compared to others without SC. The rank correlation was also higher for the model with the SC. When the SC was fitted, the rank correlation was 0.58 for the factor analytic, 0.51 for the diagonal and 0.46 for the identity variance-covariance structures. Our approach indicates improvement in predictive accuracy with SC in the context of genotype-by-location interactions of a sorghum breeding in Ethiopia.
Collapse
Affiliation(s)
- Diriba Tadese
- Biostatistics Unit, Institute of Crop Science, University of Hohenheim, Fruwirthstraße 23, 70599, Stuttgart, Germany.
| | - Hans-Peter Piepho
- Biostatistics Unit, Institute of Crop Science, University of Hohenheim, Fruwirthstraße 23, 70599, Stuttgart, Germany
| | - Jens Hartung
- Biostatistics Unit, Institute of Crop Science, University of Hohenheim, Fruwirthstraße 23, 70599, Stuttgart, Germany
| |
Collapse
|
7
|
Walser C, Spaccasassi A, Gradl K, Stark TD, Sterneder S, Wolter FP, Achatz F, Frank O, Somoza V, Hofmann T, Dawid C. Human Sensory, Taste Receptor, and Quantitation Studies on Kaempferol Glycosides Derived from Rapeseed/Canola Protein Isolates. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2024; 72:14830-14843. [PMID: 38888424 PMCID: PMC11228994 DOI: 10.1021/acs.jafc.4c02342] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 05/14/2024] [Accepted: 05/15/2024] [Indexed: 06/20/2024]
Abstract
Beyond the key bitter compound kaempferol 3-O-(2‴-O-sinapoyl-β-d-sophoroside) previously described in the literature (1), eight further bitter and astringent-tasting kaempferol glucosides (2-9) have been identified in rapeseed protein isolates (Brassica napus L.). The bitterness and astringency of these taste-active substances have been described with taste threshold concentrations ranging from 3.3 to 531.7 and 0.3 to 66.4 μmol/L, respectively, as determined by human sensory experiments. In this study, the impact of 1 and kaempferol 3-O-β-d-glucopyranoside (8) on TAS2R-linked proton secretion by HGT-1 cells was analyzed by quantification of the intracellular proton index. mRNA levels of bitter receptors TAS2R3, 4, 5, 13, 30, 31, 39, 40, 43, 45, 46, 50 and TAS2R8 were increased after treatment with compounds 1 and 8. Using quantitative UHPLC-MS/MSMRM measurements, the concentrations of 1-9 were determined in rapeseed/canola seeds and their corresponding protein isolates. Depending on the sample material, compounds 1, 3, and 5-9 exceeded dose over threshold (DoT) factors above one for both bitterness and astringency in selected protein isolates. In addition, an increase in the key bitter compound 1 during industrial protein production (apart from enrichment) was observed, allowing the identification of the potential precursor of 1 to be kaempferol 3-O-(2‴-O-sinapoyl-β-d-sophoroside)-7-O-β-d-glucopyranoside (3). These results may contribute to the production of less bitter and astringent rapeseed protein isolates through the optimization of breeding and postharvest downstream processing.
Collapse
Affiliation(s)
- Christoph Walser
- Chair of Food Chemistry and Molecular Sensory Science, TUM School of Life Sciences, Technical University of Munich, Lise-Meitner-Str. 34, 85354 Freising, Germany
| | - Andrea Spaccasassi
- Chair of Food Chemistry and Molecular Sensory Science, TUM School of Life Sciences, Technical University of Munich, Lise-Meitner-Str. 34, 85354 Freising, Germany
| | - Katrin Gradl
- Leibniz-Institute for Food Systems Biology at the Technical University of Munich, Lise-Meitner-Str. 34, 85354 Freising, Germany
- TUM School of Life Sciences, Technical University of Munich, Alte Akademie 8a, 85354 Freising, Germany
| | - Timo D Stark
- Chair of Food Chemistry and Molecular Sensory Science, TUM School of Life Sciences, Technical University of Munich, Lise-Meitner-Str. 34, 85354 Freising, Germany
| | - Sonja Sterneder
- Leibniz-Institute for Food Systems Biology at the Technical University of Munich, Lise-Meitner-Str. 34, 85354 Freising, Germany
- Vienna Doctoral School in Chemistry, Faculty of Chemistry, University of Vienna, 1090 Vienna, Austria
- Department of Physiological Chemistry, Faculty of Chemistry, University of Vienna, 1090 Vienna, Austria
| | | | - Felicia Achatz
- Chair of Food Chemistry and Molecular Sensory Science, TUM School of Life Sciences, Technical University of Munich, Lise-Meitner-Str. 34, 85354 Freising, Germany
| | - Oliver Frank
- Chair of Food Chemistry and Molecular Sensory Science, TUM School of Life Sciences, Technical University of Munich, Lise-Meitner-Str. 34, 85354 Freising, Germany
| | - Veronika Somoza
- Leibniz-Institute for Food Systems Biology at the Technical University of Munich, Lise-Meitner-Str. 34, 85354 Freising, Germany
- Department of Physiological Chemistry, Faculty of Chemistry, University of Vienna, 1090 Vienna, Austria
- ZIEL - Institute for Food & Health, Technical University of Munich, 85354 Freising, Germany
- Chair of Nutritional Systems Biology, TUM School of Life Sciences, Technical University of Munich, 85354 Freising, Germany
| | - Thomas Hofmann
- Chair of Food Chemistry and Molecular Sensory Science, TUM School of Life Sciences, Technical University of Munich, Lise-Meitner-Str. 34, 85354 Freising, Germany
| | - Corinna Dawid
- Chair of Food Chemistry and Molecular Sensory Science, TUM School of Life Sciences, Technical University of Munich, Lise-Meitner-Str. 34, 85354 Freising, Germany
- ZIEL - Institute for Food & Health, Technical University of Munich, 85354 Freising, Germany
- Professorship for Functional Phytometabolomics, TUM School of Life Sciences, Technical University of Munich, Lise-Meitner-Str. 34, 85354 Freising, Germany
| |
Collapse
|
8
|
Weber SE, Roscher-Ehrig L, Kox T, Abbadi A, Stahl A, Snowdon RJ. Genomic prediction in Brassica napus: evaluating the benefit of imputed whole-genome sequencing data. Genome 2024; 67:210-222. [PMID: 38708850 DOI: 10.1139/gen-2023-0126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/07/2024]
Abstract
Advances in sequencing technology allow whole plant genomes to be sequenced with high quality. Combining genotypic and phenotypic data in genomic prediction helps breeders to select crossing partners in partially phenotyped populations. In plant breeding programs, the cost of sequencing entire breeding populations still exceeds available genotyping budgets. Hence, the method for genotyping is still mainly single nucleotide polymorphism (SNP) arrays; however, arrays are unable to assess the entire genome- and population-wide diversity. A compromise involves genotyping the entire population using an SNP array and a subset of the population with whole-genome sequencing. Both datasets can then be used to impute markers from whole-genome sequencing onto the entire population. Here, we evaluate whether imputation of whole-genome sequencing data enhances genomic predictions, using data from a nested association mapping population of rapeseed (Brassica napus). Employing two cross-validation schemes that mimic scenarios for the prediction of close and distant relatives, we show that imputed marker data do not significantly improve prediction accuracy, likely due to redundancy in relationship estimates and imputation errors. In simulation studies, only small improvements were observed, further corroborating the findings. We conclude that SNP arrays are already equipped with the information that is added by imputation through relationship and linkage disequilibrium.
Collapse
Affiliation(s)
- Sven E Weber
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Giessen, Germany
| | - Lennard Roscher-Ehrig
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Giessen, Germany
| | | | | | - Andreas Stahl
- Julius Kuehn Institute (JKI), Federal Research Centre for Cultivated Plants, Institute for Resistance Research and Stress Tolerance, Quedlinburg, Germany
| | - Rod J Snowdon
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Giessen, Germany
| |
Collapse
|
9
|
Hayatgheibi H, Hallingbäck HR, Lundqvist SO, Grahn T, Scheepers G, Nordström P, Chen ZQ, Kärkkäinen K, Wu HX, García-Gil MR. Implications of accounting for marker-based population structure in the quantitative genetic evaluation of genetic parameters related to growth and wood properties in Norway spruce. BMC Genom Data 2024; 25:60. [PMID: 38877416 PMCID: PMC11177499 DOI: 10.1186/s12863-024-01241-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Accepted: 06/05/2024] [Indexed: 06/16/2024] Open
Abstract
BACKGROUND Forest geneticists typically use provenances to account for population differences in their improvement schemes; however, the historical records of the imported materials might not be very precise or well-aligned with the genetic clusters derived from advanced molecular techniques. The main objective of this study was to assess the impact of marker-based population structure on genetic parameter estimates related to growth and wood properties and their trade-offs in Norway spruce, by either incorporating it as a fixed effect (model-A) or excluding it entirely from the analysis (model-B). RESULTS Our results indicate that models incorporating population structure significantly reduce estimates of additive genetic variance, resulting in substantial reduction of narrow-sense heritability. However, these models considerably improve prediction accuracies. This was particularly significant for growth and solid-wood properties, which showed to have the highest population genetic differentiation (QST) among the studied traits. Additionally, although the pattern of correlations remained similar across the models, their magnitude was slightly lower for models that included population structure as a fixed effect. This suggests that selection, consistently performed within populations, might be less affected by unfavourable genetic correlations compared to mass selection conducted without pedigree restrictions. CONCLUSION We conclude that the results of models properly accounting for population structure are more accurate and less biased compared to those neglecting this effect. This might have practical implications for breeders and forest managers where, decisions based on imprecise selections can pose a high risk to economic efficiency.
Collapse
Affiliation(s)
- Haleh Hayatgheibi
- Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences (SLU), Umeå, Sweden.
| | | | | | | | | | | | - Zhi-Qiang Chen
- Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences (SLU), Umeå, Sweden
| | | | - Harry X Wu
- Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences (SLU), Umeå, Sweden
- Beijing Advanced Innovation Centre for Tree Breeding By Molecular Design, Beijing Forestry University, Beijing, China
- Black Mountain Laboratory, CSIRO National Collection Research Australia, Canberra, Australia
| | - M Rosario García-Gil
- Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences (SLU), Umeå, Sweden
| |
Collapse
|
10
|
Laurençon M, Legrix J, Wagner MH, Demilly D, Baron C, Rolland S, Ducournau S, Laperche A, Nesi N. Genomic and phenomic predictions help capture low-effect alleles promoting seed germination in oilseed rape in addition to QTL analyses. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:156. [PMID: 38858297 PMCID: PMC11164772 DOI: 10.1007/s00122-024-04659-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/23/2024] [Accepted: 05/25/2024] [Indexed: 06/12/2024]
Abstract
KEY MESSAGE Phenomic prediction implemented on a large diversity set can efficiently predict seed germination, capture low-effect favorable alleles that are not revealed by GWAS and identify promising genetic resources. Oilseed rape faces many challenges, especially at the beginning of its developmental cycle. Achieving rapid and uniform seed germination could help to ensure a successful establishment and therefore enabling the crop to compete with weeds and tolerate stresses during the earliest developmental stages. The polygenic nature of seed germination was highlighted in several studies, and more knowledge is needed about low- to moderate-effect underlying loci in order to enhance seed germination effectively by improving the genetic background and incorporating favorable alleles. A total of 17 QTL were detected for seed germination-related traits, for which the favorable alleles often corresponded to the most frequent alleles in the panel. Genomic and phenomic predictions methods provided moderate-to-high predictive abilities, demonstrating the ability to capture small additive and non-additive effects for seed germination. This study also showed that phenomic prediction estimated phenotypic values closer to phenotypic values than GEBV. Finally, as the predictive ability of phenomic prediction was less influenced by the genetic structure of the panel, it is worth using this prediction method to characterize genetic resources, particularly with a view to design prebreeding populations.
Collapse
Affiliation(s)
- Marianne Laurençon
- Institute of Genetics, Environment and Plant Protection (IGEPP), INRAE - Institut Agro Rennes-Angers - Université de Rennes, 35650, Le Rheu, France
| | - Julie Legrix
- Institute of Genetics, Environment and Plant Protection (IGEPP), INRAE - Institut Agro Rennes-Angers - Université de Rennes, 35650, Le Rheu, France
| | - Marie-Hélène Wagner
- Groupe d'Etude et de Contrôle des Variétés Et des Semences (GEVES), 49070, Beaucouzé, France
| | - Didier Demilly
- Groupe d'Etude et de Contrôle des Variétés Et des Semences (GEVES), 49070, Beaucouzé, France
| | - Cécile Baron
- Institute of Genetics, Environment and Plant Protection (IGEPP), INRAE - Institut Agro Rennes-Angers - Université de Rennes, 35650, Le Rheu, France
| | - Sophie Rolland
- Institute of Genetics, Environment and Plant Protection (IGEPP), INRAE - Institut Agro Rennes-Angers - Université de Rennes, 35650, Le Rheu, France
| | - Sylvie Ducournau
- Groupe d'Etude et de Contrôle des Variétés Et des Semences (GEVES), 49070, Beaucouzé, France
| | - Anne Laperche
- Institute of Genetics, Environment and Plant Protection (IGEPP), INRAE - Institut Agro Rennes-Angers - Université de Rennes, 35650, Le Rheu, France.
| | - Nathalie Nesi
- Institute of Genetics, Environment and Plant Protection (IGEPP), INRAE - Institut Agro Rennes-Angers - Université de Rennes, 35650, Le Rheu, France
| |
Collapse
|
11
|
Skøt L, Nay MM, Grieder C, Frey LA, Pégard M, Öhlund L, Amdahl H, Radovic J, Jaluvka L, Palmé A, Ruttink T, Lloyd D, Howarth CJ, Kölliker R. Including marker x environment interactions improves genomic prediction in red clover ( Trifolium pratense L.). FRONTIERS IN PLANT SCIENCE 2024; 15:1407609. [PMID: 38916032 PMCID: PMC11194335 DOI: 10.3389/fpls.2024.1407609] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Accepted: 05/20/2024] [Indexed: 06/26/2024]
Abstract
Genomic prediction has mostly been used in single environment contexts, largely ignoring genotype x environment interaction, which greatly affects the performance of plants. However, in the last decade, prediction models including marker x environment (MxE) interaction have been developed. We evaluated the potential of genomic prediction in red clover (Trifolium pratense L.) using field trial data from five European locations, obtained in the Horizon 2020 EUCLEG project. Three models were compared: (1) single environment (SingleEnv), (2) across environment (AcrossEnv), (3) marker x environment interaction (MxE). Annual dry matter yield (DMY) gave the highest predictive ability (PA). Joint analyses of DMY from years 1 and 2 from each location varied from 0.87 in Britain and Switzerland in year 1, to 0.40 in Serbia in year 2. Overall, crude protein (CP) was predicted poorly. PAs for date of flowering (DOF), however ranged from 0.87 to 0.67 for Britain and Switzerland, respectively. Across the three traits, the MxE model performed best and the AcrossEnv worst, demonstrating that including marker x environment effects can improve genomic prediction in red clover. Leaving out accessions from specific regions or from specific breeders' material in the cross validation tended to reduce PA, but the magnitude of reduction depended on trait, region and breeders' material, indicating that population structure contributed to the high PAs observed for DMY and DOF. Testing the genomic estimated breeding values on new phenotypic data from Sweden showed that DMY training data from Britain gave high PAs in both years (0.43-0.76), while DMY training data from Switzerland gave high PAs only for year 1 (0.70-0.87). The genomic predictions we report here underline the potential benefits of incorporating MxE interaction in multi-environment trials and could have perspectives for identifying markers with effects that are stable across environments, and markers with environment-specific effects.
Collapse
Affiliation(s)
- Leif Skøt
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, Aberystwyth, United Kingdom
| | - Michelle M. Nay
- Division of Plant Breeding, Fodder Plant Breeding, Agroscope, Zurich, Switzerland
| | - Christoph Grieder
- Division of Plant Breeding, Fodder Plant Breeding, Agroscope, Zurich, Switzerland
| | - Lea A. Frey
- Molecular Plant Breeding, Institute of Agricultural Sciences, ETH Zurich, Zurich, Switzerland
| | | | | | - Helga Amdahl
- Graminor Breeding Ltd., Bjørke Forsøksgård, Norway
| | | | | | - Anna Palmé
- The Nordic Genetic Resource Centre, Plant Section, Alnarp, Sweden
| | - Tom Ruttink
- Plant Sciences Unit, Flanders Research Institute for Agriculture, Fisheries and Food, Melle, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
| | - David Lloyd
- Germinal Horizon, Plas Gogerddan, Aberystwyth, United Kingdom
| | - Catherine J. Howarth
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, Aberystwyth, United Kingdom
| | - Roland Kölliker
- Molecular Plant Breeding, Institute of Agricultural Sciences, ETH Zurich, Zurich, Switzerland
| |
Collapse
|
12
|
Alam E, Moyer C, Verma S, Peres NA, Whitaker VM. Exploring the genetic basis of resistance to Neopestalotiopsis species in strawberry. THE PLANT GENOME 2024; 17:e20477. [PMID: 38822520 DOI: 10.1002/tpg2.20477] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Revised: 04/30/2024] [Accepted: 05/03/2024] [Indexed: 06/03/2024]
Abstract
Aggressive strains of Neopestalotiopsis sp. have recently emerged as devastating pathogens of strawberry (Fragaria × ananassa Duchesne ex Rozier), infecting nearly all plant parts and causing severe outbreaks of leaf spot and fruit rot in Florida and globally. The development of host resistance is imperative due to the absence of fungicides that effectively inhibit Neopestalotiopsis sp. growth on an infected strawberry crop. Here, we analyzed 1578 individuals from the University of Florida's (UF) strawberry breeding program to identify and dissect genetic variation for resistance to Neopestalotiopsis sp. and to explore the feasibility of genomic selection. We found that less than 12% of elite UF germplasm exhibited resistance, with narrow-sense heritability estimates ranging from 0.28 to 0.69. Through genome-wide association studies (GWAS), we identified two loci accounting for 7%-16% of phenotypic variance across four trials and 3 years. Several candidate genes encoding pattern recognition receptors, intra-cellular nucleotide-binding leucine-rich repeats, and downstream components of plant defense pathways co-localized with the Neopestalotiopsis sp. resistance loci. Interestingly, favorable alleles at the largest-effect locus were rare in elite UF material and had previously been unintentionally introduced from an exotic cultivar. The array-based markers and candidate genes described herein provide the foundation for targeting this locus through marker-assisted selection. The predictive abilities of genomic selection models, with and without explicitly modeling peak GWAS markers as fixed effects, ranged between 0.25 and 0.59, suggesting that genomic selection holds promise for enhancing resistance to Neopestalotiopsis sp. in strawberry.
Collapse
Affiliation(s)
- Elissar Alam
- Plant Breeding Graduate Program, Institute of Food and Agricultural Sciences, University of Florida, Gainesville, Florida, USA
| | - Catalina Moyer
- Horticultural Sciences Department, IFAS Gulf Coast Research and Education Center, University of Florida, Wimauma, Florida, USA
| | - Sujeet Verma
- Fall Creek Farm and Nursery Inc., Lowell, Oregon, USA
| | - Natalia A Peres
- Plant Pathology Department, IFAS Gulf Coast Research and Education Center, University of Florida, Wimauma, Florida, USA
| | - Vance M Whitaker
- Plant Breeding Graduate Program, Institute of Food and Agricultural Sciences, University of Florida, Gainesville, Florida, USA
| |
Collapse
|
13
|
Brault C, Segura V, Roques M, Lamblin P, Bouckenooghe V, Pouzalgues N, Cunty C, Breil M, Frouin M, Garcin L, Camps L, Ducasse MA, Romieu C, Masson G, Julliard S, Flutre T, Le Cunff L. Enhancing grapevine breeding efficiency through genomic prediction and selection index. G3 (BETHESDA, MD.) 2024; 14:jkae038. [PMID: 38401528 PMCID: PMC10989862 DOI: 10.1093/g3journal/jkae038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 02/09/2024] [Accepted: 02/18/2024] [Indexed: 02/26/2024]
Abstract
Grapevine (Vitis vinifera) breeding reaches a critical point. New cultivars are released every year with resistance to powdery and downy mildews. However, the traditional process remains time-consuming, taking 20-25 years, and demands the evaluation of new traits to enhance grapevine adaptation to climate change. Until now, the selection process has relied on phenotypic data and a limited number of molecular markers for simple genetic traits such as resistance to pathogens, without a clearly defined ideotype, and was carried out on a large scale. To accelerate the breeding process and address these challenges, we investigated the use of genomic prediction, a methodology using molecular markers to predict genotypic values. In our study, we focused on 2 existing grapevine breeding programs: Rosé wine and Cognac production. In these programs, several families were created through crosses of emblematic and interspecific resistant varieties to powdery and downy mildews. Thirty traits were evaluated for each program, using 2 genomic prediction methods: Genomic Best Linear Unbiased Predictor and Least Absolute Shrinkage Selection Operator. The results revealed substantial variability in predictive abilities across traits, ranging from 0 to 0.9. These discrepancies could be attributed to factors such as trait heritability and trait characteristics. Moreover, we explored the potential of across-population genomic prediction by leveraging other grapevine populations as training sets. Integrating genomic prediction allowed us to identify superior individuals for each program, using multivariate selection index method. The ideotype for each breeding program was defined collaboratively with representatives from the wine-growing sector.
Collapse
Affiliation(s)
- Charlotte Brault
- UMT Geno-Vigne®, IFV, INRAE, Institut Agro Montpellier, Montpellier 34398, France
- Institut Français de la vigne et du vin, Pôle National Matériel Végétal, Le Grau du Roi 30240, France
| | - Vincent Segura
- UMT Geno-Vigne®, IFV, INRAE, Institut Agro Montpellier, Montpellier 34398, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro Montpellier, Montpellier 34398, France
| | - Maryline Roques
- UMT Geno-Vigne®, IFV, INRAE, Institut Agro Montpellier, Montpellier 34398, France
- Institut Français de la vigne et du vin, Pôle National Matériel Végétal, Le Grau du Roi 30240, France
| | - Pauline Lamblin
- Institut Français de la vigne et du vin, Pôle National Matériel Végétal, Le Grau du Roi 30240, France
| | - Virginie Bouckenooghe
- UMT Geno-Vigne®, IFV, INRAE, Institut Agro Montpellier, Montpellier 34398, France
- Institut Français de la vigne et du vin, Pôle National Matériel Végétal, Le Grau du Roi 30240, France
| | | | - Constance Cunty
- Institut Français de la vigne et du vin, Pôle National Matériel Végétal, Le Grau du Roi 30240, France
- Centre du Rosé, Vidauban 83550, France
| | - Matthieu Breil
- UMT Geno-Vigne®, IFV, INRAE, Institut Agro Montpellier, Montpellier 34398, France
- Institut Français de la vigne et du vin, Pôle National Matériel Végétal, Le Grau du Roi 30240, France
| | - Marina Frouin
- Conservatoire du Vignoble Charentais, Institut de Formation de Richemont, Cherves-Richemont 16370, France
| | - Léa Garcin
- Institut Français de la vigne et du vin, Pôle National Matériel Végétal, Le Grau du Roi 30240, France
- Conservatoire du Vignoble Charentais, Institut de Formation de Richemont, Cherves-Richemont 16370, France
| | - Louise Camps
- Conservatoire du Vignoble Charentais, Institut de Formation de Richemont, Cherves-Richemont 16370, France
| | - Marie-Agnès Ducasse
- Institut Français de la vigne et du vin, Pôle National Matériel Végétal, Le Grau du Roi 30240, France
| | - Charles Romieu
- UMT Geno-Vigne®, IFV, INRAE, Institut Agro Montpellier, Montpellier 34398, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro Montpellier, Montpellier 34398, France
| | - Gilles Masson
- Institut Français de la vigne et du vin, Pôle National Matériel Végétal, Le Grau du Roi 30240, France
- Centre du Rosé, Vidauban 83550, France
| | - Sébastien Julliard
- Conservatoire du Vignoble Charentais, Institut de Formation de Richemont, Cherves-Richemont 16370, France
| | - Timothée Flutre
- INRAE, CNRS, AgroParisTech, Université Paris-Saclay, GQE—Le Moulon, Gif-sur-Yvette 91190, France
| | - Loïc Le Cunff
- UMT Geno-Vigne®, IFV, INRAE, Institut Agro Montpellier, Montpellier 34398, France
- Institut Français de la vigne et du vin, Pôle National Matériel Végétal, Le Grau du Roi 30240, France
| |
Collapse
|
14
|
Alemu A, Åstrand J, Montesinos-López OA, Isidro Y Sánchez J, Fernández-Gónzalez J, Tadesse W, Vetukuri RR, Carlsson AS, Ceplitis A, Crossa J, Ortiz R, Chawade A. Genomic selection in plant breeding: Key factors shaping two decades of progress. MOLECULAR PLANT 2024; 17:552-578. [PMID: 38475993 DOI: 10.1016/j.molp.2024.03.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 01/22/2024] [Accepted: 03/08/2024] [Indexed: 03/14/2024]
Abstract
Genomic selection, the application of genomic prediction (GP) models to select candidate individuals, has significantly advanced in the past two decades, effectively accelerating genetic gains in plant breeding. This article provides a holistic overview of key factors that have influenced GP in plant breeding during this period. We delved into the pivotal roles of training population size and genetic diversity, and their relationship with the breeding population, in determining GP accuracy. Special emphasis was placed on optimizing training population size. We explored its benefits and the associated diminishing returns beyond an optimum size. This was done while considering the balance between resource allocation and maximizing prediction accuracy through current optimization algorithms. The density and distribution of single-nucleotide polymorphisms, level of linkage disequilibrium, genetic complexity, trait heritability, statistical machine-learning methods, and non-additive effects are the other vital factors. Using wheat, maize, and potato as examples, we summarize the effect of these factors on the accuracy of GP for various traits. The search for high accuracy in GP-theoretically reaching one when using the Pearson's correlation as a metric-is an active research area as yet far from optimal for various traits. We hypothesize that with ultra-high sizes of genotypic and phenotypic datasets, effective training population optimization methods and support from other omics approaches (transcriptomics, metabolomics and proteomics) coupled with deep-learning algorithms could overcome the boundaries of current limitations to achieve the highest possible prediction accuracy, making genomic selection an effective tool in plant breeding.
Collapse
Affiliation(s)
- Admas Alemu
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden.
| | - Johanna Åstrand
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden; Lantmännen Lantbruk, Svalöv, Sweden
| | | | - Julio Isidro Y Sánchez
- Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223 Madrid, Spain
| | - Javier Fernández-Gónzalez
- Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223 Madrid, Spain
| | - Wuletaw Tadesse
- International Center for Agricultural Research in the Dry Areas (ICARDA), Rabat, Morocco
| | - Ramesh R Vetukuri
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
| | - Anders S Carlsson
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
| | | | - José Crossa
- International Maize and Wheat Improvement Center (CIMMYT), Km 45, Carretera México-Veracruz, Texcoco, México 52640, Mexico
| | - Rodomiro Ortiz
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden.
| | - Aakash Chawade
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
| |
Collapse
|
15
|
Njuguna JN, Clark LV, Lipka AE, Anzoua KG, Bagmet L, Chebukin P, Dwiyanti MS, Dzyubenko E, Dzyubenko N, Ghimire BK, Jin X, Johnson DA, Kjeldsen JB, Nagano H, de Bem Oliveira I, Peng J, Petersen KK, Sabitov A, Seong ES, Yamada T, Yoo JH, Yu CY, Zhao H, Munoz P, Long SP, Sacks EJ. Impact of genotype-calling methodologies on genome-wide association and genomic prediction in polyploids. THE PLANT GENOME 2023; 16:e20401. [PMID: 37903749 DOI: 10.1002/tpg2.20401] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 09/17/2023] [Accepted: 09/23/2023] [Indexed: 11/01/2023]
Abstract
Discovery and analysis of genetic variants underlying agriculturally important traits are key to molecular breeding of crops. Reduced representation approaches have provided cost-efficient genotyping using next-generation sequencing. However, accurate genotype calling from next-generation sequencing data is challenging, particularly in polyploid species due to their genome complexity. Recently developed Bayesian statistical methods implemented in available software packages, polyRAD, EBG, and updog, incorporate error rates and population parameters to accurately estimate allelic dosage across any ploidy. We used empirical and simulated data to evaluate the three Bayesian algorithms and demonstrated their impact on the power of genome-wide association study (GWAS) analysis and the accuracy of genomic prediction. We further incorporated uncertainty in allelic dosage estimation by testing continuous genotype calls and comparing their performance to discrete genotypes in GWAS and genomic prediction. We tested the genotype-calling methods using data from two autotetraploid species, Miscanthus sacchariflorus and Vaccinium corymbosum, and performed GWAS and genomic prediction. In the empirical study, the tested Bayesian genotype-calling algorithms differed in their downstream effects on GWAS and genomic prediction, with some showing advantages over others. Through subsequent simulation studies, we observed that at low read depth, polyRAD was advantageous in its effect on GWAS power and limit of false positives. Additionally, we found that continuous genotypes increased the accuracy of genomic prediction, by reducing genotyping error, particularly at low sequencing depth. Our results indicate that by using the Bayesian algorithm implemented in polyRAD and continuous genotypes, we can accurately and cost-efficiently implement GWAS and genomic prediction in polyploid crops.
Collapse
Affiliation(s)
- Joyce N Njuguna
- Department of Crop Sciences, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| | - Lindsay V Clark
- Research Scientific Computing, Seattle Children's Research Institute, Seattle, Washington, USA
| | - Alexander E Lipka
- Department of Crop Sciences, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| | - Kossonou G Anzoua
- Field Science Center for Northern Biosphere, Hokkaido University, Sapporo, Japan
| | - Larisa Bagmet
- Vavilov All-Russian Institute of Plant Genetic Resources, St. Petersburg, Russian Federation
| | - Pavel Chebukin
- FSBSI "FSC of Agricultural Biotechnology of the Far East named after A.K. Chaiki", Ussuriysk, Russian Federation
| | - Maria S Dwiyanti
- Field Science Center for Northern Biosphere, Hokkaido University, Sapporo, Japan
| | - Elena Dzyubenko
- Vavilov All-Russian Institute of Plant Genetic Resources, St. Petersburg, Russian Federation
| | - Nicolay Dzyubenko
- Vavilov All-Russian Institute of Plant Genetic Resources, St. Petersburg, Russian Federation
| | - Bimal Kumar Ghimire
- Department of Crop Science, College of Sanghuh Life Science, Konkuk University, Seoul, South Korea
| | - Xiaoli Jin
- Agronomy Department, Key Laboratory of Crop Germplasm Research of Zhejiang Province, Zhejiang University, Hangzhou, China
| | - Douglas A Johnson
- USDA-ARS Forage and Range Research Lab, Utah State University, Logan, Utah, USA
| | | | - Hironori Nagano
- Field Science Center for Northern Biosphere, Hokkaido University, Sapporo, Japan
| | | | - Junhua Peng
- Spring Valley Agriscience Co. Ltd., Jinan, China
| | | | - Andrey Sabitov
- Vavilov All-Russian Institute of Plant Genetic Resources, St. Petersburg, Russian Federation
| | - Eun Soo Seong
- Division of Bioresource Sciences, Kangwon National University, Chuncheon, South Korea
| | - Toshihiko Yamada
- Field Science Center for Northern Biosphere, Hokkaido University, Sapporo, Japan
| | - Ji Hye Yoo
- Bioherb Research Institute, Kangwon National University, Chuncheon, South Korea
| | - Chang Yeon Yu
- Bioherb Research Institute, Kangwon National University, Chuncheon, South Korea
| | - Hua Zhao
- Key Laboratory of Horticultural Plant Biology of Ministry of Education, Huazhong Agricultural University, Wuhan, China
| | - Patricio Munoz
- Horticultural Science Department, University of Florida, Gainesville, Florida, USA
| | - Stephen P Long
- Department of Crop Sciences, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| | - Erik J Sacks
- Department of Crop Sciences, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| |
Collapse
|
16
|
Meher PK, Gupta A, Rustgi S, Mir RR, Kumar A, Kumar J, Balyan HS, Gupta PK. Evaluation of eight Bayesian genomic prediction models for three micronutrient traits in bread wheat (Triticum aestivum L.). THE PLANT GENOME 2023; 16:e20332. [PMID: 37122189 DOI: 10.1002/tpg2.20332] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Revised: 02/21/2023] [Accepted: 03/13/2023] [Indexed: 06/19/2023]
Abstract
In wheat, genomic prediction accuracy (GPA) was assessed for three micronutrient traits (grain iron, grain zinc, and β-carotenoid concentrations) using eight Bayesian regression models. For this purpose, data on 246 accessions, each genotyped with 17,937 DArT markers, were utilized. The phenotypic data on traits were available for 2013-2014 from Powerkheda (Madhya Pradesh) and for 2014-2015 from Meerut (Uttar Pradesh), India. The accuracy of the models was measured in terms of reliability, which was computed following a repeated cross-validation approach. The predictions were obtained independently for each of the two environments after adjusting for the local effects and across environments after adjusting for the environmental effects. The Bayes ridge regression (BayesRR) model outperformed the other seven models, whereas BayesLASSO (BayesL) was the least efficient. The GPA increased with an increase in the size of the training set as well as with an increase in marker density. The GPA values differed for the three traits and were higher for the best linear unbiased estimate (BLUE) (obtained after adjusting for the environmental effects) relative to those for the two environments. The GPA also remained unaffected after accounting for the population structure. The results of the present study suggest that only the best model should be used for the estimations of genomic estimated breeding values (GEBVs) before their use for genomic selection to improve the grain micronutrient contents.
Collapse
Affiliation(s)
- Prabina Kumar Meher
- Division of Statistical Genetics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Ajit Gupta
- Division of Statistical Genetics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Sachin Rustgi
- Department of Plant and Environmental Sciences, Pee Dee Research and Education Centre, Clemson University, Florence, South Carolina, USA
| | - Reyazul Rouf Mir
- Division of Genetics and Plant Breeding, SKUAST-Kashmir, Kashmir, India
| | - Anuj Kumar
- Department of Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia, Canada
- Laboratory of Immunity, Shantou University Medical College, Shantou, People's Republic of China
| | - Jitendra Kumar
- National Agri-Food Biotechnology Institute (NABI), Ajitgarh, India
| | - Harindra Singh Balyan
- Department of Genetics and Plant Breeding, Chaudhary Charan Singh University, Meerut, India
| | - Pushpendra Kumar Gupta
- Department of Genetics and Plant Breeding, Chaudhary Charan Singh University, Meerut, India
| |
Collapse
|
17
|
Sadeqi MB, Ballvora A, Dadshani S, Léon J. Genetic Parameter and Hyper-Parameter Estimation Underlie Nitrogen Use Efficiency in Bread Wheat. Int J Mol Sci 2023; 24:14275. [PMID: 37762585 PMCID: PMC10531695 DOI: 10.3390/ijms241814275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Revised: 09/07/2023] [Accepted: 09/14/2023] [Indexed: 09/29/2023] Open
Abstract
Estimation and prediction play a key role in breeding programs. Currently, phenotyping of complex traits such as nitrogen use efficiency (NUE) in wheat is still expensive, requires high-throughput technologies and is very time consuming compared to genotyping. Therefore, researchers are trying to predict phenotypes based on marker information. Genetic parameters such as population structure, genomic relationship matrix, marker density and sample size are major factors that increase the performance and accuracy of a model. However, they play an important role in adjusting the statistically significant false discovery rate (FDR) threshold in estimation. In parallel, there are many genetic hyper-parameters that are hidden and not represented in the given genomic selection (GS) model but have significant effects on the results, such as panel size, number of markers, minor allele frequency, number of call rates for each marker, number of cross validations and batch size in the training set of the genomic file. The main challenge is to ensure the reliability and accuracy of predicted breeding values (BVs) as results. Our study has confirmed the results of bias-variance tradeoff and adaptive prediction error for the ensemble-learning-based model STACK, which has the highest performance when estimating genetic parameters and hyper-parameters in a given GS model compared to other models.
Collapse
Affiliation(s)
- Mohammad Bahman Sadeqi
- INRES-Plant Breeding, Rheinische Friedrich-Wilhelms-Universität Bonn, 53113 Bonn, Germany; (M.B.S.); (J.L.)
| | - Agim Ballvora
- INRES-Plant Breeding, Rheinische Friedrich-Wilhelms-Universität Bonn, 53113 Bonn, Germany; (M.B.S.); (J.L.)
| | - Said Dadshani
- INRES-Plant Nutrition, Rheinische Friedrich-Wilhelms-Universität Bonn, 53113 Bonn, Germany;
| | - Jens Léon
- INRES-Plant Breeding, Rheinische Friedrich-Wilhelms-Universität Bonn, 53113 Bonn, Germany; (M.B.S.); (J.L.)
| |
Collapse
|
18
|
Weber SE, Frisch M, Snowdon RJ, Voss-Fels KP. Haplotype blocks for genomic prediction: a comparative evaluation in multiple crop datasets. FRONTIERS IN PLANT SCIENCE 2023; 14:1217589. [PMID: 37731980 PMCID: PMC10507710 DOI: 10.3389/fpls.2023.1217589] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 08/21/2023] [Indexed: 09/22/2023]
Abstract
In modern plant breeding, genomic selection is becoming the gold standard for selection of superior genotypes. The basis for genomic prediction models is a set of phenotyped lines along with their genotypic profile. With high marker density and linkage disequilibrium (LD) between markers, genotype data in breeding populations tends to exhibit considerable redundancy. Therefore, interest is growing in the use of haplotype blocks to overcome redundancy by summarizing co-inherited features. Moreover, haplotype blocks can help to capture local epistasis caused by interacting loci. Here, we compared genomic prediction methods that either used single SNPs or haplotype blocks with regards to their prediction accuracy for important traits in crop datasets. We used four published datasets from canola, maize, wheat and soybean. Different approaches to construct haplotype blocks were compared, including blocks based on LD, physical distance, number of adjacent markers and the algorithms implemented in the software "Haploview" and "HaploBlocker". The tested prediction methods included Genomic Best Linear Unbiased Prediction (GBLUP), Extended GBLUP to account for additive by additive epistasis (EGBLUP), Bayesian LASSO and Reproducing Kernel Hilbert Space (RKHS) regression. We found improved prediction accuracy in some traits when using haplotype blocks compared to SNP-based predictions, however the magnitude of improvement was very trait- and model-specific. Especially in settings with low marker density, haplotype blocks can improve genomic prediction accuracy. In most cases, physically large haplotype blocks yielded a strong decrease in prediction accuracy. Especially when prediction accuracy varies greatly across different prediction models, prediction based on haplotype blocks can improve prediction accuracy of underperforming models. However, there is no "best" method to build haplotype blocks, since prediction accuracy varied considerably across methods and traits. Hence, criteria used to define haplotype blocks should not be viewed as fixed biological parameters, but rather as hyperparameters that need to be adjusted for every dataset.
Collapse
Affiliation(s)
- Sven E. Weber
- Department of Plant Breeding, Justus Liebig University, Giessen, Germany
| | - Matthias Frisch
- Department of Biometry and Population Genetics, Justus Liebig University, Giessen, Germany
| | - Rod J. Snowdon
- Department of Plant Breeding, Justus Liebig University, Giessen, Germany
| | - Kai P. Voss-Fels
- Institute for Grapevine Breeding, Hochschule Geisenheim University, Geisenheim, Germany
| |
Collapse
|
19
|
Solovieva E, Sakai H. PSReliP: an integrated pipeline for analysis and visualization of population structure and relatedness based on genome-wide genetic variant data. BMC Bioinformatics 2023; 24:135. [PMID: 37020193 PMCID: PMC10074814 DOI: 10.1186/s12859-023-05169-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 02/02/2023] [Indexed: 04/07/2023] Open
Abstract
BACKGROUND Population structure and cryptic relatedness between individuals (samples) are two major factors affecting false positives in genome-wide association studies (GWAS). In addition, population stratification and genetic relatedness in genomic selection in animal and plant breeding can affect prediction accuracy. The methods commonly used for solving these problems are principal component analysis (to adjust for population stratification) and marker-based kinship estimates (to correct for the confounding effects of genetic relatedness). Currently, many tools and software are available that analyze genetic variation among individuals to determine population structure and genetic relationships. However, none of these tools or pipelines perform such analyses in a single workflow and visualize all the various results in a single interactive web application. RESULTS We developed PSReliP, a standalone, freely available pipeline for the analysis and visualization of population structure and relatedness between individuals in a user-specified genetic variant dataset. The analysis stage of PSReliP is responsible for executing all steps of data filtering and analysis and contains an ordered sequence of commands from PLINK, a whole-genome association analysis toolset, along with in-house shell scripts and Perl programs that support data pipelining. The visualization stage is provided by Shiny apps, an R-based interactive web application. In this study, we describe the characteristics and features of PSReliP and demonstrate how it can be applied to real genome-wide genetic variant data. CONCLUSIONS The PSReliP pipeline allows users to quickly analyze genetic variants such as single nucleotide polymorphisms and small insertions or deletions at the genome level to estimate population structure and cryptic relatedness using PLINK software and to visualize the analysis results in interactive tables, plots, and charts using Shiny technology. The analysis and assessment of population stratification and genetic relatedness can aid in choosing an appropriate approach for the statistical analysis of GWAS data and predictions in genomic selection. The various outputs from PLINK can be used for further downstream analysis. The code and manual for PSReliP are available at https://github.com/solelena/PSReliP .
Collapse
Affiliation(s)
- Elena Solovieva
- Research Center for Advanced Analysis, National Agriculture and Food Research Organization, Tsukuba, Ibaraki, Japan
| | - Hiroaki Sakai
- Research Center for Advanced Analysis, National Agriculture and Food Research Organization, Tsukuba, Ibaraki, Japan.
| |
Collapse
|
20
|
Nadeau S, Beaulieu J, Gezan SA, Perron M, Bousquet J, Lenz PRN. Increasing genomic prediction accuracy for unphenotyped full-sib families by modeling additive and dominance effects with large datasets in white spruce. FRONTIERS IN PLANT SCIENCE 2023; 14:1137834. [PMID: 37035077 PMCID: PMC10073444 DOI: 10.3389/fpls.2023.1137834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Accepted: 02/14/2023] [Indexed: 06/19/2023]
Abstract
Introduction Genomic selection is becoming a standard technique in plant breeding and is now being introduced into forest tree breeding. Despite promising results to predict the genetic merit of superior material based on their additive breeding values, many studies and operational programs still neglect non-additive effects and their potential for enhancing genetic gains. Methods Using two large comprehensive datasets totaling 4,066 trees from 146 full-sib families of white spruce (Picea glauca (Moench) Voss), we evaluated the effect of the inclusion of dominance on the precision of genetic parameter estimates and on the accuracy of conventional pedigree-based (ABLUP-AD) and genomic-based (GBLUP-AD) models. Results While wood quality traits were mostly additively inherited, considerable non-additive effects and lower heritabilities were detected for growth traits. For growth, GBLUP-AD better partitioned the additive and dominance effects into roughly equal variances, while ABLUP-AD strongly overestimated dominance. The predictive abilities of breeding and total genetic value estimates were similar between ABLUP-AD and GBLUP-AD when predicting individuals from the same families as those included in the training dataset. However, GBLUP-AD outperformed ABLUP-AD when predicting for new unphenotyped families that were not represented in the training dataset, with, on average, 22% and 53% higher predictive ability of breeding and genetic values, respectively. Resampling simulations showed that GBLUP-AD required smaller sample sizes than ABLUP-AD to produce precise estimates of genetic variances and accurate predictions of genetic values. Still, regardless of the method used, large training datasets were needed to estimate additive and non-additive genetic variances precisely. Discussion This study highlights the different quantitative genetic architectures between growth and wood traits. Furthermore, the usefulness of genomic additive-dominance models for predicting new families should allow practicing mating allocation to maximize the total genetic values for the propagation of elite material.
Collapse
Affiliation(s)
- Simon Nadeau
- Natural Resources Canada, Canadian Forest Service, Canadian Wood Fibre Centre, Québec, QC, Canada
| | - Jean Beaulieu
- Canada Research Chair in Forest Genomics, Institute for Systems and Integrative Biology, Université Laval, Québec, QC, Canada
| | | | - Martin Perron
- Canada Research Chair in Forest Genomics, Institute for Systems and Integrative Biology, Université Laval, Québec, QC, Canada
- Direction de la Recherche Forestière, Ministère des Ressources Naturelles et des Forêts, Québec, QC, Canada
| | - Jean Bousquet
- Canada Research Chair in Forest Genomics, Institute for Systems and Integrative Biology, Université Laval, Québec, QC, Canada
| | - Patrick R. N. Lenz
- Natural Resources Canada, Canadian Forest Service, Canadian Wood Fibre Centre, Québec, QC, Canada
- Canada Research Chair in Forest Genomics, Institute for Systems and Integrative Biology, Université Laval, Québec, QC, Canada
| |
Collapse
|
21
|
Gevartosky R, Carvalho HF, Costa-Neto G, Montesinos-López OA, Crossa J, Fritsche-Neto R. Enviromic-based kernels may optimize resource allocation with multi-trait multi-environment genomic prediction for tropical Maize. BMC PLANT BIOLOGY 2023; 23:10. [PMID: 36604618 PMCID: PMC9814176 DOI: 10.1186/s12870-022-03975-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Accepted: 11/24/2022] [Indexed: 06/17/2023]
Abstract
BACKGROUND Success in any genomic prediction platform is directly dependent on establishing a representative training set. This is a complex task, even in single-trait single-environment conditions and tends to be even more intricated wherein additional information from envirotyping and correlated traits are considered. Here, we aimed to design optimized training sets focused on genomic prediction, considering multi-trait multi-environment trials, and how those methods may increase accuracy reducing phenotyping costs. For that, we considered single-trait multi-environment trials and multi-trait multi-environment trials for three traits: grain yield, plant height, and ear height, two datasets, and two cross-validation schemes. Next, two strategies for designing optimized training sets were conceived, first considering only the genomic by environment by trait interaction (GET), while a second including large-scale environmental data (W, enviromics) as genomic by enviromic by trait interaction (GWT). The effective number of individuals (genotypes × environments × traits) was assumed as those that represent at least 98% of each kernel (GET or GWT) variation, in which those individuals were then selected by a genetic algorithm based on prediction error variance criteria to compose an optimized training set for genomic prediction purposes. RESULTS The combined use of genomic and enviromic data efficiently designs optimized training sets for genomic prediction, improving the response to selection per dollar invested by up to 145% when compared to the model without enviromic data, and even more when compared to cross validation scheme with 70% of training set or pure phenotypic selection. Prediction models that include G × E or enviromic data + G × E yielded better prediction ability. CONCLUSIONS Our findings indicate that a genomic by enviromic by trait interaction kernel associated with genetic algorithms is efficient and can be proposed as a promising approach to designing optimized training sets for genomic prediction when the variance-covariance matrix of traits is available. Additionally, great improvements in the genetic gains per dollar invested were observed, suggesting that a good allocation of resources can be deployed by using the proposed approach.
Collapse
Affiliation(s)
- Raysa Gevartosky
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo, Piracicaba, São Paulo, Brazil.
| | - Humberto Fanelli Carvalho
- Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM), Madrid, Spain
| | - Germano Costa-Neto
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo, Piracicaba, São Paulo, Brazil
- Institute for Genomics Diversity, Cornell University, Ithaca, NY, USA
| | | | - José Crossa
- International Maize and Wheat Improvement Center (CIMMYT), Km 45, Carretera Mexico-Veracruz, CP 52640, Texcoco, Edo. de México, Mexico
- Colegio de Postgraduados, CP 56230, Montecillos, Edo. de México, Mexico
| | - Roberto Fritsche-Neto
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo, Piracicaba, São Paulo, Brazil
| |
Collapse
|
22
|
Berkner MO, Schulthess AW, Zhao Y, Jiang Y, Oppermann M, Reif JC. Choosing the right tool: Leveraging of plant genetic resources in wheat (Triticum aestivum L.) benefits from selection of a suitable genomic prediction model. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022; 135:4391-4407. [PMID: 36182979 PMCID: PMC9734214 DOI: 10.1007/s00122-022-04227-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Accepted: 09/17/2022] [Indexed: 06/16/2023]
Abstract
Genomic prediction of genebank accessions benefits from the consideration of additive-by-additive epistasis and subpopulation-specific marker effects. Wheat (Triticum aestivum L.) and other species of the Triticum genus are well represented in genebank collections worldwide. The substantial genetic diversity harbored by more than 850,000 accessions can be explored for their potential use in modern plant breeding. Characterization of these large number of accessions is constrained by the required resources, and this fact limits their use so far. This limitation might be overcome by engaging genomic prediction. The present study compared ten different genomic prediction approaches to the prediction of four traits, namely flowering time, plant height, thousand grain weight, and yellow rust resistance, in a diverse set of 7745 accession samples from Germany's Federal ex situ genebank at the Leibniz Institute of Plant Genetics and Crop Plant Research in Gatersleben. Approaches were evaluated based on prediction ability and robustness to the confounding influence of strong population structure. The authors propose the wide application of extended genomic best linear unbiased prediction due to the observed benefit of incorporating additive-by-additive epistasis. General and subpopulation-specific additive ridge regression best linear unbiased prediction, which accounts for subpopulation-specific marker-effects, was shown to be a good option if contrasting clusters are encountered in the analyzed collection. The presented findings reaffirm that the trait's genetic architecture as well as the composition and relatedness of the training set and test set are major driving factors for the accuracy of genomic prediction.
Collapse
Affiliation(s)
- Marcel O Berkner
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466, Stadt Seeland, Germany
| | - Albert W Schulthess
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466, Stadt Seeland, Germany
| | - Yusheng Zhao
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466, Stadt Seeland, Germany
| | - Yong Jiang
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466, Stadt Seeland, Germany
| | - Markus Oppermann
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466, Stadt Seeland, Germany
| | - Jochen C Reif
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466, Stadt Seeland, Germany.
| |
Collapse
|
23
|
Wilkinson MJ, Yamashita R, James ME, Bally ISE, Dillon NL, Ali A, Hardner CM, Ortiz-Barrientos D. The influence of genetic structure on phenotypic diversity in the Australian mango (Mangifera indica) gene pool. Sci Rep 2022; 12:20614. [PMID: 36450793 PMCID: PMC9712640 DOI: 10.1038/s41598-022-24800-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Accepted: 11/21/2022] [Indexed: 12/11/2022] Open
Abstract
Genomic selection is a promising breeding technique for tree crops to accelerate the development of new cultivars. However, factors such as genetic structure can create spurious associations between genotype and phenotype due to the shared history between populations with different trait values. Genetic structure can therefore reduce the accuracy of the genotype to phenotype map, a fundamental requirement of genomic selection models. Here, we employed 272 single nucleotide polymorphisms from 208 Mangifera indica accessions to explore whether the genetic structure of the Australian mango gene pool explained variation in trunk circumference, fruit blush colour and intensity. Multiple population genetic analyses indicate the presence of four genetic clusters and show that the most genetically differentiated cluster contains accessions imported from Southeast Asia (mainly those from Thailand). We find that genetic structure was strongly associated with three traits: trunk circumference, fruit blush colour and intensity in M. indica. This suggests that the history of these accessions could drive spurious associations between loci and key mango phenotypes in the Australian mango gene pool. Incorporating such genetic structure in associations between genotype and phenotype can improve the accuracy of genomic selection, which can assist the future development of new cultivars.
Collapse
Affiliation(s)
- Melanie J Wilkinson
- School of Biological Sciences, The University of Queensland, Brisbane, QLD, 4072, Australia.
- Australian Research Council Centre of Excellence for Plant Success in Nature and Agriculture, The University of Queensland, Brisbane, QLD, 4072, Australia.
| | - Risa Yamashita
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Maddie E James
- School of Biological Sciences, The University of Queensland, Brisbane, QLD, 4072, Australia
- Australian Research Council Centre of Excellence for Plant Success in Nature and Agriculture, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Ian S E Bally
- Queensland Department of Agriculture and Fisheries, Mareeba, QLD, 4880, Australia
| | - Natalie L Dillon
- Queensland Department of Agriculture and Fisheries, Mareeba, QLD, 4880, Australia
| | - Asjad Ali
- Queensland Department of Agriculture and Fisheries, Mareeba, QLD, 4880, Australia
| | - Craig M Hardner
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Daniel Ortiz-Barrientos
- School of Biological Sciences, The University of Queensland, Brisbane, QLD, 4072, Australia
- Australian Research Council Centre of Excellence for Plant Success in Nature and Agriculture, The University of Queensland, Brisbane, QLD, 4072, Australia
| |
Collapse
|
24
|
Hardner CM, Fikere M, Gasic K, da Silva Linge C, Worthington M, Byrne D, Rawandoozi Z, Peace C. Multi-environment genomic prediction for soluble solids content in peach ( Prunus persica). FRONTIERS IN PLANT SCIENCE 2022; 13:960449. [PMID: 36275520 PMCID: PMC9583944 DOI: 10.3389/fpls.2022.960449] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Accepted: 08/01/2022] [Indexed: 06/16/2023]
Abstract
Genotype-by-environment interaction (G × E) is a common phenomenon influencing genetic improvement in plants, and a good understanding of this phenomenon is important for breeding and cultivar deployment strategies. However, there is little information on G × E in horticultural tree crops, mostly due to evaluation costs, leading to a focus on the development and deployment of locally adapted germplasm. Using sweetness (measured as soluble solids content, SSC) in peach/nectarine assessed at four trials from three US peach-breeding programs as a case study, we evaluated the hypotheses that (i) complex data from multiple breeding programs can be connected using GBLUP models to improve the knowledge of G × E for breeding and deployment and (ii) accounting for a known large-effect quantitative trait locus (QTL) improves the prediction accuracy. Following a structured strategy using univariate and multivariate models containing additive and dominance genomic effects on SSC, a model that included a previously detected QTL and background genomic effects was a significantly better fit than a genome-wide model with completely anonymous markers. Estimates of an individual's narrow-sense and broad-sense heritability for SSC were high (0.57-0.73 and 0.66-0.80, respectively), with 19-32% of total genomic variance explained by the QTL. Genome-wide dominance effects and QTL effects were stable across environments. Significant G × E was detected for background genome effects, mostly due to the low correlation of these effects across seasons within a particular trial. The expected prediction accuracy, estimated from the linear model, was higher than the realised prediction accuracy estimated by cross-validation, suggesting that these two parameters measure different qualities of the prediction models. While prediction accuracy was improved in some cases by combining data across trials, particularly when phenotypic data for untested individuals were available from other trials, this improvement was not consistent. This study confirms that complex data can be combined into a single analysis using GBLUP methods to improve understanding of G × E and also incorporate known QTL effects. In addition, the study generated baseline information to account for population structure in genomic prediction models in horticultural crop improvement.
Collapse
Affiliation(s)
- Craig M. Hardner
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, QLD, Australia
| | - Mulusew Fikere
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, QLD, Australia
| | - Ksenija Gasic
- Department of Plant and Environmental Sciences, Clemson University, Clemson, SC, United States
| | - Cassia da Silva Linge
- Department of Plant and Environmental Sciences, Clemson University, Clemson, SC, United States
| | - Margaret Worthington
- Faculty Horticulture, University of Arkansas System Division of Agriculture, Fayetteville, AR, United States
| | - David Byrne
- College of Agriculture and Life Sciences, Texas A&M University, College Station, TX, United States
| | - Zena Rawandoozi
- College of Agriculture and Life Sciences, Texas A&M University, College Station, TX, United States
| | - Cameron Peace
- Department of Horticulture, Washington State University, Pullman, WA, United States
| |
Collapse
|
25
|
Cerioli T, Hernandez CO, Angira B, McCouch SR, Robbins KR, Famoso AN. Development and validation of an optimized marker set for genomic selection in southern U.S. rice breeding programs. THE PLANT GENOME 2022; 15:e20219. [PMID: 35611838 DOI: 10.1002/tpg2.20219] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 03/28/2022] [Indexed: 06/15/2023]
Abstract
The potential of genomic selection (GS) to increase the efficiency of breeding programs has been clearly demonstrated; however, the implementation of GS in rice (Oryza sativa L.) breeding programs has been limited. In recent years, efforts have begun to work toward implementing GS into the Louisiana State University (LSU) Agricultural Center rice breeding program. One of the first steps for successful GS implementation is to establish a suitable marker set for the target germplasm and a reliable, cost-effective genotyping platform capable of providing informative marker data with an adequate turnaround time. The objective of this study was to develop a marker set for routine GS and demonstrate its effectiveness in southern U.S. rice germplasm. The utility of the resulting marker set, the LSU500, for GS applications was demonstrated using four years of breeding data across 7,607 experimental lines and four elite biparental populations. The predictive ability of GS ranged from 0.13 to 0.78 for key traits across different market classes and yield trials. Comparisons between phenotypic selection and GS within biparental populations demonstrates similar performance of GS compared with phenotypic selection in predicting future performance. The prediction accuracies obtained with the LSU500 marker set demonstrates the utility of this marker set for cost-effective GS applications in southern U.S. rice breeding programs. The LSU500 marker set has been established through the genotyping service provider Agriplex Genomics, and in the future, it will undergo improvements to reduce the cost and increase the accuracy of GS.
Collapse
Affiliation(s)
- Tommaso Cerioli
- H. Rouse Caffey Rice Research Station, Louisiana State Univ. Agricultural Center, Rayne, LA, 70578, USA
| | - Christopher O Hernandez
- H. Rouse Caffey Rice Research Station, Louisiana State Univ. Agricultural Center, Rayne, LA, 70578, USA
| | - Brijesh Angira
- H. Rouse Caffey Rice Research Station, Louisiana State Univ. Agricultural Center, Rayne, LA, 70578, USA
| | - Susan R McCouch
- Section of Plant Breeding and Genetics, School of Integrative Plant Sciences, Cornell Univ., Ithaca, NY, 14850, USA
- Cornell Institute for Digital Agriculture, Cornell Univ., Ithaca, NY, 14850, USA
| | - Kelly R Robbins
- Section of Plant Breeding and Genetics, School of Integrative Plant Sciences, Cornell Univ., Ithaca, NY, 14850, USA
| | - Adam N Famoso
- H. Rouse Caffey Rice Research Station, Louisiana State Univ. Agricultural Center, Rayne, LA, 70578, USA
| |
Collapse
|
26
|
Semagn K, Iqbal M, Jarquin D, Randhawa H, Aboukhaddour R, Howard R, Ciechanowska I, Farzand M, Dhariwal R, Hiebert CW, N’Diaye A, Pozniak C, Spaner D. Genomic Prediction Accuracy of Stripe Rust in Six Spring Wheat Populations by Modeling Genotype by Environment Interaction. PLANTS (BASEL, SWITZERLAND) 2022; 11:1736. [PMID: 35807690 PMCID: PMC9269065 DOI: 10.3390/plants11131736] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 06/27/2022] [Accepted: 06/28/2022] [Indexed: 11/17/2022]
Abstract
Some previous studies have assessed the predictive ability of genome-wide selection on stripe (yellow) rust resistance in wheat, but the effect of genotype by environment interaction (GEI) in prediction accuracies has not been well studied in diverse genetic backgrounds. Here, we compared the predictive ability of a model based on phenotypic data only (M1), the main effect of phenotype and molecular markers (M2), and a model that incorporated GEI (M3) using three cross-validations (CV1, CV2, and CV0) scenarios of interest to breeders in six spring wheat populations. Each population was evaluated at three to eight field nurseries and genotyped with either the DArTseq technology or the wheat 90K single nucleotide polymorphism arrays, of which a subset of 1,058- 23,795 polymorphic markers were used for the analyses. In the CV1 scenario, the mean prediction accuracies of the M1, M2, and M3 models across the six populations varied from -0.11 to -0.07, from 0.22 to 0.49, and from 0.19 to 0.48, respectively. Mean accuracies obtained using the M3 model in the CV1 scenario were significantly greater than the M2 model in two populations, the same in three populations, and smaller in one population. In both the CV2 and CV0 scenarios, the mean prediction accuracies of the three models varied from 0.53 to 0.84 and were not significantly different in all populations, except the Attila/CDC Go in the CV2, where the M3 model gave greater accuracy than both the M1 and M2 models. Overall, the M3 model increased prediction accuracies in some populations by up to 12.4% and decreased accuracy in others by up to 17.4%, demonstrating inconsistent results among genetic backgrounds that require considering each population separately. This is the first comprehensive genome-wide prediction study that investigated details of the effect of GEI on stripe rust resistance across diverse spring wheat populations.
Collapse
Affiliation(s)
- Kassa Semagn
- Department of Agricultural, Food and Nutritional Science, 4-10 Agriculture-Forestry Centre, University of Alberta, Edmonton, AB T6G 2P5, Canada; (M.I.); (I.C.); (M.F.)
| | - Muhammad Iqbal
- Department of Agricultural, Food and Nutritional Science, 4-10 Agriculture-Forestry Centre, University of Alberta, Edmonton, AB T6G 2P5, Canada; (M.I.); (I.C.); (M.F.)
| | - Diego Jarquin
- Agronomy Department, University of Florida, Gainesville, FL 32611, USA;
| | - Harpinder Randhawa
- Lethbridge Research and Development Centre, Agriculture and Agri-Food Canada, 5403-1st Avenue South, Lethbridge, AB T1J 4B1, Canada; (H.R.); (R.A.); (R.D.)
| | - Reem Aboukhaddour
- Lethbridge Research and Development Centre, Agriculture and Agri-Food Canada, 5403-1st Avenue South, Lethbridge, AB T1J 4B1, Canada; (H.R.); (R.A.); (R.D.)
| | - Reka Howard
- Department of Statistics, University of Nebraska–Lincoln, Lincoln, NE 68583, USA;
| | - Izabela Ciechanowska
- Department of Agricultural, Food and Nutritional Science, 4-10 Agriculture-Forestry Centre, University of Alberta, Edmonton, AB T6G 2P5, Canada; (M.I.); (I.C.); (M.F.)
| | - Momna Farzand
- Department of Agricultural, Food and Nutritional Science, 4-10 Agriculture-Forestry Centre, University of Alberta, Edmonton, AB T6G 2P5, Canada; (M.I.); (I.C.); (M.F.)
| | - Raman Dhariwal
- Lethbridge Research and Development Centre, Agriculture and Agri-Food Canada, 5403-1st Avenue South, Lethbridge, AB T1J 4B1, Canada; (H.R.); (R.A.); (R.D.)
| | - Colin W. Hiebert
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, 101 Route 100, Morden, MB R6M 1Y5, Canada;
| | - Amidou N’Diaye
- Crop Development Centre and Department of Plant Sciences, University of Saskatchewan, 51 Campus Drive, Saskatoon, SK S7N 5A8, Canada; (A.N.); (C.P.)
| | - Curtis Pozniak
- Crop Development Centre and Department of Plant Sciences, University of Saskatchewan, 51 Campus Drive, Saskatoon, SK S7N 5A8, Canada; (A.N.); (C.P.)
| | - Dean Spaner
- Department of Agricultural, Food and Nutritional Science, 4-10 Agriculture-Forestry Centre, University of Alberta, Edmonton, AB T6G 2P5, Canada; (M.I.); (I.C.); (M.F.)
| |
Collapse
|
27
|
So D, Smith A, Sparry E, Lukens L. Genetics, not environment, contributed to winter wheat yield gains in Ontario, Canada. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022; 135:1893-1908. [PMID: 35348822 DOI: 10.1007/s00122-022-04082-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Accepted: 03/12/2022] [Indexed: 06/14/2023]
Abstract
Changes in entries' market classes and genetic improvements within classes-not environmental changes-enhanced yields over thirty-one years of wheat trials. Correlations between yields and ancestries drove genomic prediction accuracies. Increasing crop yields is important for enhancing farmers' livelihoods, meeting market demands, and reducing the environmental impact of agriculture. We analyzed the yield trends of Ontario winter wheat variety trials between 1988 and 2018. Over this period, wheat yields steadily increased by 38 kg ha-1 yr-1, or 0.68% yr-1 relative to the mean. While fungicide treatment of trials contributed a one-time 670 kg ha-1 yield increase, yields were otherwise unaffected by long-term changes in agronomic practice, climate, or other non-genetic factors. Genetic improvement entirely accounted for yield improvement. Market class changes over the 31 year span accounted for some yield improvement. More importantly, genetic improvement occurred within each market class. Entry yield estimates calculated from genomic prediction models strongly correlated with field estimated yields with a mean r of 0.68. Genomic prediction accuracies were high because yields differed across genetically distinct subpopulations. Despite environmental changes, genetic improvement will likely increase Ontario winter wheat yields into the future.
Collapse
Affiliation(s)
- Delvin So
- Department of Plant Agriculture, University of Guelph, Guelph, ON, N1G2W1, Canada
| | - Alexandra Smith
- Department of Plant Agriculture, University of Guelph, Guelph, ON, N1G2W1, Canada
| | - Ellen Sparry
- C and M Seed, 6180 5th Line, Palmerston, ON, N0G2P0, Canada
| | - Lewis Lukens
- Department of Plant Agriculture, University of Guelph, Guelph, ON, N1G2W1, Canada.
| |
Collapse
|
28
|
Juliana P, He X, Poland J, Roy KK, Malaker PK, Mishra VK, Chand R, Shrestha S, Kumar U, Roy C, Gahtyari NC, Joshi AK, Singh RP, Singh PK. Genomic selection for spot blotch in bread wheat breeding panels, full-sibs and half-sibs and index-based selection for spot blotch, heading and plant height. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022; 135:1965-1983. [PMID: 35416483 PMCID: PMC9205839 DOI: 10.1007/s00122-022-04087-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Accepted: 03/17/2022] [Indexed: 06/14/2023]
Abstract
KEY MESSAGE Genomic selection is a promising tool to select for spot blotch resistance and index-based selection can simultaneously select for spot blotch resistance, heading and plant height. A major biotic stress challenging bread wheat production in regions characterized by humid and warm weather is spot blotch caused by the fungus Bipolaris sorokiniana. Since genomic selection (GS) is a promising selection tool, we evaluated its potential for spot blotch in seven breeding panels comprising 6736 advanced lines from the International Maize and Wheat Improvement Center. Our results indicated moderately high mean genomic prediction accuracies of 0.53 and 0.40 within and across breeding panels, respectively which were on average 177.6% and 60.4% higher than the mean accuracies from fixed effects models using selected spot blotch loci. Genomic prediction was also evaluated in full-sibs and half-sibs panels and sibs were predicted with the highest mean accuracy (0.63) from a composite training population with random full-sibs and half-sibs. The mean accuracies when full-sibs were predicted from other full-sibs within families and when full-sibs panels were predicted from other half-sibs panels were 0.47 and 0.44, respectively. Comparison of GS with phenotypic selection (PS) of the top 10% of resistant lines suggested that GS could be an ideal tool to discard susceptible lines, as greater than 90% of the susceptible lines discarded by PS were also discarded by GS. We have also reported the evaluation of selection indices to simultaneously select non-late and non-tall genotypes with low spot blotch phenotypic values and genomic-estimated breeding values. Overall, this study demonstrates the potential of integrating GS and index-based selection for improving spot blotch resistance in bread wheat.
Collapse
Affiliation(s)
- Philomin Juliana
- Borlaug Institute for South Asia (BISA), Ludhiana, Punjab, India
| | - Xinyao He
- International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600, Mexico, DF, Mexico
| | - Jesse Poland
- Biological and Environmental Science and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
| | - Krishna K Roy
- Bangladesh Wheat and Maize Research Institute, Nashipur, Dinajpur, 5200, Bangladesh
| | - Paritosh K Malaker
- Bangladesh Wheat and Maize Research Institute, Nashipur, Dinajpur, 5200, Bangladesh
| | - Vinod K Mishra
- Institute of Agricultural Sciences, Banaras Hindu University, Varanasi, Uttar Pradesh, India
| | - Ramesh Chand
- Institute of Agricultural Sciences, Banaras Hindu University, Varanasi, Uttar Pradesh, India
| | - Sandesh Shrestha
- Department of Plant Pathology, Wheat Genetics Resource Center, Kansas State University, Manhattan, KS, USA
| | - Uttam Kumar
- Borlaug Institute for South Asia (BISA), Ludhiana, Punjab, India
| | - Chandan Roy
- Department of Plant Breeding and Genetics, Bihar Agricultural University, Sabour, Bihar, 813210, India
| | - Navin C Gahtyari
- ICAR-Vivekanand Parvatiya Krishi Anushandhan Sansthan, Almora, Uttarakhand, 263601, India
| | - Arun K Joshi
- Borlaug Institute for South Asia (BISA), Ludhiana, Punjab, India
- CIMMYT-India, NASC Complex, DPS Marg, New Delhi, India
| | - Ravi P Singh
- International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600, Mexico, DF, Mexico.
| | - Pawan K Singh
- International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600, Mexico, DF, Mexico.
| |
Collapse
|
29
|
Brault C, Segura V, This P, Le Cunff L, Flutre T, François P, Pons T, Péros JP, Doligez A. Across-population genomic prediction in grapevine opens up promising prospects for breeding. HORTICULTURE RESEARCH 2022; 9:uhac041. [PMID: 35184162 PMCID: PMC9070645 DOI: 10.1093/hr/uhac041] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Accepted: 02/01/2022] [Indexed: 05/15/2023]
Abstract
Crop breeding involves two selection steps: choosing progenitors and selecting individuals within progenies. Genomic prediction, based on genome-wide marker estimation of genetic values, could facilitate these steps. However, its potential usefulness in grapevine (Vitis vinifera L.) has only been evaluated in non-breeding contexts mainly through cross-validation within a single population. We tested across-population genomic prediction in a more realistic breeding configuration, from a diversity panel to ten bi-parental crosses connected within a half-diallel mating design. Prediction quality was evaluated over 15 traits of interest (related to yield, berry composition, phenology and vigour), for both the average genetic value of each cross (cross mean) and the genetic values of individuals within each cross (individual values). Genomic prediction in these conditions was found useful: for cross mean, average per-trait predictive ability was 0.6, while per-cross predictive ability was halved on average, but reached a maximum of 0.7. Mean predictive ability for individual values within crosses was 0.26, about half the within-half-diallel value taken as a reference. For some traits and/or crosses, these across-population predictive ability values are promising for implementing genomic selection in grapevine breeding. This study also provided key insights on variables affecting predictive ability. Per-cross predictive ability was well predicted by genetic distance between parents and when this predictive ability was below 0.6, it was improved by training set optimization. For individual values, predictive ability mostly depended on trait-related variables (magnitude of the cross effect and heritability). These results will greatly help designing grapevine breeding programs assisted by genomic prediction.
Collapse
Affiliation(s)
- Charlotte Brault
- UMT Geno-Vigne®, IFV-INRAE-Institut Agro, F-34398 Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, F-34398 Montpellier, France
- Institut Français de la Vigne et du Vin, F-34398 Montpellier, France
| | - Vincent Segura
- UMT Geno-Vigne®, IFV-INRAE-Institut Agro, F-34398 Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, F-34398 Montpellier, France
| | - Patrice This
- UMT Geno-Vigne®, IFV-INRAE-Institut Agro, F-34398 Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, F-34398 Montpellier, France
| | - Loïc Le Cunff
- UMT Geno-Vigne®, IFV-INRAE-Institut Agro, F-34398 Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, F-34398 Montpellier, France
- Institut Français de la Vigne et du Vin, F-34398 Montpellier, France
| | - Timothée Flutre
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE – Le Moulon, 91190, Gif-sur-Yvette, France
| | - Pierre François
- UMT Geno-Vigne®, IFV-INRAE-Institut Agro, F-34398 Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, F-34398 Montpellier, France
| | - Thierry Pons
- UMT Geno-Vigne®, IFV-INRAE-Institut Agro, F-34398 Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, F-34398 Montpellier, France
| | - Jean-Pierre Péros
- UMT Geno-Vigne®, IFV-INRAE-Institut Agro, F-34398 Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, F-34398 Montpellier, France
| | - Agnès Doligez
- UMT Geno-Vigne®, IFV-INRAE-Institut Agro, F-34398 Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, F-34398 Montpellier, France
| |
Collapse
|
30
|
Lauer E, Holland J, Isik F. Prediction ability of genome-wide markers in Pinus taeda L. within and between population is affected by relatedness to the training population and trait genetic architecture. G3 (BETHESDA, MD.) 2022; 12:6440053. [PMID: 34849838 PMCID: PMC9210318 DOI: 10.1093/g3journal/jkab405] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 11/08/2021] [Indexed: 11/26/2022]
Abstract
Genomic prediction has the potential to significantly increase the rate of genetic gain in tree breeding programs. In this study, a clonally replicated population (n = 2063) was used to train a genomic prediction model. The model was validated both within the training population and in a separate population (n = 451). The prediction abilities from random (20% vs 80%) cross validation within the training population were 0.56 and 0.78 for height and stem form, respectively. Removal of all full-sib relatives within the training population resulted in ∼50% reduction in their genomic prediction ability for both traits. The average prediction ability for all 451 individual trees was 0.29 for height and 0.57 for stem form. The degree of genetic linkage (full-sib family, half sib family, unrelated) between the training and validation sets had a strong impact on prediction ability for stem form but not for height. A dominant dwarfing allele, the first to be reported in a conifer species, was discovered via genome-wide association studies on linkage Group 5 that conferred a 0.33-m mean height reduction. However, the QTL was family specific. The rapid decay of linkage disequilibrium, large genome size, and inconsistencies in marker-QTL linkage phase suggest that large, diverse training populations are needed for genomic selection in Pinus taeda L.
Collapse
Affiliation(s)
- Edwin Lauer
- Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, NC 27695, USA
| | - James Holland
- USDA-ARS Plant Science Research Unit, North Carolina State University, Raleigh, NC 27695, USA
| | - Fikret Isik
- Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, NC 27695, USA
| |
Collapse
|
31
|
Torres LG, de Oliveira EJ, Ogbonna AC, Bauchet GJ, Mueller LA, Azevedo CF, Fonseca e Silva F, Simiqueli GF, de Resende MDV. Can Cross-Country Genomic Predictions Be a Reasonable Strategy to Support Germplasm Exchange? - A Case Study With Hydrogen Cyanide in Cassava. FRONTIERS IN PLANT SCIENCE 2021; 12:742638. [PMID: 34956254 PMCID: PMC8692580 DOI: 10.3389/fpls.2021.742638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 11/08/2021] [Indexed: 06/14/2023]
Abstract
Genomic prediction (GP) offers great opportunities for accelerated genetic gains by optimizing the breeding pipeline. One of the key factors to be considered is how the training populations (TP) are composed in terms of genetic improvement, kinship/origin, and their impacts on GP. Hydrogen cyanide content (HCN) is a determinant trait to guide cassava's products usage and processing. This work aimed to achieve the following objectives: (i) evaluate the feasibility of using cross-country (CC) GP between germplasm's of Embrapa Mandioca e Fruticultura (Embrapa, Brazil) and The International Institute of Tropical Agriculture (IITA, Nigeria) for HCN; (ii) provide an assessment of population structure for the joint dataset; (iii) estimate the genetic parameters based on single nucleotide polymorphisms (SNPs) and a haplotype-approach. Datasets of HCN from Embrapa and IITA breeding programs were analyzed, separately and jointly, with 1,230, 590, and 1,820 clones, respectively. After quality control, ∼14K SNPs were used for GP. The genomic estimated breeding values (GEBVs) were predicted based on SNP effects from analyses with TP composed of the following: (i) Embrapa genotypic and phenotypic data, (ii) IITA genotypic and phenotypic data, and (iii) the joint datasets. Comparisons on GEBVs' estimation were made considering the hypothetical situation of not having the phenotypic characterization for a set of clones for a certain research institute/country and might need to use the markers' effects that were trained with data from other research institutes/country's germplasm to estimate their clones' GEBV. Fixation index (FST) among the genetic groups identified within the joint dataset ranged from 0.002 to 0.091. The joint dataset provided an improved accuracy (0.8-0.85) compared to the prediction accuracy of either germplasm's sources individually (0.51-0.67). CC GP proved to have potential use under the present study's scenario, the correlation between GEBVs predicted with TP from Embrapa and IITA was 0.55 for Embrapa's germplasm, whereas for IITA's it was 0.1. This seems to be among the first attempts to evaluate the CC GP in plants. As such, a lot of useful new information was provided on the subject, which can guide new research on this very important and emerging field.
Collapse
Affiliation(s)
- Lívia Gomes Torres
- Department of Plant Science, Universidade Federal de Viçosa, Viçosa, Brazil
| | | | - Alex C. Ogbonna
- Department of Plant Breeding and Genetics, Cornell University, Ithaca, NY, United States
- Boyce Thompson Institute, Ithaca, NY, United States
| | | | - Lukas A. Mueller
- Department of Plant Breeding and Genetics, Cornell University, Ithaca, NY, United States
- Boyce Thompson Institute, Ithaca, NY, United States
| | | | | | | | - Marcos Deon Vilela de Resende
- Department of Forestry Engineering, Universidade Federal de Viçosa, Viçosa, Brazil
- Embrapa Café, Universidade Federal de Viçosa, Viçosa, Brazil
| |
Collapse
|
32
|
Michel S, Löschenberger F, Ametz C, Bürstmayr H. Genomic selection of parents and crosses beyond the native gene pool of a breeding program. THE PLANT GENOME 2021; 14:e20153. [PMID: 34651462 DOI: 10.1002/tpg2.20153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Accepted: 08/03/2021] [Indexed: 06/13/2023]
Abstract
Genomic selection has become a valuable tool for selecting cultivar candidates in many plant breeding programs. Genomic selection of elite parents and crossing combinations with germplasm developed outside a breeding program has, however, hardly been explored until now. The aim of this study was to assess the potential of this method for commonly ranking and selecting elite germplasm developed within and beyond a given breeding program. A winter wheat (Triticum aestivum L.) population consisting of 611 in-house and 87 externally developed lines was used to compare training population compositions and statistical models for genomically predicting baking quality in this framework. Augmenting training populations with lines from other breeding programs had a larger influence on the prediction ability than adding in-house generated lines when aiming to commonly rank both germplasm sets. Exploiting preexisting information of secondary correlated traits resulted likewise in more accurate predictions both in empirical analyses and simulations. Genotyping germplasm developed beyond a given breeding program is moreover a convenient way to clarify its relationships with a breeder's own germplasm because pedigree information is oftentimes not available for this purpose. Genomic predictions can thus support a more informed diversity management, especially when integrating simply to phenotype correlated traits to partly circumvent resource reallocations for a costly phenotyping of germplasm from other programs.
Collapse
Affiliation(s)
- Sebastian Michel
- Dep. of Agrobiotechnology, IFA-Tulln, Univ. of Natural Resources and Life Sciences Vienna, Konrad-Lorenz-Str. 20, 3430 Tulln, Austria
| | | | - Christian Ametz
- Saatzucht Donau GesmbH & CoKG, Saatzuchtstrasse 11, 2301 Probstdorf, Austria
| | - Hermann Bürstmayr
- Dep. of Agrobiotechnology, IFA-Tulln, Univ. of Natural Resources and Life Sciences Vienna, Konrad-Lorenz-Str. 20, 3430 Tulln, Austria
| |
Collapse
|
33
|
Mbinda W, Mukami A. A Review of Recent Advances and Future Directions in the Management of Salinity Stress in Finger Millet. FRONTIERS IN PLANT SCIENCE 2021; 12:734798. [PMID: 34603359 PMCID: PMC8481900 DOI: 10.3389/fpls.2021.734798] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Accepted: 08/16/2021] [Indexed: 06/13/2023]
Abstract
Salinity stress is a major environmental impediment affecting the growth and production of crops. Finger millet is an important cereal grown in many arid and semi-arid areas of the world characterized by erratic rainfall and scarcity of good-quality water. Finger millet salinity stress is caused by the accumulation of soluble salts due to irrigation without a proper drainage system, coupled with the underlying rocks having a high salt content, which leads to the salinization of arable land. This problem is projected to be exacerbated by climate change. The use of new and efficient strategies that provide stable salinity tolerance across a wide range of environments can guarantee sustainable production of finger millet in the future. In this review, we analyze the strategies that have been used for salinity stress management in finger millet production and discuss potential future directions toward the development of salt-tolerant finger millet varieties. This review also describes how advanced biotechnological tools are being used to develop salt-tolerant plants. The biotechnological techniques discussed in this review are simple to implement, have design flexibility, low cost, and highly efficient. This information provides insights into enhancing finger millet salinity tolerance and improving production.
Collapse
Affiliation(s)
- Wilton Mbinda
- Department of Biochemistry and Biotechnology, Pwani University, Kilifi, Kenya
- Pwani University Biosciences Research Centre (PUBReC), Pwani University, Kilifi, Kenya
| | - Asunta Mukami
- Department of Life Sciences, South Eastern Kenya University, Kitui, Kenya
| |
Collapse
|
34
|
Fritsche-Neto R, Galli G, Borges KLR, Costa-Neto G, Alves FC, Sabadin F, Lyra DH, Morais PPP, Braatz de Andrade LR, Granato I, Crossa J. Optimizing Genomic-Enabled Prediction in Small-Scale Maize Hybrid Breeding Programs: A Roadmap Review. FRONTIERS IN PLANT SCIENCE 2021; 12:658267. [PMID: 34276721 PMCID: PMC8281958 DOI: 10.3389/fpls.2021.658267] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/25/2021] [Accepted: 05/10/2021] [Indexed: 06/13/2023]
Abstract
The usefulness of genomic prediction (GP) for many animal and plant breeding programs has been highlighted for many studies in the last 20 years. In maize breeding programs, mostly dedicated to delivering more highly adapted and productive hybrids, this approach has been proved successful for both large- and small-scale breeding programs worldwide. Here, we present some of the strategies developed to improve the accuracy of GP in tropical maize, focusing on its use under low budget and small-scale conditions achieved for most of the hybrid breeding programs in developing countries. We highlight the most important outcomes obtained by the University of São Paulo (USP, Brazil) and how they can improve the accuracy of prediction in tropical maize hybrids. Our roadmap starts with the efforts for germplasm characterization, moving on to the practices for mating design, and the selection of the genotypes that are used to compose the training population in field phenotyping trials. Factors including population structure and the importance of non-additive effects (dominance and epistasis) controlling the desired trait are also outlined. Finally, we explain how the source of the molecular markers, environmental, and the modeling of genotype-environment interaction can affect the accuracy of GP. Results of 7 years of research in a public maize hybrid breeding program under tropical conditions are discussed, and with the great advances that have been made, we find that what is yet to come is exciting. The use of open-source software for the quality control of molecular markers, implementing GP, and envirotyping pipelines may reduce costs in an efficient computational manner. We conclude that exploring new models/tools using high-throughput phenotyping data along with large-scale envirotyping may bring more resolution and realism when predicting genotype performances. Despite the initial costs, mostly for genotyping, the GP platforms in combination with these other data sources can be a cost-effective approach for predicting the performance of maize hybrids for a large set of growing conditions.
Collapse
Affiliation(s)
- Roberto Fritsche-Neto
- Laboratory of Allogamous Plant Breeding, Genetics Department, Luiz de Queiroz College of Agriculture, University of São Paulo, Piracicaba, Brazil
| | - Giovanni Galli
- Laboratory of Allogamous Plant Breeding, Genetics Department, Luiz de Queiroz College of Agriculture, University of São Paulo, Piracicaba, Brazil
| | - Karina Lima Reis Borges
- Laboratory of Allogamous Plant Breeding, Genetics Department, Luiz de Queiroz College of Agriculture, University of São Paulo, Piracicaba, Brazil
| | - Germano Costa-Neto
- Laboratory of Allogamous Plant Breeding, Genetics Department, Luiz de Queiroz College of Agriculture, University of São Paulo, Piracicaba, Brazil
| | - Filipe Couto Alves
- Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, MI, United States
| | - Felipe Sabadin
- Laboratory of Allogamous Plant Breeding, Genetics Department, Luiz de Queiroz College of Agriculture, University of São Paulo, Piracicaba, Brazil
| | - Danilo Hottis Lyra
- Department of Computational and Analytical Sciences, Rothamsted Research, Harpenden, United Kingdom
| | | | | | - Italo Granato
- Laboratoire d'Ecophysiologie des Plantes sous Stress Environnementaux (LEPSE), Institut National de la Recherche Agronomique (INRA), Univ. Montpellier, SupAgro, Montpellier, France
| | - Jose Crossa
- Biometrics and Statistics Unit, International Maize and Wheat Improvement Center (CIMMYT), Carretera México - Veracruz, Texcoco, Mexico
- Colegio de Posgraduado, Montecillo, Mexico
| |
Collapse
|