1
|
Beugnot A, Mary-Huard T, Bauland C, Combes V, Madur D, Lagardère B, Palaffre C, Charcosset A, Moreau L, Fievet JB. Identifying QTLs involved in hybrid performance and heterotic group complementarity: new GWAS models applied to factorial and admixed diallel maize hybrid panels. Theor Appl Genet 2023; 136:219. [PMID: 37816986 PMCID: PMC10564676 DOI: 10.1007/s00122-023-04431-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Accepted: 07/25/2023] [Indexed: 10/12/2023]
Abstract
KEY MESSAGE An original GWAS model integrating the ancestry of alleles was proposed and allowed the detection of background specific additive and dominance QTLs involved in heterotic group complementarity and hybrid performance. Maize genetic diversity is structured into genetic groups selected and improved relative to each other. This process increases group complementarity and differentiation over time and ensures that the hybrids produced from inter-group crosses exhibit high performances and heterosis. To identify loci involved in hybrid performance and heterotic group complementarity, we introduced an original association study model that disentangles allelic effects from the heterotic group origin of the alleles and compared it with a conventional additive/dominance model. This new model was applied on a factorial between Dent and Flint lines and a diallel between Dent-Flint admixed lines with two different layers of analysis: within each environment and in a multiple-environment context. We identified several strong additive QTLs for all traits, including some well-known additive QTLs for flowering time (in the region of Vgt1/2 on chromosome 8). Yield trait displayed significant non-additive effects in the diallel panel. Most of the detected Yield QTLs exhibited overdominance or, more likely, pseudo-overdominance effects. Apparent overdominance at these QTLs contributed to a part of the genetic group complementarity. The comparison between environments revealed a higher stability of additive QTL effects than non-additive ones. Several QTLs showed variations of effects according to the local heterotic group origin. We also revealed large chromosomic regions that display genetic group origin effects. Altogether, our results illustrate how admixed panels combined with dedicated GWAS modeling allow the identification of new QTLs that could not be revealed by a classical hybrid panel analyzed with traditional modeling.
Collapse
Affiliation(s)
- Aurélien Beugnot
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, UMR GQE-Le Moulon, 91272, Gif-Sur-Yvette, France
| | - Tristan Mary-Huard
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, UMR GQE-Le Moulon, 91272, Gif-Sur-Yvette, France
- Université Paris-Saclay, AgroParisTech, INRAE, UMR MIA Paris-Saclay, 91120, Palaiseau, France
| | - Cyril Bauland
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, UMR GQE-Le Moulon, 91272, Gif-Sur-Yvette, France
| | - Valerie Combes
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, UMR GQE-Le Moulon, 91272, Gif-Sur-Yvette, France
| | - Delphine Madur
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, UMR GQE-Le Moulon, 91272, Gif-Sur-Yvette, France
| | | | | | - Alain Charcosset
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, UMR GQE-Le Moulon, 91272, Gif-Sur-Yvette, France
| | - Laurence Moreau
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, UMR GQE-Le Moulon, 91272, Gif-Sur-Yvette, France
| | - Julie B Fievet
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, UMR GQE-Le Moulon, 91272, Gif-Sur-Yvette, France.
| |
Collapse
|
2
|
Liu R, Cui Y, Kong L, Zheng F, Zhao W, Meng Q, Yuan J, Zhang M, Chen Y. Evaluating the Genetic Background Effect on Dissecting the Genetic Basis of Kernel Traits in Reciprocal Maize Introgression Lines. Genes (Basel) 2023; 14:genes14051044. [PMID: 37239404 DOI: 10.3390/genes14051044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Revised: 05/03/2023] [Accepted: 05/04/2023] [Indexed: 05/28/2023] Open
Abstract
Maize yield is mostly determined by its grain size. Although numerous quantitative trait loci (QTL) have been identified for kernel-related traits, the application of these QTL in breeding programs has been strongly hindered because the populations used for QTL mapping are often different from breeding populations. However, the effect of genetic background on the efficiency of QTL and the accuracy of trait genomic prediction has not been fully studied. Here, we used a set of reciprocal introgression lines (ILs) derived from 417F × 517F to evaluate how genetic background affects the detection of QTLassociated with kernel shape traits. A total of 51 QTL for kernel size were identified by chromosome segment lines (CSL) and genome-wide association studies (GWAS) methods. These were subsequently clustered into 13 common QTL based on their physical position, including 7 genetic-background-independent and 6 genetic-background-dependent QTL, respectively. Additionally, different digenic epistatic marker pairs were identified in the 417F and 517F ILs. Therefore, our results demonstrated that genetic background strongly affected not only the kernel size QTL mapping via CSL and GWAS but also the genomic prediction accuracy and epistatic detection, thereby enhancing our understanding of how genetic background affects the genetic dissection of grain size-related traits.
Collapse
Affiliation(s)
- Ruixiang Liu
- Provincial Key Laboratory of Agrobiology, Institute of Food Crops, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Yakun Cui
- Provincial Key Laboratory of Agrobiology, Institute of Food Crops, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Lingjie Kong
- Provincial Key Laboratory of Agrobiology, Institute of Food Crops, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Fei Zheng
- Provincial Key Laboratory of Agrobiology, Institute of Food Crops, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Wenming Zhao
- Provincial Key Laboratory of Agrobiology, Institute of Food Crops, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Qingchang Meng
- Provincial Key Laboratory of Agrobiology, Institute of Food Crops, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Jianhua Yuan
- Provincial Key Laboratory of Agrobiology, Institute of Food Crops, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Meijing Zhang
- Provincial Key Laboratory of Agrobiology, Institute of Food Crops, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Yanping Chen
- Provincial Key Laboratory of Agrobiology, Institute of Food Crops, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| |
Collapse
|
3
|
Schaal KA, Yu YTN, Vasse M, Velicer GJ. Allopatric divergence of cooperators confers cheating resistance and limits effects of a defector mutation. BMC Ecol Evol 2022; 22:141. [PMID: 36510120 PMCID: PMC9746145 DOI: 10.1186/s12862-022-02094-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Accepted: 11/23/2022] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Social defectors may meet diverse cooperators. Genotype-by-genotype interactions may constrain the ranges of cooperators upon which particular defectors can cheat, limiting cheater spread. Upon starvation, the soil bacterium Myxococcus xanthus cooperatively develops into spore-bearing fruiting bodies, using a complex regulatory network and several intercellular signals. Some strains (cheaters) are unable to sporulate effectively in pure culture due to mutations that reduce signal production but can exploit and outcompete cooperators within mixed groups. RESULTS In this study, interactions between a cheater disrupted at the signaling gene csgA and allopatrically diversified cooperators reveal a very small cheating range. Expectedly, the cheater failed to cheat on all natural-isolate cooperators owing to non-cheater-specific antagonisms. Surprisingly, some lab-evolved cooperators had already exited the csgA mutant's cheating range after accumulating fewer than 20 mutations and without experiencing cheating during evolution. Cooperators might also diversify in the potential for a mutation to reduce expression of a cooperative trait or generate a cheating phenotype. A new csgA mutation constructed in several highly diverged cooperators generated diverse sporulation phenotypes, ranging from a complete defect to no defect, indicating that genetic backgrounds can limit the set of genomes in which a mutation creates a defector. CONCLUSIONS Our results demonstrate that natural populations may feature geographic mosaics of cooperators that have diversified in their susceptibility to particular cheaters, limiting defectors' cheating ranges and preventing them from spreading. This diversification may also lead to variation in the phenotypes generated by any given cooperation-gene mutation, further decreasing the chance of a cheater emerging which threatens the persistence of cooperation in the system.
Collapse
Affiliation(s)
- Kaitlin A. Schaal
- grid.5801.c0000 0001 2156 2780Institute of Integrative Biology, ETH Zürich, 8092 Zurich, Switzerland
| | - Yuen-Tsu Nicco Yu
- grid.5801.c0000 0001 2156 2780Institute of Integrative Biology, ETH Zürich, 8092 Zurich, Switzerland
| | - Marie Vasse
- grid.5801.c0000 0001 2156 2780Institute of Integrative Biology, ETH Zürich, 8092 Zurich, Switzerland ,grid.121334.60000 0001 2097 0141Institute MIVEGEC (UMR 5290 CNRS, IRD, UM), 34394 Montpellier, France
| | - Gregory J. Velicer
- grid.5801.c0000 0001 2156 2780Institute of Integrative Biology, ETH Zürich, 8092 Zurich, Switzerland
| |
Collapse
|
4
|
Santos IGD, Verzegnazzi AL, Edwards J, Frei UK, Boerman N, Tonello Zuffo L, Pires LPM, de La Fuente G, Lübberstedt T. Usefulness of temperate-adapted maize lines developed by doubled haploid and single-seed descent methods. Theor Appl Genet 2022; 135:1829-1841. [PMID: 35305125 DOI: 10.1007/s00122-022-04075-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Accepted: 03/01/2022] [Indexed: 06/14/2023]
Abstract
Spontaneous haploid genome doubling is not associated with undesirable linkage drag effects. The presence of spontaneous doubling genes allows maximum exploitation of variability from the temperate-adapted BS39 population Tropical non-elite maize (Zea mays L.) germplasm, such as BS39, provides a unique opportunity for broadening the genetic base of U.S. Corn Belt germplasm. In vivo doubled haploid (DH) technology has been used to efficiently exploit non-elite germplasm. It can help to purge deleterious recessive alleles. The objectives of this study were to determine the usefulness of BS39-derived inbred lines using both SSD and DH methods, to determine the impact of spontaneous as compared with artificial haploid genome doubling on genetic variance among BS39-derived DH lines, and to identify SNP markers associated with agronomic traits among BS39 inbreds monitored at testcross level. We developed two sets of inbred lines directly from BS39 by DH and SSD methods, named BS39_DH and BS39_SSD. Additionally, two sets were derived from a cross between BS39 and A427 (SHGD donor) by DH and SSD methods, named BS39 × A427_DH and BS39 × A427_SSD, respectively. Grain yield, moisture, plant height, ear height, stalk lodging, and root lodging were measured to estimate genetic parameters. For genome-wide association analysis, inbred lines were genotyped using genotype-by-sequencing and Diversity Array Technology Sequencing (DArTSeq). Some BS39-derived inbred lines performed better than elite germplasm inbreds and all sets showed significant genetic variance. The presence of spontaneous haploid genome doubling genes did not affect performance of inbred lines. Five SNPs were significant and three of them located within genes related to plant development or abiotic stresses. These results demonstrate the potential of BS39 to add novel alleles to temperate elite germplasm.
Collapse
Affiliation(s)
| | | | - Jode Edwards
- USDA-ARS, Corn Insects and Crop Genetics Research Unit, Ames, IA, USA
| | - Ursula K Frei
- Department of Agronomy, Iowa State University, Ames, IA, USA
| | - Nicholas Boerman
- USDA-ARS, Southern Plains Range Research Station, Woodward, OK, USA
| | - Leandro Tonello Zuffo
- Department of Plant Sciences, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil
| | | | | | | |
Collapse
|
5
|
Hartmann K, Seweryn M, Sadee W. Interpreting coronary artery disease GWAS results: A functional genomics approach assessing biological significance. PLoS One 2022; 17:e0244904. [PMID: 35192625 PMCID: PMC8863290 DOI: 10.1371/journal.pone.0244904] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Accepted: 01/01/2022] [Indexed: 01/09/2023] Open
Abstract
Genome-wide association studies (GWAS) have implicated 58 loci in coronary artery disease (CAD). However, the biological basis for these associations, the relevant genes, and causative variants often remain uncertain. Since the vast majority of GWAS loci reside outside coding regions, most exert regulatory functions. Here we explore the complexity of each of these loci, using tissue specific RNA sequencing data from GTEx to identify genes that exhibit altered expression patterns in the context of GWAS-significant loci, expanding the list of candidate genes from the 75 currently annotated by GWAS to 245, with almost half of these transcripts being non-coding. Tissue specific allelic expression imbalance data, also from GTEx, allows us to uncover GWAS variants that mark functional variation in a locus, e.g., rs7528419 residing in the SORT1 locus, in liver specifically, and rs72689147 in the GUYC1A1 locus, across a variety of tissues. We consider the GWAS variant rs1412444 in the LIPA locus in more detail as an example, probing tissue and transcript specific effects of genetic variation in the region. By evaluating linkage disequilibrium (LD) between tissue specific eQTLs, we reveal evidence for multiple functional variants within loci. We identify 3 variants (rs1412444, rs1051338, rs2250781) that when considered together, each improve the ability to account for LIPA gene expression, suggesting multiple interacting factors. These results refine the assignment of 58 GWAS loci to likely causative variants in a handful of cases and for the remainder help to re-prioritize associated genes and RNA isoforms, suggesting that ncRNAs maybe a relevant transcript in almost half of CAD GWAS results. Our findings support a multi-factorial system where a single variant can influence multiple genes and each genes is regulated by multiple variants.
Collapse
Affiliation(s)
- Katherine Hartmann
- Department of Cancer Biology and Genetics, Center for Pharmacogenomics, College of Medicine, The Ohio State University, Columbus, OH, United States of America
- * E-mail:
| | - Michał Seweryn
- Biobank Lab, Department of Molecular Biophysics, University of Lodz, Lodz, Poland
| | - Wolfgang Sadee
- Department of Cancer Biology and Genetics, Center for Pharmacogenomics, College of Medicine, The Ohio State University, Columbus, OH, United States of America
| |
Collapse
|
6
|
Odell SG, Hudson AI, Praud S, Dubreuil P, Tixier MH, Ross-Ibarra J, Runcie DE. Modeling allelic diversity of multiparent mapping populations affects detection of quantitative trait loci. G3 (Bethesda) 2022; 12:6509518. [PMID: 35100382 PMCID: PMC8895984 DOI: 10.1093/g3journal/jkac011] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Accepted: 10/27/2021] [Indexed: 12/02/2022]
Abstract
The search for quantitative trait loci that explain complex traits such as yield and drought tolerance has been ongoing in all crops. Methods such as biparental quantitative trait loci mapping and genome-wide association studies each have their own advantages and limitations. Multiparent advanced generation intercross populations contain more recombination events and genetic diversity than biparental mapping populations and are better able to estimate effect sizes of rare alleles than association mapping populations. Here, we discuss the results of using a multiparent advanced generation intercross population of doubled haploid maize lines created from 16 diverse founders to perform quantitative trait loci mapping. We compare 3 models that assume bi-allelic, founder, and ancestral haplotype allelic states for quantitative trait loci. The 3 methods have differing power to detect quantitative trait loci for a variety of agronomic traits. Although the founder approach finds the most quantitative trait loci, all methods are able to find unique quantitative trait loci, suggesting that each model has advantages for traits with different genetic architectures. A closer look at a well-characterized flowering time quantitative trait loci, qDTA8, which contains vgt1, highlights the strengths and weaknesses of each method and suggests a potential epistatic interaction. Overall, our results reinforce the importance of considering different approaches to analyzing genotypic datasets, and shows the limitations of binary SNP data for identifying multiallelic quantitative trait loci.
Collapse
Affiliation(s)
- Sarah G Odell
- Department of Plant Sciences, University of California, Davis, CA 95616, USA.,Department of Evolution and Ecology, University of California, Davis, CA 95616, USA
| | - Asher I Hudson
- Department of Evolution and Ecology, University of California, Davis, CA 95616, USA.,Center for Population Biology, University of California, Davis, CA 95616, USA
| | - Sébastien Praud
- Limagrain, Centre de Recherche de Chappes, Chappes 63720, France
| | - Pierre Dubreuil
- Limagrain, Centre de Recherche de Chappes, Chappes 63720, France
| | | | - Jeffrey Ross-Ibarra
- Department of Evolution and Ecology, University of California, Davis, CA 95616, USA.,Center for Population Biology, University of California, Davis, CA 95616, USA.,Genome Center, University of California, Davis, CA 95616, USA
| | - Daniel E Runcie
- Department of Plant Sciences, University of California, Davis, CA 95616, USA
| |
Collapse
|
7
|
Brault C, Segura V, This P, Le Cunff L, Flutre T, François P, Pons T, Péros JP, Doligez A. Across-population genomic prediction in grapevine opens up promising prospects for breeding. Hortic Res 2022; 9:uhac041. [PMID: 35184162 PMCID: PMC9070645 DOI: 10.1093/hr/uhac041] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Accepted: 02/01/2022] [Indexed: 05/15/2023]
Abstract
Crop breeding involves two selection steps: choosing progenitors and selecting individuals within progenies. Genomic prediction, based on genome-wide marker estimation of genetic values, could facilitate these steps. However, its potential usefulness in grapevine (Vitis vinifera L.) has only been evaluated in non-breeding contexts mainly through cross-validation within a single population. We tested across-population genomic prediction in a more realistic breeding configuration, from a diversity panel to ten bi-parental crosses connected within a half-diallel mating design. Prediction quality was evaluated over 15 traits of interest (related to yield, berry composition, phenology and vigour), for both the average genetic value of each cross (cross mean) and the genetic values of individuals within each cross (individual values). Genomic prediction in these conditions was found useful: for cross mean, average per-trait predictive ability was 0.6, while per-cross predictive ability was halved on average, but reached a maximum of 0.7. Mean predictive ability for individual values within crosses was 0.26, about half the within-half-diallel value taken as a reference. For some traits and/or crosses, these across-population predictive ability values are promising for implementing genomic selection in grapevine breeding. This study also provided key insights on variables affecting predictive ability. Per-cross predictive ability was well predicted by genetic distance between parents and when this predictive ability was below 0.6, it was improved by training set optimization. For individual values, predictive ability mostly depended on trait-related variables (magnitude of the cross effect and heritability). These results will greatly help designing grapevine breeding programs assisted by genomic prediction.
Collapse
Affiliation(s)
- Charlotte Brault
- UMT Geno-Vigne®, IFV-INRAE-Institut Agro, F-34398 Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, F-34398 Montpellier, France
- Institut Français de la Vigne et du Vin, F-34398 Montpellier, France
| | - Vincent Segura
- UMT Geno-Vigne®, IFV-INRAE-Institut Agro, F-34398 Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, F-34398 Montpellier, France
| | - Patrice This
- UMT Geno-Vigne®, IFV-INRAE-Institut Agro, F-34398 Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, F-34398 Montpellier, France
| | - Loïc Le Cunff
- UMT Geno-Vigne®, IFV-INRAE-Institut Agro, F-34398 Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, F-34398 Montpellier, France
- Institut Français de la Vigne et du Vin, F-34398 Montpellier, France
| | - Timothée Flutre
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE – Le Moulon, 91190, Gif-sur-Yvette, France
| | - Pierre François
- UMT Geno-Vigne®, IFV-INRAE-Institut Agro, F-34398 Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, F-34398 Montpellier, France
| | - Thierry Pons
- UMT Geno-Vigne®, IFV-INRAE-Institut Agro, F-34398 Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, F-34398 Montpellier, France
| | - Jean-Pierre Péros
- UMT Geno-Vigne®, IFV-INRAE-Institut Agro, F-34398 Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, F-34398 Montpellier, France
| | - Agnès Doligez
- UMT Geno-Vigne®, IFV-INRAE-Institut Agro, F-34398 Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, F-34398 Montpellier, France
| |
Collapse
|
8
|
Roth M, Beugnot A, Mary-Huard T, Moreau L, Charcosset A, Fiévet JB. Improving genomic predictions with inbreeding and nonadditive effects in two admixed maize hybrid populations in single and multienvironment contexts. Genetics 2022; 220:6527635. [PMID: 35150258 PMCID: PMC8982028 DOI: 10.1093/genetics/iyac018] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2021] [Accepted: 01/28/2022] [Indexed: 11/12/2022] Open
Abstract
Genetic admixture, resulting from the recombination between structural groups, is frequently encountered in breeding populations. In hybrid breeding, crossing admixed lines can generate substantial nonadditive genetic variance and contrasted levels of inbreeding which can impact trait variation. This study aimed at testing recent methodological developments for the modeling of inbreeding and nonadditive effects in order to increase prediction accuracy in admixed populations. Using two maize (Zea mays L.) populations of hybrids admixed between dent and flint heterotic groups, we compared a suite of five genomic prediction models incorporating (or not) parameters accounting for inbreeding and nonadditive effects with the natural and orthogonal interaction approach in single and multienvironment contexts. In both populations, variance decompositions showed the strong impact of inbreeding on plant yield, height, and flowering time which was supported by the superiority of prediction models incorporating this effect (+0.038 in predictive ability for mean yield). In most cases dominance variance was reduced when inbreeding was accounted for. The model including additivity, dominance, epistasis, and inbreeding effects appeared to be the most robust for prediction across traits and populations (+0.054 in predictive ability for mean yield). In a multienvironment context, we found that the inclusion of nonadditive and inbreeding effects was advantageous when predicting hybrids not yet observed in any environment. Overall, comparing variance decompositions was helpful to guide model selection for genomic prediction. Finally, we recommend the use of models including inbreeding and nonadditive parameters following the natural and orthogonal interaction approach to increase prediction accuracy in admixed populations.
Collapse
Affiliation(s)
- Morgane Roth
- Plant Breeding Research Division, Agroscope, Wädenswil, 8820 Zurich, Switzerland,Corresponding author: INRAE GAFL, 67 Allée des Chênes 84140 Montfavet, France.
| | - Aurélien Beugnot
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, UMR GQE-Le Moulon, 91190 Gif-sur-Yvette, France
| | - Tristan Mary-Huard
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, UMR GQE-Le Moulon, 91190 Gif-sur-Yvette, France,Université Paris-Saclay, INRAE, AgroParisTech, UMR MIA-Paris Paris, 75005 Paris, France
| | - Laurence Moreau
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, UMR GQE-Le Moulon, 91190 Gif-sur-Yvette, France
| | - Alain Charcosset
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, UMR GQE-Le Moulon, 91190 Gif-sur-Yvette, France
| | - Julie B Fiévet
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, UMR GQE-Le Moulon, 91190 Gif-sur-Yvette, France
| |
Collapse
|
9
|
Affiliation(s)
- Kenneth Aase
- Centre for Biodiversity Dynamics, Department of Biology Norwegian University of Science and Technology Trondheim Norway
| | - Henrik Jensen
- Centre for Biodiversity Dynamics, Department of Biology Norwegian University of Science and Technology Trondheim Norway
| | - Stefanie Muff
- Centre for Biodiversity Dynamics, Department of Biology Norwegian University of Science and Technology Trondheim Norway
- Department of Mathematical Sciences, Norwegian University of Science and Technology Trondheim Norway
| |
Collapse
|
10
|
Litvinov DY, Karlov GI, Divashuk MG. Metabolomics for Crop Breeding: General Considerations. Genes (Basel) 2021; 12:1602. [PMID: 34680996 DOI: 10.3390/genes12101602] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Revised: 10/05/2021] [Accepted: 10/08/2021] [Indexed: 12/16/2022] Open
Abstract
The development of new, more productive varieties of agricultural crops is becoming an increasingly difficult task. Modern approaches for the identification of beneficial alleles and their use in elite cultivars, such as quantitative trait loci (QTL) mapping and marker-assisted selection (MAS), are effective but insufficient for keeping pace with the improvement of wheat or other crops. Metabolomics is a powerful but underutilized approach that can assist crop breeding. In this review, basic methodological information is summarized, and the current strategies of applications of metabolomics related to crop breeding are explored using recent examples. We briefly describe classes of plant metabolites, cellular localization of metabolic pathways, and the strengths and weaknesses of the main metabolomics technique. Among the commercialized genetically modified crops, about 50 with altered metabolic enzyme activities have been identified in the International Service for the Acquisition of Agri-biotech Applications (ISAAA) database. These plants are reviewed as encouraging examples of the application of knowledge of biochemical pathways. Based on the recent examples of metabolomic studies, we discuss the performance of metabolic markers, the integration of metabolic and genomic data in metabolic QTLs (mQTLs) and metabolic genome-wide association studies (mGWAS). The elucidation of metabolic pathways and involved genes will help in crop breeding and the introgression of alleles of wild relatives in a more targeted manner.
Collapse
|
11
|
Powell OM, Voss-Fels KP, Jordan DR, Hammer G, Cooper M. Perspectives on Applications of Hierarchical Gene-To-Phenotype (G2P) Maps to Capture Non-stationary Effects of Alleles in Genomic Prediction. Front Plant Sci 2021; 12:663565. [PMID: 34149761 PMCID: PMC8211918 DOI: 10.3389/fpls.2021.663565] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Accepted: 04/13/2021] [Indexed: 05/26/2023]
Abstract
Genomic prediction of complex traits across environments, breeding cycles, and populations remains a challenge for plant breeding. A potential explanation for this is that underlying non-additive genetic (GxG) and genotype-by-environment (GxE) interactions generate allele substitution effects that are non-stationary across different contexts. Such non-stationary effects of alleles are either ignored or assumed to be implicitly captured by most gene-to-phenotype (G2P) maps used in genomic prediction. The implicit capture of non-stationary effects of alleles requires the G2P map to be re-estimated across different contexts. We discuss the development and application of hierarchical G2P maps that explicitly capture non-stationary effects of alleles and have successfully increased short-term prediction accuracy in plant breeding. These hierarchical G2P maps achieve increases in prediction accuracy by allowing intermediate processes such as other traits and environmental factors and their interactions to contribute to complex trait variation. However, long-term prediction remains a challenge. The plant breeding community should undertake complementary simulation and empirical experiments to interrogate various hierarchical G2P maps that connect GxG and GxE interactions simultaneously. The existing genetic correlation framework can be used to assess the magnitude of non-stationary effects of alleles and the predictive ability of these hierarchical G2P maps in long-term, multi-context genomic predictions of complex traits in plant breeding.
Collapse
Affiliation(s)
- Owen M. Powell
- Queensland Alliance for Agriculture and Food Innovation, Centre for Crop Science, The University of Queensland, St Lucia, QLD, Australia
- ARC Centre of Excellence for Plant Success in Nature and Agriculture, The University of Queensland, St Lucia, QLD, Australia
| | - Kai P. Voss-Fels
- Queensland Alliance for Agriculture and Food Innovation, Centre for Crop Science, The University of Queensland, St Lucia, QLD, Australia
| | - David R. Jordan
- Queensland Alliance for Agriculture and Food Innovation, Hermitage Research Facility, The University of Queensland, Warwick, QLD, Australia
- ARC Centre of Excellence for Plant Success in Nature and Agriculture, The University of Queensland, St Lucia, QLD, Australia
| | - Graeme Hammer
- Queensland Alliance for Agriculture and Food Innovation, Centre for Crop Science, The University of Queensland, St Lucia, QLD, Australia
- ARC Centre of Excellence for Plant Success in Nature and Agriculture, The University of Queensland, St Lucia, QLD, Australia
| | - Mark Cooper
- Queensland Alliance for Agriculture and Food Innovation, Centre for Crop Science, The University of Queensland, St Lucia, QLD, Australia
- ARC Centre of Excellence for Plant Success in Nature and Agriculture, The University of Queensland, St Lucia, QLD, Australia
| |
Collapse
|
12
|
Diaz-Gallo LM, Brynedal B, Westerlind H, Sandberg R, Ramsköld D. Understanding interactions between risk factors, and assessing the utility of the additive and multiplicative models through simulations. PLoS One 2021; 16:e0250282. [PMID: 33901204 PMCID: PMC8075235 DOI: 10.1371/journal.pone.0250282] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2020] [Accepted: 04/02/2021] [Indexed: 01/04/2023] Open
Abstract
Understanding the genetic background of complex diseases requires the expansion of studies beyond univariate associations. Therefore, it is important to use interaction assessments of risk factors in order to discover whether, and how genetic risk variants act together on disease development. The principle of interaction analysis is to explore the magnitude of the combined effect of risk factors on disease causation. In this study, we use simulations to investigate different scenarios of causation to show how the magnitude of the effect of two risk factors interact. We mainly focus on the two most commonly used interaction models, the additive and multiplicative risk scales, since there is often confusion regarding their use and interpretation. Our results show that the combined effect is multiplicative when two risk factors are involved in the same chain of events, an interaction called synergism. Synergism is often described as a deviation from additivity, which is a broader term. Our results also confirm that it is often relevant to estimate additive effect relationships, because they correspond to independent risk factors at low disease prevalence. Importantly, we evaluate the threshold of more than two required risk factors for disease causation, called the multifactorial threshold model. We found a simple mathematical relationship (square root) between the threshold and an additive-to-multiplicative linear effect scale (AMLES), where 0 corresponds to an additive effect and 1 to a multiplicative. We propose AMLES as a metric that could be used to test different effects relationships at the same time, given that it can simultaneously reveal additive, multiplicative and intermediate risk effects relationships. Finally, the utility of our simulation study was demonstrated using real data by analyzing and interpreting gene-gene interaction odds ratios from a rheumatoid arthritis case-control cohort.
Collapse
Affiliation(s)
- Lina-Marcela Diaz-Gallo
- Division of Rheumatology, Department of Medicine Solna, Center for Molecular Medicine, Karolinska Institutet, Karolinska University Hospital, Stockholm, Sweden
| | - Boel Brynedal
- Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
| | - Helga Westerlind
- Clinical Epidemiology Division, Department of Medicine Solna, Karolinska Institutet, Stockholm, Sweden
| | - Rickard Sandberg
- Department of Cell and Molecular Biology, Karolinska Institutet, Stockholm, Sweden
| | - Daniel Ramsköld
- Department of Cell and Molecular Biology, Karolinska Institutet, Stockholm, Sweden
- * E-mail:
| |
Collapse
|
13
|
Somineni HK, Nagpal S, Venkateswaran S, Cutler DJ, Okou DT, Haritunians T, Simpson CL, Begum F, Datta LW, Quiros AJ, Seminerio J, Mengesha E, Alexander JS, Baldassano RN, Dudley-Brown S, Cross RK, Dassopoulos T, Denson LA, Dhere TA, Iskandar H, Dryden GW, Hou JK, Hussain SZ, Hyams JS, Isaacs KL, Kader H, Kappelman MD, Katz J, Kellermayer R, Kuemmerle JF, Lazarev M, Li E, Mannon P, Moulton DE, Newberry RD, Patel AS, Pekow J, Saeed SA, Valentine JF, Wang MH, McCauley JL, Abreu MT, Jester T, Molle-Rios Z, Palle S, Scherl EJ, Kwon J, Rioux JD, Duerr RH, Silverberg MS, Zwick ME, Stevens C, Daly MJ, Cho JH, Gibson G, McGovern DP, Brant SR, Kugathasan S. Whole-genome sequencing of African Americans implicates differential genetic architecture in inflammatory bowel disease. Am J Hum Genet 2021; 108:431-445. [PMID: 33600772 PMCID: PMC8008495 DOI: 10.1016/j.ajhg.2021.02.001] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Accepted: 02/01/2021] [Indexed: 12/20/2022] Open
Abstract
Whether or not populations diverge with respect to the genetic contribution to risk of specific complex diseases is relevant to understanding the evolution of susceptibility and origins of health disparities. Here, we describe a large-scale whole-genome sequencing study of inflammatory bowel disease encompassing 1,774 affected individuals and 1,644 healthy control Americans with African ancestry (African Americans). Although no new loci for inflammatory bowel disease are discovered at genome-wide significance levels, we identify numerous instances of differential effect sizes in combination with divergent allele frequencies. For example, the major effect at PTGER4 fine maps to a single credible interval of 22 SNPs corresponding to one of four independent associations at the locus in European ancestry individuals but with an elevated odds ratio for Crohn disease in African Americans. A rare variant aggregate analysis implicates Ca2+-binding neuro-immunomodulator CALB2 in ulcerative colitis. Highly significant overall overlap of common variant risk for inflammatory bowel disease susceptibility between individuals with African and European ancestries was observed, with 41 of 241 previously known lead variants replicated and overall correlations in effect sizes of 0.68 for combined inflammatory bowel disease. Nevertheless, subtle differences influence the performance of polygenic risk scores, and we show that ancestry-appropriate weights significantly improve polygenic prediction in the highest percentiles of risk. The median amount of variance explained per locus remains the same in African and European cohorts, providing evidence for compensation of effect sizes as allele frequencies diverge, as expected under a highly polygenic model of disease.
Collapse
|
14
|
Sehgal D, Mondal S, Crespo-Herrera L, Velu G, Juliana P, Huerta-Espino J, Shrestha S, Poland J, Singh R, Dreisigacker S. Haplotype-Based, Genome-Wide Association Study Reveals Stable Genomic Regions for Grain Yield in CIMMYT Spring Bread Wheat. Front Genet 2020; 11:589490. [PMID: 33335539 PMCID: PMC7737720 DOI: 10.3389/fgene.2020.589490] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Accepted: 10/21/2020] [Indexed: 01/16/2023] Open
Abstract
We untangled key regions of the genetic architecture of grain yield (GY) in CIMMYT spring bread wheat by conducting a haplotype-based, genome-wide association study (GWAS), together with an investigation of epistatic interactions using seven large sets of elite yield trials (EYTs) consisting of a total of 6,461 advanced breeding lines. These lines were phenotyped under irrigated and stress environments in seven growing seasons (2011-2018) and genotyped with genotyping-by-sequencing markers. Genome-wide 519 haplotype blocks were constructed, using a linkage disequilibrium-based approach covering 14,036 Mb in the wheat genome. Haplotype-based GWAS identified 7, 4, 10, and 15 stable (significant in three or more EYTs) associations in irrigated (I), mild drought (MD), severe drought (SD), and heat stress (HS) testing environments, respectively. Considering all EYTs and the four testing environments together, 30 stable associations were deciphered with seven hotspots identified on chromosomes 1A, 1B, 2B, 4A, 5B, 6B, and 7B, where multiple haplotype blocks were associated with GY. Epistatic interactions contributed significantly to the genetic architecture of GY, explaining variation of 3.5-21.1%, 3.7-14.7%, 3.5-20.6%, and 4.4- 23.1% in I, MD, SD, and HS environments, respectively. Our results revealed the intricate genetic architecture of GY, controlled by both main and epistatic effects. The importance of these results for practical applications in the CIMMYT breeding program is discussed.
Collapse
Affiliation(s)
- Deepmala Sehgal
- International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Mexico
| | - Suchismita Mondal
- International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Mexico
| | | | - Govindan Velu
- International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Mexico
| | - Philomin Juliana
- International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Mexico
| | | | | | - Jesse Poland
- Kansas State University, Manhattan, KS, United States
| | - Ravi Singh
- International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Mexico
| | | |
Collapse
|
15
|
Rio S, Moreau L, Charcosset A, Mary-Huard T. Accounting for Group-Specific Allele Effects and Admixture in Genomic Predictions: Theory and Experimental Evaluation in Maize. Genetics 2020; 216:27-41. [PMID: 32680885 PMCID: PMC7463286 DOI: 10.1534/genetics.120.303278] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Accepted: 07/10/2020] [Indexed: 02/01/2023] Open
Abstract
Populations structured into genetic groups may display group-specific linkage disequilibrium, mutations, and/or interactions between quantitative trait loci and the genetic background. These factors lead to heterogeneous marker effects affecting the efficiency of genomic prediction, especially for admixed individuals. Such individuals have a genome that is a mosaic of chromosome blocks from different origins, and may be of interest to combine favorable group-specific characteristics. We developed two genomic prediction models adapted to the prediction of admixed individuals in presence of heterogeneous marker effects: multigroup admixed genomic best linear unbiased prediction random individual (MAGBLUP-RI), modeling the ancestry of alleles; and multigroup admixed genomic best linear unbiased prediction random allele effect (MAGBLUP-RAE), modeling group-specific distributions of allele effects. MAGBLUP-RI can estimate the segregation variance generated by admixture while MAGBLUP-RAE can disentangle the variability that is due to main allele effects from the variability that is due to group-specific deviation allele effects. Both models were evaluated for their genomic prediction accuracy using a maize panel including lines from the Dent and Flint groups, along with admixed individuals. Based on simulated traits, both models proved their efficiency to improve genomic prediction accuracy compared to standard GBLUP models. For real traits, a clear gain was observed at low marker densities whereas it became limited at high marker densities. The interest of including admixed individuals in multigroup training sets was confirmed using simulated traits, but was variable using real traits. Both MAGBLUP models and admixed individuals are of interest whenever group-specific SNP allele effects exist.
Collapse
Affiliation(s)
- Simon Rio
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE-Le Moulon, 91190 Gif-sur-Yvette, France
| | - Laurence Moreau
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE-Le Moulon, 91190 Gif-sur-Yvette, France
| | - Alain Charcosset
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE-Le Moulon, 91190 Gif-sur-Yvette, France
| | - Tristan Mary-Huard
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE-Le Moulon, 91190 Gif-sur-Yvette, France
- MIA, INRAE, AgroParisTech, Université Paris-Saclay, 75005 Paris, France
| |
Collapse
|