1
|
Martins FB, Aono AH, Moraes ADCL, Ferreira RCU, Vilela MDM, Pessoa-Filho M, Rodrigues-Motta M, Simeão RM, de Souza AP. Genome-wide family prediction unveils molecular mechanisms underlying the regulation of agronomic traits in Urochloa ruziziensis. FRONTIERS IN PLANT SCIENCE 2023; 14:1303417. [PMID: 38148869 PMCID: PMC10749977 DOI: 10.3389/fpls.2023.1303417] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Accepted: 11/15/2023] [Indexed: 12/28/2023]
Abstract
Tropical forage grasses, particularly those belonging to the Urochloa genus, play a crucial role in cattle production and serve as the main food source for animals in tropical and subtropical regions. The majority of these species are apomictic and tetraploid, highlighting the significance of U. ruziziensis, a sexual diploid species that can be tetraploidized for use in interspecific crosses with apomictic species. As a means to support breeding programs, our study investigates the feasibility of genome-wide family prediction in U. ruziziensis families to predict agronomic traits. Fifty half-sibling families were assessed for green matter yield, dry matter yield, regrowth capacity, leaf dry matter, and stem dry matter across different clippings established in contrasting seasons with varying available water capacity. Genotyping was performed using a genotyping-by-sequencing approach based on DNA samples from family pools. In addition to conventional genomic prediction methods, machine learning and feature selection algorithms were employed to reduce the necessary number of markers for prediction and enhance predictive accuracy across phenotypes. To explore the regulation of agronomic traits, our study evaluated the significance of selected markers for prediction using a tree-based approach, potentially linking these regions to quantitative trait loci (QTLs). In a multiomic approach, genes from the species transcriptome were mapped and correlated to those markers. A gene coexpression network was modeled with gene expression estimates from a diverse set of U. ruziziensis genotypes, enabling a comprehensive investigation of molecular mechanisms associated with these regions. The heritabilities of the evaluated traits ranged from 0.44 to 0.92. A total of 28,106 filtered SNPs were used to predict phenotypic measurements, achieving a mean predictive ability of 0.762. By employing feature selection techniques, we could reduce the dimensionality of SNP datasets, revealing potential genotype-phenotype associations. The functional annotation of genes near these markers revealed associations with auxin transport and biosynthesis of lignin, flavonol, and folic acid. Further exploration with the gene coexpression network uncovered associations with DNA metabolism, stress response, and circadian rhythm. These genes and regions represent important targets for expanding our understanding of the metabolic regulation of agronomic traits and offer valuable insights applicable to species breeding. Our work represents an innovative contribution to molecular breeding techniques for tropical forages, presenting a viable marker-assisted breeding approach and identifying target regions for future molecular studies on these agronomic traits.
Collapse
Affiliation(s)
- Felipe Bitencourt Martins
- Center for Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, São Paulo, Brazil
| | - Alexandre Hild Aono
- Center for Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, São Paulo, Brazil
| | - Aline da Costa Lima Moraes
- Department of Plant Biology, Biology Institute, University of Campinas (UNICAMP), Campinas, São Paulo, Brazil
| | | | | | - Marco Pessoa-Filho
- Embrapa Cerrados, Brazilian Agricultural Research Corporation, Brasília, Brazil
| | | | - Rosangela Maria Simeão
- Embrapa Gado de Corte, Brazilian Agricultural Research Corporation, Campo Grande, Mato Grosso, Brazil
| | - Anete Pereira de Souza
- Center for Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, São Paulo, Brazil
- Department of Plant Biology, Biology Institute, University of Campinas (UNICAMP), Campinas, São Paulo, Brazil
| |
Collapse
|
2
|
Maulana F, Perumal R, Serba DD, Tesso T. Genomic prediction of hybrid performance in grain sorghum ( Sorghum bicolor L.). FRONTIERS IN PLANT SCIENCE 2023; 14:1139896. [PMID: 37180401 PMCID: PMC10167770 DOI: 10.3389/fpls.2023.1139896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/07/2023] [Accepted: 03/22/2023] [Indexed: 05/16/2023]
Abstract
Genomic selection is expected to improve selection efficiency and genetic gain in breeding programs. The objective of this study was to assess the efficacy of predicting the performance of grain sorghum hybrids using genomic information of parental genotypes. One hundred and two public sorghum inbred parents were genotyped using genotyping-by-sequencing. Ninty-nine of the inbreds were crossed to three tester female parents generating a total of 204 hybrids for evaluation at two environments. The hybrids were sorted in to three sets of 77,59 and 68 and evaluated along with two commercial checks using a randomized complete block design in three replications. The sequence analysis generated 66,265 SNP markers that were used to predict the performance of 204 F1 hybrids resulted from crosses between the parents. Both additive (partial model) and additive and dominance (full model) were constructed and tested using various training population (TP) sizes and cross-validation procedures. Increasing TP size from 41 to 163 increased prediction accuracies for all traits. With the partial model, the five-fold cross validated prediction accuracies ranged from 0.03 for thousand kernel weight (TKW) to 0.58 for grain yield (GY) while it ranged from 0.06 for TKW to 0.67 for GY with the full model. The results suggest that genomic prediction could become an effective tool for predicting the performance of sorghum hybrids based on parental genotypes.
Collapse
Affiliation(s)
- Frank Maulana
- Department of Agronomy, Kansas State University, Manhattan, KS, United States
| | - Ramasamy Perumal
- Kansas State University, Agricultural Research Center, Hays, KS, United States
| | - Desalegn D. Serba
- United States Department of Agriculture-Agricultural Research Service (USDA-ARS), U.S. Arid Land Agricultural Research Center, Maricopa, AZ, United States
| | - Tesfaye Tesso
- Department of Agronomy, Kansas State University, Manhattan, KS, United States
| |
Collapse
|
3
|
Assessment of the Genetic Diversity and Population Structure of the Peruvian Andean Legume, Tarwi (Lupinus mutabilis), with High Quality SNPs. DIVERSITY 2023. [DOI: 10.3390/d15030437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2023]
Abstract
Lupinus mutabilis Sweet (Fabaceae), “tarwi” or “chocho”, is an important grain legume in the Andean region. In Peru, studies on tarwi have mainly focused on morphological features; however, they have not been molecularly characterized. Currently, it is possible to explore the genetic parameters of plants with reliable and modern methods such as genotyping by sequencing (GBS). Here, for the first time, we used single nucleotide polymorphism (SNP) markers to infer the genetic diversity and population structure of 89 accessions of tarwi from nine Andean regions of Peru. A total of 5922 SNPs distributed along all chromosomes of tarwi were identified. STRUCTURE analysis revealed that this crop is grouped into two clusters. A dendrogram was generated using the UPGMA clustering algorithm and, like the principal coordinate analysis (PCoA), it showed two groups that correspond to the geographic origin of the tarwi samples. AMOVA showed a reduced variation between clusters (7.59%) and indicated that variability within populations is 92.41%. Population divergence (Fst) between clusters 1 and 2 revealed low genetic difference (0.019). We also detected a negative Fis for both populations, demonstrating that, like other Lupinus species, tarwi also depends on cross-pollination. SNP markers were powerful and effective for the genotyping process in this germplasm. We hope that this information is the beginning of the path towards a modern genetic improvement and conservation strategies of this important Andean legume.
Collapse
|
4
|
Fernández-González J, Akdemir D, Isidro Y Sánchez J. A comparison of methods for training population optimization in genomic selection. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 136:30. [PMID: 36892603 PMCID: PMC9998580 DOI: 10.1007/s00122-023-04265-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Accepted: 11/21/2022] [Indexed: 06/18/2023]
Abstract
Maximizing CDmean and Avg_GRM_self were the best criteria for training set optimization. A training set size of 50-55% (targeted) or 65-85% (untargeted) is needed to obtain 95% of the accuracy. With the advent of genomic selection (GS) as a widespread breeding tool, mechanisms to efficiently design an optimal training set for GS models became more relevant, since they allow maximizing the accuracy while minimizing the phenotyping costs. The literature described many training set optimization methods, but there is a lack of a comprehensive comparison among them. This work aimed to provide an extensive benchmark among optimization methods and optimal training set size by testing a wide range of them in seven datasets, six different species, different genetic architectures, population structure, heritabilities, and with several GS models to provide some guidelines about their application in breeding programs. Our results showed that targeted optimization (uses information from the test set) performed better than untargeted (does not use test set data), especially when heritability was low. The mean coefficient of determination was the best targeted method, although it was computationally intensive. Minimizing the average relationship within the training set was the best strategy for untargeted optimization. Regarding the optimal training set size, maximum accuracy was obtained when the training set was the entire candidate set. Nevertheless, a 50-55% of the candidate set was enough to reach 95-100% of the maximum accuracy in the targeted scenario, while we needed a 65-85% for untargeted optimization. Our results also suggested that a diverse training set makes GS robust against population structure, while including clustering information was less effective. The choice of the GS model did not have a significant influence on the prediction accuracies.
Collapse
Affiliation(s)
- Javier Fernández-González
- Centro de Biotecnologia y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnologia Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223, Madrid, Spain.
| | - Deniz Akdemir
- CIBMTR (Center for International Blood and Marrow Transplant Research), National Marrow Donor Program/Be The Match, Minneapolis, USA
| | - Julio Isidro Y Sánchez
- Centro de Biotecnologia y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnologia Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223, Madrid, Spain.
| |
Collapse
|
5
|
Genetic diversity and local adaption of alfalfa populations (Medicago sativa L.) under long-term grazing. Sci Rep 2023; 13:1632. [PMID: 36717619 PMCID: PMC9886962 DOI: 10.1038/s41598-023-28521-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Accepted: 01/19/2023] [Indexed: 02/01/2023] Open
Abstract
Genomic information on alfalfa adaptation to long-term grazing is useful for alfalfa genetic improvement. In this study, 14 alfalfa populations were collected from long-term grazing sites (> 25 years) across four soil zones in western Canada. Alfalfa cultivars released between 1926 and 1980 were used to compare degree of genetic variation of the 14 populations. Six agro-morphological and three nutritive value traits were evaluated from 2018 to 2020. The genotyping-by-sequencing (GBS) data of the alfalfa populations and environmental data were used for genotype-environment association (GEA). Both STRUCTURE and UPGMA based on 19,853 SNPs showed that the 14 alfalfa populations from long-term grazing sites had varying levels of parentages from alfalfa sub-species Medicago sativa and M. falcata. The linear regression of STRUCTURE membership probability on phenotypic data indicated genetic variations of forage dry matter yield, spring vigor and plant height were low, but genetic variations of regrowth, fall plant height, days to flower and crude protein were still high for the 14 alfalfa populations from long-term grazing sites. The GEA identified 31 SNPs associated with 13 candidate genes that were mainly associated with six environmental factors of. Candidate genes underlying environmental factors were associated with a variety of proteins, which were involved in plant responses to abiotic stresses, i.e., drought, cold and salinity-alkali stresses.
Collapse
|
6
|
Feldmann MJ, Piepho HP, Knapp SJ. Average semivariance directly yields accurate estimates of the genomic variance in complex trait analyses. G3 GENES|GENOMES|GENETICS 2022; 12:6571389. [PMID: 35442424 PMCID: PMC9157152 DOI: 10.1093/g3journal/jkac080] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/27/2021] [Accepted: 03/17/2022] [Indexed: 11/23/2022]
Abstract
Many important traits in plants, animals, and microbes are polygenic and challenging to improve through traditional marker-assisted selection. Genomic prediction addresses this by incorporating all genetic data in a mixed model framework. The primary method for predicting breeding values is genomic best linear unbiased prediction, which uses the realized genomic relationship or kinship matrix (K) to connect genotype to phenotype. Genomic relationship matrices share information among entries to estimate the observed entries’ genetic values and predict unobserved entries’ genetic values. One of the main parameters of such models is genomic variance (σg2), or the variance of a trait associated with a genome-wide sample of DNA polymorphisms, and genomic heritability (hg2); however, the seminal papers introducing different forms of K often do not discuss their effects on the model estimated variance components despite their importance in genetic research and breeding. Here, we discuss the effect of several standard methods for calculating the genomic relationship matrix on estimates of σg2 and hg2. With current approaches, we found that the genomic variance tends to be either overestimated or underestimated depending on the scaling and centering applied to the marker matrix (Z), the value of the average diagonal element of K, and the assortment of alleles and heterozygosity (H) in the observed population. Using the average semivariance, we propose a new matrix, KASV, that directly yields accurate estimates of σg2 and hg2 in the observed population and produces best linear unbiased predictors equivalent to routine methods in plants and animals.
Collapse
Affiliation(s)
- Mitchell J Feldmann
- Department of Plant Sciences, University of California , Davis, CA 95616, USA
| | - Hans-Peter Piepho
- Biostatistics Unit, Institute of Crop Science, University of Hohenheim , 70593 Stuttgart, Germany
| | - Steven J Knapp
- Department of Plant Sciences, University of California , Davis, CA 95616, USA
| |
Collapse
|
7
|
Sutherland J, Bell T, Trexler RV, Carlson JE, Lasky JR. Host genomic influence on bacterial composition in the switchgrass rhizosphere. Mol Ecol 2022; 31:3934-3950. [PMID: 35621390 PMCID: PMC10150372 DOI: 10.1111/mec.16549] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Revised: 05/20/2022] [Accepted: 05/24/2022] [Indexed: 11/28/2022]
Abstract
Host genetic variation can shape the diversity and composition of associated microbiomes, which may reciprocally influence host traits and performance. While the genetic basis of phenotypic diversity of plant populations in nature has been studied, comparatively little research has investigated the genetics of host effects on their associated microbiomes. Switchgrass (Panicum virgatum) is a highly outcrossing, perennial, grass species with substantial locally adaptive diversity across its native North American range. Here, we compared 383 switchgrass accessions in a common garden to determine the host genotypic influence on rhizosphere bacterial composition. We hypothesized that the composition and diversity of rhizosphere bacterial assemblages would differentiate due to genotypic differences between hosts (potentially due to root phenotypes and associated life history variation). We observed higher alpha diversity of bacteria associated with upland ecotypes and tetraploids, compared to lowland ecotypes and octoploids, respectively. Alpha diversity correlated negatively with flowering time and plant height, indicating that bacterial composition varies along switchgrass life history axes. Narrow-sense heritability (h2 ) of the relative abundance of twenty-one core bacterial families was observed. Overall compositional differences among tetraploids, due to genetic variation, supports wide-spread genotypic influence on the rhizosphere microbiome. Tetraploids were only considered due to complexities associated with the octoploid genomes. Lastly, a genome-wide association study identified 1,861 single-nucleotide polymorphisms associated with 110 families and genes containing them related to potential regulatory functions. Our findings suggest that switchgrass genomic and life-history variation influences bacterial composition in the rhizosphere, potentially due to host adaptation to local environments.
Collapse
Affiliation(s)
- Jeremy Sutherland
- Department of Plant Pathology and Environmental Microbiology, The Pennsylvania State University, University Park, PA, USA.,Intercollege Graduate Degree Program in Bioinformatics and Genomics, The Pennsylvania State University, University Park, PA, USA.,Department of Biology, The Pennsylvania State University, University Park, PA, USA
| | - Terrence Bell
- Department of Plant Pathology and Environmental Microbiology, The Pennsylvania State University, University Park, PA, USA.,Intercollege Graduate Degree Program in Bioinformatics and Genomics, The Pennsylvania State University, University Park, PA, USA.,Intercollege Graduate Degree Program in Ecology, The Pennsylvania State University, University Park, PA, USA
| | - Ryan V Trexler
- Intercollege Graduate Degree Program in Ecology, The Pennsylvania State University, University Park, PA, USA.,Department of Ecosystem Science and Management, The Pennsylvania State University, University Park, PA, USA
| | - John E Carlson
- Intercollege Graduate Degree Program in Bioinformatics and Genomics, The Pennsylvania State University, University Park, PA, USA.,Department of Ecosystem Science and Management, The Pennsylvania State University, University Park, PA, USA
| | - Jesse R Lasky
- Department of Biology, The Pennsylvania State University, University Park, PA, USA
| |
Collapse
|
8
|
Songsomboon K, Crawford R, Crawford J, Hansen J, Cummings J, Mattson N, Bergstrom GC, Viands DR. Genome-Wide Associations with Resistance to Bipolaris Leaf Spot (Bipolaris oryzae (Breda de Haan) Shoemaker) in a Northern Switchgrass Population (Panicum virgatum L.). PLANTS 2022; 11:plants11101362. [PMID: 35631787 PMCID: PMC9144872 DOI: 10.3390/plants11101362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 05/17/2022] [Accepted: 05/17/2022] [Indexed: 11/18/2022]
Abstract
Switchgrass (Panicum virgatum L.), a northern native perennial grass, suffers from yield reduction from Bipolaris leaf spot caused by Bipolaris oryzae (Breda de Haan) Shoemaker. This study aimed to determine the resistant populations via multiple phenotyping approaches and identify potential resistance genes from genome-wide association studies (GWAS) in the switchgrass northern association panel. The disease resistance was evaluated from both natural (field evaluations in Ithaca, New York and Phillipsburg, Philadelphia) and artificial inoculations (detached leaf and leaf disk assays). The most resistant populations based on a combination of three phenotyping approaches—detached leaf, leaf disk, and mean from two locations—were ‘SW788’, ‘SW806’, ‘SW802’, ‘SW793’, ‘SW781’, ‘SW797’, ‘SW798’, ‘SW803’, ‘SW795’, ‘SW805’. The GWAS from the association panel showed 27 significant SNPs on 12 chromosomes: 1K, 2K, 2N, 3K, 3N, 4N, 5K, 5N, 6N, 7K, 7N, and 9N. These markers accumulatively explained the phenotypic variance of the resistance ranging from 3.28 to 26.52%. Within linkage disequilibrium of 20 kb, these SNP markers linked with the potential resistance genes included the genes encoding for NBS-LRR, PPR, cell-wall related proteins, homeostatic proteins, anti-apoptotic proteins, and ABC transporter.
Collapse
Affiliation(s)
- Kittikun Songsomboon
- Section of Plant Breeding and Genetics, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA; (R.C.); (J.C.); (J.H.); (D.R.V.)
- Correspondence:
| | - Ryan Crawford
- Section of Plant Breeding and Genetics, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA; (R.C.); (J.C.); (J.H.); (D.R.V.)
| | - Jamie Crawford
- Section of Plant Breeding and Genetics, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA; (R.C.); (J.C.); (J.H.); (D.R.V.)
| | - Julie Hansen
- Section of Plant Breeding and Genetics, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA; (R.C.); (J.C.); (J.H.); (D.R.V.)
| | | | - Neil Mattson
- Section of Horticulture, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA;
| | - Gary C. Bergstrom
- Section of Plant Pathology and Plant-Microbe Biology, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA;
| | - Donald R. Viands
- Section of Plant Breeding and Genetics, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA; (R.C.); (J.C.); (J.H.); (D.R.V.)
| |
Collapse
|
9
|
Tilhou NW, Casler MD. Subsampling and DNA pooling can increase gains through genomic selection in switchgrass. THE PLANT GENOME 2021; 14:e20149. [PMID: 34626166 DOI: 10.1002/tpg2.20149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Accepted: 07/22/2021] [Indexed: 06/13/2023]
Abstract
Genomic selection (GS) can accelerate breeding cycles in perennial crops such as the bioenergy grass switchgrass (Panicum virgatum L.). The sequencing costs of GS can be reduced by pooling DNA samples in the training population (TP), only sequencing TP phenotypic outliers, or pooling candidate population (CP) samples. These strategies were simulated for two traits (spring vigor and anthesis date) in three breeding populations. Sequencing only the outlier 50% of the TP phenotype distribution resulted in a penalty of <5% of the predictive ability, measured using cross-validation. Predictive ability also decreased when sequencing progressively fewer TP DNA pools, but TPs constructed from only two phenotypically contrasting DNA samples retained a mean of >80% predictive ability relative to individual TP sequencing. Novel group testing methods allowed greater than one CP individual to be screened per sequenced DNA sample but resulted in a predictive ability penalty. To determine the impact of reduced sequencing, genetic gain was calculated for seven GS scenarios with variable sequencing budgets. Reduced TP sequencing and most CP pooling methods were superior to individual sequence-based GS when sequencing resources were restricted (2,000 DNA samples per 5-yr cycle). Only one scenario was superior to individual sequencing when sequencing budgets were large (8,000 DNA samples per 5-yr cycle). This study highlights multiple routes for reduced sequencing costs in GS.
Collapse
Affiliation(s)
- Neal Wepking Tilhou
- Department of Agronomy, University of Wisconsin, 1575 Linden Dr, Madison, WI, 53706, USA
| | - Michael D Casler
- U.S. Dairy Forage Research Center, USDA-ARS, 1925 Linden Dr, Madison, WI, 53706-1108, USA
| |
Collapse
|
10
|
A classic approach for determining genomic prediction accuracy under terminal drought stress and well-watered conditions in wheat landraces and cultivars. PLoS One 2021; 16:e0247824. [PMID: 33667255 PMCID: PMC7935232 DOI: 10.1371/journal.pone.0247824] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Accepted: 02/12/2021] [Indexed: 11/21/2022] Open
Abstract
The present study aimed to improve the accuracy of genomic prediction of 16 agronomic traits in a diverse bread wheat (Triticum aestivum L.) germplasm under terminal drought stress and well-watered conditions in semi-arid environments. An association panel including 87 bread wheat cultivars and 199 landraces from Iran bread wheat germplasm was planted under two irrigation systems in semi-arid climate zones. The whole association panel was genotyped with 9047 single nucleotide polymorphism markers using the genotyping-by-sequencing method. A number of 23 marker-trait associations were selected for traits under each condition, whereas 17 marker-trait associations were common between terminal drought stress and well-watered conditions. The identified marker-trait associations were mostly single nucleotide polymorphisms with minor allele effects. This study examined the effect of population structure, genomic selection method (ridge regression-best linear unbiased prediction, genomic best-linear unbiased predictions, and Bayesian ridge regression), training set size, and type of marker set on genomic prediction accuracy. The prediction accuracies were low (-0.32) to moderate (0.52). A marker set including 93 significant markers identified through genome-wide association studies with P values ≤ 0.001 increased the genomic prediction accuracy for all traits under both conditions. This study concluded that obtaining the highest genomic prediction accuracy depends on the extent of linkage disequilibrium, the genetic architecture of trait, genetic diversity of the population, and the genomic selection method. The results encouraged the integration of genome-wide association study and genomic selection to enhance genomic prediction accuracy in applied breeding programs.
Collapse
|
11
|
Lloyd JP, Soellner MB, Merajver SD, Li JZ. Impact of between-tissue differences on pan-cancer predictions of drug sensitivity. PLoS Comput Biol 2021; 17:e1008720. [PMID: 33630864 PMCID: PMC7906305 DOI: 10.1371/journal.pcbi.1008720] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Accepted: 01/18/2021] [Indexed: 11/24/2022] Open
Abstract
Increased availability of drug response and genomics data for many tumor cell lines has accelerated the development of pan-cancer prediction models of drug response. However, it is unclear how much between-tissue differences in drug response and molecular characteristics may contribute to pan-cancer predictions. Also unknown is whether the performance of pan-cancer models could vary by cancer type. Here, we built a series of pan-cancer models using two datasets containing 346 and 504 cell lines, each with MEK inhibitor (MEKi) response and mRNA expression, point mutation, and copy number variation data, and found that, while the tissue-level drug responses are accurately predicted (between-tissue ρ = 0.88–0.98), only 5 of 10 cancer types showed successful within-tissue prediction performance (within-tissue ρ = 0.11–0.64). Between-tissue differences make substantial contributions to the performance of pan-cancer MEKi response predictions, as exclusion of between-tissue signals leads to a decrease in Spearman’s ρ from a range of 0.43–0.62 to 0.30–0.51. In practice, joint analysis of multiple cancer types usually has a larger sample size, hence greater power, than for one cancer type; and we observe that higher accuracy of pan-cancer prediction of MEKi response is almost entirely due to the sample size advantage. Success of pan-cancer prediction reveals how drug response in different cancers may invoke shared regulatory mechanisms despite tissue-specific routes of oncogenesis, yet predictions in different cancer types require flexible incorporation of between-cancer and within-cancer signals. As most datasets in genome sciences contain multiple levels of heterogeneity, careful parsing of group characteristics and within-group, individual variation is essential when making robust inference. One of the central goals for precision oncology is to tailor treatment of individual tumors by their molecular characteristics. While drug response predictions have traditionally been sought within each cancer type, it has long been hoped to develop more robust predictions by jointly considering diverse cancer types. While such pan-cancer approaches have improved in recent years, it remains unclear whether between-tissue differences are contributing to the reported pan-cancer prediction performance. This concern stems from the observation that, when cancer types differ in both molecular features and drug response, strong predictive information can come mainly from differences among tissue types. Our study finds that both between- and within-cancer type signals provide substantial contributions to pan-cancer drug response prediction models, and about half of the cancer types examined are poorly predicted despite strong overall performance across all cancer types. We also find that pan-cancer prediction models perform similarly or better than cancer type-specific models, and in many cases the advantage of pan-cancer models is due to the larger number of samples available for pan-cancer analysis. Our results highlight tissue-of-origin as a key consideration for pan-cancer drug response prediction models, and recommend cancer type-specific considerations when translating pan-cancer prediction models for clinical use.
Collapse
Affiliation(s)
- John P Lloyd
- Department of Human Genetics, University of Michigan, Ann Arbor, Michigan, United States of America.,Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, United States of America.,Rogel Cancer Center, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Matthew B Soellner
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, United States of America.,Rogel Cancer Center, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Sofia D Merajver
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, United States of America.,Rogel Cancer Center, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Jun Z Li
- Department of Human Genetics, University of Michigan, Ann Arbor, Michigan, United States of America.,Rogel Cancer Center, University of Michigan, Ann Arbor, Michigan, United States of America
| |
Collapse
|
12
|
Genomic mechanisms of climate adaptation in polyploid bioenergy switchgrass. Nature 2021; 590:438-444. [PMID: 33505029 PMCID: PMC7886653 DOI: 10.1038/s41586-020-03127-1] [Citation(s) in RCA: 87] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Accepted: 12/16/2020] [Indexed: 01/30/2023]
Abstract
Long-term climate change and periodic environmental extremes threaten food and fuel security1 and global crop productivity2-4. Although molecular and adaptive breeding strategies can buffer the effects of climatic stress and improve crop resilience5, these approaches require sufficient knowledge of the genes that underlie productivity and adaptation6-knowledge that has been limited to a small number of well-studied model systems. Here we present the assembly and annotation of the large and complex genome of the polyploid bioenergy crop switchgrass (Panicum virgatum). Analysis of biomass and survival among 732 resequenced genotypes, which were grown across 10 common gardens that span 1,800 km of latitude, jointly revealed extensive genomic evidence of climate adaptation. Climate-gene-biomass associations were abundant but varied considerably among deeply diverged gene pools. Furthermore, we found that gene flow accelerated climate adaptation during the postglacial colonization of northern habitats through introgression of alleles from a pre-adapted northern gene pool. The polyploid nature of switchgrass also enhanced adaptive potential through the fractionation of gene function, as there was an increased level of heritable genetic diversity on the nondominant subgenome. In addition to investigating patterns of climate adaptation, the genome resources and gene-trait associations developed here provide breeders with the necessary tools to increase switchgrass yield for the sustainable production of bioenergy.
Collapse
|
13
|
Abstract
A suitable pairwise relatedness estimation is key to genetic studies. Several methods are proposed to compute relatedness in autopolyploids based on molecular data. However, unlike diploids, autopolyploids still need further studies considering scenarios with many linked molecular markers with known dosage. In this study, we provide guidelines for plant geneticists and breeders to access trustworthy pairwise relatedness estimates. To this end, we simulated populations considering different ploidy levels, meiotic pairings patterns, number of loci and alleles, and inbreeding levels. Analysis were performed to access the accuracy of distinct methods and to demonstrate the usefulness of molecular marker in practical situations. Overall, our results suggest that at least 100 effective biallelic molecular markers are required to have good pairwise relatedness estimation if methods based on correlation is used. For this number of loci, current methods based on multiallelic markers show lower performance than biallelic ones. To estimate relatedness in cases of inbreeding or close relationships (as parent-offspring, full-sibs, or half-sibs) is more challenging. Methods to estimate pairwise relatedness based on molecular markers, for different ploidy levels or pedigrees were implemented in the AGHmatrix R package.
Collapse
|
14
|
Shabannejad M, Bihamta MR, Majidi-Hervan E, Alipour H, Ebrahimi A. A simple, cost-effective high-throughput image analysis pipeline improves genomic prediction accuracy for days to maturity in wheat. PLANT METHODS 2020; 16:146. [PMID: 33292394 PMCID: PMC7607823 DOI: 10.1186/s13007-020-00686-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Accepted: 10/15/2020] [Indexed: 05/05/2023]
Abstract
BACKGROUND High-throughput phenotyping and genomic selection accelerate genetic gain in breeding programs by advances in phenotyping and genotyping methods. This study developed a simple, cost-effective high-throughput image analysis pipeline to quantify digital images taken in a panel of 286 Iran bread wheat accessions under terminal drought stress and well-watered conditions. The color proportion of green to yellow (tolerance ratio) and the color proportion of yellow to green (stress ratio) was assessed for each canopy using the pipeline. The estimated tolerance and stress ratios were used as covariates in the genomic prediction models to evaluate the effect of change in canopy color on the improvement of the genomic prediction accuracy of different agronomic traits in wheat. RESULTS The reliability of the high-throughput image analysis pipeline was proved by three to four times of improvement in the accuracy of genomic predictions for days to maturity with the use of tolerance and stress ratios as covariates in the univariate genomic selection models. The higher prediction accuracies were attained for days to maturity when both tolerance and stress ratios were used as fixed effects in the univariate models. The results of this study indicated that the Bayesian ridge regression and ridge regression-best linear unbiased prediction methods were superior to other genomic prediction methods which were used in this study under terminal drought stress and well-watered conditions, respectively. CONCLUSIONS This study provided a robust, quick, and cost-effective machine learning-enabled image-phenotyping pipeline to improve the genomic prediction accuracy for days to maturity in wheat. The results encouraged the integration of phenomics and genomics in breeding programs.
Collapse
Affiliation(s)
- Morteza Shabannejad
- Department of Plant Breeding and Biotechnology, Faculty of Agricultural Sciences and Food Industries, Science and Research Branch, Islamic Azad University, P.O. Box 14515/775, Tehran, Iran
| | - Mohammad-Reza Bihamta
- Department of Agronomy and Plant Breeding, Faculty of Agricultural Sciences and Engineering, College of Agriculture and Natural Resources, University of Tehran, P.O. Box 4111, Karaj, Alborz, Iran.
| | - Eslam Majidi-Hervan
- Department of Plant Breeding and Biotechnology, Faculty of Agricultural Sciences and Food Industries, Science and Research Branch, Islamic Azad University, P.O. Box 14515/775, Tehran, Iran
| | - Hadi Alipour
- Department of Plant Production and Genetics, Faculty of Agriculture and Natural Resources, Urmia University, P.O. Box 165, Urmia, Iran
| | - Asa Ebrahimi
- Department of Plant Breeding and Biotechnology, Faculty of Agricultural Sciences and Food Industries, Science and Research Branch, Islamic Azad University, P.O. Box 14515/775, Tehran, Iran
| |
Collapse
|
15
|
Baral K, Coulman B, Biligetu B, Fu YB. Advancing crested wheatgrass [Agropyron cristatum (L.) Gaertn.] breeding through genotyping-by-sequencing and genomic selection. PLoS One 2020; 15:e0239609. [PMID: 33031422 PMCID: PMC7544028 DOI: 10.1371/journal.pone.0239609] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2020] [Accepted: 09/09/2020] [Indexed: 11/18/2022] Open
Abstract
Crested wheatgrass [Agropyron cristatum (L.) Gaertn.] provides high quality, highly palatable forage for early season grazing. Genetic improvement of crested wheatgrass has been challenged by its complex genome, outcrossing nature, long breeding cycle, and lack of informative molecular markers. Genomic selection (GS) has potential for improving traits of perennial forage species, and genotyping-by-sequencing (GBS) has enabled the development of genome-wide markers in non-model polyploid plants. An attempt was made to explore the utility of GBS and GS in crested wheatgrass breeding. Sequencing and phenotyping 325 genotypes representing 10 diverse breeding lines were performed. Bioinformatics analysis identified 827, 3,616, 14,090 and 46,136 single nucleotide polymorphism markers at 20%, 30%, 40% and 50% missing marker levels, respectively. Four GS models (BayesA, BayesB, BayesCπ, and rrBLUP) were examined for the accuracy of predicting nine agro-morphological and three nutritive value traits. Moderate accuracy (0.20 to 0.32) was obtained for the prediction of heading days, leaf width, plant height, clump diameter, tillers per plant and early spring vigor for genotypes evaluated at Saskatoon, Canada. Similar accuracy (0.29 to 0.35) was obtained for predicting fall regrowth and plant height for genotypes evaluated at Swift Current, Canada. The Bayesian models displayed similar or higher accuracy than rrBLUP. These findings show the feasibility of GS application for a non-model species to advance plant breeding.
Collapse
Affiliation(s)
- Kiran Baral
- Department of Plant Sciences, College of Agriculture and Bioresources, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Bruce Coulman
- Department of Plant Sciences, College of Agriculture and Bioresources, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Bill Biligetu
- Department of Plant Sciences, College of Agriculture and Bioresources, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Yong-Bi Fu
- Plant Gene Resources of Canada, Saskatoon Research and Development Centre, Agriculture and Agri-Food Canada, Saskatoon, Saskatchewan, Canada
| |
Collapse
|
16
|
Arojju SK, Cao M, Zulfi Jahufer MZ, Barrett BA, Faville MJ. Genomic Predictive Ability for Foliar Nutritive Traits in Perennial Ryegrass. G3 (BETHESDA, MD.) 2020; 10:695-708. [PMID: 31792009 PMCID: PMC7003077 DOI: 10.1534/g3.119.400880] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/31/2019] [Accepted: 11/25/2019] [Indexed: 11/24/2022]
Abstract
Forage nutritive value impacts animal nutrition, which underpins livestock productivity, reproduction and health. Genetic improvement for nutritive traits in perennial ryegrass has been limited, as they are typically expensive and time-consuming to measure through conventional methods. Genomic selection is appropriate for such complex and expensive traits, enabling cost-effective prediction of breeding values using genome-wide markers. The aims of the present study were to assess the potential of genomic selection for a range of nutritive traits in a multi-population training set, and to quantify contributions of family, location and family-by-location variance components to trait variation and heritability for nutritive traits. The training set consisted of a total of 517 half-sibling (half-sib) families, from five advanced breeding populations, evaluated in two distinct New Zealand grazing environments. Autumn-harvested samples were analyzed for 18 nutritive traits and maternal parents of the half-sib families were genotyped using genotyping-by-sequencing. Significant (P < 0.05) family variance was detected for all nutritive traits and genomic heritability (h2g ) was moderate to high (0.20 to 0.74). Family-by-location interactions were significant and particularly large for water soluble carbohydrate (WSC), crude fat, phosphorus (P) and crude protein. GBLUP, KGD-GBLUP and BayesCπ genomic prediction models displayed similar predictive ability, estimated by 10-fold cross validation, for all nutritive traits with values ranging from r = 0.16 to 0.45 using phenotypes from across two locations. High predictive ability was observed for the mineral traits sulfur (0.44), sodium (0.45) and magnesium (0.45) and the lowest values were observed for P (0.16), digestibility (0.22) and high molecular weight WSC (0.23). Predictive ability estimates for most nutritive traits were retained when marker number was reduced from one million to as few as 50,000. The moderate to high predictive abilities observed suggests implementation of genomic selection is feasible for most of the nutritive traits examined.
Collapse
Affiliation(s)
- Sai Krishna Arojju
- AgResearch Ltd, Grasslands Research Centre, PB 11008, Palmerston North, New Zealand
| | - Mingshu Cao
- AgResearch Ltd, Grasslands Research Centre, PB 11008, Palmerston North, New Zealand
| | - M Z Zulfi Jahufer
- AgResearch Ltd, Grasslands Research Centre, PB 11008, Palmerston North, New Zealand
| | - Brent A Barrett
- AgResearch Ltd, Grasslands Research Centre, PB 11008, Palmerston North, New Zealand
| | - Marty J Faville
- AgResearch Ltd, Grasslands Research Centre, PB 11008, Palmerston North, New Zealand
| |
Collapse
|
17
|
Qin J, Shi A, Song Q, Li S, Wang F, Cao Y, Ravelombola W, Song Q, Yang C, Zhang M. Genome Wide Association Study and Genomic Selection of Amino Acid Concentrations in Soybean Seeds. FRONTIERS IN PLANT SCIENCE 2019; 10:1445. [PMID: 31803203 PMCID: PMC6873630 DOI: 10.3389/fpls.2019.01445] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/30/2019] [Accepted: 10/17/2019] [Indexed: 05/15/2023]
Abstract
Soybean is a major source of protein for human consumption and animal feed. Releasing new cultivars with high nutritional value is one of the major goals in soybean breeding. To achieve this goal, genome-wide association studies of seed amino acid contents were conducted based on 249 soybean accessions from China, US, Japan, and South Korea. The accessions were evaluated for 15 amino acids and genotyped by sequencing. Significant genetic variation was observed for amino acids among the accessions. Among the 231 single nucleotide polymorphisms (SNPs) significantly associated with variations in amino acid contents, fifteen SNPs localized near 14 candidate genes involving in amino acid metabolism. The amino acids were classified into two groups with five in one group and seven amino acids in the other. Correlation coefficients among the amino acids within each group were high and positive, but the correlation coefficients of amino acids between the two groups were negative. Twenty-five SNP markers associated with multiple amino acids can be used to simultaneously improve multi-amino acid concentration in soybean. Genomic selection analysis of amino acid concentration showed that selection efficiency of amino acids based on the markers significantly associated with all 15 amino acids was higher than that based on random markers or markers only associated with individual amino acid. The identified markers could facilitate selection of soybean varieties with improved seed quality.
Collapse
Affiliation(s)
- Jun Qin
- National Soybean Improvement Center Shijiazhuang Sub-Center, North China Key Laboratory of Biology and Genetic Improvement of Soybean, Ministry of Agriculture, Laboratory of Crop Genetics and Breeding of Hebei, Cereal & Oil Crop Institute, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang, China
- Department of Horticulture, University of Arkansas, Fayetteville, AR, United States
| | - Ainong Shi
- Department of Horticulture, University of Arkansas, Fayetteville, AR, United States
| | - Qijian Song
- Soybean Genomics and Improvement Lab, USDA-ARS, Beltsville, MD, United States
| | - Song Li
- Crop and Soil Environmental Science, Virginia Tech, Blacksburg, VA, United States
| | - Fengmin Wang
- National Soybean Improvement Center Shijiazhuang Sub-Center, North China Key Laboratory of Biology and Genetic Improvement of Soybean, Ministry of Agriculture, Laboratory of Crop Genetics and Breeding of Hebei, Cereal & Oil Crop Institute, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang, China
| | - Yinghao Cao
- Bioinformatics Center, Allife Medical Science and Technology Co., Ltd, Beijing, China
| | - Waltram Ravelombola
- Department of Horticulture, University of Arkansas, Fayetteville, AR, United States
| | - Qi Song
- Crop and Soil Environmental Science, Virginia Tech, Blacksburg, VA, United States
| | - Chunyan Yang
- National Soybean Improvement Center Shijiazhuang Sub-Center, North China Key Laboratory of Biology and Genetic Improvement of Soybean, Ministry of Agriculture, Laboratory of Crop Genetics and Breeding of Hebei, Cereal & Oil Crop Institute, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang, China
| | - Mengchen Zhang
- National Soybean Improvement Center Shijiazhuang Sub-Center, North China Key Laboratory of Biology and Genetic Improvement of Soybean, Ministry of Agriculture, Laboratory of Crop Genetics and Breeding of Hebei, Cereal & Oil Crop Institute, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang, China
| |
Collapse
|
18
|
Azodi CB, Bolger E, McCarren A, Roantree M, de Los Campos G, Shiu SH. Benchmarking Parametric and Machine Learning Models for Genomic Prediction of Complex Traits. G3 (BETHESDA, MD.) 2019; 9:3691-3702. [PMID: 31533955 PMCID: PMC6829122 DOI: 10.1534/g3.119.400498] [Citation(s) in RCA: 65] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Accepted: 09/09/2019] [Indexed: 12/21/2022]
Abstract
The usefulness of genomic prediction in crop and livestock breeding programs has prompted efforts to develop new and improved genomic prediction algorithms, such as artificial neural networks and gradient tree boosting. However, the performance of these algorithms has not been compared in a systematic manner using a wide range of datasets and models. Using data of 18 traits across six plant species with different marker densities and training population sizes, we compared the performance of six linear and six non-linear algorithms. First, we found that hyperparameter selection was necessary for all non-linear algorithms and that feature selection prior to model training was critical for artificial neural networks when the markers greatly outnumbered the number of training lines. Across all species and trait combinations, no one algorithm performed best, however predictions based on a combination of results from multiple algorithms (i.e., ensemble predictions) performed consistently well. While linear and non-linear algorithms performed best for a similar number of traits, the performance of non-linear algorithms vary more between traits. Although artificial neural networks did not perform best for any trait, we identified strategies (i.e., feature selection, seeded starting weights) that boosted their performance to near the level of other algorithms. Our results highlight the importance of algorithm selection for the prediction of trait values.
Collapse
Affiliation(s)
| | - Emily Bolger
- Department of Mathematics, Moravian College, Bethlehem, PA
| | - Andrew McCarren
- Insight Centre for Data Analytics, School of Computing, Dublin City University, Dublin 9, Ireland
| | - Mark Roantree
- Insight Centre for Data Analytics, School of Computing, Dublin City University, Dublin 9, Ireland
| | - Gustavo de Los Campos
- Department of Epidemiology & Biostatistics
- Department of Statistics & Probability
- Institute for Quantitative Health Science and Engineering, and
| | - Shin-Han Shiu
- Department of Plant Biology
- Department of Computational, Mathematics, Science, and Engineering, Michigan State University, East Lansing, MI, 48824
| |
Collapse
|
19
|
de C Lara LA, Santos MF, Jank L, Chiari L, Vilela MDM, Amadeu RR, Dos Santos JPR, Pereira GDS, Zeng ZB, Garcia AAF. Genomic Selection with Allele Dosage in Panicum maximum Jacq. G3 (BETHESDA, MD.) 2019; 9:2463-2475. [PMID: 31171567 PMCID: PMC6686918 DOI: 10.1534/g3.118.200986] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Accepted: 05/23/2019] [Indexed: 12/21/2022]
Abstract
Genomic selection is an efficient approach to get shorter breeding cycles in recurrent selection programs and greater genetic gains with selection of superior individuals. Despite advances in genotyping techniques, genetic studies for polyploid species have been limited to a rough approximation of studies in diploid species. The major challenge is to distinguish the different types of heterozygotes present in polyploid populations. In this work, we evaluated different genomic prediction models applied to a recurrent selection population of 530 genotypes of Panicum maximum, an autotetraploid forage grass. We also investigated the effect of the allele dosage in the prediction, i.e., considering tetraploid (GS-TD) or diploid (GS-DD) allele dosage. A longitudinal linear mixed model was fitted for each one of the six phenotypic traits, considering different covariance matrices for genetic and residual effects. A total of 41,424 genotyping-by-sequencing markers were obtained using 96-plex and Pst1 restriction enzyme, and quantitative genotype calling was performed. Six predictive models were generalized to tetraploid species and predictive ability was estimated by a replicated fivefold cross-validation process. GS-TD and GS-DD models were performed considering 1,223 informative markers. Overall, GS-TD data yielded higher predictive abilities than with GS-DD data. However, different predictive models had similar predictive ability performance. In this work, we provide bioinformatic and modeling guidelines to consider tetraploid dosage and observed that genomic selection may lead to additional gains in recurrent selection program of P. maximum.
Collapse
Affiliation(s)
- Letícia A de C Lara
- Luiz de Queiroz College of Agriculture / University of São Paulo (ESALQ/USP), Piracicaba, SP, Brazil
| | | | - Liana Jank
- Embrapa Beef Cattle, Campo Grande, MS, Brazil, and
| | | | | | - Rodrigo R Amadeu
- Luiz de Queiroz College of Agriculture / University of São Paulo (ESALQ/USP), Piracicaba, SP, Brazil
| | - Jhonathan P R Dos Santos
- Luiz de Queiroz College of Agriculture / University of São Paulo (ESALQ/USP), Piracicaba, SP, Brazil
| | | | | | - Antonio Augusto F Garcia
- Luiz de Queiroz College of Agriculture / University of São Paulo (ESALQ/USP), Piracicaba, SP, Brazil
| |
Collapse
|
20
|
Genomic Prediction for Winter Survival of Lowland Switchgrass in the Northern USA. G3-GENES GENOMES GENETICS 2019; 9:1921-1931. [PMID: 30971392 PMCID: PMC6553536 DOI: 10.1534/g3.119.400094] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The lowland ecotype of switchgrass has generated considerable interest because of its higher biomass yield and late flowering characteristics compared to the upland ecotype. However, lowland ecotypes planted in northern latitudes exhibit very low winter survival. Implementation of genomic selection could potentially enhance switchgrass breeding for winter survival by reducing generation time while eliminating the dependence on weather. The objectives of this study were to assess the potential of genomic selection for winter survival in lowland switchgrass by combining multiple populations in the training set and applying the selected model in two independent testing datasets for validation. Marker data were generated using exome capture sequencing. Validation was conducted using (1) indirect indicators of winter adaptation based on geographic and climatic variables of accessions from different source locations and (2) winter survival estimates of the phenotype. The prediction accuracies were significantly higher when the training dataset comprising all populations was used in fivefold cross validation but its application was not useful in the independent validation dataset. Nevertheless, modeling for population heterogeneity improved the prediction accuracy to some extent but the genetic relationship between the training and validation populations was found to be more influential. The predicted winter survival of lowland switchgrass indicated latitudinal and longitudinal variability, with the northeast USA the region for most cold tolerant lowland populations. Our results suggested that GS could provide valuable opportunities for improving winter survival and accelerate the lowland switchgrass breeding programs toward the development of cold tolerant cultivars suitable for northern latitudes.
Collapse
|
21
|
Ramstein GP, Casler MD. Extensions of BLUP Models for Genomic Prediction in Heterogeneous Populations: Application in a Diverse Switchgrass Sample. G3 (BETHESDA, MD.) 2019; 9:789-805. [PMID: 30651285 PMCID: PMC6404615 DOI: 10.1534/g3.118.200969] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Accepted: 01/10/2019] [Indexed: 11/18/2022]
Abstract
Genomic prediction is a useful tool to accelerate genetic gain in selection using DNA marker information. However, this technology typically relies on standard prediction procedures, such as genomic BLUP, that are not designed to accommodate population heterogeneity resulting from differences in marker effects across populations. In this study, we assayed different prediction procedures to capture marker-by-population interactions in genomic prediction models. Prediction procedures included genomic BLUP and two kernel-based extensions of genomic BLUP which explicitly accounted for population heterogeneity. To model population heterogeneity, dissemblance between populations was either depicted by a unique coefficient (as previously reported), or a more flexible function of genetic distance between populations (proposed herein). Models under investigation were applied in a diverse switchgrass sample under two validation schemes: whole-sample calibration, where all individuals except selection candidates are included in the calibration set, and cross-population calibration, where the target population is entirely excluded from the calibration set. First, we showed that using fixed effects, from principal components or putative population groups, appeared detrimental to prediction accuracy, especially in cross-population calibration. Then we showed that modeling population heterogeneity by our proposed procedure resulted in highly significant improvements in model fit. In such cases, gains in accuracy were often positive. These results suggest that population heterogeneity may be parsimoniously captured by kernel methods. However, in cases where improvement in model fit by our proposed procedure is null-to-moderate, ignoring heterogeneity should probably be preferred due to the robustness and simplicity of the standard genomic BLUP model.
Collapse
Affiliation(s)
| | - Michael D Casler
- Department of Agronomy, University of Wisconsin-Madison, Madison, WI 53706
- Agricultural Research Service, United States Department of Agriculture, Madison, WI 53706
| |
Collapse
|
22
|
Ramstein GP, Evans J, Nandety A, Saha MC, Brummer EC, Kaeppler SM, Buell CR, Casler MD. Candidate Variants for Additive and Interactive Effects on Bioenergy Traits in Switchgrass ( Panicum virgatum L.) Identified by Genome-Wide Association Analyses. THE PLANT GENOME 2018; 11:180002. [PMID: 30512032 DOI: 10.3835/plantgenome2018.01.0002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
Switchgrass ( L.) is a promising herbaceous energy crop, but further gains in biomass yield and quality must be achieved to enable a viable bioenergy industry. Developing DNA markers can contribute to such progress, but depiction of genetic bases should be reliable, involving simple additive marker effects and also interactions with genetic backgrounds (e.g., ecotypes) or synergies with other markers. We analyzed plant height, C content, N content, and mineral concentration in a diverse panel consisting of 512 genotypes of upland and lowland ecotypes. We performed association analyses based on exome capture sequencing and tested 439,170 markers for marginal effects, 83,290 markers for marker × ecotype interactions, and up to 311,445 marker pairs for pairwise interactions. Analyses of pairwise interactions focused on subsets of marker pairs preselected on the basis of marginal marker effects, gene ontology annotation, and pairwise marker associations. Our tests identified 12 significant effects. Homology and gene expression information corroborated seven effects and indicated plausible causal pathways: flowering time and lignin synthesis for plant height; plant growth and senescence for C content and mineral concentration. Four pairwise interactions were detected, including three interactions preselected on the basis of pairwise marker correlations. Furthermore, a marker × ecotype interaction and a pairwise interaction were confirmed in an independent switchgrass panel. Our analyses identified reliable candidate variants for important bioenergy traits. Moreover, they exemplified the importance of interactive effects for depicting genetic bases and illustrated the usefulness of preselecting marker pairs for identifying pairwise marker interactions in association studies.
Collapse
|
23
|
Kenaley SC, Quan M, Aime MC, Bergstrom GC. New insight into the species diversity and life cycles of rust fungi (Pucciniales) affecting bioenergy switchgrass (Panicum virgatum) in the Eastern and Central United States. Mycol Prog 2018. [DOI: 10.1007/s11557-018-1434-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
24
|
Covarrubias-Pazaran G, Schlautman B, Diaz-Garcia L, Grygleski E, Polashock J, Johnson-Cicalese J, Vorsa N, Iorizzo M, Zalapa J. Multivariate GBLUP Improves Accuracy of Genomic Selection for Yield and Fruit Weight in Biparental Populations of Vaccinium macrocarpon Ait. FRONTIERS IN PLANT SCIENCE 2018; 9:1310. [PMID: 30258453 PMCID: PMC6144488 DOI: 10.3389/fpls.2018.01310] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/19/2018] [Accepted: 08/20/2018] [Indexed: 05/10/2023]
Abstract
The development of high-throughput genotyping has made genome-wide association (GWAS) and genomic selection (GS) applications possible for both model and non-model species. The exploitation of genome-assisted approaches could greatly benefit breeding efforts in American cranberry (Vaccinium macrocarpon) and other minor crops. Using biparental populations with different degrees of relatedness, we evaluated multiple GS methods for total yield (TY) and mean fruit weight (MFW). Specifically, we compared predictive ability (PA) differences between univariate and multivariate genomic best linear unbiased predictors (GBLUP and MGBLUP, respectively). We found that MGBLUP provided higher predictive ability (PA) than GBLUP, in scenarios with medium genetic correlation (8-17% increase with corg~0.6) and high genetic correlations (25-156% with corg~0.9), but found no increase when genetic correlation was low. In addition, we found that only a few hundred single nucleotide polymorphism (SNP) markers are needed to reach a plateau in PA for both traits in the biparental populations studied (in full linkage disequilibrium). We observed that higher resemblance among individuals in the training (TP) and validation (VP) populations provided greater PA. Although multivariate GS methods are available, genetic correlations and other factors need to be carefully considered when applying these methods for genetic improvement.
Collapse
Affiliation(s)
| | | | - Luis Diaz-Garcia
- Department of Horticulture, University of Wisconsin Madison, Madison, WI, United States
- Instituto Nacional de Investigaciones, Forestales, Agrícolas y Pecuarias, Campo Experimental Pabellón, Aguascalientes, Mexico
| | | | - James Polashock
- Genetic Improvement of Fruits and Vegetables Laboratory, USDA-ARS, Chatsworth, NJ, United States
| | - Jennifer Johnson-Cicalese
- Blueberry and Cranberry Research and Extension Center, Rutgers University, Chatsworth, NJ, United States
| | - Nicholi Vorsa
- Blueberry and Cranberry Research and Extension Center, Rutgers University, Chatsworth, NJ, United States
| | - Massimo Iorizzo
- Department of Horticulture Sciences, Plants for Human Health Institute, North Carolina State University, Kannapolis, NC, United States
| | - Juan Zalapa
- Vegetable Crops Research Unit, USDA-ARS, University of Wisconsin, Madison, WI, United States
| |
Collapse
|
25
|
Fiedler JD, Lanzatella C, Edmé SJ, Palmer NA, Sarath G, Mitchell R, Tobias CM. Genomic prediction accuracy for switchgrass traits related to bioenergy within differentiated populations. BMC PLANT BIOLOGY 2018; 18:142. [PMID: 29986667 PMCID: PMC6038187 DOI: 10.1186/s12870-018-1360-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/26/2018] [Accepted: 07/02/2018] [Indexed: 05/16/2023]
Abstract
BACKGROUND Switchgrass breeders need to improve the rates of genetic gain in many bioenergy-related traits in order to create improved cultivars that are higher yielding and have optimal biomass composition. One way to achieve this is through genomic selection. However, the heritability of traits needs to be determined as well as the accuracy of prediction in order to determine if efficient selection is possible. RESULTS Using five distinct switchgrass populations comprised of three lowland, one upland and one hybrid accession, the accuracy of genomic predictions under different cross-validation strategies and prediction methods was investigated. Individual genotypes were collected using GBS while kin-BLUP, partial least squares, sparse partial least squares, and BayesB methods were employed to predict yield, morphological, and NIRS-based compositional data collected in 2012-2013 from a replicated Nebraska field trial. Population structure was assessed by F statistics which ranged from 0.3952 between lowland and upland accessions to 0.0131 among the lowland accessions. Prediction accuracy ranged from 0.57-0.52 for cell wall soluble glucose and fructose respectively, to insignificant for traits with low repeatability. Ratios of heritability across to within-population ranged from 15 to 0.6. CONCLUSIONS Accuracy was significantly affected by both cross-validation strategy and trait. Accounting for population structure with a cross-validation strategy constrained by accession resulted in accuracies that were 69% lower than apparent accuracies using unconstrained cross-validation. Less accurate genomic selection is anticipated when most of the phenotypic variation exists between populations such as with spring regreening and yield phenotypes.
Collapse
Affiliation(s)
- Jason D. Fiedler
- Department of Plant Sciences, North Dakota State University, 166 Loftsgard Hall, Fargo, ND 58108-6050 USA
| | - Christina Lanzatella
- USDA-ARS Crop Improvement and Genetics Research Unit, Western Regional Research Center, Albany, CA 94710 USA
| | - Serge J. Edmé
- USDA-ARS Wheat, Sorghum and Forage Research Unit, 251 Filley Hall/Food Ind. University of Nebraska, East Campus, Lincoln, NE 68583 USA
| | - Nathan A. Palmer
- USDA-ARS Wheat, Sorghum and Forage Research Unit, 251 Filley Hall/Food Ind. University of Nebraska, East Campus, Lincoln, NE 68583 USA
| | - Gautam Sarath
- USDA-ARS Wheat, Sorghum and Forage Research Unit, 251 Filley Hall/Food Ind. University of Nebraska, East Campus, Lincoln, NE 68583 USA
| | - Rob Mitchell
- USDA-ARS Wheat, Sorghum and Forage Research Unit, 251 Filley Hall/Food Ind. University of Nebraska, East Campus, Lincoln, NE 68583 USA
| | - Christian M. Tobias
- USDA-ARS Crop Improvement and Genetics Research Unit, Western Regional Research Center, Albany, CA 94710 USA
| |
Collapse
|
26
|
Genotyping by Sequencing Reasserts the Close Relationship between Tef and Its Putative Wild Eragrostis Progenitors. DIVERSITY-BASEL 2018. [DOI: 10.3390/d10020017] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
|
27
|
Fernandes SB, Dias KOG, Ferreira DF, Brown PJ. Efficiency of multi-trait, indirect, and trait-assisted genomic selection for improvement of biomass sorghum. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2018; 131:747-755. [PMID: 29218378 PMCID: PMC5814553 DOI: 10.1007/s00122-017-3033-y] [Citation(s) in RCA: 71] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/10/2017] [Accepted: 12/01/2017] [Indexed: 05/18/2023]
Abstract
We compare genomic selection methods that use correlated traits to help predict biomass yield in sorghum, and find that trait-assisted genomic selection performs best. Genomic selection (GS) is usually performed on a single trait, but correlated traits can also help predict a focal trait through indirect or multi-trait GS. In this study, we use a pre-breeding population of biomass sorghum to compare strategies that use correlated traits to improve prediction of biomass yield, the focal trait. Correlated traits include moisture, plant height measured at monthly intervals between planting and harvest, and the area under the growth progress curve. In addition to single- and multi-trait direct and indirect GS, we test a new strategy called trait-assisted GS, in which correlated traits are used along with marker data in the validation population to predict a focal trait. Single-trait GS for biomass yield had a prediction accuracy of 0.40. Indirect GS performed best using area under the growth progress curve to predict biomass yield, with a prediction accuracy of 0.37, and did not differ from indirect multi-trait GS that also used moisture information. Multi-trait GS and single-trait GS yielded similar results, indicating that correlated traits did not improve prediction of biomass yield in a standard GS scenario. However, trait-assisted GS increased prediction accuracy by up to [Formula: see text] when using plant height in both the training and validation populations to help predict yield in the validation population. Coincidence between selected genotypes in phenotypic and genomic selection was also highest in trait-assisted GS. Overall, these results suggest that trait-assisted GS can be an efficient strategy when correlated traits are obtained earlier or more inexpensively than a focal trait.
Collapse
Affiliation(s)
- Samuel B Fernandes
- Department of Crop Sciences, University of Illinois, 1206 W Gregory Drive, IL, Urbana, 61801, USA.
| | - Kaio O G Dias
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo, PO Box 83, Piracicaba, SP, 13400-970, Brazil
| | - Daniel F Ferreira
- Departamento de Estatística, Universidade Federal de Lavras, 3037, Lavras, MG, 37200-000, Brazil
| | - Patrick J Brown
- Department of Crop Sciences, University of Illinois, 1206 W Gregory Drive, IL, Urbana, 61801, USA
| |
Collapse
|
28
|
Sogbohossou EOD, Achigan-Dako EG, Maundu P, Solberg S, Deguenon EMS, Mumm RH, Hale I, Van Deynze A, Schranz ME. A roadmap for breeding orphan leafy vegetable species: a case study of Gynandropsis gynandra (Cleomaceae). HORTICULTURE RESEARCH 2018; 5:2. [PMID: 29423232 PMCID: PMC5798814 DOI: 10.1038/s41438-017-0001-2] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/07/2017] [Revised: 10/23/2017] [Accepted: 11/29/2017] [Indexed: 05/24/2023]
Abstract
Despite an increasing awareness of the potential of "orphan" or unimproved crops to contribute to food security and enhanced livelihoods for farmers, coordinated research agendas to facilitate production and use of orphan crops by local communities are generally lacking. We provide an overview of the current knowledge on leafy vegetables with a focus on Gynandropsis gynandra, a highly nutritious species used in Africa and Asia, and highlight general and species-specific guidelines for participatory, genomics-assisted breeding of orphan crops. Key steps in genome-enabled orphan leafy vegetables improvement are identified and discussed in the context of Gynandropsis gynandra breeding, including: (1) germplasm collection and management; (2) product target definition and refinement; (3) characterization of the genetic control of key traits; (4) design of the 'process' for cultivar development; (5) integration of genomic data to optimize that 'process'; (6) multi-environmental participatory testing and end-user evaluation; and (7) crop value chain development. The review discusses each step in detail, with emphasis on improving leaf yield, phytonutrient content, organoleptic quality, resistance to biotic and abiotic stresses and post-harvest management.
Collapse
Affiliation(s)
- E. O. Deedi Sogbohossou
- Biosystematics Group, Wageningen University, Postbus 647 6700AP, Wageningen, The Netherlands
- Laboratory of Genetics, Horticulture and Seed Sciences, Faculty of Agronomic Sciences, University of Abomey-Calavi, BP 2549 Abomey-Calavi, Benin
| | - Enoch G. Achigan-Dako
- Laboratory of Genetics, Horticulture and Seed Sciences, Faculty of Agronomic Sciences, University of Abomey-Calavi, BP 2549 Abomey-Calavi, Benin
| | - Patrick Maundu
- Kenya Resource Center for Indigenous Knowledge (KENRIK), Centre for Biodiversity, National Museums of Kenya, Museum Hill, P.O. Box 40658, Nairobi, 00100 Kenya
| | - Svein Solberg
- World Vegetable Center (AVRDC), P.O. Box 42, Shanhua, Tainan 74199 Taiwan
| | | | - Rita H. Mumm
- Department of Crop Sciences, University of Illinois, Urbana-Champaign, IL 61801 USA
| | - Iago Hale
- Department of Agriculture, Nutrition, and Food Systems, University of New Hampshire, Durham, NH 03824 USA
| | - Allen Van Deynze
- Department of Plant Sciences, University of California, Davis, CA 95616 USA
| | - M. Eric Schranz
- Biosystematics Group, Wageningen University, Postbus 647 6700AP, Wageningen, The Netherlands
| |
Collapse
|
29
|
Zhang J, Naik HS, Assefa T, Sarkar S, Reddy RVC, Singh A, Ganapathysubramanian B, Singh AK. Computer vision and machine learning for robust phenotyping in genome-wide studies. Sci Rep 2017; 7:44048. [PMID: 28272456 PMCID: PMC5358742 DOI: 10.1038/srep44048] [Citation(s) in RCA: 55] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2016] [Accepted: 02/02/2017] [Indexed: 12/15/2022] Open
Abstract
Traditional evaluation of crop biotic and abiotic stresses are time-consuming and labor-intensive limiting the ability to dissect the genetic basis of quantitative traits. A machine learning (ML)-enabled image-phenotyping pipeline for the genetic studies of abiotic stress iron deficiency chlorosis (IDC) of soybean is reported. IDC classification and severity for an association panel of 461 diverse plant-introduction accessions was evaluated using an end-to-end phenotyping workflow. The workflow consisted of a multi-stage procedure including: (1) optimized protocols for consistent image capture across plant canopies, (2) canopy identification and registration from cluttered backgrounds, (3) extraction of domain expert informed features from the processed images to accurately represent IDC expression, and (4) supervised ML-based classifiers that linked the automatically extracted features with expert-rating equivalent IDC scores. ML-generated phenotypic data were subsequently utilized for the genome-wide association study and genomic prediction. The results illustrate the reliability and advantage of ML-enabled image-phenotyping pipeline by identifying previously reported locus and a novel locus harboring a gene homolog involved in iron acquisition. This study demonstrates a promising path for integrating the phenotyping pipeline into genomic prediction, and provides a systematic framework enabling robust and quicker phenotyping through ground-based systems.
Collapse
Affiliation(s)
- Jiaoping Zhang
- Department of Agronomy, Iowa State University, Ames, IA, USA
| | - Hsiang Sing Naik
- Department of Mechanical Engineering, Iowa State University, Ames, IA, USA
| | - Teshale Assefa
- Department of Agronomy, Iowa State University, Ames, IA, USA
| | - Soumik Sarkar
- Department of Mechanical Engineering, Iowa State University, Ames, IA, USA
| | | | - Arti Singh
- Department of Agronomy, Iowa State University, Ames, IA, USA
| | | | | |
Collapse
|
30
|
Grabowski PP, Evans J, Daum C, Deshpande S, Barry KW, Kennedy M, Ramstein G, Kaeppler SM, Buell CR, Jiang Y, Casler MD. Genome-wide associations with flowering time in switchgrass using exome-capture sequencing data. THE NEW PHYTOLOGIST 2017; 213:154-169. [PMID: 27443672 DOI: 10.1111/nph.14101] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2016] [Accepted: 06/10/2016] [Indexed: 05/20/2023]
Abstract
Flowering time is a major determinant of biomass yield in switchgrass (Panicum virgatum), a perennial bioenergy crop, because later flowering allows for an extended period of vegetative growth and increased biomass production. A better understanding of the genetic regulation of flowering time in switchgrass will aid the development of switchgrass varieties with increased biomass yields, particularly at northern latitudes, where late-flowering but southern-adapted varieties have high winter mortality. We use genotypes derived from recently published exome-capture sequencing, which mitigates challenges related to the large, highly repetitive and polyploid switchgrass genome, to perform genome-wide association studies (GWAS) using flowering time data from a switchgrass association panel in an effort to characterize the genetic architecture and genes underlying flowering time regulation in switchgrass. We identify associations with flowering time at multiple loci, including in a homolog of FLOWERING LOCUS T and in a locus containing TIMELESS, a homolog of a key circadian regulator in animals. Our results suggest that flowering time variation in switchgrass is due to variation at many positions across the genome. The relationship of flowering time and geographic origin indicates likely roles for genes in the photoperiod and autonomous pathways in generating switchgrass flowering time variation.
Collapse
Affiliation(s)
- Paul P Grabowski
- US Dairy Forage Research Center, USDA-ARS, 1925 Linden Dr. W, Madison, WI, 53706, USA
| | - Joseph Evans
- DuPont Pioneer, Johnston, IA, 50131, USA
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
- DOE Great Lakes Bioenergy Research Center, Michigan State University, East Lansing, MI, 48824, USA
| | - Chris Daum
- DOE Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | | | - Kerrie W Barry
- DOE Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Megan Kennedy
- DOE Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Guillaume Ramstein
- Department of Agronomy, University of Wisconsin-Madison, 1575 Linden Dr, Madison, WI, 53706, USA
| | - Shawn M Kaeppler
- Department of Agronomy, University of Wisconsin-Madison, 1575 Linden Dr, Madison, WI, 53706, USA
- DOE Great Lakes Bioenergy Research Center, University of Wisconsin-Madison, 1552 University Ave, Madison, WI, 53726, USA
| | - C Robin Buell
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
- DOE Great Lakes Bioenergy Research Center, Michigan State University, East Lansing, MI, 48824, USA
| | - Yiwei Jiang
- Department of Agronomy, Purdue University, 915 West State Street, West Lafayette, IN, 47907, USA
| | - Michael D Casler
- US Dairy Forage Research Center, USDA-ARS, 1925 Linden Dr. W, Madison, WI, 53706, USA
| |
Collapse
|
31
|
Bhat JA, Ali S, Salgotra RK, Mir ZA, Dutta S, Jadon V, Tyagi A, Mushtaq M, Jain N, Singh PK, Singh GP, Prabhu KV. Genomic Selection in the Era of Next Generation Sequencing for Complex Traits in Plant Breeding. Front Genet 2016; 7:221. [PMID: 28083016 PMCID: PMC5186759 DOI: 10.3389/fgene.2016.00221] [Citation(s) in RCA: 123] [Impact Index Per Article: 15.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2016] [Accepted: 12/12/2016] [Indexed: 12/31/2022] Open
Abstract
Genomic selection (GS) is a promising approach exploiting molecular genetic markers to design novel breeding programs and to develop new markers-based models for genetic evaluation. In plant breeding, it provides opportunities to increase genetic gain of complex traits per unit time and cost. The cost-benefit balance was an important consideration for GS to work in crop plants. Availability of genome-wide high-throughput, cost-effective and flexible markers, having low ascertainment bias, suitable for large population size as well for both model and non-model crop species with or without the reference genome sequence was the most important factor for its successful and effective implementation in crop species. These factors were the major limitations to earlier marker systems viz., SSR and array-based, and was unimaginable before the availability of next-generation sequencing (NGS) technologies which have provided novel SNP genotyping platforms especially the genotyping by sequencing. These marker technologies have changed the entire scenario of marker applications and made the use of GS a routine work for crop improvement in both model and non-model crop species. The NGS-based genotyping have increased genomic-estimated breeding value prediction accuracies over other established marker platform in cereals and other crop species, and made the dream of GS true in crop breeding. But to harness the true benefits from GS, these marker technologies will be combined with high-throughput phenotyping for achieving the valuable genetic gain from complex traits. Moreover, the continuous decline in sequencing cost will make the WGS feasible and cost effective for GS in near future. Till that time matures the targeted sequencing seems to be more cost-effective option for large scale marker discovery and GS, particularly in case of large and un-decoded genomes.
Collapse
Affiliation(s)
- Javaid A Bhat
- Division of Genetics, Indian Agricultural Research Institute New Delhi, India
| | - Sajad Ali
- National Research Centre for Plant Biotechnology New Delhi, India
| | - Romesh K Salgotra
- School of Biotechnology, Sher-e-Kashmir University of Agricultural Sciences and Technology of Jammu Chatha, India
| | - Zahoor A Mir
- National Research Centre for Plant Biotechnology New Delhi, India
| | - Sutapa Dutta
- Division of Genetics, Indian Agricultural Research Institute New Delhi, India
| | - Vasudha Jadon
- Division of Genetics, Indian Agricultural Research Institute New Delhi, India
| | - Anshika Tyagi
- National Research Centre for Plant Biotechnology New Delhi, India
| | - Muntazir Mushtaq
- School of Biotechnology, Sher-e-Kashmir University of Agricultural Sciences and Technology of Jammu Chatha, India
| | - Neelu Jain
- Division of Genetics, Indian Agricultural Research Institute New Delhi, India
| | - Pradeep K Singh
- Division of Genetics, Indian Agricultural Research Institute New Delhi, India
| | - Gyanendra P Singh
- Division of Genetics, Indian Agricultural Research Institute New Delhi, India
| | - K V Prabhu
- Division of Genetics, Indian Agricultural Research Institute New Delhi, India
| |
Collapse
|
32
|
Arbizu CI, Ellison SL, Senalik D, Simon PW, Spooner DM. Genotyping-by-sequencing provides the discriminating power to investigate the subspecies of Daucus carota (Apiaceae). BMC Evol Biol 2016; 16:234. [PMID: 27793080 PMCID: PMC5084430 DOI: 10.1186/s12862-016-0806-x] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2016] [Accepted: 10/14/2016] [Indexed: 12/05/2022] Open
Abstract
BACKGROUND The majority of the subspecies of Daucus carota have not yet been discriminated clearly by various molecular or morphological methods and hence their phylogeny and classification remains unresolved. Recent studies using 94 nuclear orthologs and morphological characters, and studies employing other molecular approaches were unable to distinguish clearly many of the subspecies. Fertile intercrosses among traditionally recognized subspecies are well documented. We here explore the utility of single nucleotide polymorphisms (SNPs) generated by genotyping-by-sequencing (GBS) to serve as an effective molecular method to discriminate the subspecies of the D. carota complex. RESULTS We used GBS to obtain SNPs covering all nine Daucus carota chromosomes from 162 accessions of Daucus and two related genera. To study Daucus phylogeny, we scored a total of 10,814 or 38,920 SNPs with a maximum of 10 or 30 % missing data, respectively. To investigate the subspecies of D. carota, we employed two data sets including 150 accessions: (i) rate of missing data 10 % with a total of 18,565 SNPs, and (ii) rate of missing data 30 %, totaling 43,713 SNPs. Consistent with prior results, the topology of both data sets separated species with 2n = 18 chromosome from all other species. Our results place all cultivated carrots (D. carota subsp. sativus) in a single clade. The wild members of D. carota from central Asia were on a clade with eastern members of subsp. sativus. The other subspecies of D. carota were in four clades associated with geographic groups: (1) the Balkan Peninsula and the Middle East, (2) North America and Europe, (3) North Africa exclusive of Morocco, and (4) the Iberian Peninsula and Morocco. Daucus carota subsp. maximus was discriminated, but neither it, nor subsp. gummifer (defined in a broad sense) are monophyletic. CONCLUSIONS Our study suggests that (1) the morphotypes identified as D. carota subspecies gummifer (as currently broadly circumscribed), all confined to areas near the Atlantic Ocean and the western Mediterranean Sea, have separate origins from sympatric members of other subspecies of D. carota, (2) D. carota subsp. maximus, on two clades with some accessions of subsp. carota, can be distinguished from each other but only with poor morphological support, (3) D. carota subsp. capillifolius, well distinguished morphologically, is an apospecies relative to North African populations of D. carota subsp. carota, (4) the eastern cultivated carrots have origins closer to wild carrots from central Asia than to western cultivated carrots, and (5) large SNP data sets are suitable for species-level phylogenetic studies in Daucus.
Collapse
Affiliation(s)
- Carlos I Arbizu
- Department of Horticulture, University of Wisconsin-Madison, 1575 Linden Drive, Madison, WI, 53706-1590, USA
| | - Shelby L Ellison
- Department of Horticulture, University of Wisconsin-Madison, 1575 Linden Drive, Madison, WI, 53706-1590, USA
| | - Douglas Senalik
- Department of Horticulture, University of Wisconsin-Madison, 1575 Linden Drive, Madison, WI, 53706-1590, USA
- USDA-Agricultural Research Service, Vegetable Crops Research Unit, University of Wisconsin-Madison, 1575 Linden Drive, Madison, WI, 53706-1590, USA
| | - Philipp W Simon
- Department of Horticulture, University of Wisconsin-Madison, 1575 Linden Drive, Madison, WI, 53706-1590, USA
- USDA-Agricultural Research Service, Vegetable Crops Research Unit, University of Wisconsin-Madison, 1575 Linden Drive, Madison, WI, 53706-1590, USA
| | - David M Spooner
- Department of Horticulture, University of Wisconsin-Madison, 1575 Linden Drive, Madison, WI, 53706-1590, USA.
- USDA-Agricultural Research Service, Vegetable Crops Research Unit, University of Wisconsin-Madison, 1575 Linden Drive, Madison, WI, 53706-1590, USA.
| |
Collapse
|
33
|
Zhang X, Sallam A, Gao L, Kantarski T, Poland J, DeHaan LR, Wyse DL, Anderson JA. Establishment and Optimization of Genomic Selection to Accelerate the Domestication and Improvement of Intermediate Wheatgrass. THE PLANT GENOME 2016; 9. [PMID: 27898759 DOI: 10.3835/plantgenome2015.07.0059] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Intermediate wheatgrass (IWG) is a perennial species and has edible and nutritious grain and desirable agronomic traits, including large seed size, high grain yield, and biomass. It also has the potential to provide ecosystem services and an economic return to farmers. However, because of its allohexaploidy and self-incompatibility, developing molecular markers for genetic analysis and molecular breeding has been challenging. In the present study, using genotyping-by-sequencing (GBS) technology, 3436 genome-wide markers discovered in a biparental population with 178 genets, were mapped to 21 linkage groups (LG) corresponding to 21 chromosomes of IWG. Genomic prediction models were developed using 3883 markers discovered in a breeding population containing 1126 representative genets from 58 half-sib families. High predictive ability was observed for seven agronomic traits using cross-validation, ranging from 0.46 for biomass to 0.67 for seed weight. Optimization results indicated that 8 to 10 genets from each half-sib family can form a good training population to predict the breeding value of their siblings, and 1600 genome-wide markers are adequate to capture the genetic variation in the current breeding population for genomic selection. Thus, with the advances in sequencing-based marker technologies, it was practical to perform molecular genetic analysis and molecular breeding on a new and challenging species like IWG, and genomic selection could increase the efficiency of recurrent selection and accelerate the domestication and improvement of IWG.
Collapse
|
34
|
Grinberg NF, Lovatt A, Hegarty M, Lovatt A, Skøt KP, Kelly R, Blackmore T, Thorogood D, King RD, Armstead I, Powell W, Skøt L. Implementation of Genomic Prediction in Lolium perenne (L.) Breeding Populations. FRONTIERS IN PLANT SCIENCE 2016; 7:133. [PMID: 26904088 PMCID: PMC4751346 DOI: 10.3389/fpls.2016.00133] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/11/2015] [Accepted: 01/25/2016] [Indexed: 05/23/2023]
Abstract
Perennial ryegrass (Lolium perenne L.) is one of the most widely grown forage grasses in temperate agriculture. In order to maintain and increase its usage as forage in livestock agriculture, there is a continued need for improvement in biomass yield, quality, disease resistance, and seed yield. Genetic gain for traits such as biomass yield has been relatively modest. This has been attributed to its long breeding cycle, and the necessity to use population based breeding methods. Thanks to recent advances in genotyping techniques there is increasing interest in genomic selection from which genomically estimated breeding values are derived. In this paper we compare the classical RRBLUP model with state-of-the-art machine learning techniques that should yield themselves easily to use in GS and demonstrate their application to predicting quantitative traits in a breeding population of L. perenne. Prediction accuracies varied from 0 to 0.59 depending on trait, prediction model and composition of the training population. The BLUP model produced the highest prediction accuracies for most traits and training populations. Forage quality traits had the highest accuracies compared to yield related traits. There appeared to be no clear pattern to the effect of the training population composition on the prediction accuracies. The heritability of the forage quality traits was generally higher than for the yield related traits, and could partly explain the difference in accuracy. Some population structure was evident in the breeding populations, and probably contributed to the varying effects of training population on the predictions. The average linkage disequilibrium between adjacent markers ranged from 0.121 to 0.215. Higher marker density and larger training population closely related with the test population are likely to improve the prediction accuracy.
Collapse
Affiliation(s)
| | - Alan Lovatt
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth UniversityAberystwyth, UK
| | - Matt Hegarty
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth UniversityAberystwyth, UK
| | - Andi Lovatt
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth UniversityAberystwyth, UK
| | - Kirsten P. Skøt
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth UniversityAberystwyth, UK
| | - Rhys Kelly
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth UniversityAberystwyth, UK
| | - Tina Blackmore
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth UniversityAberystwyth, UK
| | - Danny Thorogood
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth UniversityAberystwyth, UK
| | - Ross D. King
- Manchester Institute of Biotechnology, University of ManchesterManchester, UK
| | - Ian Armstead
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth UniversityAberystwyth, UK
| | - Wayne Powell
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth UniversityAberystwyth, UK
- CGIAR Consortium, CGIAR Consortium OfficeMontpellier, France
| | - Leif Skøt
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth UniversityAberystwyth, UK
| |
Collapse
|
35
|
Zhang J, Song Q, Cregan PB, Jiang GL. Genome-wide association study, genomic prediction and marker-assisted selection for seed weight in soybean (Glycine max). TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2016; 129:117-30. [PMID: 26518570 PMCID: PMC4703630 DOI: 10.1007/s00122-015-2614-x] [Citation(s) in RCA: 80] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2015] [Accepted: 09/29/2015] [Indexed: 05/18/2023]
Abstract
KEY MESSAGE Twenty-two loci for soybean SW and candidate genes conditioning seed development were identified; and prediction accuracies of GS and MAS were estimated through cross-validation and validation with unrelated populations. Soybean (Glycine max) is a major crop for plant protein and oil production, and seed weight (SW) is important for yield and quality in food/vegetable uses of soybean. However, our knowledge of genes controlling SW remains limited. To better understand the molecular mechanism underlying the trait and explore marker-based breeding approaches, we conducted a genome-wide association study in a population of 309 soybean germplasm accessions using 31,045 single nucleotide polymorphisms (SNPs), and estimated the prediction accuracy of genomic selection (GS) and marker-assisted selection (MAS) for SW. Twenty-two loci of minor effect associated with SW were identified, including hotspots on Gm04 and Gm19. The mixed model containing these loci explained 83.4% of phenotypic variation. Candidate genes with Arabidopsis orthologs conditioning SW were also proposed. The prediction accuracies of GS and MAS by cross-validation were 0.75-0.87 and 0.62-0.75, respectively, depending on the number of SNPs used and the size of training population. GS also outperformed MAS when the validation was performed using unrelated panels across a wide range of maturities, with an average prediction accuracy of 0.74 versus 0.53. This study convincingly demonstrated that soybean SW is controlled by numerous minor-effect loci. It greatly enhances our understanding of the genetic basis of SW in soybean and facilitates the identification of genes controlling the trait. It also suggests that GS holds promise for accelerating soybean breeding progress. The results are helpful for genetic improvement and genomic prediction of yield in soybean.
Collapse
Affiliation(s)
- Jiaoping Zhang
- Plant Science Department, South Dakota State University, Brookings, SD, 57006, USA
| | - Qijian Song
- Soybean Genomics and Improvement Laboratory, US Department of Agriculture, Agricultural Research Service (USDA-ARS), 10300 Baltimore Ave, Beltsville, MD, 20705, USA
| | - Perry B Cregan
- Soybean Genomics and Improvement Laboratory, US Department of Agriculture, Agricultural Research Service (USDA-ARS), 10300 Baltimore Ave, Beltsville, MD, 20705, USA
| | - Guo-Liang Jiang
- Agricultural Research Station, Virginia State University, Carter G. Woodson Ave, P.O. Box 9061, Petersburg, VA, 23806, USA.
| |
Collapse
|
36
|
Arruda MP, Brown PJ, Lipka AE, Krill AM, Thurber C, Kolb FL. Genomic Selection for Predicting Fusarium Head Blight Resistance in a Wheat Breeding Program. THE PLANT GENOME 2015; 8:eplantgenome2015.01.0003. [PMID: 33228272 DOI: 10.3835/plantgenome2015.01.0003] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/16/2015] [Accepted: 06/23/2015] [Indexed: 05/20/2023]
Abstract
Genomic selection (GS) is a breeding method that uses marker-trait models to predict unobserved phenotypes. This study developed GS models for predicting traits associated with resistance to Fusarium head blight (FHB) in wheat (Triticum aestivum L.). We used genotyping-by-sequencing (GBS) to identify 5054 single-nucleotide polymorphisms (SNPs), which were then treated as predictor variables in GS analysis. We compared how the prediction accuracy of the genomic-estimated breeding values (GEBVs) was affected by (i) five genotypic imputation methods (random forest imputation [RFI], expectation maximization imputation [EMI], k-nearest neighbor imputation [kNNI], singular value decomposition imputation [SVDI], and the mean imputation [MNI]); (ii) three statistical models (ridge-regression best linear unbiased predictor [RR-BLUP], least absolute shrinkage and operator selector [LASSO], and elastic net); (iii) marker density (p = 500, 1500, 3000, and 4500 SNPs); (iv) training population (TP) size (nTP = 96, 144, 192, and 218); (v) marker-based and pedigree-based relationship matrices; and (vi) control for relatedness in TPs and validation populations (VPs). No discernable differences in prediction accuracy were observed among imputation methods. The RR-BLUP outperformed other models in nearly all scenarios. Accuracies decreased substantially when marker number decreased to 3000 or 1500 SNPs, depending on the trait; when sample size of the training set was less than 192; when using pedigree-based instead of marker-based matrix; or when no control for relatedness was implemented. Overall, moderate to high prediction accuracies were observed in this study, suggesting that GS is a very promising breeding strategy for FHB resistance in wheat.
Collapse
Affiliation(s)
- Marcio P Arruda
- Dep. of Crop Sciences, Univ. of Illinois, 1102 S. Goodwin Ave., Urbana, IL, 61801
| | - Patrick J Brown
- Dep. of Crop Sciences, Univ. of Illinois, 1102 S. Goodwin Ave., Urbana, IL, 61801
| | - Alexander E Lipka
- Dep. of Crop Sciences, Univ. of Illinois, 1102 S. Goodwin Ave., Urbana, IL, 61801
| | - Allison M Krill
- Dep. of Crop Sciences, Univ. of Illinois, 1102 S. Goodwin Ave., Urbana, IL, 61801
| | - Carrie Thurber
- School of Science and Mathematics, Abraham Baldwin Agricultural College, 2802 Moore Hwy., Tifton, GA, 31793
| | - Frederic L Kolb
- Dep. of Crop Sciences, Univ. of Illinois, 1102 S. Goodwin Ave., Urbana, IL, 61801
| |
Collapse
|
37
|
Evans J, Crisovan E, Barry K, Daum C, Jenkins J, Kunde-Ramamoorthy G, Nandety A, Ngan CY, Vaillancourt B, Wei CL, Schmutz J, Kaeppler SM, Casler MD, Buell CR. Diversity and population structure of northern switchgrass as revealed through exome capture sequencing. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2015; 84:800-15. [PMID: 26426343 DOI: 10.1111/tpj.13041] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2015] [Revised: 08/31/2015] [Accepted: 09/03/2015] [Indexed: 05/11/2023]
Abstract
Panicum virgatum L. (switchgrass) is a polyploid, perennial grass species that is native to North America, and is being developed as a future biofuel feedstock crop. Switchgrass is present primarily in two ecotypes: a northern upland ecotype, composed of tetraploid and octoploid accessions, and a southern lowland ecotype, composed of primarily tetraploid accessions. We employed high-coverage exome capture sequencing (~2.4 Tb) to genotype 537 individuals from 45 upland and 21 lowland populations. From these data, we identified ~27 million single-nucleotide polymorphisms (SNPs), of which 1 590 653 high-confidence SNPs were used in downstream analyses of diversity within and between the populations. From the 66 populations, we identified five primary population groups within the upland and lowland ecotypes, a result that was further supported through genetic distance analysis. We identified conserved, ecotype-restricted, non-synonymous SNPs that are predicted to affect the protein function of CONSTANS (CO) and EARLY HEADING DATE 1 (EHD1), key genes involved in flowering, which may contribute to the phenotypic differences between the two ecotypes. We also identified, relative to the near-reference Kanlow population, 17 228 genes present in more copies than in the reference genome (up-CNVs), 112 630 genes present in fewer copies than in the reference genome (down-CNVs) and 14 430 presence/absence variants (PAVs), affecting a total of 9979 genes, including two upland-specific CNV clusters. In total, 45 719 genes were affected by an SNP, CNV, or PAV across the panel, providing a firm foundation to identify functional variation associated with phenotypic traits of interest for biofuel feedstock production.
Collapse
Affiliation(s)
- Joseph Evans
- DOE Great Lakes Bioenergy Research Center, Michigan State University, East Lansing, MI, 48824, USA
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
| | - Emily Crisovan
- DOE Great Lakes Bioenergy Research Center, Michigan State University, East Lansing, MI, 48824, USA
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
| | - Kerrie Barry
- Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Chris Daum
- Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Jerry Jenkins
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | | | - Aruna Nandety
- Department of Microbiology and Plant Biology, University of Oklahoma, Norman, OK, 73019, USA
| | - Chew Yee Ngan
- Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Brieanne Vaillancourt
- DOE Great Lakes Bioenergy Research Center, Michigan State University, East Lansing, MI, 48824, USA
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
| | - Chia-Lin Wei
- Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Jeremy Schmutz
- Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Shawn M Kaeppler
- DOE Great Lakes Bioenergy Research Center, University of Wisconsin-Madison, 1575 Linden Drive, Madison, WI, 53706, USA
- Department of Agronomy, University of Wisconsin-Madison, 1575 Linden Drive, Madison, WI, 53706, USA
| | - Michael D Casler
- DOE Great Lakes Bioenergy Research Center, University of Wisconsin-Madison, 1575 Linden Drive, Madison, WI, 53706, USA
- USDA-ARS, U.S. Dairy Forage Research Center, 1925 Linden Dr., Madison, WI, 53706-1108, USA
| | - Carol Robin Buell
- DOE Great Lakes Bioenergy Research Center, Michigan State University, East Lansing, MI, 48824, USA
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
| |
Collapse
|
38
|
Muranty H, Troggio M, Sadok IB, Rifaï MA, Auwerkerken A, Banchi E, Velasco R, Stevanato P, van de Weg WE, Di Guardo M, Kumar S, Laurens F, Bink MCAM. Accuracy and responses of genomic selection on key traits in apple breeding. HORTICULTURE RESEARCH 2015; 2:15060. [PMID: 26744627 PMCID: PMC4688998 DOI: 10.1038/hortres.2015.60] [Citation(s) in RCA: 75] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
The application of genomic selection in fruit tree crops is expected to enhance breeding efficiency by increasing prediction accuracy, increasing selection intensity and decreasing generation interval. The objectives of this study were to assess the accuracy of prediction and selection response in commercial apple breeding programmes for key traits. The training population comprised 977 individuals derived from 20 pedigreed full-sib families. Historic phenotypic data were available on 10 traits related to productivity and fruit external appearance and genotypic data for 7829 SNPs obtained with an Illumina 20K SNP array. From these data, a genome-wide prediction model was built and subsequently used to calculate genomic breeding values of five application full-sib families. The application families had genotypes at 364 SNPs from a dedicated 512 SNP array, and these genotypic data were extended to the high-density level by imputation. These five families were phenotyped for 1 year and their phenotypes were compared to the predicted breeding values. Accuracy of genomic prediction across the 10 traits reached a maximum value of 0.5 and had a median value of 0.19. The accuracies were strongly affected by the phenotypic distribution and heritability of traits. In the largest family, significant selection response was observed for traits with high heritability and symmetric phenotypic distribution. Traits that showed non-significant response often had reduced and skewed phenotypic variation or low heritability. Among the five application families the accuracies were uncorrelated to the degree of relatedness to the training population. The results underline the potential of genomic prediction to accelerate breeding progress in outbred fruit tree crops that still need to overcome long generation intervals and extensive phenotyping costs.
Collapse
Affiliation(s)
- Hélène Muranty
- Institut de Recherche en Horticulture et Semences UMR1345, INRA, SFR 4207 QUASAV, F-49071 Beaucouze, France
- ()
| | - Michela Troggio
- Research and Innovation Center, Fondazione Edmund Mach, San Michele all’Adige, Trento, Italy
| | - Inès Ben Sadok
- Institut de Recherche en Horticulture et Semences UMR1345, INRA, SFR 4207 QUASAV, F-49071 Beaucouze, France
| | - Mehdi Al Rifaï
- Institut de Recherche en Horticulture et Semences UMR1345, INRA, SFR 4207 QUASAV, F-49071 Beaucouze, France
| | | | - Elisa Banchi
- Research and Innovation Center, Fondazione Edmund Mach, San Michele all’Adige, Trento, Italy
| | - Riccardo Velasco
- Research and Innovation Center, Fondazione Edmund Mach, San Michele all’Adige, Trento, Italy
| | - Piergiorgio Stevanato
- DAFNAE, Dipartimento di Agronomia Animali Alimenti Risorse Naturali e Ambiente, viale Università 16, 35020 Legnaro (PD), Università, degli Studi di Padova, Italy
| | - W Eric van de Weg
- Wageningen UR Plant Breeding, Wageningen University and Research Center, Wageningen, The Netherlands
| | - Mario Di Guardo
- Research and Innovation Center, Fondazione Edmund Mach, San Michele all’Adige, Trento, Italy
- Wageningen UR Plant Breeding, Wageningen University and Research Center, Wageningen, The Netherlands
| | - Satish Kumar
- The New Zealand Institute for Plant & Food Research Limited, Private Bag 1401, Havelock North 4157, New Zealand
| | - François Laurens
- Institut de Recherche en Horticulture et Semences UMR1345, INRA, SFR 4207 QUASAV, F-49071 Beaucouze, France
| | - Marco C A M Bink
- Biometris, Wageningen University and Research Center, Wageningen, The Netherlands
- ()
| |
Collapse
|