1
|
Genomic prediction of synthetic hexaploid wheat upon tetraploid durum and diploid Aegilops parental pools. THE PLANT GENOME 2024:e20464. [PMID: 38764312 DOI: 10.1002/tpg2.20464] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 04/04/2024] [Accepted: 04/09/2024] [Indexed: 05/21/2024]
Abstract
Bread wheat (Triticum aestivum L.) is a globally important food crop, which was domesticated about 8-10,000 years ago. Bread wheat is an allopolyploid, and it evolved from two hybridization events of three species. To widen the genetic base in breeding, bread wheat has been re-synthesized by crossing durum wheat (Triticum turgidum ssp. durum) and goat grass (Aegilops tauschii Coss), leading to so-called synthetic hexaploid wheat (SHW). We applied the quantitative genetics tools of "hybrid prediction"-originally developed for the prediction of wheat hybrids generated from different heterotic groups - to a situation of allopolyploidization. Our use-case predicts the phenotypes of SHW for three quantitatively inherited global wheat diseases, namely tan spot (TS), septoria nodorum blotch (SNB), and spot blotch (SB). Our results revealed prediction abilities comparable to studies in 'traditional' elite or hybrid wheat. Prediction abilities were highest using a marker model and performing random cross-validation, predicting the performance of untested SHW (0.483 for SB to 0.730 for TS). When testing parents not necessarily used in SHW, combination prediction abilities were slightly lower (0.378 for SB to 0.718 for TS), yet still promising. Despite the limited phenotypic data, our results provide a general example for predictive models targeting an allopolyploidization event and a method that can guide the use of genetic resources available in gene banks.
Collapse
|
2
|
Long-Lasting Silver Nanoparticles Synthesized with Tagetes erecta and Their Antibacterial Activity against Erwinia amylovora, a Serious Rosaceous Pathogen. PLANTS (BASEL, SWITZERLAND) 2024; 13:981. [PMID: 38611509 PMCID: PMC11013423 DOI: 10.3390/plants13070981] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Revised: 03/14/2024] [Accepted: 03/20/2024] [Indexed: 04/14/2024]
Abstract
A rapid, eco-friendly, and simple method for the synthesis of long-lasting (2 years) silver nanoparticles (AgNPs) is reported using aqueous leaf and petal extracts of Tagetes erecta L. The particles were characterized using UV-Visible spectrophotometry and the analytical and crystallographic techniques of transmission electron microscopy (TEM). The longevity of the AgNPs was studied using UV-Vis and high-resolution TEM. The antibacterial activity of the particles against Erwinia amylovora was evaluated using the Kirby-Bauer disk diffusion method. The results were analyzed using ANOVA and Tukey's test (p ≤ 0.05). Both the leaf and petal extracts produced AgNPs, but the leaf extract (1 mL) was long-lasting and quasi-spherical (17.64 ± 8.87 nm), with an absorbance of UV-Vis λmax 433 and a crystalline structure (fcc, 111). Phenols, flavonoids, tannins, and terpenoids which are associated with -OH, C=O, and C=C were identified in the extracts and could act as reducing and stabilizing agents. The best antibacterial activity was obtained with a nanoparticle concentration of 50 mg AgNPs L-1. The main contribution of the present research is to present a sustainable method for producing nanoparticles which are stable for 2 years and with antibacterial activity against E. amylovora, one of most threatening pathogens to pear and apple productions.
Collapse
|
3
|
A fast algorithm to factorize high-dimensional tensor product matrices used in genetic models. G3 (BETHESDA, MD.) 2024; 14:jkae001. [PMID: 38180089 PMCID: PMC11090460 DOI: 10.1093/g3journal/jkae001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/21/2023] [Revised: 12/26/2023] [Accepted: 12/28/2023] [Indexed: 01/06/2024]
Abstract
Many genetic models (including models for epistatic effects as well as genetic-by-environment) involve covariance structures that are Hadamard products of lower rank matrices. Implementing these models requires factorizing large Hadamard product matrices. The available algorithms for factorization do not scale well for big data, making the use of some of these models not feasible with large sample sizes. Here, based on properties of Hadamard products and (related) Kronecker products, we propose an algorithm that produces an approximate decomposition that is orders of magnitude faster than the standard eigenvalue decomposition. In this article, we describe the algorithm, show how it can be used to factorize large Hadamard product matrices, present benchmarks, and illustrate the use of the method by presenting an analysis of data from the northern testing locations of the G × E project from the Genomes to Fields Initiative (n ∼ 60,000). We implemented the proposed algorithm in the open-source "tensorEVD" R package.
Collapse
|
4
|
Deep learning methods improve genomic prediction of wheat breeding. FRONTIERS IN PLANT SCIENCE 2024; 15:1324090. [PMID: 38504889 PMCID: PMC10949530 DOI: 10.3389/fpls.2024.1324090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 02/19/2024] [Indexed: 03/21/2024]
Abstract
In the field of plant breeding, various machine learning models have been developed and studied to evaluate the genomic prediction (GP) accuracy of unseen phenotypes. Deep learning has shown promise. However, most studies on deep learning in plant breeding have been limited to small datasets, and only a few have explored its application in moderate-sized datasets. In this study, we aimed to address this limitation by utilizing a moderately large dataset. We examined the performance of a deep learning (DL) model and compared it with the widely used and powerful best linear unbiased prediction (GBLUP) model. The goal was to assess the GP accuracy in the context of a five-fold cross-validation strategy and when predicting complete environments using the DL model. The results revealed the DL model outperformed the GBLUP model in terms of GP accuracy for two out of the five included traits in the five-fold cross-validation strategy, with similar results in the other traits. This indicates the superiority of the DL model in predicting these specific traits. Furthermore, when predicting complete environments using the leave-one-environment-out (LOEO) approach, the DL model demonstrated competitive performance. It is worth noting that the DL model employed in this study extends a previously proposed multi-modal DL model, which had been primarily applied to image data but with small datasets. By utilizing a moderately large dataset, we were able to evaluate the performance and potential of the DL model in a context with more information and challenging scenario in plant breeding.
Collapse
|
5
|
Quantitative trait loci and genomic prediction for grain sugar and mineral concentrations of cowpea [Vigna unguiculata (L.) Walp.]. Sci Rep 2024; 14:4567. [PMID: 38403625 PMCID: PMC10894872 DOI: 10.1038/s41598-024-55214-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2023] [Accepted: 02/21/2024] [Indexed: 02/27/2024] Open
Abstract
Development of high yielding cowpea varieties coupled with good taste and rich in essential minerals can promote consumption and thus nutrition and profitability. The sweet taste of cowpea grain is determined by its sugar content, which comprises mainly sucrose and galacto-oligosaccharides (GOS) including raffinose and stachyose. However, GOS are indigestible and their fermentation in the colon can produce excess intestinal gas, causing undesirable bloating and flatulence. In this study, we aimed to examine variation in grain sugar and mineral concentrations, then map quantitative trait loci (QTLs) and estimate genomic-prediction (GP) accuracies for possible application in breeding. Grain samples were collected from a multi-parent advanced generation intercross (MAGIC) population grown in California during 2016-2017. Grain sugars were assayed using high-performance liquid chromatography. Grain minerals were determined by inductively coupled plasma-optical emission spectrometry and combustion. Considerable variation was observed for sucrose (0.6-6.9%) and stachyose (2.3-8.4%). Major QTLs for sucrose (QSuc.vu-1.1), stachyose (QSta.vu-7.1), copper (QCu.vu-1.1) and manganese (QMn.vu-5.1) were identified. Allelic effects of major sugar QTLs were validated using the MAGIC grain samples grown in West Africa in 2017. GP accuracies for minerals were moderate (0.4-0.58). These findings help guide future breeding efforts to develop mineral-rich cowpea varieties with desirable sugar content.
Collapse
|
6
|
A Bayesian optimization R package for multitrait parental selection. THE PLANT GENOME 2024:e20433. [PMID: 38385985 DOI: 10.1002/tpg2.20433] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 12/26/2023] [Accepted: 01/02/2024] [Indexed: 02/23/2024]
Abstract
Selecting and mating parents in conventional phenotypic and genomic selection are crucial. Plant breeding programs aim to improve the economic value of crops, considering multiple traits simultaneously. When traits are negatively correlated and/or when there are missing records in some traits, selection becomes more complex. To address this problem, we propose a multitrait selection approach using the Multitrait Parental Selection (MPS) R package-an efficient tool for genetic improvement, precision breeding, and conservation genetics. The package employs Bayesian optimization algorithms and three loss functions (Kullback-Leibler, Energy Score, and Multivariate Asymmetric Loss) to identify parental candidates with desirable traits. The software's functionality includes three main functions-EvalMPS, FastMPS, and ApproxMPS-catering to different data availability scenarios. Through the presented application examples, the MPS R package proves effective in multitrait genomic selection, enabling breeders to make informed decisions and achieve strong performance across multiple traits.
Collapse
|
7
|
Inbreeding Effects on the Performance and Genomic Prediction for Polysomic Tetraploid Potato Offspring Grown at High Nordic Latitudes. Genes (Basel) 2023; 14:1302. [PMID: 37372482 DOI: 10.3390/genes14061302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2023] [Revised: 06/18/2023] [Accepted: 06/19/2023] [Indexed: 06/29/2023] Open
Abstract
Inbreeding depression (ID) is caused by increased homozygosity in the offspring after selfing. Although the self-compatible, highly heterozygous, tetrasomic polyploid potato (Solanum tuberosum L.) suffers from ID, some argue that the potential genetic gains from using inbred lines in a sexual propagation system of potato are too large to be ignored. The aim of this research was to assess the effects of inbreeding on potato offspring performance under a high latitude and the accuracy of the genomic prediction of breeding values (GEBVs) for further use in selection. Four inbred (S1) and two hybrid (F1) offspring and their parents (S0) were used in the experiment, with a field layout of an augmented design with the four S0 replicated in nine incomplete blocks comprising 100, four-plant plots at Umeå (63°49'30″ N 20°15'50″ E), Sweden. S0 was significantly (p < 0.01) better than both S1 and F1 offspring for tuber weight (total and according to five grading sizes), tuber shape and size uniformity, tuber eye depth and reducing sugars in the tuber flesh, while F1 was significantly (p < 0.01) better than S1 for all tuber weight and uniformity traits. Some F1 hybrid offspring (15-19%) had better total tuber yield than the best-performing parent. The GEBV accuracy ranged from -0.3928 to 0.4436. Overall, tuber shape uniformity had the highest GEBV accuracy, while tuber weight traits exhibited the lowest accuracy. The F1 full sib's GEBV accuracy was higher, on average, than that of S1. Genomic prediction may facilitate eliminating undesired inbred or hybrid offspring for further use in the genetic betterment of potato.
Collapse
|
8
|
Partial least squares enhance multi-trait genomic prediction of potato cultivars in new environments. Sci Rep 2023; 13:9947. [PMID: 37336933 DOI: 10.1038/s41598-023-37169-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Accepted: 06/17/2023] [Indexed: 06/21/2023] Open
Abstract
It is of paramount importance in plant breeding to have methods dealing with large numbers of predictor variables and few sample observations, as well as efficient methods for dealing with high correlation in predictors and measured traits. This paper explores in terms of prediction performance the partial least squares (PLS) method under single-trait (ST) and multi-trait (MT) prediction of potato traits. The first prediction was for tested lines in tested environments under a five-fold cross-validation (5FCV) strategy and the second prediction was for tested lines in untested environments (herein denoted as leave one environment out cross validation, LOEO). There was a good performance in terms of predictions (with accuracy mostly > 0.5 for Pearson's correlation) the accuracy of 5FCV was better than LOEO. Hence, we have empirical evidence that the ST and MT PLS framework is a very valuable tool for prediction in the context of potato breeding data.
Collapse
|
9
|
Multimodal deep learning methods enhance genomic prediction of wheat breeding. G3 (BETHESDA, MD.) 2023; 13:jkad045. [PMID: 36869747 PMCID: PMC10151399 DOI: 10.1093/g3journal/jkad045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 02/21/2023] [Accepted: 02/22/2023] [Indexed: 03/05/2023]
Abstract
While several statistical machine learning methods have been developed and studied for assessing the genomic prediction (GP) accuracy of unobserved phenotypes in plant breeding research, few methods have linked genomics and phenomics (imaging). Deep learning (DL) neural networks have been developed to increase the GP accuracy of unobserved phenotypes while simultaneously accounting for the complexity of genotype-environment interaction (GE); however, unlike conventional GP models, DL has not been investigated for when genomics is linked with phenomics. In this study we used 2 wheat data sets (DS1 and DS2) to compare a novel DL method with conventional GP models. Models fitted for DS1 were GBLUP, gradient boosting machine (GBM), support vector regression (SVR) and the DL method. Results indicated that for 1 year, DL provided better GP accuracy than results obtained by the other models. However, GP accuracy obtained for other years indicated that the GBLUP model was slightly superior to the DL. DS2 is comprised only of genomic data from wheat lines tested for 3 years, 2 environments (drought and irrigated) and 2-4 traits. DS2 results showed that when predicting the irrigated environment with the drought environment, DL had higher accuracy than the GBLUP model in all analyzed traits and years. When predicting drought environment with information on the irrigated environment, the DL model and GBLUP model had similar accuracy. The DL method used in this study is novel and presents a strong degree of generalization as several modules can potentially be incorporated and concatenated to produce an output for a multi-input data structure.
Collapse
|
10
|
Results from rapid-cycle recurrent genomic selection in spring bread wheat. G3 (BETHESDA, MD.) 2023; 13:jkad025. [PMID: 36702618 PMCID: PMC10085763 DOI: 10.1093/g3journal/jkad025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Revised: 01/18/2023] [Accepted: 01/19/2023] [Indexed: 01/28/2023]
Abstract
Genomic selection (GS) in wheat breeding programs is of great interest for predicting the genotypic values of individuals, where both additive and nonadditive effects determine the final breeding value of lines. While several simulation studies have shown the efficiency of rapid-cycling GS strategies for parental selection or population improvement, their practical implementations are still lacking in wheat and other crops. In this study, we demonstrate the potential of rapid-cycle recurrent GS (RCRGS) to increase genetic gain for grain yield (GY) in wheat. Our results showed a consistent realized genetic gain for GY after 3 cycles of recombination (C1, C2, and C3) of bi-parental F1s, when summarized across 2 years of phenotyping. For both evaluation years combined, genetic gain through RCRGS reached 12.3% from cycle C0 to C3 and realized gain was 0.28 ton ha-1 per cycle with a GY from C0 (6.88 ton ha-1) to C3 (7.73 ton ha-1). RCRGS was also associated with some changes in important agronomic traits that were measured (days to heading, days to maturity, and plant height) but not selected for. To account for these changes, we recommend implementing GS together with multi-trait prediction models.
Collapse
|
11
|
Fine mapping and accurate prediction of complex traits using Bayesian Variable Selection models applied to biobank-size data. Eur J Hum Genet 2023; 31:313-320. [PMID: 35853950 PMCID: PMC9995454 DOI: 10.1038/s41431-022-01135-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Revised: 04/06/2022] [Accepted: 06/13/2022] [Indexed: 11/08/2022] Open
Abstract
Modern GWAS studies use an enormous sample size and ultra-high density SNP genotypes. These conditions reduce the mapping resolution of marginal association tests-the method most often used in GWAS. Multi-locus Bayesian Variable Selection (BVS) offers a one-stop solution for powerful and precise mapping of risk variants and polygenic risk score (PRS) prediction. We show (with an extensive simulation) that multi-locus BVS methods can achieve high power with a low false discovery rate and a much better mapping resolution than marginal association tests. We demonstrate the performance of BVS for mapping and PRS prediction using data from blood biomarkers from the UK-Biobank (~300,000 samples and ~5.5 million SNPs). The article is accompanied by open-source R-software that implement the methods used in the study and scales to biobank-sized data.
Collapse
|
12
|
Single-step genomic evaluation for growth traits in a Mexican Braunvieh cattle population. Anim Biosci 2023:ab.22.0158. [PMID: 36915917 DOI: 10.5713/ab.22.0158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Accepted: 02/27/2023] [Indexed: 03/03/2023] Open
Abstract
Objective The objective was to compare (pedigree-based) BLUP, genomic BLUP (GBLUP), and single-step GBLUP (ssGBLUP) methods for genomic evaluation of growth traits in a Mexican Braunvieh cattle population. Methods Birth (BW), weaning (WW), and yearling weight (YW) data of a Mexican Braunvieh cattle population were analyzed with BLUP, GBLUP, and ssGBLUP methods. These methods are differentiated by the additive genetic relationship matrix included in the model and the animals under evaluation. The predictive ability of the model was evaluated using random partitions of the data in training and testing sets, consistently predicting about 20% of genotyped animals on all occasions. For each partition, the Pearson correlation coefficient between adjusted phenotypes for fixed effects and non-genetic random effects and the estimated breeding values (EBV) were computed. Results The random contemporary group (CG) effect explained about 50, 45, and 35% of the phenotypic variance in BW, WW, and YW, respectively. For the three methods, the CG effect explained the highest proportion of the phenotypic variances (except for YW-GBLUP). The heritability estimate obtained with GBLUP was the lowest for BW, while the highest heritability was obtained with BLUP. For WW, the highest heritability estimate was obtained with BLUP, the estimates obtained with GBLUP and ssGBLUP were similar. For YW, the heritability estimates obtained with GBLUP and BLUP were similar, and the lowest heritability was obtained with ssGBLUP. Pearson correlation coefficients between adjusted phenotypes for non-genetic effects and EBVs were the highest for BLUP, followed by ssBLUP and GBLUP. Conclusion The successful implementation of genetic evaluations that include genotyped and non-genotyped animals in our study indicate a promising method for use in genetic improvement programs of Braunvieh cattle. Our findings showed that simultaneous evaluation of genotyped and non-genotyped animals improved prediction accuracy for growth traits even with a limited number of genotyped animals.
Collapse
|
13
|
Genetic analyses of tropical maize lines under artificial infestation of fall armyworm and foliar diseases under optimum conditions. FRONTIERS IN PLANT SCIENCE 2023; 14:1086757. [PMID: 36743507 PMCID: PMC9896009 DOI: 10.3389/fpls.2023.1086757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Accepted: 01/04/2023] [Indexed: 06/18/2023]
Abstract
Development and deployment of high-yielding maize varieties with native resistance to Fall armyworm (FAW), turcicum leaf blight (TLB), and gray leaf spot (GLS) infestation is critical for addressing the food insecurity in sub-Saharan Africa. The objectives of this study were to determine the inheritance of resistance for FAW, identity hybrids which in addition to FAW resistance, also show resistance to TLB and GLS, and investigate the usefulness of models based on general combining ability (GCA) and SNP markers in predicting the performance of new untested hybrids. Half-diallel mating scheme was used to generate 105 F1 hybrids from 15 parents and another 55 F1 hybrids from 11 parents. These were evaluated in two experiments, each with commercial checks in multiple locations under FAW artificial infestation and optimum management in Kenya. Under artificial FAW infestation, significant mean squares among hybrids and hybrids x environment were observed for most traits in both experiments, including at least one of the three assessments carried out for foliar damage caused by FAW. Interaction of GCA x environment and specific combining ability (SCA) x environment interactions were significant for all traits under FAW infestation and optimal conditions. Moderate to high heritability estimates were observed for GY under both management conditions. Correlation between GY and two of the three scorings (one and three weeks after infestation) for foliar damage caused by FAW were negative (-0.27 and -0.38) and significant. Positive and significant correlation (0.84) was observed between FAW-inflicted ear damage and the percentage of rotten ears. We identified many superior-performing hybrids compared to the best commercial checks for both GY and FAW resistance associated traits. Inbred lines CML312, CML567, CML488, DTPYC9-F46-1-2-1-2, CKDHL164288, CKDHL166062, and CLRCY039 had significant and positive GCA for GY (positive) and FAW resistance-associated traits (negative). CML567 was a parent in four of the top ten hybrids under optimum and FAW conditions. Both additive and non-additive gene action were important in the inheritance of FAW resistance. Both GCA and marker-based models showed high correlation with field performance, but marker-based models exhibited considerably higher correlation. The best performing hybrids identified in this study could be used as potential single cross testers in the development of three-way FAW resistance hybrids. Overall, our results provide insights that help breeders to design effective breeding strategies to develop FAW resistant hybrids that are high yielding under FAW and optimum conditions.
Collapse
|
14
|
Bayesian modelling of phosphorus content in wheat grain using hyperspectral reflectance data. PLANT METHODS 2023; 19:6. [PMID: 36670477 PMCID: PMC9854047 DOI: 10.1186/s13007-023-00980-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Accepted: 12/31/2022] [Indexed: 06/17/2023]
Abstract
BACKGROUND As a result of the technological progress, the use of sensors for crop survey has substantially increased, generating valuable information for modelling agricultural data. Plant spectroscopy jointly with statistical modeling can potentially help to assess certain chemical components of interest present in plants, which may be laborious and expensive to obtain by direct measurements. In this research, the phosphorus content in wheat grain is modeled using reflectance information measured by a hyperspectral sensor at different wavelengths. A Bayesian procedure for selecting variables was used to identify the set of the most important spectral bands. Additionally, three different models were evaluated: the first model assumes that the observations are independent, the other two models assume that the observations are spatially correlated: one of the proposed models, assumes spatial dependence using a Conditionally Autoregressive Model (CAR), and the other through an exponential correlogram. The goodness of fit of the models was evaluated by means of the Deviance Information Criterion, and the predictive power is evaluated using cross validation. RESULTS We have found that CAR was the model that best fits and predicts the data. Additionally, the selection variable procedure in the CAR model reveals which wavelengths in the range of 500-690 nm are the most important. Comparing the vegetative indices with the CAR model, it was observed that the average correlation of the CAR model exceeded that of the vegetative indices by 23.26%, - 1.2% and 22.78% for the year 2010, 2011 and 2012 respectively; therefore, the use of the proposed methodology outperformed the vegetative indices in prediction. CONCLUSIONS The proposal to predict the phosphorus content in wheat grain using Bayesian approach, reflect with the results as a good alternative.
Collapse
|
15
|
Regresión cuantil para predicción de caracteres complejos en bovinos Suizo Europeo usando marcadores SNP y pedigrí. REV MEX CIENC PECU 2022. [DOI: 10.22319/rmcp.v14i1.6182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Los modelos de predicción genómica generalmente suponen que los errores se distribuyen como variables aleatorias normales, independientes e idénticamente distribuidas con media cero e igual varianza. Esto no siempre se cumple, además puede haber fenotipos distantes de la media poblacional, los que usualmente se eliminan al realizar la predicción. La regresión cuantil (QR) afronta aspectos estadísticos como alta dimensionalidad, multicolinealidad y distribución fenotípica diferente de la normal. El objetivo de este trabajo fue comparar QR utilizando información de marcadores y pedigrí con los métodos alternativos tales como mejor predicción lineal insesgada genómica (GBLUP) y mejor predicción lineal insesgada genómica en un solo paso (ssGBLUP) para analizar los pesos al nacimiento (PN), destete (PD) y al año (PA) de bovinos Suizo Europeo y datos simulados con diferente grado de asimetría y proporción de datos atípicos. La capacidad predictiva de los modelos se evaluó mediante validación cruzada. El desempeño predictivo de QR tanto sólo con información de marcadores como con marcadores más pedigrí, con el conjunto de datos reales, fue mejor que las metodologías GBLUP y ssGBLUP para PD y PA. Para PN GBLUP y ssGBLUP fueron mejores, sin embargo, solo se utilizaron los cuantiles 0.25, 0.50 y 0.75, y la distribución de PN no fue asimétrica. En el experimento de datos simulados, las correlaciones entre efectos de marcador “verdadero” y efectos estimados, así como las correlaciones de señales “verdaderas” y estimadas fueron más altas cuando se usó QR comparado con GBLUP. Las ventajas de QR fueron más notorias con distribución asimétrica de los fenotipos y con mayor proporción de datos atípicos, como fue el caso del conjunto de datos simulados.
Collapse
|
16
|
Sparse kernel models provide optimization of training set design for genomic prediction in multiyear wheat breeding data. THE PLANT GENOME 2022; 15:e20254. [PMID: 36043341 DOI: 10.1002/tpg2.20254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2021] [Accepted: 07/17/2022] [Indexed: 06/15/2023]
Abstract
The success of genomic selection (GS) in breeding schemes relies on its ability to provide accurate predictions of unobserved lines at early stages. Multigeneration data provides opportunities to increase the training data size and thus, the likelihood of extracting useful information from ancestors to improve prediction accuracy. The genomic best linear unbiased predictions (GBLUPs) are performed by borrowing information through kinship relationships between individuals. Multigeneration data usually becomes heterogeneous with complex family relationship patterns that are increasingly entangled with each generation. Under these conditions, historical data may not be optimal for model training as the accuracy could be compromised. The sparse selection index (SSI) is a method for training set (TRN) optimization, in which training individuals provide predictions to some but not all predicted subjects. We added an additional trimming process to the original SSI (trimmed SSI) to remove less important training individuals for prediction. Using a large multigeneration (8 yr) wheat (Triticum aestivum L.) grain yield dataset (n = 68,836), we found increases in accuracy as more years are included in the TRN, with improvements of ∼0.05 in the GBLUP accuracy when using 5 yr of historical data relative to when using only 1 yr. The SSI method showed a small gain over the GBLUP accuracy but with an important reduction on the TRN size. These reduced TRNs were formed with a similar number of subjects from each training generation. Our results suggest that the SSI provides a more stable ranking of genotypes than the GBLUP as the TRN becomes larger.
Collapse
|
17
|
Multi-trait Bayesian Shrinkage and Variable Selection Models with the BGLR R-package. Genetics 2022; 222:6655691. [PMID: 35924977 PMCID: PMC9434216 DOI: 10.1093/genetics/iyac112] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Accepted: 07/14/2022] [Indexed: 12/02/2022] Open
Abstract
The BGLR-R package implements various types of single-trait shrinkage/variable selection Bayesian regressions. The package was first released in 2014, since then it has become a software very often used in genomic studies. We recently develop functionality for multitrait models. The implementation allows users to include an arbitrary number of random-effects terms. For each set of predictors, users can choose diffuse, Gaussian, and Gaussian–spike–slab multivariate priors. Unlike other software packages for multitrait genomic regressions, BGLR offers many specifications for (co)variance parameters (unstructured, diagonal, factor analytic, and recursive). Samples from the posterior distribution of the models implemented in the multitrait function are generated using a Gibbs sampler, which is implemented by combining code written in the R and C programming languages. In this article, we provide an overview of the models and methods implemented BGLR’s multitrait function, present examples that illustrate the use of the package, and benchmark the performance of the software.
Collapse
|
18
|
HDBRR: a statistical package for high-dimensional Bayesian ridge regression without MCMC. J STAT COMPUT SIM 2022. [DOI: 10.1080/00949655.2022.2081968] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
19
|
Genetic diversity in reproductive traits of Braunvieh cattle determined with SNP markers. Vet Med Sci 2022; 8:1709-1720. [PMID: 35545927 PMCID: PMC9297803 DOI: 10.1002/vms3.836] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Braunvieh is an important dual‐purpose breed in the Mexican tropics. The study of its genetic diversity is key to implementing genetic improvement programs. This study was conducted to determine genetic diversity of reproductive traits in a Mexican Braunvieh beef cattle population using single nucleotide polymorphisms in candidate genes. Information from 24 genes with 52 intra‐genic loci reported in literature to be associated with productive life, pregnancy rate and cow and heifer conception rate of 150 Braunvieh males and females was considered. Observed heterozygosity (Ho) revealed high genetic diversity for the studied traits, Ho = 0.42 ± 0.087, relative to that of other populations of the same breed. Cluster analyses were carried out using the Ward and K‐means algorithms. These analyses revealed high genetic diversity that was observed in the biplot of non‐metric multi‐dimensional scaling. It was found that clustering strategy allowed visualisation of distant groups by genotype but not by favourable alleles in all the loci. We found that the genes CSNK1E, DNAH11, DSC2, IBSP and OCLN affected most of the traits in our study and they were highly informative. Therefore, they represent a potential resource for selection and crossbreeding programs of the traits studied in Braunvieh. The analyses showed that the Mexican Braunvieh population has a high level of genetic diversity, arguably due to decades‐long adaptation to the Mexican tropics.
Collapse
|
20
|
Genome-Wide Association Study for Resistance to Tan Spot in Synthetic Hexaploid Wheat. PLANTS (BASEL, SWITZERLAND) 2022; 11:433. [PMID: 35161413 PMCID: PMC8839754 DOI: 10.3390/plants11030433] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/08/2022] [Revised: 01/28/2022] [Accepted: 02/01/2022] [Indexed: 06/14/2023]
Abstract
Synthetic hexaploid wheat (SHW) has shown effective resistance to a diversity of diseases and insects, including tan spot, which is caused by Pyrenophora tritici-repentis, being an important foliar disease that can attack all types of wheat and several grasses. In this study, 443 SHW plants were evaluated for their resistance to tan spot under controlled environmental conditions. Additionally, a genome-wide association study was conducted by genotyping all entries with the DArTSeq technology to identify marker-trait associations for tan spot resistance. Of the 443 SHW plants, 233 showed resistant and 183 moderately resistant reactions, and only 27 were moderately susceptible or susceptible to tan spot. Durum wheat (DW) parents of the SHW showed moderately susceptible to susceptible reactions. A total of 30 significant marker-trait associations were found on chromosomes 1B (4 markers), 1D (1 marker), 2A (1 marker), 2D (2 markers), 3A (4 markers), 3D (3 markers), 4B (1 marker), 5A (4 markers), 6A (6 markers), 6B (1 marker) and 7D (3 markers). Increased resistance in the SHW in comparison to the DW parents, along with the significant association of resistance with the A and B genome, supported the concept of activating epistasis interaction across the three wheat genomes. Candidate genes coding for F-box and cytochrome P450 proteins that play significant roles in biotic stress resistance were identified for the significant markers. The identified resistant SHW lines can be deployed in wheat breeding for tan spot resistance.
Collapse
|
21
|
Response to Early Generation Genomic Selection for Yield in Wheat. FRONTIERS IN PLANT SCIENCE 2022; 12:718611. [PMID: 35087542 PMCID: PMC8787636 DOI: 10.3389/fpls.2021.718611] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Accepted: 10/22/2021] [Indexed: 06/14/2023]
Abstract
We investigated increasing genetic gain for grain yield using early generation genomic selection (GS). A training set of 1,334 elite wheat breeding lines tested over three field seasons was used to generate Genomic Estimated Breeding Values (GEBVs) for grain yield under irrigated conditions applying markers and three different prediction methods: (1) Genomic Best Linear Unbiased Predictor (GBLUP), (2) GBLUP with the imputation of missing genotypic data by Ridge Regression BLUP (rrGBLUP_imp), and (3) Reproducing Kernel Hilbert Space (RKHS) a.k.a. Gaussian Kernel (GK). F2 GEBVs were generated for 1,924 individuals from 38 biparental cross populations between 21 parents selected from the training set. Results showed that F2 GEBVs from the different methods were not correlated. Experiment 1 consisted of selecting F2s with the highest average GEBVs and advancing them to form genomically selected bulks and make intercross populations aiming to combine favorable alleles for yield. F4:6 lines were derived from genomically selected bulks, intercrosses, and conventional breeding methods with similar numbers from each. Results of field-testing for Experiment 1 did not find any difference in yield with genomic compared to conventional selection. Experiment 2 compared the predictive ability of the different GEBV calculation methods in F2 using a set of single plant-derived F2:4 lines from randomly selected F2 plants. Grain yield results from Experiment 2 showed a significant positive correlation between observed yields of F2:4 lines and predicted yield GEBVs of F2 single plants from GK (the predictive ability of 0.248, P < 0.001) and GBLUP (0.195, P < 0.01) but no correlation with rrGBLUP_imp. Results demonstrate the potential for the application of GS in early generations of wheat breeding and the importance of using the appropriate statistical model for GEBV calculation, which may not be the same as the best model for inbreds.
Collapse
|
22
|
The power of genomic estimated breeding values for selection when using a finite population size in genetic improvement of tetraploid potato. G3 (BETHESDA, MD.) 2022; 12:6407142. [PMID: 34849763 PMCID: PMC8728039 DOI: 10.1093/g3journal/jkab362] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/10/2021] [Accepted: 10/08/2021] [Indexed: 12/02/2022]
Abstract
Potato breeding relies heavily on visual phenotypic scoring for clonal selection. Obtaining robust phenotypic data can be labor intensive and expensive, especially in the early cycles of a potato breeding program where the number of genotypes is very large. We have investigated the power of genomic estimated breeding values (GEBVs) for selection from a limited population size in potato breeding. We collected genotypic data from 669 tetraploid potato clones from all cycles of a potato breeding program, as well as phenotypic data for eight important breeding traits. The genotypes were partitioned into a training and a test population distinguished by cycle of selection in the breeding program. GEBVs for seven traits were predicted for individuals from the first stage of the breeding program (T1) which had not undergone any selection, or individuals selected at least once in the field (T2). An additional approach in which GEBVs were predicted within and across full-sib families from unselected material (T1) was tested for four breeding traits. GEBVs were obtained by using a Bayesian Ridge Regression model estimating single marker effects and phenotypic data from individuals at later stages of selection of the breeding program. Our results suggest that, for most traits included in this study, information from individuals from later stages of selection cannot be utilized to make selections based on GEBVs in earlier clonal generations. Predictions of GEBVs across full-sib families yielded similarly low prediction accuracies as across generations. The most promising approach for selection using GEBVs was found to be making predictions within full-sib families.
Collapse
|
23
|
Genome and Environment Based Prediction Models and Methods of Complex Traits Incorporating Genotype × Environment Interaction. Methods Mol Biol 2022; 2467:245-283. [PMID: 35451779 DOI: 10.1007/978-1-0716-2205-6_9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Genomic-enabled prediction models are of paramount importance for the successful implementation of genomic selection (GS) based on breeding values. As opposed to animal breeding, plant breeding includes extensive multienvironment and multiyear field trial data. Hence, genomic-enabled prediction models should include genotype × environment (G × E) interaction, which most of the time increases the prediction performance when the response of lines are different from environment to environment. In this chapter, we describe a historical timeline since 2012 related to advances of the GS models that take into account G × E interaction. We describe theoretical and practical aspects of those GS models, including the gains in prediction performance when including G × E structures for both complex continuous and categorical scale traits. Then, we detailed and explained the main G × E genomic prediction models for complex traits measured in continuous and noncontinuous (categorical) scale. Related to G × E interaction models this review also examine the analyses of the information generated with high-throughput phenotype data (phenomic) and the joint analyses of multitrait and multienvironment field trial data that is also employed in the general assessment of multitrait G × E interaction. The inclusion of nongenomic data in increasing the accuracy and biological reliability of the G × E approach is also outlined. We show the recent advances in large-scale envirotyping (enviromics), and how the use of mechanistic computational modeling can derive the crop growth and development aspects useful for predicting phenotypes and explaining G × E.
Collapse
|
24
|
Genetic Diversity and Population Structure for Resistance and Susceptibility to Mastitis in Braunvieh Cattle. Vet Sci 2021; 8:vetsci8120329. [PMID: 34941856 PMCID: PMC8707377 DOI: 10.3390/vetsci8120329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2021] [Revised: 12/05/2021] [Accepted: 12/12/2021] [Indexed: 11/18/2022] Open
Abstract
Mastitis is a disease that causes significant economic losses, since resistance to mastitis is a difficult trait to be improved due to its multifactorial occurrence. Therefore, our objective was to characterize a Mexican Braunvieh cattle population for genetic resistance and susceptibility to mastitis. We used 66 SNP markers for 45 candidate genes in 150 animals. The average heterozygosity was 0.445 ± 0.076, a value higher than those reported for some European breeds. The inbreeding coefficient was slightly negative for resistance to subclinical (−0.058 ± 0.055) and clinical (−0.034 ± 0.076) mastitis, possibly due to low selection for the immunological candidate genes that influence these traits. The genotypic profiles for the candidate loci per K-means group were obtained, as well as the group distribution through the graphics of the principal component analysis. The genotypic profiles showed high genetic diversity among groups. Resistance to clinical mastitis had the lowest presence of the heterozygous genotypes. Although the percentage of highly inbred animals (>50%) is up to 13.3%, there are highly heterozygous groups in terms of the studied traits, a favorable indicator of the presence of genetic diversity. The results of this study constitute evidence of the genetic potential of the Mexican Braunvieh population to improve mastitis-related traits.
Collapse
|
25
|
A Comparison of the Adoption of Genomic Selection Across Different Breeding Institutions. FRONTIERS IN PLANT SCIENCE 2021; 12:728567. [PMID: 34868114 PMCID: PMC8640095 DOI: 10.3389/fpls.2021.728567] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Accepted: 09/30/2021] [Indexed: 06/13/2023]
|
26
|
Multi-generation genomic prediction of maize yield using parametric and non-parametric sparse selection indices. Heredity (Edinb) 2021; 127:423-432. [PMID: 34564692 PMCID: PMC8551287 DOI: 10.1038/s41437-021-00474-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Revised: 09/10/2021] [Accepted: 09/11/2021] [Indexed: 02/07/2023] Open
Abstract
Genomic prediction models are often calibrated using multi-generation data. Over time, as data accumulates, training data sets become increasingly heterogeneous. Differences in allele frequency and linkage disequilibrium patterns between the training and prediction genotypes may limit prediction accuracy. This leads to the question of whether all available data or a subset of it should be used to calibrate genomic prediction models. Previous research on training set optimization has focused on identifying a subset of the available data that is optimal for a given prediction set. However, this approach does not contemplate the possibility that different training sets may be optimal for different prediction genotypes. To address this problem, we recently introduced a sparse selection index (SSI) that identifies an optimal training set for each individual in a prediction set. Using additive genomic relationships, the SSI can provide increased accuracy relative to genomic-BLUP (GBLUP). Non-parametric genomic models using Gaussian kernels (KBLUP) have, in some cases, yielded higher prediction accuracies than standard additive models. Therefore, here we studied whether combining SSIs and kernel methods could further improve prediction accuracy when training genomic models using multi-generation data. Using four years of doubled haploid maize data from the International Maize and Wheat Improvement Center (CIMMYT), we found that when predicting grain yield the KBLUP outperformed the GBLUP, and that using SSI with additive relationships (GSSI) lead to 5-17% increases in accuracy, relative to the GBLUP. However, differences in prediction accuracy between the KBLUP and the kernel-based SSI were smaller and not always significant.
Collapse
|
27
|
Genome-enabled prediction for sparse testing in multi-environmental wheat trials. THE PLANT GENOME 2021; 14:e20151. [PMID: 34510790 DOI: 10.1002/tpg2.20151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/06/2021] [Accepted: 07/22/2021] [Indexed: 06/13/2023]
Abstract
Sparse testing in genome-enabled prediction in plant breeding can be emulated throughout different line allocations where some lines are observed in all environments (overlap) and others are observed in only one environment (nonoverlap). We studied three general cases of the composition of the sparse testing allocation design for genome-enabled prediction of wheat (Triticum aestivum L.) breeding: (a) completely nonoverlapping wheat lines in environments, (b) completely overlapping wheat lines in all environments, and (c) a proportion of nonoverlapping/overlapping wheat lines allocated in the environments. We also studied several cases in which the size of the testing population was systematically decreased. The study used three extensive wheat data sets (W1, W2, and W3). Three different genome-enabled prediction models (M1-M3) were used to study the effect of the sparse testing in terms of the genomic prediction accuracy. Model M1 included only main effects of environments and lines; M2 included main effects of environments, lines, and genomic effects; whereas the remaining model (M3) also incorporated the genomic × environment interaction (GE). The results show that the GE component of the genome-based model M3 captures a larger genetic variability than the main genomic effects term from models M1 and M2. In addition, model M3 provides higher prediction accuracy than models M1 and M2 for the same allocation designs (different combinations of nonoverlapping/overlapping lines in environments and training set sizes). Overlapped sets of 30-50 lines in all the environments provided stable genomic-enabled prediction accuracy. Reducing the size of the testing populations under all allocation designs decreases the prediction accuracy, which recovers when more lines are tested in all environments. Model M3 offers the possibility of maintaining the prediction accuracy throughout both extreme situations of all nonoverlapping lines and all overlapping lines.
Collapse
|
28
|
Multi-trait genomic-enabled prediction enhances accuracy in multi-year wheat breeding trials. G3-GENES GENOMES GENETICS 2021; 11:6332007. [PMID: 34568924 PMCID: PMC8496321 DOI: 10.1093/g3journal/jkab270] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Accepted: 07/25/2021] [Indexed: 11/14/2022]
Abstract
Implementing genomic-based prediction models in genomic selection requires an understanding of the measures for evaluating prediction accuracy from different models and methods using multi-trait data. In this study, we compared prediction accuracy using six large multi-trait wheat data sets (quality and grain yield). The data were used to predict 1 year (testing) from the previous year (training) to assess prediction accuracy using four different prediction models. The results indicated that the conventional Pearson’s correlation between observed and predicted values underestimated the true correlation value, whereas the corrected Pearson’s correlation calculated by fitting a bivariate model was higher than the division of the Pearson’s correlation by the squared root of the heritability across traits, by 2.53–11.46%. Across the datasets, the corrected Pearson’s correlation was higher than the uncorrected by 5.80–14.01%. Overall, we found that for grain yield the prediction performance was highest using a multi-trait compared to a single-trait model. The higher the absolute genetic correlation between traits the greater the benefits of multi-trait models for increasing the genomic-enabled prediction accuracy of traits.
Collapse
|
29
|
Application of Genomic Selection at the Early Stage of Breeding Pipeline in Tropical Maize. FRONTIERS IN PLANT SCIENCE 2021; 12:685488. [PMID: 34262585 PMCID: PMC8274566 DOI: 10.3389/fpls.2021.685488] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Accepted: 05/31/2021] [Indexed: 06/13/2023]
Abstract
In maize, doubled haploid (DH) line production capacity of large-sized maize breeding programs often exceeds the capacity to phenotypically evaluate the complete set of testcross candidates in multi-location trials. The ability to partially select DH lines based on genotypic data while maintaining or improving genetic gains for key traits using phenotypic selection can result in significant resource savings. The present study aimed to evaluate genomic selection (GS) prediction scenarios for grain yield and agronomic traits of one of the tropical maize breeding pipelines of CIMMYT in eastern Africa, based on multi-year empirical data for designing a GS-based strategy at the early stages of the pipeline. We used field data from 3,068 tropical maize DH lines genotyped using rAmpSeq markers and evaluated as test crosses in well-watered (WW) and water-stress (WS) environments in Kenya from 2017 to 2019. Three prediction schemes were compared: (1) 1 year of performance data to predict a second year; (2) 2 years of pooled data to predict performance in the third year, and (3) using individual or pooled data plus converting a certain proportion of individuals from the testing set (TST) to the training set (TRN) to predict the next year's data. Employing five-fold cross-validation, the mean prediction accuracies for grain yield (GY) varied from 0.19 to 0.29 under WW and 0.22 to 0.31 under WS, when the 1-year datasets were used training set to predict a second year's data as a testing set. The mean prediction accuracies increased to 0.32 under WW and 0.31 under WS when the 2-year datasets were used as a training set to predict the third-year data set. In a forward prediction scenario, good predictive abilities (0.53 to 0.71) were found when the training set consisted of the previous year's breeding data and converting 30% of the next year's data from the testing set to the training set. The prediction accuracy for anthesis date and plant height across WW and WS environments obtained using 1-year data and integrating 10, 30, 50, 70, and 90% of the TST set to TRN set was much higher than those trained in individual years. We demonstrate that by increasing the TRN set to include genotypic and phenotypic data from the previous year and combining only 10-30% of the lines from the year of testing, the predicting accuracy can be increased, which in turn could be used to replace the first stage of field-based screening partially, thus saving significant costs associated with the testcross formation and multi-location testcross evaluation.
Collapse
|
30
|
Abstract
Genomic selection (GS) is a technology used for genetic improvement, and it has many advantages over phenotype-based selection. There are several statistical models that adequately approach the statistical challenges in GS, such as in linear mixed models (LMMs). An active area of research is the development of software for fitting LMMs mainly used to make genome-based predictions. The lme4 is the standard package for fitting linear and generalized LMMs in the R-package, but its use for genetic analysis is limited because it does not allow the correlation between individuals or groups of individuals to be defined. This article describes the new lme4GS package for R, which is focused on fitting LMMs with covariance structures defined by the user, bandwidth selection, and genomic prediction. The new package is focused on genomic prediction of the models used in GS and can fit LMMs using different variance–covariance matrices. Several examples of GS models are presented using this package as well as the analysis using real data.
Collapse
|
31
|
Author Correction: A data-driven simulation platform to predict cultivars' performances under uncertain weather conditions. Nat Commun 2021; 12:1966. [PMID: 33762573 PMCID: PMC7990950 DOI: 10.1038/s41467-021-22384-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
|
32
|
Additive genetic variance and covariance between relatives in synthetic wheat crosses with variable parental ploidy levels. Genetics 2021; 217:iyaa048. [PMID: 33724416 PMCID: PMC8045691 DOI: 10.1093/genetics/iyaa048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2020] [Accepted: 12/17/2020] [Indexed: 12/04/2022] Open
Abstract
Cultivated bread wheat (Triticum aestivum L.) is an allohexaploid species resulting from the natural hybridization and chromosome doubling of allotetraploid durum wheat (T. turgidum) and a diploid goatgrass Aegilops tauschii Coss (Ae. tauschii). Synthetic hexaploid wheat (SHW) was developed through the interspecific hybridization of Ae. tauschii and T. turgidum, and then crossed to T. aestivum to produce synthetic hexaploid wheat derivatives (SHWDs). Owing to this founding variability, one may infer that the genetic variances of native wild populations vs improved wheat may vary due to their differential origin and evolutionary history. In this study, we partitioned the additive variance of SHW and SHWD with respect to their breed origin by fitting a hierarchical Bayesian model with heterogeneous covariance structure for breeding values to estimate variance components for each breed category, and segregation variance. Two data sets were used to test the proposed hierarchical Bayesian model, one from a multi-year multi-location field trial of SHWD and the other comprising the two species of SHW. For the SHWD, the Bayesian estimates of additive variances of grain yield from each breed category were similar for T. turgidum and Ae. tauschii, but smaller for T. aestivum. Segregation variances between Ae. tauschii-T. aestivum and T. turgidum-T. aestivum populations explained a sizable proportion of the phenotypic variance. Bayesian additive variance components and the Best Linear Unbiased Predictors (BLUPs) estimated by two well-known software programs were similar for multi-breed origin and for the sum of the breeding values by origin for both data sets. Our results support the suitability of models with heterogeneous additive genetic variances to predict breeding values in wheat crosses with variable ploidy levels.
Collapse
|
33
|
Abstract
BACKGROUND Several conventional genomic Bayesian (or no Bayesian) prediction methods have been proposed including the standard additive genetic effect model for which the variance components are estimated with mixed model equations. In recent years, deep learning (DL) methods have been considered in the context of genomic prediction. The DL methods are nonparametric models providing flexibility to adapt to complicated associations between data and output with the ability to adapt to very complex patterns. MAIN BODY We review the applications of deep learning (DL) methods in genomic selection (GS) to obtain a meta-picture of GS performance and highlight how these tools can help solve challenging plant breeding problems. We also provide general guidance for the effective use of DL methods including the fundamentals of DL and the requirements for its appropriate use. We discuss the pros and cons of this technique compared to traditional genomic prediction approaches as well as the current trends in DL applications. CONCLUSIONS The main requirement for using DL is the quality and sufficiently large training data. Although, based on current literature GS in plant and animal breeding we did not find clear superiority of DL in terms of prediction power compared to conventional genome based prediction models. Nevertheless, there are clear evidences that DL algorithms capture nonlinear patterns more efficiently than conventional genome based. Deep learning algorithms are able to integrate data from different sources as is usually needed in GS assisted breeding and it shows the ability for improving prediction accuracy for large plant breeding data. It is important to apply DL to large training-testing data sets.
Collapse
|
34
|
Target Population of Environments for Wheat Breeding in India: Definition, Prediction and Genetic Gains. FRONTIERS IN PLANT SCIENCE 2021; 12:638520. [PMID: 34108977 PMCID: PMC8181127 DOI: 10.3389/fpls.2021.638520] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2020] [Accepted: 04/06/2021] [Indexed: 05/02/2023]
Abstract
In this study, we defined the target population of environments (TPE) for wheat breeding in India, the largest wheat producer in South Asia, and estimated the correlated response to the selection and prediction ability of five selection environments (SEs) in Mexico. We also estimated grain yield (GY) gains in each TPE. Our analysis used meteorological, soil, and GY data from the international Elite Spring Wheat Yield Trials (ESWYT) distributed by the International Maize and Wheat Improvement Center (CIMMYT) from 2001 to 2016. We identified three TPEs: TPE 1, the optimally irrigated Northwestern Plain Zone; TPE 2, the optimally irrigated, heat-stressed North Eastern Plains Zone; and TPE 3, the drought-stressed Central-Peninsular Zone. The correlated response to selection ranged from 0.4 to 0.9 within each TPE. The highest prediction accuracies for GY per TPE were derived using models that included genotype-by-environment interaction and/or meteorological information and their interaction with the lines. The highest prediction accuracies for TPEs 1, 2, and 3 were 0.37, 0.46, and 0.51, respectively, and the respective GY gains were 118, 46, and 123 kg/ha/year. These results can help fine-tune the breeding of elite wheat germplasm with stable yields to reduce farmers' risk from year-to-year environmental variation in India's wheat lands, which cover 30 million ha, account for 100 million tons of grain or more each year, and provide food and livelihoods for hundreds of millions of farmers and consumers in South Asia.
Collapse
|
35
|
Approximate Genome-Based Kernel Models for Large Data Sets Including Main Effects and Interactions. Front Genet 2020; 11:567757. [PMID: 33193659 PMCID: PMC7594507 DOI: 10.3389/fgene.2020.567757] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2020] [Accepted: 08/28/2020] [Indexed: 11/23/2022] Open
Abstract
The rapid development of molecular markers and sequencing technologies has made it possible to use genomic prediction (GP) and selection (GS) in animal and plant breeding. However, when the number of observations (n) is large (thousands or millions), computational difficulties when handling these large genomic kernel relationship matrices (inverting and decomposing) increase exponentially. This problem increases when genomic × environment interaction and multi-trait kernels are included in the model. In this research we propose selecting a small number of lines m(m < n) for constructing an approximate kernel of lower rank than the original and thus exponentially decreasing the required computing time. First, we describe the full genomic method for single environment (FGSE) with a covariance matrix (kernel) including all n lines. Second, we select m lines and approximate the original kernel for the single environment model (APSE). Similarly, but including main effects and G × E, we explain a full genomic method with genotype × environment model (FGGE), and including m lines, we approximated the kernel method with G × E (APGE). We applied the proposed method to two different wheat data sets of different sizes (n) using the standard linear kernel Genomic Best Linear Unbiased Predictor (GBLUP) and also using eigen value decomposition. In both data sets, we compared the prediction performance and computing time for FGSE versus APSE; we also compared FGGE versus APGE. Results showed a competitive prediction performance of the approximated methods with a significant reduction in computing time. Genomic prediction accuracy depends on the decay of the eigenvalues (amount of variance information loss) of the original kernel as well as on the size of the selected lines m.
Collapse
|
36
|
Genomic prediction across years in a maize doubled haploid breeding program to accelerate early-stage testcross testing. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2020; 133:2869-2879. [PMID: 32607592 PMCID: PMC7782462 DOI: 10.1007/s00122-020-03638-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Accepted: 06/16/2020] [Indexed: 05/20/2023]
Abstract
Genomic selection with a multiple-year training population dataset could accelerate early-stage testcross testing by skipping the first-stage yield testing, which significantly saves the time and cost of early-stage testcross testing. With the development of doubled haploid (DH) technology, the main task for a maize breeder is to estimate the breeding values of thousands of DH lines annually. In early-stage testcross testing, genomic selection (GS) offers the opportunity of replacing expensive multiple-environment phenotyping and phenotypic selection with lower-cost genotyping and genomic estimated breeding value (GEBV)-based selection. In the present study, a total of 1528 maize DH lines, phenotyped in multiple-environment trials in three consecutive years and genotyped with a low-cost per-sample genotyping platform of rAmpSeq, were used to explore how to implement GS to accelerate early-stage testcross testing. Results showed that the average prediction accuracy estimated from the cross-validation schemes was above 0.60 across all the scenarios. The average prediction accuracies estimated from the independent validation schemes ranged from 0.23 to 0.32 across all the scenarios, when the one-year datasets were used as training population (TRN) to predict the other year data as testing population (TST). The average prediction accuracies increased to a range from 0.31 to 0.42 across all the scenarios, when the two-years datasets were used as TRN. The prediction accuracies increased to a range from 0.50 to 0.56, when the TRN consisted of two-years of breeding data and 50% of third year's data converted from TST to TRN. This information showed that GS with a multiple-year TRN set offers the opportunity to accelerate early-stage testcross testing by skipping the first-stage yield testing, which significantly saves the time and cost of early-stage testcross testing.
Collapse
|
37
|
|
38
|
Litterfall Hg deposition to an oak forest soil from southwestern Europe. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2020; 269:110858. [PMID: 32561026 DOI: 10.1016/j.jenvman.2020.110858] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Revised: 05/22/2020] [Accepted: 05/25/2020] [Indexed: 06/11/2023]
Abstract
Litterfall constitutes one of the main vectors for mercury (Hg) transfer to forested ecosystems, so we studied the deposition of Hg through senescent vegetation (oak leaves, twigs and miscellaneous) in a deciduous forest plot of Southwest Europe dominated by Quercus robur in 2015 and 2016. Total Hg concentrations increased in the following order: bole wood (1.4 μg kg-1) < bark (8.3 μg kg-1) < twigs (12.2 μg kg-1) < miscellaneous (36.0 μg kg-1) < oak leaves (39.3 μg kg-1) < mineral soil (42.4 μg kg-1) < Oi horizons (48.7 μg kg-1) < Oe + Oa horizons (71.6 μg kg-1). Mercury accumulation rates in oak leaves during the growing season were 0.15-0.18 μg kg-1 day-1. Mercury deposition fluxes were 26 and 21 μg m-2 yr-1 for 2015 and 2016, respectively, with oak leaves being the fraction that contributed the most. Mercury determination in litterfall sorted biomass fractions lead to a more accurate estimation of the total annual Hg deposition fluxes through litterfall. Higher Hg content was obtained for organic horizons (average of 60.2 μg kg-1) than for mineral soil (mean of 42.4 μg kg-1), but the soil Hg pool was higher in the latter. The results confirmed the necessity of taking into account the Hg pool in the deeper mineral soil layers as they accumulate substantial quantities of Hg associated to organic C and Al compounds, preventing its mobilization to other compartments of the terrestrial ecosystems.
Collapse
|
39
|
Genomic Prediction with Genotype by Environment Interaction Analysis for Kernel Zinc Concentration in Tropical Maize Germplasm. G3 (BETHESDA, MD.) 2020; 10:2629-2639. [PMID: 32482728 PMCID: PMC7407456 DOI: 10.1534/g3.120.401172] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/22/2020] [Accepted: 05/28/2020] [Indexed: 01/25/2023]
Abstract
Zinc (Zn) deficiency is a major risk factor for human health, affecting about 30% of the world's population. To study the potential of genomic selection (GS) for maize with increased Zn concentration, an association panel and two doubled haploid (DH) populations were evaluated in three environments. Three genomic prediction models, M (M1: Environment + Line, M2: Environment + Line + Genomic, and M3: Environment + Line + Genomic + Genomic x Environment) incorporating main effects (lines and genomic) and the interaction between genomic and environment (G x E) were assessed to estimate the prediction ability (rMP ) for each model. Two distinct cross-validation (CV) schemes simulating two genomic prediction breeding scenarios were used. CV1 predicts the performance of newly developed lines, whereas CV2 predicts the performance of lines tested in sparse multi-location trials. Predictions for Zn in CV1 ranged from -0.01 to 0.56 for DH1, 0.04 to 0.50 for DH2 and -0.001 to 0.47 for the association panel. For CV2, rMP values ranged from 0.67 to 0.71 for DH1, 0.40 to 0.56 for DH2 and 0.64 to 0.72 for the association panel. The genomic prediction model which included G x E had the highest average rMP for both CV1 (0.39 and 0.44) and CV2 (0.71 and 0.51) for the association panel and DH2 population, respectively. These results suggest that GS has potential to accelerate breeding for enhanced kernel Zn concentration by facilitating selection of superior genotypes.
Collapse
|
40
|
Genome-based prediction of Bayesian linear and non-linear regression models for ordinal data. THE PLANT GENOME 2020; 13:e20021. [PMID: 33016610 DOI: 10.1002/tpg2.20021] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/10/2020] [Revised: 03/21/2020] [Accepted: 03/28/2020] [Indexed: 06/11/2023]
Abstract
Linear and non-linear models used in applications of genomic selection (GS) can fit different types of responses (e.g., continuous, ordinal, binary). In recent years, several genomic-enabled prediction models have been developed for predicting complex traits in genomic-assisted animal and plant breeding. These models include linear, non-linear and non-parametric models, mostly for continuous responses and less frequently for categorical responses. Several linear and non-linear models are special cases of a more general family of statistical models known as artificial neural networks, which provide better prediction ability than other models. In this paper, we propose a Bayesian Regularized Neural Network (BRNNO) for modelling ordinal data. The proposed model was fitted using a Bayesian framework; we used the data augmentation algorithm to facilitate computations. The proposed model was fitted using the Gibbs Maximum a Posteriori and Generalized EM algorithm implemented by combining code written in C and R programming languages. The new model was tested with two real maize datasets evaluated for Septoria and GLS diseases and was compared with the Bayesian Ordered Probit Model (BOPM). Results indicated that the BRNNO model performed better in terms of genomic-based prediction than the BOPM model.
Collapse
|
41
|
Strategies for Effective Use of Genomic Information in Crop Breeding Programs Serving Africa and South Asia. FRONTIERS IN PLANT SCIENCE 2020; 11:353. [PMID: 32292411 PMCID: PMC7119190 DOI: 10.3389/fpls.2020.00353] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/22/2019] [Accepted: 03/10/2020] [Indexed: 05/20/2023]
Abstract
Much of the world's population growth will occur in regions where food insecurity is prevalent, with large increases in food demand projected in regions of Africa and South Asia. While improving food security in these regions will require a multi-faceted approach, improved performance of crop varieties in these regions will play a critical role. Current rates of genetic gain in breeding programs serving Africa and South Asia fall below rates achieved in other regions of the world. Given resource constraints, increased genetic gain in these regions cannot be achieved by simply expanding the size of breeding programs. New approaches to breeding are required. The Genomic Open-source Breeding informatics initiative (GOBii) and Excellence in Breeding Platform (EiB) are working with public sector breeding programs to build capacity, develop breeding strategies, and build breeding informatics capabilities to enable routine use of new technologies that can improve the efficiency of breeding programs and increase genetic gains. Simulations evaluating breeding strategies indicate cost-effective implementations of genomic selection (GS) are feasible using relatively small training sets, and proof-of-concept implementations have been validated in the International Maize and Wheat Improvement Center (CIMMYT) maize breeding program. Progress on GOBii, EiB, and implementation of GS in CIMMYT and International Crops Research Institute for the Semi-Arid Tropics (ICRISAT) breeding programs are discussed, as well as strategies for routine implementation of GS in breeding programs serving Africa and South Asia.
Collapse
|
42
|
Impact of Social Isolation Due to COVID-19 on Health in Older People: Mental and Physical Effects and Recommendations. J Nutr Health Aging 2020; 24:938-947. [PMID: 33155618 PMCID: PMC7597423 DOI: 10.1007/s12603-020-1469-2] [Citation(s) in RCA: 127] [Impact Index Per Article: 31.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Accepted: 06/10/2020] [Indexed: 12/27/2022]
Abstract
OBJECTIVES To review the impact of social isolation during COVID-19 pandemic on mental and physical health of older people and the recommendations for patients, caregivers and health professionals. DESIGN Narrative review. SETTING Non-institutionalized community-living people. PARTICIPANTS 20.069 individuals from ten descriptive cross-sectional papers. MEASUREMENTS Articles since 2019 to 2020 published on Pubmed, Scielo and Google Scholar databases with the following MeSh terms ('COVID-19', 'coronavirus', 'aging', 'older people', 'elderly', 'social isolation' and 'quarantine') in English, Spanish or Portuguese were included. The studies not including people over 60 were excluded. Guidelines, recommendations, and update documents from different international organizations related to mental and physical activity were also analysed. RESULTS 41 documents have been included in this narrative review, involving a total of 20.069 individuals (58% women), from Asia, Europe and America. 31 articles included recommendations and 10 addressed the impact of social distancing on mental or physical health. The main outcomes reported were anxiety, depression, poor sleep quality and physical inactivity during the isolation period. Cognitive strategies and increasing physical activity levels using apps, online videos, telehealth, are the main international recommendations. CONCLUSION Mental and physical health in older people are negatively affected during the social distancing for COVID-19. Therefore, a multicomponent program with exercise and psychological strategies are highly recommended for this population during the confinement. Future investigations are necessary in this field.
Collapse
|
43
|
Deep Kernel and Deep Learning for Genome-Based Prediction of Single Traits in Multienvironment Breeding Trials. Front Genet 2019; 10:1168. [PMID: 31921277 PMCID: PMC6913188 DOI: 10.3389/fgene.2019.01168] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2019] [Accepted: 10/23/2019] [Indexed: 11/13/2022] Open
Abstract
Deep learning (DL) is a promising method for genomic-enabled prediction. However, the implementation of DL is difficult because many hyperparameters (number of hidden layers, number of neurons, learning rate, number of epochs, batch size, etc.) need to be tuned. For this reason, deep kernel methods, which only require defining the number of layers, may be an attractive alternative. Deep kernel methods emulate DL models with a large number of neurons, but are defined by relatively easily computed covariance matrices. In this research, we compared the genome-based prediction of DL to a deep kernel (arc-cosine kernel, AK), to the commonly used non-additive Gaussian kernel (GK), as well as to the conventional additive genomic best linear unbiased predictor (GBLUP/GB). We used two real wheat data sets for benchmarking these methods. On average, AK and GK outperformed DL and GB. The gain in terms of prediction performance of AK and GK over DL and GB was not large, but AK and GK have the advantage that only one parameter, the number of layers (AK) or the bandwidth parameter (GK), has to be tuned in each method. Furthermore, although AK and GK had similar performance, deep kernel AK is easier to implement than GK, since the parameter "number of layers" is more easily determined than the bandwidth parameter of GK. Comparing AK and DL for the data set of year 2015-2016, the difference in performance of the two methods was bigger, with AK predicting much better than DL. On this data, the optimization of the hyperparameters for DL was difficult and the finally used parameters may have been suboptimal. Our results suggest that AK is a good alternative to DL with the advantage that practically no tuning process is required.
Collapse
|
44
|
Empirical Comparison of Tropical Maize Hybrids Selected Through Genomic and Phenotypic Selections. FRONTIERS IN PLANT SCIENCE 2019; 10:1502. [PMID: 31824533 PMCID: PMC6883373 DOI: 10.3389/fpls.2019.01502] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2019] [Accepted: 10/29/2019] [Indexed: 05/20/2023]
Abstract
Genomic selection predicts the genomic estimated breeding values (GEBVs) of individuals not previously phenotyped. Several studies have investigated the accuracy of genomic predictions in maize but there is little empirical evidence on the practical performance of lines selected based on phenotype in comparison with those selected solely on GEBVs in advanced testcross yield trials. The main objectives of this study were to (1) empirically compare the performance of tropical maize hybrids selected through phenotypic selection (PS) and genomic selection (GS) under well-watered (WW) and managed drought stress (WS) conditions in Kenya, and (2) compare the cost-benefit analysis of GS and PS. For this study, we used two experimental maize data sets (stage I and stage II yield trials). The stage I data set consisted of 1492 doubled haploid (DH) lines genotyped with rAmpSeq SNPs. A subset of these lines (855) representing various DH populations within the stage I cohort was crossed with an individual single-cross tester chosen to complement each population. These testcross hybrids were evaluated in replicated trials under WW and WS conditions for grain yield and other agronomic traits, while the remaining 637 DH lines were predicted using the 855 lines as a training set. The second data set (stage II) consists of 348 DH lines from the first data set. Among these 348 best DH lines, 172 lines selected were solely based on GEBVs, and 176 lines were selected based on phenotypic performance. Each of the 348 DH lines were crossed with three common testers from complementary heterotic groups, and the resulting 1042 testcross hybrids and six commercial checks were evaluated in four to five WW locations and one WS condition in Kenya. For stage I trials, the cross-validated prediction accuracy for grain yield was 0.67 and 0.65 under WW and WS conditions, respectively. We found similar responses to selection using PS and GS for grain yield other agronomic traits under WW and WS conditions. The top 15% of hybrids advanced through GS and PS gave 21%-23% higher grain yield under WW and 51%-52% more grain yield under WS than the mean of the checks. The GS reduced the cost by 32% over the PS with similar selection gains. We concluded that the use of GS for yield under WW and WS conditions in maize can produce selection candidates with similar performance as those generated from conventional PS, but at a lower cost, and therefore, should be incorporated into maize breeding pipelines to increase breeding program efficiency.
Collapse
|
45
|
Patterns of Oviposition and Feeding in the Monophagous Fly Anastrepha spatulata (Diptera: Tephritidae) on its Larval Host Plant Schoepfia schreberi. ENVIRONMENTAL ENTOMOLOGY 2019; 48:1178-1186. [PMID: 31298272 DOI: 10.1093/ee/nvz088] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/18/2019] [Indexed: 06/10/2023]
Abstract
Monophagous insects that use discrete resources for oviposition and feeding are especially sensitive to variations in host quality and availability because their opportunities to find these resources are scarce. The monophagous tephritid fly Anastrepha spatulata Stone is a tephritid fly that uses as hosts the fruits of the non-economically important Schoepfia schreberi J. F. Gmel. Scant information of host utilization behavior in the field is available for this species. Wild individually marked flies were observed during the fruiting season. Observations of oviposition, feeding and resting on three trees were taken hourly from 0900 to 1800 hours on days with benign weather. Our results suggest that females can use fruits for oviposition or for feeding according to a temporal scale. Females were significantly more likely to feed on smaller hosts and oviposit in larger ones. Additionally, individual variation in host patch exploitation was detected. However, females that fed on a natural food source such as host fruit juice oviposited fewer eggs than females provided an artificial diet of sucrose and hydrolyzed yeast. Results indicate that females use different foraging tactics during the fruiting season and confirm that, in this case, the host plant is not the center of activity.
Collapse
|
46
|
Abstract
The presence of earthworm species in crop fields is as old as agriculture itself. The earthworms Pontoscolex corethrurus (invasive) and Balanteodrilus pearsei (native) are associated with the emergence of agriculture and sedentism in the region Amazon and Maya, respectively. Both species have shifted their preference from their natural habitat to the cropland niche. They contrast in terms of intensification of agricultural land use (anthropic impact to the symbiotic soil microbiome). P. corethrurus inhabits conventional agroecosystems, while B. pearsei thrives in traditional agroecosystems, i.e., P. corethrurus has not yet been recorded in soils where B. pearsei dwells. The demographic behavior of these two earthworm species was assessed in the laboratory over 100 days, according to their origin (OE; P. corethrurus and B. pearsei) food quality (FQ; soil only, maize stubble, Mucuna pruriens), and soil moisture (SM; 25, 33, 42%). The results showed that OE, FQ, SM, and the OE x FQ interaction were highly significant for the survival, growth, and reproduction of earthworms. P. corethrurus showed a lower survival rate (> mortality). P. corethrurus survivors fed a diet of low-to-intermediate nutritional quality (soil and stubble maize, respectively) showed a greater capacity to grow and reproduce; however, it was surpassed by the native earthworm when fed a high-quality diet (M. pruriens). Besides, P. corethrurus displayed a low cocoon hatching (emergence of juveniles). These results suggest that the presence of the invasive species was associated with a negative interaction with the soil microbiota where the native species dwells, and with the absence of natural mutualistic bacteria (gut, nephridia, and cocoons). These results are consistent with the absence of P. corethrurus in milpa and pasture-type agricultural niches managed by peasants (agroecologists) to grow food regularly through biological soil management. Results reported here suggest that P. corethrurus is an invasive species that is neither wild nor domesticated, that is, its eco-evolutionary phylogeny needs to be derived based on its symbionts.
Collapse
|
47
|
SNP and Haplotype-Based Genomic Selection of Quantitative Traits in Eucalyptus globulus. PLANTS 2019; 8:plants8090331. [PMID: 31492041 PMCID: PMC6783840 DOI: 10.3390/plants8090331] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Revised: 09/02/2019] [Accepted: 09/03/2019] [Indexed: 01/02/2023]
Abstract
Eucalyptus globulus (Labill.) is one of the most important cultivated eucalypts in temperate and subtropical regions and has been successfully subjected to intensive breeding. In this study, Bayesian genomic models that include the effects of haplotype and single nucleotide polymorphisms (SNP) were assessed to predict quantitative traits related to wood quality and tree growth in a 6-year-old breeding population. To this end, the following markers were considered: (a) ~14 K SNP markers (SNP), (b) ~3 K haplotypes (HAP), and (c) haplotypes and SNPs that were not assigned to a haplotype (HAP-SNP). Predictive ability values (PA) were dependent on the genomic prediction models and markers. On average, Bayesian ridge regression (BRR) and Bayes C had the highest PA for the majority of traits. Notably, genomic models that included the haplotype effect (either HAP or HAP-SNP) significantly increased the PA of low-heritability traits. For instance, BRR based on HAP had the highest PA (0.58) for stem straightness. Consistently, the heritability estimates from genomic models were higher than the pedigree-based estimates for these traits. The results provide additional perspectives for the implementation of genomic selection in Eucalyptus breeding programs, which could be especially beneficial for improving traits with low heritability.
Collapse
|
48
|
Deep Kernel for Genomic and Near Infrared Predictions in Multi-environment Breeding Trials. G3 (BETHESDA, MD.) 2019; 9:2913-2924. [PMID: 31289023 PMCID: PMC6723142 DOI: 10.1534/g3.119.400493] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/28/2019] [Accepted: 07/04/2019] [Indexed: 01/15/2023]
Abstract
Kernel methods are flexible and easy to interpret and have been successfully used in genomic-enabled prediction of various plant species. Kernel methods used in genomic prediction comprise the linear genomic best linear unbiased predictor (GBLUP or GB) kernel, and the Gaussian kernel (GK). In general, these kernels have been used with two statistical models: single-environment and genomic × environment (GE) models. Recently near infrared spectroscopy (NIR) has been used as an inexpensive and non-destructive high-throughput phenotyping method for predicting unobserved line performance in plant breeding trials. In this study, we used a non-linear arc-cosine kernel (AK) that emulates deep learning artificial neural networks. We compared AK prediction accuracy with the prediction accuracy of GB and GK kernel methods in four genomic data sets, one of which also includes pedigree and NIR information. Results show that for all four data sets, AK and GK kernels achieved higher prediction accuracy than the linear GB kernel for the single-environment and GE multi-environment models. In addition, AK achieved similar or slightly higher prediction accuracy than the GK kernel. For all data sets, the GE model achieved higher prediction accuracy than the single-environment model. For the data set that includes pedigree, markers and NIR, results show that the NIR wavelength alone achieved lower prediction accuracy than the genomic information alone; however, the pedigree plus NIR information achieved only slightly lower prediction accuracy than the marker plus the NIR high-throughput data.
Collapse
|
49
|
Copper and zinc in rhizospheric soil of wild plants growing in long-term acid vineyard soils. Insights on availability and metal remediation. THE SCIENCE OF THE TOTAL ENVIRONMENT 2019; 672:389-399. [PMID: 30965255 DOI: 10.1016/j.scitotenv.2019.03.301] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/11/2018] [Revised: 02/21/2019] [Accepted: 03/20/2019] [Indexed: 06/09/2023]
Abstract
Total and available Cu and Zn levels were assessed in plant biomass, as well as in two rhizosphere fractions (tightly adhering rhizosphere (TAR), and loosely adhering rhizosphere (LAR)), in wild plants species from vineyard soils. Both TAR and LAR fractions were enriched in total Cu and Zn (1.7 and 1.6 times, respectively), and in available Cu and Zn (2.2 and 19.5 times, respectively), with the former being significantly higher for TAR than for LAR fractions. Mean values for total Cu accumulation in root and aerial biomass of the studied wild plants were 84 and 66 mg kg-1, respectively, being 57 and 79 mg kg-1 for Zn. No correlations were found among metal contents in plant biomass and available Cu and Zn concentrations in the rhizosphere fractions. Translocation factor (TF) values for Zn (range 1.0-3.5) indicate preferential accumulation in the aerial biomass in all the studied wild plants. On the contrary, TF for Cu shows a greater variability, depending on plant species, and ranging from 0.2 to 5.9. Regarding bioaccumulation factor (BAF), ranges were 0.03-0.27 and 0.13-0.58, for Cu and Zn, respectively. Results suggest that D. sanguinalis, P. hieracioides, S. viridis, and T. barbata could be useful for Cu remediation in the studied soils, by means of phytostabilization processes.
Collapse
|
50
|
Maize responsiveness to Azospirillum brasilense: Insights into genetic control, heterosis and genomic prediction. PLoS One 2019; 14:e0217571. [PMID: 31173600 PMCID: PMC6555527 DOI: 10.1371/journal.pone.0217571] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2019] [Accepted: 05/14/2019] [Indexed: 12/30/2022] Open
Abstract
Several studies have shown differences in the abilities of maize genotypes to facilitate or impede Azospirillum brasilense colonization and to receive benefits from this association. Hence, our aim was to study the genetic control, heterosis effect and the prediction accuracy of the shoot and root traits of maize in response to A. brasilense. For that, we evaluated 118 hybrids under two contrasting scenarios: i) N stress (control) and ii) N stress plus A. brasilense inoculation. The diallel analyses were performed using mixed model equations, and the genomic prediction models accounted for the general and specific combining ability (GCA and SCA, respectively) and the presence or not of G×E effects. In addition, the genomic models were fitted considering parametric (G-BLUP) and semi-parametric (RKHS) kernels. The genotypes showed significant inoculation effect for five root traits, and the GCA and SCA were significant for both. The GCA in the inoculated treatment presented a greater magnitude than the control, whereas the opposite was observed for SCA. Heterosis was weakly influenced by the inoculation, and the heterozygosity and N status in the plant can have a role in the benefits that can be obtained from this Plant Growth-Promoting Bacteria (PGPB). Prediction accuracies for N stress plus A. brasilense ranged from 0.42 to 0.78, depending on the scenario and trait, and were higher, in most cases, than the non-inoculated treatment. Finally, our findings provide an understanding of the quantitative variation of maize responsiveness to A. brasilense and important insights to be applied in maize breeding aiming the development of superior hybrids for this association.
Collapse
|