1
|
Nazzicari N, Franguelli N, Ferrari B, Pecetti L, Annicchiarico P. The Effect of Genome Parametrization and SNP Marker Subsetting on Genomic Selection in Autotetraploid Alfalfa. Genes (Basel) 2024; 15:449. [PMID: 38674384 PMCID: PMC11050091 DOI: 10.3390/genes15040449] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Revised: 03/21/2024] [Accepted: 03/27/2024] [Indexed: 04/28/2024] Open
Abstract
BACKGROUND Alfalfa, the most economically important forage legume worldwide, features modest genetic progress due to long selection cycles and the extent of the non-additive genetic variance associated with its autotetraploid genome. METHODS To improve the efficiency of genomic selection in alfalfa, we explored the effects of genome parametrization (as tetraploid and diploid dosages, plus allele ratios) and SNP marker subsetting (all available SNPs, only genic regions, and only non-genic regions) on genomic regressions, together with various levels of filtering on reading depth and missing rates. We used genotyping by sequencing-generated data and focused on traits of different genetic complexity, i.e., dry biomass yield in moisture-favorable (FE) and drought stress (SE) environments, leaf size, and the onset of flowering, which were assessed in 143 genotyped plants from a genetically broad European reference population and their phenotyped half-sib progenies. RESULTS On average, the allele ratio improved the predictive ability compared with other genome parametrizations (+7.9% vs. tetraploid dosage, +12.6% vs. diploid dosage), while using all the SNPs offered an advantage compared with any specific SNP subsetting (+3.7% vs. genic regions, +7.6% vs. non-genic regions). However, when focusing on specific traits, different combinations of genome parametrization and subsetting achieved better performances. We also released Legpipe2, an SNP calling pipeline tailored for reduced representation (GBS, RAD) in medium-sized genotyping experiments.
Collapse
Affiliation(s)
- Nelson Nazzicari
- Council for Agricultural Research and Economics (CREA), Research Center for Animal Production and Aquaculture, Viale Piacenza 29, 26900 Lodi, Italy
| | | | | | | | | |
Collapse
|
2
|
Crosta M, Romani M, Nazzicari N, Ferrari B, Annicchiarico P. Genomic prediction and allele mining of agronomic and morphological traits in pea ( Pisum sativum) germplasm collections. Front Plant Sci 2023; 14:1320506. [PMID: 38186592 PMCID: PMC10766761 DOI: 10.3389/fpls.2023.1320506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Accepted: 11/30/2023] [Indexed: 01/09/2024]
Abstract
Well-performing genomic prediction (GP) models for polygenic traits and molecular marker sets for oligogenic traits could be useful for identifying promising genetic resources in germplasm collections, setting core collections, and establishing molecular variety distinction. This study aimed at (i) defining GP models and key marker sets for predicting 15 agronomic or morphological traits in germplasm collections, (ii) verifying the GP model usefulness also for selection in breeding programs, (iii) investigating the consistency between molecular and phenotypic diversity patterns, and (iv) identifying genomic regions associated with to the target traits. The study was based on phenotyping data and over 41,000 genotyping-by-sequencing-generated SNP markers of 220 landraces or old cultivars belonging to a world germplasm collection and 11 modern cultivars. Non-metric multi-dimensional scaling (NMDS) and an analysis of population genetic structure indicated a high level of genetic differentiation of material from Western Asia, a major West-East diversity gradient, and quite limited genetic diversity of the improved germplasm. Mantel's test revealed a low correlation (r = 0.12) between phenotypic and molecular diversity, which increased (r = 0.45) when considering only the molecular diversity relative to significant SNPs from genome-wide association analyses. These analyses identified, inter alia, several areas of chromosome 6 involved in a largely pleiotropic control of vegetative or reproductive organ pigmentation. We found various significant SNPs for grain and straw yield under severe drought and onset of flowering, and one SNP on chromosome 5 for grain protein content. GP models displayed moderately high predictive ability (0.43 to 0.61) for protein content, grain and straw yield, and onset of flowering, and high predictive ability (0.76) for individual seed weight, based on intra-population, intra-environment cross-validations. The inter-population, inter-environment assessment of the models trained on the germplasm collection for breeding material of three recombinant inbred line (RIL) populations, which was challenged by much narrower diversity of the material, over eight-fold less available markers and quite different test environments, led to an overall loss of predictive ability of about 40% for seed weight, 50% for protein content and straw yield, and 60% for onset of flowering, and no prediction for grain yield. Within-RIL population predictive ability differed among populations.
Collapse
Affiliation(s)
- Margherita Crosta
- Council for Agricultural Research and Economics (CREA), Research Centre for Animal Production and Aquaculture, Lodi, Italy
- Department of Sustainable Crop Production, Catholic University of Sacred Heart, Piacenza, Italy
| | - Massimo Romani
- Council for Agricultural Research and Economics (CREA), Research Centre for Animal Production and Aquaculture, Lodi, Italy
| | - Nelson Nazzicari
- Council for Agricultural Research and Economics (CREA), Research Centre for Animal Production and Aquaculture, Lodi, Italy
| | - Barbara Ferrari
- Council for Agricultural Research and Economics (CREA), Research Centre for Animal Production and Aquaculture, Lodi, Italy
| | - Paolo Annicchiarico
- Council for Agricultural Research and Economics (CREA), Research Centre for Animal Production and Aquaculture, Lodi, Italy
| |
Collapse
|
3
|
D'Incecco P, Bettera L, Bancalari E, Rosi V, Sindaco M, Gobbi S, Candotti P, Nazzicari N, Limbo S, Gatti M, Pellegrino L. High-speed cold centrifugation of milk modifies the microbiota, the ripening process and the sensory characteristics of raw-milk hard cheeses. Food Res Int 2023; 172:113102. [PMID: 37689872 DOI: 10.1016/j.foodres.2023.113102] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 06/01/2023] [Accepted: 06/09/2023] [Indexed: 09/11/2023]
Abstract
The microbial population of raw milk plays a crucial role in the development of distinctive traits of raw-milk cheeses particularly appreciated by consumers. It was previously demonstrated that the microbial population of raw milk is modified by a high-speed centrifugation (also called bactofugation) conducted at 39 °C. The aim of the present study was to evaluate the effects of this process, performed once or twice, on the microbial, compositional, biochemical, and sensory characteristics of the derived hard cheeses. Experimental and control cheesemaking were conducted in parallel at a cheese factory during a 13-month period. Cheeses were analysed after 9, 15 and 20 months of ripening for microbial count, composition, proteolysis extent, volatile compounds, and sensory profile. Results evidenced that experimental cheeses were characterized by lower numbers of viable lactobacilli respect to control. Experimental cheeses also showed differences in the progress of primary and secondary proteolysis which, in turn, caused different patterns of free amino acids at all ripening times. Experimental cheeses had significantly lower content of esters and were differentiated from control for some traits by assessors. In conclusion, use of high-speed centrifugation of milk shall be discouraged if characteristic traits of raw-milk cheeses, particularly PDO cheeses, want to be retained.
Collapse
Affiliation(s)
- Paolo D'Incecco
- Department of Food, Environmental and Nutritional Sciences, University of Milan, 20133 Milan, Italy.
| | - Luca Bettera
- Department of Food and Drug, University of Parma, Parma 43124, Italy
| | - Elena Bancalari
- Department of Food and Drug, University of Parma, Parma 43124, Italy
| | - Veronica Rosi
- Department of Food, Environmental and Nutritional Sciences, University of Milan, 20133 Milan, Italy
| | - Marta Sindaco
- Department of Food, Environmental and Nutritional Sciences, University of Milan, 20133 Milan, Italy
| | - Serena Gobbi
- Department of Food, Environmental and Nutritional Sciences, University of Milan, 20133 Milan, Italy
| | - Paolo Candotti
- National Reference Centre for Animal Welfare, IZSLER, 25124 Brescia, Italy
| | - Nelson Nazzicari
- Council for Agricultural Research and Economics (CREA), Research Centre for Fodder Crops and Dairy Productions, 26900 Lodi, Italy
| | - Sara Limbo
- Department of Food, Environmental and Nutritional Sciences, University of Milan, 20133 Milan, Italy
| | - Monica Gatti
- Department of Food and Drug, University of Parma, Parma 43124, Italy
| | - Luisa Pellegrino
- Department of Food, Environmental and Nutritional Sciences, University of Milan, 20133 Milan, Italy
| |
Collapse
|
4
|
Pozzi A, Nazzicari N, Capoferri R, Radovic S, Bongioni G. Assessment of residual plant DNA in bulk milk for Grana Padano PDO production by a metabarcoding approach. PLoS One 2023; 18:e0289108. [PMID: 37490502 PMCID: PMC10368264 DOI: 10.1371/journal.pone.0289108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 07/11/2023] [Indexed: 07/27/2023] Open
Abstract
The aim of this study was to evaluate the ability of DNA metabarcoding, by rbcl as barcode marker, to identify and classify the small traces of plant DNA isolated from raw milk used to produce Grana Padano (GP) cheese. GP is one of the most popular Italian PDO (Protected Designation of Origin) produced in Italy in accordance with the GP PDO specification rules that define which forage can be used for feeding cows. A total of 42 GP bulk tank milk samples were collected from 14 dairies located in the Grana Padano production area. For the taxonomic classification, a local database with the rbcL sequences available in NCBI on September 2020/March 2021 for the Italian flora was generated. A total of 8,399,591 reads were produced with an average of 204,868 per sample (range 37,002-408,724) resulting in 16, 31 and 28 dominant OTUs at family, genus and species level, respectively. The taxonomic analysis of plant species in milk samples identified 7 families, 14 genera and 14 species, the statistical analysis conducted using alpha and beta diversity approaches, did not highlight differences among the investigated samples. However, the milk samples are featured by a high plant variability and the lack of differences at multiple taxonomic levels could be due to the standardisation of the feed rationing, as requested by the GP rules. The results suggest that DNA metabarcoding is a valuable resource to explore plant DNA traces in a complex matrix such as milk.
Collapse
Affiliation(s)
- Anna Pozzi
- Istituto Sperimentale Lazzaro Spallanzani, Localita' La Quercia, Rivolta d'Adda (CR), Italy
| | - Nelson Nazzicari
- CREA-Council for Agricultural Research and Analysis of Agricultural Economics, Research Centre for Animal Production and Aquaculture, Viale Piacenza, Lodi, Italy
| | - Rossana Capoferri
- Istituto Sperimentale Lazzaro Spallanzani, Localita' La Quercia, Rivolta d'Adda (CR), Italy
| | | | - Graziella Bongioni
- Istituto Sperimentale Lazzaro Spallanzani, Localita' La Quercia, Rivolta d'Adda (CR), Italy
| |
Collapse
|
5
|
Andrijanić Z, Nazzicari N, Šarčević H, Sudarić A, Annicchiarico P, Pejić I. Genetic Diversity and Population Structure of European Soybean Germplasm Revealed by Single Nucleotide Polymorphism. Plants (Basel) 2023; 12:plants12091837. [PMID: 37176892 PMCID: PMC10180984 DOI: 10.3390/plants12091837] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/25/2023] [Revised: 04/25/2023] [Accepted: 04/27/2023] [Indexed: 05/15/2023]
Abstract
Soybean is the most grown high-protein crop in the world. Despite the rapid increase of acreage and production volume, European soybean production accounts for only 34% of its consumption in Europe. This study aims to support the optimal exploitation of genetic resources by European breeding programs by investigating the genetic diversity and the genetic structure of 207 European cultivars or American introductions registered in Europe, which were genotyped by the SoySNP50K array. The expected heterozygosity (He) was 0.34 for the entire collection and ranged among countries from 0.24 for Swiss cultivars to 0.32 for American cultivars (partly reflecting differences in sample size between countries). Cluster analysis grouped all genotypes into two main clusters with eight subgroups that corresponded to the country of origin of cultivars and their maturity group. Pairwise Fst values between countries of origin showed the highest differentiation of Swiss cultivars from the rest of the European gene pool, while the lowest mean differentiation was found between American introductions and all other European countries. On the other hand, Fst values between maturity groups were much lower compared to those observed between countries. In analysis of molecular variance, the total genetic variation was partitioned either by country of origin or by maturity group, explaining 9.1% and 3.5% of the total genetic variance, respectively. On the whole, our results suggest that the European soybean gene pool still has sufficient diversity due to the different historical breeding practices in western and eastern countries and the relatively short period of breeding in Europe.
Collapse
Affiliation(s)
- Zoe Andrijanić
- Centre of Excellence for Biodiversity and Molecular Plant Breeding, Svetošimunska Cesta 25, 10000 Zagreb, Croatia
| | - Nelson Nazzicari
- Research Centre for Animal Production and Aquaculture, Council for Agricultural Research and Economics (CREA), Viale Piacenza 29, 26900 Lodi, Italy
| | - Hrvoje Šarčević
- Centre of Excellence for Biodiversity and Molecular Plant Breeding, Svetošimunska Cesta 25, 10000 Zagreb, Croatia
- Faculty of Agriculture, University of Zagreb, Svetošimunska Cesta 25, 10000 Zagreb, Croatia
| | - Aleksandra Sudarić
- Centre of Excellence for Biodiversity and Molecular Plant Breeding, Svetošimunska Cesta 25, 10000 Zagreb, Croatia
- Agricultural Institute Osijek, Južno Predgrađe 17, 31000 Osijek, Croatia
| | - Paolo Annicchiarico
- Research Centre for Animal Production and Aquaculture, Council for Agricultural Research and Economics (CREA), Viale Piacenza 29, 26900 Lodi, Italy
| | - Ivan Pejić
- Centre of Excellence for Biodiversity and Molecular Plant Breeding, Svetošimunska Cesta 25, 10000 Zagreb, Croatia
- Faculty of Agriculture, University of Zagreb, Svetošimunska Cesta 25, 10000 Zagreb, Croatia
| |
Collapse
|
6
|
Annicchiarico P, de Buck AJ, Vlachostergios DN, Heupink D, Koskosidis A, Nazzicari N, Crosta M. White Lupin Adaptation to Moderately Calcareous Soils: Phenotypic Variation and Genome-Enabled Prediction. Plants (Basel) 2023; 12:1139. [PMID: 36903997 PMCID: PMC10005150 DOI: 10.3390/plants12051139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Revised: 02/10/2023] [Accepted: 02/23/2023] [Indexed: 06/18/2023]
Abstract
White lupin is a promising high-protein crop, the cultivation of which is limited by a lack of adaptation to soils that are even just mildly calcareous. This study aimed to assess the phenotypic variation, the trait architecture based on a GWAS, and the predictive ability of genome-enabled models for grain yield and contributing traits of a genetically-broad population of 140 lines grown in an autumn-sown environment of Greece (Larissa) and a spring-sown environment of the Netherlands (Ens) that featured moderately calcareous and alkaline soils. We found large genotype × environment interaction and modest or nil genetic correlation for line responses across locations for grain yield, a lime susceptibility score, and other traits, with the exception of individual seed weight and plant height. The GWAS identified significant SNP markers associated with various traits that were markedly inconsistent across locations, while providing direct or indirect evidence for widespread polygenic trait control. Genomic selection proved to be a feasible strategy, owing to a moderate predictive ability for yield and lime susceptibility in Larissa (the site featuring greater lime soil stress). Other supporting results for breeding programs where the identification of a candidate gene for lime tolerance and the high reliability of genome-enabled predictions for individual seed weight.
Collapse
Affiliation(s)
- Paolo Annicchiarico
- Research Centre for Animal Production and Aquaculture, Council for Agricultural Research and Economics, 26900 Lodi, Italy
| | | | - Dimitrios N. Vlachostergios
- Institute of Industrial and Forage Crops, Hellenic Agricultural Organization “Demeter”, 41335 Larissa, Greece
| | | | - Avraam Koskosidis
- Institute of Industrial and Forage Crops, Hellenic Agricultural Organization “Demeter”, 41335 Larissa, Greece
| | - Nelson Nazzicari
- Research Centre for Animal Production and Aquaculture, Council for Agricultural Research and Economics, 26900 Lodi, Italy
| | - Margherita Crosta
- Research Centre for Animal Production and Aquaculture, Council for Agricultural Research and Economics, 26900 Lodi, Italy
| |
Collapse
|
7
|
Pecetti L, Annicchiarico P, Crosta M, Notario T, Ferrari B, Nazzicari N. White Lupin Drought Tolerance: Genetic Variation, Trait Genetic Architecture, and Genome-Enabled Prediction. Int J Mol Sci 2023; 24:ijms24032351. [PMID: 36768674 PMCID: PMC9916572 DOI: 10.3390/ijms24032351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Revised: 01/16/2023] [Accepted: 01/21/2023] [Indexed: 01/27/2023] Open
Abstract
White lupin is a high-protein crop requiring drought tolerance improvement. This study focused on a genetically-broad population of 138 lines to investigate the phenotypic variation and genotype × environment interaction (GEI) for grain yield and other traits across drought-prone and moisture-favourable managed environments, the trait genetic architecture and relevant genomic regions by a GWAS using 9828 mapped SNP markers, and the predictive ability of genomic selection (GS) models. Water treatments across two late cropping months implied max. available soil water content of 60-80% for favourable conditions and from wilting point to 15% for severe drought. Line yield responses across environments featured a genetic correlation of 0.84. Relatively better line yield under drought was associated with an increased harvest index. Two significant QTLs emerged for yield in each condition that differed across conditions. Line yield under stress displayed an inverse linear relationship with the onset of flowering, confirmed genomically by a common major QTL. An adjusted grain yield computed as deviation from phenology-predicted yield acted as an indicator of intrinsic drought tolerance. On the whole, the yield in both conditions and the adjusted yield were polygenic, heritable, and exploitable by GS with a high predictive ability (0.62-0.78). Our results can support selection for climatically different drought-prone regions.
Collapse
|
8
|
Annicchiarico P, Nazzicari N, Bouizgaren A, Hayek T, Laouar M, Cornacchione M, Basigalup D, Monterrubio Martin C, Brummer EC, Pecetti L. Alfalfa genomic selection for different stress-prone growing regions. Plant Genome 2022; 15:e20264. [PMID: 36222346 DOI: 10.1002/tpg2.20264] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Accepted: 08/25/2022] [Indexed: 06/16/2023]
Abstract
Alfalfa (Medicago sativa L.) selection for stress-prone regions has high priority for sustainable crop-livestock systems. This study assessed the genomic selection (GS) ability to predict alfalfa breeding values for drought-prone agricultural sites of Algeria, Morocco, and Argentina; managed-stress (MS) environments of Italy featuring moderate or intense drought; and one Tunisian site irrigated with moderately saline water. Additional aims were to investigate genotype × environment interaction (GEI) patterns and the effect on GS predictions of three single-nucleotide polymorphism (SNP) calling procedures, 12 statistical models that exclude or incorporate GEI, and allele dosage information. Our study included 127 genotypes from a Mediterranean reference population originated from three geographically contrasting populations, genotyped via genotyping-by-sequencing and phenotyped based on multi-year biomass dry matter yield of their dense-planted half-sib progenies. The GEI was very large, as shown by 27-fold greater additive genetic variance × environment interaction relative to the additive genetic variance and low genetic correlation for progeny yield responses across environments. The predictive ability of GS (using at least 37,969 SNP markers) exceeded 0.20 for moderate MS (representing Italian stress-prone sites) and the sites of Algeria and Argentina while being quite low for the Tunisian site and intense MS. Predictions of GS were complicated by rapid linkage disequilibrium decay. The weighted GBLUP model, GEI incorporation into GS models, and SNP calling based on a mock reference genome exhibited a predictive ability advantage for some environments. Our results support the specific breeding for each target region and suggest a positive role for GS in most regions when considering the challenges associated with phenotypic selection.
Collapse
Affiliation(s)
- Paolo Annicchiarico
- Consiglio per la Ricerca in Agricoltura e l'Analisi dell'Economia Agraria, Centro di ricerca Zootecnia e Acquacoltura, 29 viale Piacenza, Lodi, 26900, Italy
| | - Nelson Nazzicari
- Consiglio per la Ricerca in Agricoltura e l'Analisi dell'Economia Agraria, Centro di ricerca Zootecnia e Acquacoltura, 29 viale Piacenza, Lodi, 26900, Italy
| | - Abdelaziz Bouizgaren
- Institut National de la Recherche Agronomique, Centre Régional de Marrakech, BP 533, Marrakech, 40000, Morocco
| | - Taoufik Hayek
- Institut des Régions Arides, Route du Jorf, Médenine, 4119, Tunisia
| | - Meriem Laouar
- Ecole Nationale Supérieure Agronomique, Dép. de Productions Végétales. Laboratoire d'Amélioration Intégrative des Productions Végétales (C2711100), Rue Hassen Badi, El Harrach 16200, Alger, Algérie
| | - Monica Cornacchione
- Instituto Nacional de Tecnología Agropecuaria, Estación Experimental Santiago del Estero, Jujuy 850, Santiago del Estero, 4200, Argentina
| | - Daniel Basigalup
- Instituto Nacional de Tecnología Agropecuaria, Estación Experimental Manfredi, Ruta Nacional no. 9 km 636, Manfredi, Córdoba, X5988, Argentina
| | - Cristina Monterrubio Martin
- Consiglio per la Ricerca in Agricoltura e l'Analisi dell'Economia Agraria, Centro di ricerca Zootecnia e Acquacoltura, 29 viale Piacenza, Lodi, 26900, Italy
| | - Edward Charles Brummer
- Plant Breeding Center, Dep. of Plant Sciences, Univ. of California, Davis, CA, 95616, USA
| | - Luciano Pecetti
- Consiglio per la Ricerca in Agricoltura e l'Analisi dell'Economia Agraria, Centro di ricerca Zootecnia e Acquacoltura, 29 viale Piacenza, Lodi, 26900, Italy
| |
Collapse
|
9
|
Nazzicari N, Biscarini F. Stacked kinship CNN vs. GBLUP for genomic predictions of additive and complex continuous phenotypes. Sci Rep 2022; 12:19889. [PMID: 36400808 PMCID: PMC9674857 DOI: 10.1038/s41598-022-24405-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Accepted: 11/15/2022] [Indexed: 11/19/2022] Open
Abstract
Deep learning is impacting many fields of data science with often spectacular results. However, its application to whole-genome predictions in plant and animal science or in human biology has been rather limited, with mostly underwhelming results. While most works focus on exploring alternative network architectures, in this study we propose an innovative representation of marker genotype data and tested it against the GBLUP (Genomic BLUP) benchmark with linear and nonlinear phenotypes. From publicly available cattle SNP genotype data, different types of genomic kinship matrices are stacked together in a 3D pile from where 2D grayscale slices are extracted and fed to a deep convolutional neural network (DNN). We simulated nine phenotype scenarios with combinations of additivity, dominance and epistasis, and compared the DNN to GBLUP-A (computed using only the additive kinship matrix) and GBLUP-optim (additive, dominance, and epistasis kinship matrices, as needed). Results varied depending on the accuracy metric employed, with DNN performing better in terms of root mean squared error (1-12% lower than GBLUP-A; 1-9% lower than GBLUP-optim) but worse in terms of Pearson's correlation (0.505 for DNN compared to 0.672 and 0.669 of GBLUP-A and GBLUP-optim for fully additive case; 0.274 for DNN, 0.279 for GBLUP-A, and 0.477 for GBLUP-optim for fully dominant case). The proposed approach offers a basis to explore further the application of DNN to tabular data in whole-genome predictions.
Collapse
Affiliation(s)
- Nelson Nazzicari
- CREA Council for Agricultural Research and Analysis of Agricultural Economics, Research Centre for Animal Production and Aquaculture, Viale Piacenza 29, 26900 Lodi, Italy
| | - Filippo Biscarini
- grid.510304.3CNR: National Research Council, Institute of Agricultural Biology and Biotechnology, Via Bassini 15, Milan, 20133 Italy
| |
Collapse
|
10
|
Tarchi L, Fantoni T, Pisano T, Damiani S, La Torraca Vittori P, Marini S, Nazzicari N, Castellini G, Politi P, Ricca V. i-ECO: a novel method for the analysis and visualization of fMRI results in Psychiatry. Eur Psychiatry 2022. [PMCID: PMC9567001 DOI: 10.1192/j.eurpsy.2022.554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Introduction
The high technical barrier to entry in the field of neuroimaging can hinder early insight from promising results and the development of evidence-based clinical practice.
Objectives
The working group focused on published literature in order to develop a new methodology in the analysis, visualization, and representation of fMRI data in the psychiatric setting.
Methods
Three valid and established measures were chosen, in order to achieve dimensionality reduction, stability and explainability of results, namely Regional-Homogeneity; fractional Amplitude of Low-Frequency Fluctuations; Eigenvector-Centrality. Each measure was color coded and individual images per subject compiled, averaging results by functional networks as described the FIND lab of the University of Stanford. 272 individual scans were processed (130 neurotypicals, 50 patients with Schizophrenia, 49 with Bipolar Disorder, 43 with ADHD).
Results
The discriminative power between clinical groups of the novel method was significant both by human eye, and later confirmation by statistical tests, and by computer vision algorithms (Convolutional Neural Networks). The precision-recall Area Under the Curve, dividing by 80/20 proportion between train and test sets, was >84.5% for each group. The group of patients with Bipolar Disorder showed a partial overlap with the group of patients suffering from Schizophrenia – by a dominance of Eigenvector-Centrality and Regional-Homogeneity, as well as a lower prevalence of fractional Amplitude of Low-Frequency Fluctuations, for both in comparison to controls.
Conclusions
The present study offers preliminary evidence for the adoption of i-ECO (integrated-Explainability through Color Coding) in fMRI analyses during rest in the Psychiatric field.
Disclosure
No significant relationships.
Collapse
|
11
|
Alkemade JA, Nazzicari N, Messmer MM, Annicchiarico P, Ferrari B, Voegele RT, Finckh MR, Arncken C, Hohmann P. Genome-wide association study reveals white lupin candidate gene involved in anthracnose resistance. Theor Appl Genet 2022; 135:1011-1024. [PMID: 34988630 PMCID: PMC8942938 DOI: 10.1007/s00122-021-04014-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Accepted: 12/06/2021] [Indexed: 05/11/2023]
Abstract
GWAS identifies candidate gene controlling resistance to anthracnose disease in white lupin. White lupin (Lupinus albus L.) is a promising grain legume to meet the growing demand for plant-based protein. Its cultivation, however, is severely threatened by anthracnose disease caused by the fungal pathogen Colletotrichum lupini. To dissect the genetic architecture for anthracnose resistance, genotyping by sequencing was performed on white lupin accessions collected from the center of domestication and traditional cultivation regions. GBS resulted in 4611 high-quality single-nucleotide polymorphisms (SNPs) for 181 accessions, which were combined with resistance data observed under controlled conditions to perform a genome-wide association study (GWAS). Obtained disease phenotypes were shown to highly correlate with overall three-year disease assessments under Swiss field conditions (r > 0.8). GWAS results identified two significant SNPs associated with anthracnose resistance on gene Lalb_Chr05_g0216161 encoding a RING zinc-finger E3 ubiquitin ligase which is potentially involved in plant immunity. Population analysis showed a remarkably fast linkage disequilibrium decay, weak population structure and grouping of commercial varieties with landraces, corresponding to the slow domestication history and scarcity of modern breeding efforts in white lupin. Together with 15 highly resistant accessions identified in the resistance assay, our findings show promise for further crop improvement. This study provides the basis for marker-assisted selection, genomic prediction and studies aimed at understanding anthracnose resistance mechanisms in white lupin and contributes to improving breeding programs worldwide.
Collapse
Affiliation(s)
- Joris A Alkemade
- Department of Crop Sciences, Research Institute of Organic Agriculture (FiBL), Frick, Switzerland
| | - Nelson Nazzicari
- Research Centre for Animal Production and Aquaculture, CREA, Lodi, Italy
| | - Monika M Messmer
- Department of Crop Sciences, Research Institute of Organic Agriculture (FiBL), Frick, Switzerland.
| | | | - Barbara Ferrari
- Research Centre for Animal Production and Aquaculture, CREA, Lodi, Italy
| | - Ralf T Voegele
- Institute of Phytomedicine, University of Hohenheim, Stuttgart, Germany
| | - Maria R Finckh
- Department of Ecological Plant Protection, University of Kassel, Witzenhausen, Germany
| | - Christine Arncken
- Department of Crop Sciences, Research Institute of Organic Agriculture (FiBL), Frick, Switzerland
| | - Pierre Hohmann
- Department of Crop Sciences, Research Institute of Organic Agriculture (FiBL), Frick, Switzerland
| |
Collapse
|
12
|
Pavan S, Delvento C, Nazzicari N, Ferrari B, D’Agostino N, Taranto F, Lotti C, Ricciardi L, Annicchiarico P. Merging genotyping-by-sequencing data from two ex situ collections provides insights on the pea evolutionary history. Hortic Res 2022; 9:uhab062. [PMID: 35043171 PMCID: PMC8935929 DOI: 10.1093/hr/uhab062] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Revised: 11/09/2021] [Accepted: 11/09/2021] [Indexed: 05/13/2023]
Abstract
Pea (Pisum sativum L. subsp. sativum) is one of the oldest domesticated species and a widely cultivated legume. In this study, we combined next generation sequencing (NGS) data referring to two genotyping-by-sequencing (GBS) libraries, each one prepared from a different Pisum germplasm collection. The selection of single nucleotide polymorphism (SNP) loci called in both germplasm collections caused some loss of information; however, this did not prevent the obtainment of one of the largest datasets ever used to explore pea biodiversity, consisting of 652 accessions and 22 127 markers. The analysis of population structure reflected genetic variation based on geographic patterns and allowed the definition of a model for the expansion of pea cultivation from the domestication centre to other regions of the world. In genetically distinct populations, the average decay of linkage disequilibrium (LD) ranged from a few bases to hundreds of kilobases, thus indicating different evolutionary histories leading to their diversification. Genome-wide scans resulted in the identification of putative selective sweeps associated with domestication and breeding, including genes known to regulate shoot branching, cotyledon colour and resistance to lodging, and the correct mapping of two Mendelian genes. In addition to providing information of major interest for fundamental and applied research on pea, our work describes the first successful example of integration of different GBS datasets generated from ex situ collections - a process of potential interest for a variety of purposes, including conservation genetics, genome-wide association studies, and breeding.
Collapse
Affiliation(s)
- Stefano Pavan
- Department of Soil, Plant and Food Sciences, University of Bari “Aldo Moro”, Via Amendola 165/A, 70126 Bari, Italy
| | - Chiara Delvento
- Department of Soil, Plant and Food Sciences, University of Bari “Aldo Moro”, Via Amendola 165/A, 70126 Bari, Italy
| | - Nelson Nazzicari
- Council for Agricultural Research and Economics, Research Centre for Animal Production and Aquaculture, viale Piacenza 29, 26900 Lodi, Italy
| | - Barbara Ferrari
- Council for Agricultural Research and Economics, Research Centre for Animal Production and Aquaculture, viale Piacenza 29, 26900 Lodi, Italy
| | - Nunzio D’Agostino
- Department of Agricultural Sciences, University of Naples Federico II, via Università 100, 80055 Portici, Italy
| | - Francesca Taranto
- Institute of Biosciences and Bioresources, National Research Council of Italy, Via Amendola 165/A, 70126 Bari,
Italy
| | - Concetta Lotti
- Department of Agriculture, Food, Natural Resources and Engineering, University of Foggia, Via Napoli 25, 71100 Foggia, Italy
| | - Luigi Ricciardi
- Department of Soil, Plant and Food Sciences, University of Bari “Aldo Moro”, Via Amendola 165/A, 70126 Bari, Italy
| | - Paolo Annicchiarico
- Council for Agricultural Research and Economics, Research Centre for Animal Production and Aquaculture, viale Piacenza 29, 26900 Lodi, Italy
| |
Collapse
|
13
|
Crosta M, Nazzicari N, Ferrari B, Pecetti L, Russi L, Romani M, Cabassi G, Cavalli D, Marocco A, Annicchiarico P. Pea Grain Protein Content Across Italian Environments: Genetic Relationship With Grain Yield, and Opportunities for Genome-Enabled Selection for Protein Yield. Front Plant Sci 2022; 12:718713. [PMID: 35046967 PMCID: PMC8761899 DOI: 10.3389/fpls.2021.718713] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Accepted: 11/18/2021] [Indexed: 06/14/2023]
Abstract
Wider pea (Pisum sativum L.) cultivation has great interest for European agriculture, owing to its favorable environmental impact and provision of high-protein feedstuff. This work aimed to investigate the extent of genotype × environment interaction (GEI), genetically based trade-offs and polygenic control for crude protein content and grain yield of pea targeted to Italian environments, and to assess the efficiency of genomic selection (GS) as an alternative to phenotypic selection (PS) to increase protein yield per unit area. Some 306 genotypes belonging to three connected recombinant inbred line (RIL) populations derived from paired crosses between elite cultivars were genotyped through genotyping-by-sequencing and phenotyped for grain yield and protein content on a dry matter basis in three autumn-sown environments of northern or central Italy. Line variation for mean protein content ranged from 21.7 to 26.6%. Purely genetic effects, compared with GEI effects, were over two-fold larger for protein content, and over 2-fold smaller for grain and protein yield per unit area. Grain yield and protein content exhibited no inverse genetic correlation. A genome-wide association study revealed a definite polygenic control not only for grain yield but also for protein content, with small amounts of trait variation accounted for by individual loci. On average, the GS predictive ability for individual RIL populations based on the rrBLUP model (which was selected out of four tested models) using by turns two environments for selection and one for validation was moderately high for protein content (0.53) and moderate for grain yield (0.40) and protein yield (0.41). These values were about halved for inter-environment, inter-population predictions using one RIL population for model construction to predict data of the other populations. The comparison between GS and PS for protein yield based on predicted gains per unit time and similar evaluation costs indicated an advantage of GS for model construction including the target RIL population and, in case of multi-year PS, even for model training based on data of a non-target population. In conclusion, protein content is less challenging than grain yield for phenotypic or genome-enabled improvement, and GS is promising for the simultaneous improvement of both traits.
Collapse
Affiliation(s)
- Margherita Crosta
- Council for Agricultural Research and Economics (CREA), Research Centre for Animal Production and Aquaculture, Lodi, Italy
| | - Nelson Nazzicari
- Council for Agricultural Research and Economics (CREA), Research Centre for Animal Production and Aquaculture, Lodi, Italy
| | - Barbara Ferrari
- Council for Agricultural Research and Economics (CREA), Research Centre for Animal Production and Aquaculture, Lodi, Italy
| | - Luciano Pecetti
- Council for Agricultural Research and Economics (CREA), Research Centre for Animal Production and Aquaculture, Lodi, Italy
| | - Luigi Russi
- Department of Agricultural, Food and Environmental Science, University of Perugia, Perugia, Italy
| | - Massimo Romani
- Council for Agricultural Research and Economics (CREA), Research Centre for Animal Production and Aquaculture, Lodi, Italy
| | - Giovanni Cabassi
- Council for Agricultural Research and Economics (CREA), Research Centre for Animal Production and Aquaculture, Lodi, Italy
| | - Daniele Cavalli
- Council for Agricultural Research and Economics (CREA), Research Centre for Animal Production and Aquaculture, Lodi, Italy
| | - Adriano Marocco
- Department of Sustainable Crop Production, Catholic University of Sacred Heart, Piacenza, Italy
| | - Paolo Annicchiarico
- Council for Agricultural Research and Economics (CREA), Research Centre for Animal Production and Aquaculture, Lodi, Italy
| |
Collapse
|
14
|
Tarchi L, Damiani S, La Torraca Vittori P, Marini S, Nazzicari N, Castellini G, Pisano T, Politi P, Ricca V. The colors of our brain: an integrated approach for dimensionality reduction and explainability in fMRI through color coding (i-ECO). Brain Imaging Behav 2021; 16:977-990. [PMID: 34689318 PMCID: PMC9107439 DOI: 10.1007/s11682-021-00584-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/06/2021] [Indexed: 11/29/2022]
Abstract
Several systematic reviews have highlighted the role of multiple sources in the investigation of psychiatric illness. For what concerns fMRI, the focus of recent literature preferentially lies on three lines of research, namely: functional connectivity, network analysis and spectral analysis. Data was gathered from the UCLA Consortium for Neuropsychiatric Phenomics. The sample was composed by 130 neurotypicals, 50 participants diagnosed with Schizophrenia, 49 with Bipolar disorder and 43 with ADHD. Single fMRI scans were reduced in their dimensionality by a novel method (i-ECO) averaging results per Region of Interest and through an additive color method (RGB): local connectivity values (Regional Homogeneity), network centrality measures (Eigenvector Centrality), spectral dimensions (fractional Amplitude of Low-Frequency Fluctuations). Average images per diagnostic group were plotted and described. The discriminative power of this novel method for visualizing and analyzing fMRI results in an integrative manner was explored through the usage of convolutional neural networks. The new methodology of i-ECO showed between-groups differences that could be easily appreciated by the human eye. The precision-recall Area Under the Curve (PR-AUC) of our models was > 84.5% for each diagnostic group as evaluated on the test-set – 80/20 split. In conclusion, this study provides evidence for an integrative and easy-to-understand approach in the analysis and visualization of fMRI results. A high discriminative power for psychiatric conditions was reached. This proof-of-work study may serve to investigate further developments over more extensive datasets covering a wider range of psychiatric diagnoses.
Collapse
Affiliation(s)
- Livio Tarchi
- Psychiatry Unit, Department of Health Sciences, University of Florence, viale della Maternità, Padiglione 8b, AOU Careggi, Firenze, Florence, FI, 50134, Italy.
| | - Stefano Damiani
- Department of Brain and Behavioral Sciences, University of Pavia, Pavia, PV, Italy
| | | | - Simone Marini
- Department of Epidemiology, University of Florida, Gainesville, FL, USA
| | - Nelson Nazzicari
- Council for Agricultural Research and Economics (CREA), Research Centre for Fodder Crops and Dairy Productions, Lodi, LO, Italy
| | - Giovanni Castellini
- Psychiatry Unit, Department of Health Sciences, University of Florence, viale della Maternità, Padiglione 8b, AOU Careggi, Firenze, Florence, FI, 50134, Italy
| | - Tiziana Pisano
- Pediatric Neurology, Neurogenetics and Neurobiology Unit and Laboratories, Neuroscience Department, Meyer Children's Hospital, University of Florence, Florence, Italy
| | - Pierluigi Politi
- Department of Brain and Behavioral Sciences, University of Pavia, Pavia, PV, Italy
| | - Valdo Ricca
- Psychiatry Unit, Department of Health Sciences, University of Florence, viale della Maternità, Padiglione 8b, AOU Careggi, Firenze, Florence, FI, 50134, Italy
| |
Collapse
|
15
|
Annicchiarico P, Nazzicari N, Notario T, Monterrubio Martin C, Romani M, Ferrari B, Pecetti L. Pea Breeding for Intercropping With Cereals: Variation for Competitive Ability and Associated Traits, and Assessment of Phenotypic and Genomic Selection Strategies. Front Plant Sci 2021; 12:731949. [PMID: 34630481 PMCID: PMC8495324 DOI: 10.3389/fpls.2021.731949] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Accepted: 07/19/2021] [Indexed: 06/12/2023]
Abstract
Mixed stand (MS) cropping of pea with small-grain cereals can produce more productive and environment-friendly grain crops relative to pure stand (PS) crops but may require selection to alleviate the pea competitive disadvantage. This study aimed to assess the pea variation for competitive ability and its associated traits and the efficiency of four phenotypic or genomic selection strategies. A set of 138 semi-leafless, semi-dwarf pea lines belonging to six recombinant inbred line populations and six parent lines were genotyped using genotyping-by-sequencing and grown in PS and in MS simultaneously with one barley and one bread wheat cultivar in two autumn-sown trials in Northern Italy. Cereal companions were selected in a preliminary study that highlighted the paucity of cultivars with sufficient earliness for association. Pea was severely outcompeted in both years albeit with variation for pea proportion ranging from nearly complete suppression (<3%) to values approaching a balanced mixture. Greater pea proportion in MS was associated with greater total yield of the mixture (r ≥ 0.46). The genetic correlation for pea yield across MS and PS conditions slightly exceeded 0.40 in both years. Later onset of flowering and taller plant height at flowering onset displayed a definite correlation with pea yield in MS (r ≥ 0.46) but not in PS, whereas tolerance to ascochyta blight exhibited the opposite pattern. Comparisons of phenotypic selection strategies within or across populations based on predicted or actual yield gains for independent years indicated an efficiency of 52-64% for indirect selection based on pea yield in PS relative to pea yield selection in MS. The efficiency of an indirect selection index including onset of flowering, plant height, and grain yield in PS was comparable to that of pea yield selection in MS. A genome-wide association study based on 5,909 SNP markers revealed the substantial diversity of genomic areas associated with pea yield in MS and PS. Genomic selection for pea yield in MS displayed an efficiency close to that of phenotypic selection for pea yield in MS, and nearly two-fold greater efficiency when also taking into account its shorter selection cycle and smaller evaluation cost.
Collapse
|
16
|
Zago M, Rossetti L, Bardelli T, Carminati D, Nazzicari N, Giraffa G. Bacterial Community of Grana Padano PDO Cheese and Generical Hard Cheeses: DNA Metabarcoding and DNA Metafingerprinting Analysis to Assess Similarities and Differences. Foods 2021; 10:1826. [PMID: 34441603 PMCID: PMC8392751 DOI: 10.3390/foods10081826] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Revised: 07/29/2021] [Accepted: 08/06/2021] [Indexed: 12/18/2022] Open
Abstract
The microbiota of Protected Designation of Origin (PDO) cheeses plays an essential role in defining their quality and typicity and could be applied to protect these products from counterfeiting. To study the possible role of cheese microbiota in distinguishing Grana Padano (GP) cheese from generical hard cheeses (HC), the microbial structure of 119 GP cheese samples was studied by DNA metabarcoding and DNA metafingerprinting and compared with 49 samples of generical hard cheeses taken from retail. DNA metabarcoding highlighted the presence, as dominant taxa, of Lacticaseibacillus rhamnosus, Lactobacillus helveticus, Streptococcus thermophilus, Limosilactobacillus fermentum, Lactobacillus delbrueckii, Lactobacillus spp., and Lactococcus spp. in both GP cheese and HC. Differential multivariate statistical analysis of metataxonomic and metafingerprinting data highlighted significant differences in the Shannon index, bacterial composition, and species abundance within both dominant and subdominant taxa between the two cheese groups. A supervised Neural Network (NN) classification tool, trained by metagenotypic data, was implemented, allowing to correctly classify GP cheese and HC samples. Further implementation and validation to increase the robustness and improve the predictive capacity of the NN classifier will be needed. Nonetheless, the proposed tool opens interesting perspectives in helping protection and valorization of GP and other PDO cheeses.
Collapse
Affiliation(s)
- Miriam Zago
- Research Centre for Animal Production and Aquaculture (CREA-ZA), Council for Agricultural Research and Economics, 26900 Lodi, Italy; (M.Z.); (L.R.); (D.C.); (N.N.)
| | - Lia Rossetti
- Research Centre for Animal Production and Aquaculture (CREA-ZA), Council for Agricultural Research and Economics, 26900 Lodi, Italy; (M.Z.); (L.R.); (D.C.); (N.N.)
| | - Tommaso Bardelli
- Research Centre for Plant Protection and Certification (CREA-DC), Council for Agricultural Research and Economics, 26900 Lodi, Italy;
| | - Domenico Carminati
- Research Centre for Animal Production and Aquaculture (CREA-ZA), Council for Agricultural Research and Economics, 26900 Lodi, Italy; (M.Z.); (L.R.); (D.C.); (N.N.)
| | - Nelson Nazzicari
- Research Centre for Animal Production and Aquaculture (CREA-ZA), Council for Agricultural Research and Economics, 26900 Lodi, Italy; (M.Z.); (L.R.); (D.C.); (N.N.)
| | - Giorgio Giraffa
- Research Centre for Animal Production and Aquaculture (CREA-ZA), Council for Agricultural Research and Economics, 26900 Lodi, Italy; (M.Z.); (L.R.); (D.C.); (N.N.)
| |
Collapse
|
17
|
Alkemade JA, Messmer MM, Arncken C, Leska A, Annicchiarico P, Nazzicari N, Książkiewicz M, Voegele RT, Finckh MR, Hohmann P. A High-Throughput Phenotyping Tool to Identify Field-Relevant Anthracnose Resistance in White Lupin. Plant Dis 2021; 105:1719-1727. [PMID: 33337235 DOI: 10.1094/pdis-07-20-1531-re] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
The seed- and air-borne pathogen Colletotrichum lupini, the causal agent of lupin anthracnose, is the most important disease in white lupin (Lupinus albus) worldwide and can cause total yield loss. The aims of this study were to establish a reliable high-throughput phenotyping tool to identify anthracnose resistance in white lupin germplasm and to evaluate a genomic prediction model, accounting for previously reported resistance quantitative trait loci, on a set of independent lupin genotypes. Phenotyping under controlled conditions, performing stem inoculation on seedlings, showed to be applicable for high throughput, and its disease score strongly correlated with field plot disease assessments (r = 0.95, P < 0.0001) and yield (r = -0.64, P = 0.035). Traditional one-row field disease phenotyping showed no significant correlation with field plot disease assessments (r = 0.31, P = 0.34) and yield (r = -0.45, P = 0.17). Genomically predicted resistance values showed no correlation with values observed under controlled or field conditions, and the parental lines of the recombinant inbred line population used for constructing the prediction model exhibited a resistance pattern opposite to that displayed in the original (Australian) environment used for model construction. Differing environmental conditions, inoculation procedures, or population structure may account for this result. Phenotyping a diverse set of 40 white lupin accessions under controlled conditions revealed eight accessions with improved resistance to anthracnose. The standardized area under the disease progress curves (sAUDPC) ranged from 2.1 to 2.8, compared with the susceptible reference accession with a sAUDPC of 3.85. These accessions can be incorporated into white lupin breeding programs. In conclusion, our data support stem inoculation-based disease phenotyping under controlled conditions as a time-effective approach to identify field-relevant resistance, which can now be applied to further identify sources of resistance and their underlying genetics.
Collapse
Affiliation(s)
- Joris A Alkemade
- Department of Crop Sciences, Research Institute of Organic Agriculture (FiBL), Frick, Switzerland
| | - Monika M Messmer
- Department of Crop Sciences, Research Institute of Organic Agriculture (FiBL), Frick, Switzerland
| | - Christine Arncken
- Department of Crop Sciences, Research Institute of Organic Agriculture (FiBL), Frick, Switzerland
| | - Agata Leska
- Getreidezüchtung Peter Kunz (gzpk), Feldbach, Switzerland
| | | | - Nelson Nazzicari
- CREA, Research Centre for Animal Production and Aquaculture, Lodi, Italy
| | | | - Ralf T Voegele
- Institute of Phytomedicine, University of Hohenheim, Stuttgart, Germany
| | - Maria R Finckh
- Department of Ecological Plant Protection, University of Kassel, Witzenhausen, Germany
| | - Pierre Hohmann
- Department of Crop Sciences, Research Institute of Organic Agriculture (FiBL), Frick, Switzerland
| |
Collapse
|
18
|
Rychel-Bielska S, Nazzicari N, Plewiński P, Bielski W, Annicchiarico P, Książkiewicz M. Development of PCR-based markers and whole-genome selection model for anthracnose resistance in white lupin (Lupinus albus L.). J Appl Genet 2020; 61:531-545. [PMID: 32968972 PMCID: PMC7652745 DOI: 10.1007/s13353-020-00585-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2020] [Revised: 09/07/2020] [Accepted: 09/14/2020] [Indexed: 12/15/2022]
Abstract
White lupin (Lupinus albus L.) is a high-protein grain legume crop, grown since ancient Greece and Rome. Despite long domestication history, its cultivation remains limited, partly because of susceptibility to anthracnose. Only some late-flowering, bitter, low-yielding landraces from Ethiopian mountains displayed resistance to this devastating disease. The resistance is controlled by various genes, thereby complicating the breeding efforts. The objective of this study was developing tools for molecular tracking of Ethiopian resistance genes based on genotyping-by-sequencing (GBS) data, envisaging linkage mapping and genomic selection approaches. Twenty GBS markers from two major quantitative trait loci (QTLs), antr04_1/antr05_1 and antr04_2/antr05_2, were converted to PCR-based markers using assigned transcriptome sequences. Newly developed markers improved mapping resolution around both anthracnose resistance loci, providing more precise QTL estimation. PCR-based screening of diversified domesticated and primitive germplasm revealed the high specificity of two markers for the antr04_1/antr05_1 locus (TP222136 and TP47110) and one for the antr04_2/antr05_2 locus (TP338761), highlighted by simple matching coefficients of 0.96 and 0.89, respectively. Moreover, a genomic selection approach based on GBS data of a recombinant inbred line mapping population was assessed, providing an average predictive ability of 0.56. These tools can be used for preselection of candidate white lupin germplasm for anthracnose resistance assays.
Collapse
Affiliation(s)
- Sandra Rychel-Bielska
- Department of Genetics, Plant Breeding and Seed Production, Wroclaw University of Environmental and Life Sciences, Plac Grunwaldzki 24A, 50-363, Wrocław, Poland.,Institute of Plant Genetics, Polish Academy of Sciences, Strzeszyńska 34, 60-479, Poznań, Poland
| | - Nelson Nazzicari
- CREA-FLC, Council for Agricultural Research and Economics, Research Centre for Fodder Crops and Dairy Production, Viale Piacenza 29, 26900, Lodi, Italy
| | - Piotr Plewiński
- Institute of Plant Genetics, Polish Academy of Sciences, Strzeszyńska 34, 60-479, Poznań, Poland
| | - Wojciech Bielski
- Institute of Plant Genetics, Polish Academy of Sciences, Strzeszyńska 34, 60-479, Poznań, Poland
| | - Paolo Annicchiarico
- CREA-FLC, Council for Agricultural Research and Economics, Research Centre for Fodder Crops and Dairy Production, Viale Piacenza 29, 26900, Lodi, Italy
| | - Michał Książkiewicz
- Institute of Plant Genetics, Polish Academy of Sciences, Strzeszyńska 34, 60-479, Poznań, Poland.
| |
Collapse
|
19
|
Annicchiarico P, Nazzicari N, Laouar M, Thami-Alami I, Romani M, Pecetti L. Development and Proof-of-Concept Application of Genome-Enabled Selection for Pea Grain Yield under Severe Terminal Drought. Int J Mol Sci 2020; 21:E2414. [PMID: 32244428 PMCID: PMC7177262 DOI: 10.3390/ijms21072414] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2020] [Revised: 03/27/2020] [Accepted: 03/27/2020] [Indexed: 11/16/2022] Open
Abstract
Terminal drought is the main stress limiting pea (Pisum sativum L.) grain yield in Mediterranean environments. This study aimed to investigate genotype × environment (GE) interaction patterns, define a genomic selection (GS) model for yield under severe drought based on single nucleotide polymorphism (SNP) markers from genotyping-by-sequencing, and compare GS with phenotypic selection (PS) and marker-assisted selection (MAS). Some 288 lines belonging to three connected RIL populations were evaluated in a managed-stress (MS) environment of Northern Italy, Marchouch (Morocco), and Alger (Algeria). Intra-environment, cross-environment, and cross-population predictive ability were assessed by Ridge Regression best linear unbiased prediction (rrBLUP) and Bayesian Lasso models. GE interaction was particularly large across moderate-stress and severe-stress environments. In proof-of-concept experiments performed in a MS environment, GS models constructed from MS environment and Marchouch data applied to independent material separated top-performing lines from mid- and bottom-performing ones, and produced actual yield gains similar to PS. The latter result would imply somewhat greater GS efficiency when considering same selection costs, in partial agreement with predicted efficiency results. GS, which exploited drought escape and intrinsic drought tolerance, exhibited 18% greater selection efficiency than MAS (albeit with non-significant difference between selections) and moderate to high cross-population predictive ability. GS can be cost-efficient to raise yields under severe drought.
Collapse
Affiliation(s)
- Paolo Annicchiarico
- Council for Agricultural Research and Economics (CREA), Research Centre for Animal Production and Aquaculture, viale Piacenza 29, 26900 Lodi, Italy; (N.N.); (M.R.); (L.P.)
| | - Nelson Nazzicari
- Council for Agricultural Research and Economics (CREA), Research Centre for Animal Production and Aquaculture, viale Piacenza 29, 26900 Lodi, Italy; (N.N.); (M.R.); (L.P.)
| | - Meriem Laouar
- Ecole Nationale Supérieure Agronomique (ENSA), Laboratoire d’Amélioration Intégrative des Productions Végétales (C2711100), Rue Hassen Badi, El Harrach, Alger DZ16200, Algeria;
| | - Imane Thami-Alami
- Institut National de la Recherche Agronomique (INRA), Centre Régional de Rabat, Av. de la Victoire, Rabat BP 415, Morocco;
| | - Massimo Romani
- Council for Agricultural Research and Economics (CREA), Research Centre for Animal Production and Aquaculture, viale Piacenza 29, 26900 Lodi, Italy; (N.N.); (M.R.); (L.P.)
| | - Luciano Pecetti
- Council for Agricultural Research and Economics (CREA), Research Centre for Animal Production and Aquaculture, viale Piacenza 29, 26900 Lodi, Italy; (N.N.); (M.R.); (L.P.)
| |
Collapse
|
20
|
Nazzicari N, Vella D, Coronnello C, Di Silvestre D, Bellazzi R, Marini S. MTGO-SC, A Tool to Explore Gene Modules in Single-Cell RNA Sequencing Data. Front Genet 2019; 10:953. [PMID: 31649730 PMCID: PMC6794379 DOI: 10.3389/fgene.2019.00953] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2019] [Accepted: 09/05/2019] [Indexed: 01/08/2023] Open
Abstract
The identification of functional modules in gene interaction networks is a key step in understanding biological processes. Network interpretation is essential for unveiling biological mechanisms, candidate biomarkers, or potential targets for drug discovery/repositioning. Plenty of biological module identification algorithms are available, although none is explicitly designed to perform the task on single-cell RNA sequencing (scRNA-seq) data. Here, we introduce MTGO-SC, an adaptation for scRNA-seq of our biological network module detection algorithm MTGO. MTGO-SC isolates gene functional modules by leveraging on both the network topological structure and the annotations characterizing the nodes (genes). These annotations are provided by an external source, such as databases and literature repositories (e.g., the Gene Ontology, Reactome). Thanks to the depth of single-cell data, it is possible to define one network for each cell cluster (typically, cell type or state) composing each sample, as opposed to traditional bulk RNA-seq, where the emerging gene network is averaged over the whole sample. MTGO-SC provides two complexity levels for interpretation: the gene-gene interaction and the intermodule interaction networks. MTGO-SC is versatile in letting the users define the rules to extract the gene network and integrated with the Seurat scRNA-seq analysis pipeline. MTGO-SC is available at https://github.com/ne1s0n/MTGOsc.
Collapse
Affiliation(s)
- Nelson Nazzicari
- Research Centre for Fodder Crops and Dairy Productions, Council for Agricultural Research and Economics (CREA), Lodi, Italy
| | - Danila Vella
- Bioengineering Unit, Ri. MED Foundation, Palermo, Italy.,Istituti Clinici Scientifici Maugeri, Pavia, Italy
| | | | - Dario Di Silvestre
- Institute of Biomedical Technologies, National Research Council, Segrate, Italy
| | - Riccardo Bellazzi
- Istituti Clinici Scientifici Maugeri, Pavia, Italy.,Department of Electrical, Computer and Biomedical Engineering; Centre for Health, Technologies, University of Pavia, Pavia, Italy
| | - Simone Marini
- Department of Electrical, Computer and Biomedical Engineering; Centre for Health, Technologies, University of Pavia, Pavia, Italy.,Department of Surgery, University of Michigan, Ann Arbor, MI, United States
| |
Collapse
|
21
|
Abstract
BACKGROUND A thorough verification of the ability of genomic selection (GS) to predict estimated breeding values for pea (Pisum sativum L.) grain yield is pending. Prediction for different environments (inter-environment prediction) has key importance when breeding for target environments featuring high genotype × environment interaction (GEI). The interest of GS would increase if it could display acceptable prediction accuracies in different environments also for germplasm that was not used in model training (inter-population prediction). RESULTS Some 306 genotypes belonging to three connected RIL populations derived from paired crosses between elite cultivars were genotyped through genotyping-by-sequencing and phenotyped for grain yield, onset of flowering, lodging susceptibility, seed weight and winter plant survival in three autumn-sown environments of northern or central Italy. The large GEI for grain yield and its pattern (implying larger variation across years than sites mainly due to year-to-year variability for low winter temperatures) encouraged the breeding for wide adaptation. Wider within-population than between-population variation was observed for nearly all traits, supporting GS application to many lines of relatively few elite RIL populations. Bayesian Lasso without structure imputation and 1% maximum genotype missing rate (including 6058 polymorphic SNP markers) was selected for GS modelling after assessing different GS models and data configurations. On average, inter-environment predictive ability using intra-population predictions reached 0.30 for yield, 0.65 for onset of flowering, 0.64 for seed weight, and 0.28 for lodging susceptibility. Using inter-population instead of intra-population predictions reduced the inter-environment predictive ability to 0.19 for grain yield, 0.40 for onset of flowering, 0.28 for seed weight, and 0.22 for lodging susceptibility. A comparison of GS vs phenotypic selection (PS) based on predicted genetic gains per unit time for same selection costs suggested greater efficiency of GS for all traits under various selection scenarios. For yield, the advantage in predicted efficiency of GS over PS was at least 80% using intra-population predictions and 20% using inter-population predictions. A genome-wide association study confirmed the highly polygenic control of most traits. CONCLUSIONS Genome-enabled predictions can increase the efficiency of pea line selection for wide adaptation to Italian environments relative to phenotypic selection.
Collapse
Affiliation(s)
- Paolo Annicchiarico
- Council for Agricultural Research and Economics (CREA), Research Center for Animal Production and Aquaculture, viale Piacenza 29, 26900 Lodi, Italy
| | - Nelson Nazzicari
- Council for Agricultural Research and Economics (CREA), Research Center for Animal Production and Aquaculture, viale Piacenza 29, 26900 Lodi, Italy
| | - Luciano Pecetti
- Council for Agricultural Research and Economics (CREA), Research Center for Animal Production and Aquaculture, viale Piacenza 29, 26900 Lodi, Italy
| | - Massimo Romani
- Council for Agricultural Research and Economics (CREA), Research Center for Animal Production and Aquaculture, viale Piacenza 29, 26900 Lodi, Italy
| | - Luigi Russi
- Department of Agricultural, Food and Environmental Science, University of Perugia, Borgo XX Giugno 74, 06121 Perugia, Italy
| |
Collapse
|
22
|
Laurens F, Aranzana MJ, Arus P, Bassi D, Bink M, Bonany J, Caprera A, Corelli-Grappadelli L, Costes E, Durel CE, Mauroux JB, Muranty H, Nazzicari N, Pascal T, Patocchi A, Peil A, Quilot-Turion B, Rossini L, Stella A, Troggio M, Velasco R, van de Weg E. An integrated approach for increasing breeding efficiency in apple and peach in Europe. Hortic Res 2018; 5:11. [PMID: 29507735 PMCID: PMC5830435 DOI: 10.1038/s41438-018-0016-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2017] [Accepted: 12/23/2017] [Indexed: 05/02/2023]
Abstract
Despite the availability of whole genome sequences of apple and peach, there has been a considerable gap between genomics and breeding. To bridge the gap, the European Union funded the FruitBreedomics project (March 2011 to August 2015) involving 28 research institutes and private companies. Three complementary approaches were pursued: (i) tool and software development, (ii) deciphering genetic control of main horticultural traits taking into account allelic diversity and (iii) developing plant materials, tools and methodologies for breeders. Decisive breakthroughs were made including the making available of ready-to-go DNA diagnostic tests for Marker Assisted Breeding, development of new, dense SNP arrays in apple and peach, new phenotypic methods for some complex traits, software for gene/QTL discovery on breeding germplasm via Pedigree Based Analysis (PBA). This resulted in the discovery of highly predictive molecular markers for traits of horticultural interest via PBA and via Genome Wide Association Studies (GWAS) on several European genebank collections. FruitBreedomics also developed pre-breeding plant materials in which multiple sources of resistance were pyramided and software that can support breeders in their selection activities. Through FruitBreedomics, significant progresses were made in the field of apple and peach breeding, genetics, genomics and bioinformatics of which advantage will be made by breeders, germplasm curators and scientists. A major part of the data collected during the project has been stored in the FruitBreedomics database and has been made available to the public. This review covers the scientific discoveries made in this major endeavour, and perspective in the apple and peach breeding and genomics in Europe and beyond.
Collapse
Affiliation(s)
- Francois Laurens
- IRHS, INRA, Agrocampus-Ouest, Université d’Angers, SFR 4207 QuaSaV, Université Bretagne Loire, 42 rue Georges Morel, Beaucouzé, 49071 France
| | - Maria José Aranzana
- IRTA (Institut de Recerca i Tecnologia Agroalimentàries), Barcelona, Spain
- Centre for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Campus UAB, Bellaterra, Barcelona Spain
| | - Pere Arus
- IRTA (Institut de Recerca i Tecnologia Agroalimentàries), Barcelona, Spain
- Centre for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Campus UAB, Bellaterra, Barcelona Spain
| | - Daniele Bassi
- Università degli Studi di Milano - DiSAA, Via Celoria 2, Milan, 20133 Italy
| | - Marco Bink
- Biometris, Wageningen University and Research, Droevendaalsesteeg 1, Wageningen, 6708PB The Netherlands
- Present Address: Hendrix Genetics Research, Technology & Services, Boxmeer, 5830 AC The Netherlands
| | - Joan Bonany
- IRTA-Mas Badia, Mas Badia, La Tallada, 17134 Spain
| | - Andrea Caprera
- Parco Tecnologico Padano, Via Einstein, Loc. Cascina Codazza, Lodi, 26900 Italy
| | | | | | - Charles-Eric Durel
- IRHS, INRA, Agrocampus-Ouest, Université d’Angers, SFR 4207 QuaSaV, Université Bretagne Loire, 42 rue Georges Morel, Beaucouzé, 49071 France
| | | | - Hélène Muranty
- IRHS, INRA, Agrocampus-Ouest, Université d’Angers, SFR 4207 QuaSaV, Université Bretagne Loire, 42 rue Georges Morel, Beaucouzé, 49071 France
| | - Nelson Nazzicari
- Parco Tecnologico Padano, Via Einstein, Loc. Cascina Codazza, Lodi, 26900 Italy
| | | | - Andrea Patocchi
- Agroscope, Research Division Plant Breeding, Schloss 1, Wädenswil, 8820 Switzerland
| | - Andreas Peil
- Julius Kühn-Institute (JKI); Federal Research Centre for Cultivated Plants, Institute for Breeding Research on Fruit Crops, Pillnitzer Platz 3a, Dresden, 01326 Germany
| | | | - Laura Rossini
- Università degli Studi di Milano - DiSAA, Via Celoria 2, Milan, 20133 Italy
- Parco Tecnologico Padano, Via Einstein, Loc. Cascina Codazza, Lodi, 26900 Italy
| | - Alessandra Stella
- Parco Tecnologico Padano, Via Einstein, Loc. Cascina Codazza, Lodi, 26900 Italy
| | - Michela Troggio
- Research and Innovation Center, Fondazione Edmund Mach, San Michele all’Adige, Trento, Italy
| | - Riccardo Velasco
- Research and Innovation Center, Fondazione Edmund Mach, San Michele all’Adige, Trento, Italy
- CREA-VE, Center of Viticulture and Enology, via XXVIII Aprile 26, Conegliano (TV), 31015 Italy
| | - Eric van de Weg
- Plant Breeding, Wageningen University and Research, Droevendaalsesteeg 1, P.O.Box 386, Wageningen, 6700AJ The Netherlands
| |
Collapse
|
23
|
Annicchiarico P, Nazzicari N, Pecetti L, Romani M, Ferrari B, Wei Y, Brummer EC. GBS-Based Genomic Selection for Pea Grain Yield under Severe Terminal Drought. Plant Genome 2017; 10. [PMID: 28724076 DOI: 10.3835/plantgenome2016.07.0072] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2016] [Accepted: 01/23/2017] [Indexed: 05/18/2023]
Abstract
Terminal drought is the main stress that limits pea ( L.) grain yield in Mediterranean-climate regions. This study provides an unprecedented assessment of the predictive ability of genomic selection (GS) for grain yield under severe terminal drought using genotyping-by-sequencing (GBS) data. Additional aims were to assess the GS predictive ability for different GBS data quality filters and GS models, comparing intrapopulation with interpopulation GS predictive ability and to perform genome-wide association (GWAS) studies. The yield and onset of flowering of 315 lines from three recombinant inbred line (RIL) populations issued by connected crosses between three elite cultivars were assessed under a field rainout shelter. We defined an adjusted yield, which is associated with intrinsic drought tolerance, as the yield deviation from the value expected as a function of onset of flowering (which correlated negatively with grain yield). Total polymorphic markers ranged from approximately 100 (minimum of eight reads per locus, maximum 10% genotype missing data) to over 7500 markers (minimum of four reads, maximum 50% missing rate). Best predictions were provided by Bayesian Lasso (BL) or ridge regression best linear unbiased prediction (rrBLUP), rather than support vector regression (SVR) models, with at least 400-500 markers. Intrapopulation GS predictive ability exceeded 0.5 for yield and onset of flowering in all populations and approached 0.4 for the adjusted yield of a population with high trait variation. Genomic selection was preferable to phenotypic selection in terms of predicted yield gains. Interpopulation GS predictive ability varied largely depending on the pair of populations. GWAS revealed extensive colocalization of markers associated with high yield and early flowering and suggested that they are concentrated in a few genomic regions.
Collapse
|
24
|
Marini S, Nazzicari N, Biscarini F, Wang GZ. Dscam1 web server: online prediction of Dscam1 self- and hetero-affinity. Bioinformatics 2017; 33:1879-1880. [PMID: 28137710 DOI: 10.1093/bioinformatics/btx039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2016] [Accepted: 01/24/2017] [Indexed: 11/14/2022] Open
Abstract
Motivation Formation of homodimers by identical Dscam1 protein isomers on cell surface is the key factor for the self-avoidance of growing neurites. Dscam1 immense diversity has a critical role in the formation of arthropod neuronal circuit, showing unique evolutionary properties when compared to other cell surface proteins. Experimental measures are available for 89 self-binding and 1722 hetero-binding protein samples, out of more than 19 thousands (self-binding) and 350 millions (hetero-binding) possible isomer combinations. Results We developed Dscam1 Web Server to quickly predict Dscam1 self- and hetero- binding affinity for batches of Dscam1 isomers. The server can help the study of Dscam1 affinity and help researchers navigate through the tens of millions of possible isomer combinations to isolate the strong-binding ones. Availability and Implementation Dscam1 Web Server is freely available at: http://bioinformatics.tecnoparco.org/Dscam1-webserver . Web server code is available at https://gitlab.com/ne1s0n/Dscam1-binding . Contact simone.marini@unipv.it or guangzhong.wang@picb.ac.cn. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Simone Marini
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto, Japan
| | - Nelson Nazzicari
- Council for Agricultural Research and Economics (CREA), Research Centre for Fodder Crops and Dairy Productions, Lodi, Italy
| | - Filippo Biscarini
- Department of Bioinformatics and Biostatistics, PTP Science Park, Lodi, Italy.,Faculty of Bioscience and Technology for Food, Agriculture and Environment, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Guang-Zhong Wang
- Key Laboratory of Computational Biology, Chinese Academy of Sciences-Max Planck Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
| |
Collapse
|
25
|
Hernández Mora JR, Micheletti D, Bink M, Van de Weg E, Cantín C, Nazzicari N, Caprera A, Dettori MT, Micali S, Banchi E, Campoy JA, Dirlewanger E, Lambert P, Pascal T, Troggio M, Bassi D, Rossini L, Verde I, Quilot-Turion B, Laurens F, Arús P, Aranzana MJ. Integrated QTL detection for key breeding traits in multiple peach progenies. BMC Genomics 2017; 18:404. [PMID: 28583082 PMCID: PMC5460339 DOI: 10.1186/s12864-017-3783-6] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2017] [Accepted: 05/10/2017] [Indexed: 01/23/2023] Open
Abstract
Background Peach (Prunus persica (L.) Batsch) is a major temperate fruit crop with an intense breeding activity. Breeding is facilitated by knowledge of the inheritance of the key traits that are often of a quantitative nature. QTLs have traditionally been studied using the phenotype of a single progeny (usually a full-sib progeny) and the correlation with a set of markers covering its genome. This approach has allowed the identification of various genes and QTLs but is limited by the small numbers of individuals used and by the narrow transect of the variability analyzed. In this article we propose the use of a multi-progeny mapping strategy that used pedigree information and Bayesian approaches that supports a more precise and complete survey of the available genetic variability. Results Seven key agronomic characters (data from 1 to 3 years) were analyzed in 18 progenies from crosses between occidental commercial genotypes and various exotic lines including accessions of other Prunus species. A total of 1467 plants from these progenies were genotyped with a 9 k SNP array. Forty-seven QTLs were identified, 22 coinciding with major genes and QTLs that have been consistently found in the same populations when studied individually and 25 were new. A substantial part of the QTLs observed (47%) would not have been detected in crosses between only commercial materials, showing the high value of exotic lines as a source of novel alleles for the commercial gene pool. Our strategy also provided estimations on the narrow sense heritability of each character, and the estimation of the QTL genotypes of each parent for the different QTLs and their breeding value. Conclusions The integrated strategy used provides a broader and more accurate picture of the variability available for peach breeding with the identification of many new QTLs, information on the sources of the alleles of interest and the breeding values of the potential donors of such valuable alleles. These results are first-hand information for breeders and a step forward towards the implementation of DNA-informed strategies to facilitate selection of new cultivars with improved productivity and quality. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3783-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- José R Hernández Mora
- IRTA, Centre de Recerca en Agrigenòmica CSIC-IRTA-UAB-UB; Campus UAB, Bellaterra (Cerdanyola del Vallès), 08193, Barcelona, Spain
| | - Diego Micheletti
- Research and Innovation Centre, Fondazione Edmund Mach (FEM), Via Mach 1, 38010, San Michele all'Adige, TN, Italy
| | - Marco Bink
- Hendrix Genetics Research, Technology & Services B.V., P.O. Box 114, 5830AC, Boxmeer, The Netherlands
| | - Eric Van de Weg
- Plant Breeding, Wageningen University and Research Droevendaalsesteeg 1, P.O. Box 386, 6700AJ, Wageningen, The Netherlands
| | - Celia Cantín
- IRTA, FruitCentreParc Cientific i Tecnològic Agroalimentari de Lleida (PCiTAL), Lleida, Spain
| | - Nelson Nazzicari
- PTP Science Park, Via Einstein, 26900, Lodi, Italy.,Council for Agricultural Research and Economics (CREA) Research Centre for Fodder Crops and Dairy Productions, Lodi, Italy
| | | | - Maria Teresa Dettori
- Consiglio per la Ricerca in Agricoltura e L'analisi Dell'economia Agraria (CREA) - Centro di Ricerca per la Frutticoltura, 00134, Roma, Italy
| | - Sabrina Micali
- Consiglio per la Ricerca in Agricoltura e L'analisi Dell'economia Agraria (CREA) - Centro di Ricerca per la Frutticoltura, 00134, Roma, Italy
| | - Elisa Banchi
- Research and Innovation Centre, Fondazione Edmund Mach (FEM), Via Mach 1, 38010, San Michele all'Adige, TN, Italy
| | | | | | | | | | - Michela Troggio
- Research and Innovation Centre, Fondazione Edmund Mach (FEM), Via Mach 1, 38010, San Michele all'Adige, TN, Italy
| | - Daniele Bassi
- Università degli Studi di Milano, DiSAA, Via Celoria 2, 20133, Milan, Italy
| | - Laura Rossini
- PTP Science Park, Via Einstein, 26900, Lodi, Italy.,Università degli Studi di Milano, DiSAA, Via Celoria 2, 20133, Milan, Italy
| | - Ignazio Verde
- Consiglio per la Ricerca in Agricoltura e L'analisi Dell'economia Agraria (CREA) - Centro di Ricerca per la Frutticoltura, 00134, Roma, Italy
| | | | | | - Pere Arús
- IRTA, Centre de Recerca en Agrigenòmica CSIC-IRTA-UAB-UB; Campus UAB, Bellaterra (Cerdanyola del Vallès), 08193, Barcelona, Spain
| | - Maria José Aranzana
- IRTA, Centre de Recerca en Agrigenòmica CSIC-IRTA-UAB-UB; Campus UAB, Bellaterra (Cerdanyola del Vallès), 08193, Barcelona, Spain.
| |
Collapse
|
26
|
Biscarini F, Nazzicari N, Bink M, Arús P, Aranzana MJ, Verde I, Micali S, Pascal T, Quilot-Turion B, Lambert P, da Silva Linge C, Pacheco I, Bassi D, Stella A, Rossini L. Genome-enabled predictions for fruit weight and quality from repeated records in European peach progenies. BMC Genomics 2017; 18:432. [PMID: 28583089 PMCID: PMC5460546 DOI: 10.1186/s12864-017-3781-8] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2017] [Accepted: 05/10/2017] [Indexed: 11/16/2022] Open
Abstract
Background Highly polygenic traits such as fruit weight, sugar content and acidity strongly influence the agroeconomic value of peach varieties. Genomic Selection (GS) can accelerate peach yield and quality gain if predictions show higher levels of accuracy compared to phenotypic selection. The available IPSC 9K SNP array V1 allows standardized and highly reliable genotyping, preparing the ground for GS in peach. Results A repeatability model (multiple records per individual plant) for genome-enabled predictions in eleven European peach populations is presented. The analysis included 1147 individuals derived from both commercial and non-commercial peach or peach-related accessions. Considered traits were average fruit weight (FW), sugar content (SC) and titratable acidity (TA). Plants were genotyped with the 9K IPSC array, grown in three countries (France, Italy, Spain) and phenotyped for 3–5 years. An analysis of imputation accuracy of missing genotypic data was conducted using the software Beagle, showing that two of the eleven populations were highly sensitive to increasing levels of missing data. The regression model produced, for each trait and each population, estimates of heritability (FW:0.35, SC:0.48, TA:0.53, on average) and repeatability (FW:0.56, SC:0.63, TA:0.62, on average). Predictive ability was estimated in a five-fold cross validation scheme within population as the correlation of true and predicted phenotypes. Results differed by populations and traits, but predictive abilities were in general high (FW:0.60, SC:0.72, TA:0.65, on average). Conclusions This study assessed the feasibility of Genomic Selection in peach for highly polygenic traits linked to yield and fruit quality. The accuracy of imputing missing genotypes was as high as 96%, and the genomic predictive ability was on average 0.65, but could be as high as 0.84 for fruit weight or 0.83 for titratable acidity. The estimated repeatability may prove very useful in the management of the typical long cycles involved in peach productions. All together, these results are very promising for the application of genomic selection to peach breeding programmes. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3781-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Filippo Biscarini
- PTP Science Park, Via Einstein - Loc. Cascina Codazza, Lodi, Italy.,IBBA-CNR, Via Edoardo Bassini, 15, Milan, 20133, Italy
| | - Nelson Nazzicari
- PTP Science Park, Via Einstein - Loc. Cascina Codazza, Lodi, Italy.,Council for Agricultural Research and Economics (CREA) Research Centre for Fodder Crops and Dairy Productions, Lodi, Italy
| | - Marco Bink
- Wageningen UR Biometris, Wageningen, The Netherlands.,Present Address: Hendrix Genetics Research, Technology & Services B.V., P.O. Box 114, Boxmeer NL, 5830AC, The Netherlands
| | - Pere Arús
- IRTA, Centre de Recerca en Agrigenòmica CSIC-IRTA-UAB-UB, Campus UAB, Bellaterra (Cerdanyola del Vallés), Barcelona, Spain
| | - Maria José Aranzana
- IRTA, Centre de Recerca en Agrigenòmica CSIC-IRTA-UAB-UB, Campus UAB, Bellaterra (Cerdanyola del Vallés), Barcelona, Spain
| | - Ignazio Verde
- Consiglio per la ricerca in agricoltura e l'analisi dell'economia agraria (CREA) - Centro di Ricerca per la Frutticoltura (CREA-FRU), Via di Fioranello 52, Roma, Italy
| | - Sabrina Micali
- Consiglio per la ricerca in agricoltura e l'analisi dell'economia agraria (CREA) - Centro di Ricerca per la Frutticoltura (CREA-FRU), Via di Fioranello 52, Roma, Italy
| | | | | | - Patrick Lambert
- Università degli Studi di Milano - DiSAA, Via Celoria 2, Milano, Italy
| | | | - Igor Pacheco
- Università degli Studi di Milano - DiSAA, Via Celoria 2, Milano, Italy.,Institute of Nutrition and Food Technology - INTA, Universidad de Chile, Av El Líbano 5524, Santiago, Chile
| | - Daniele Bassi
- Università degli Studi di Milano - DiSAA, Via Celoria 2, Milano, Italy
| | - Alessandra Stella
- PTP Science Park, Via Einstein - Loc. Cascina Codazza, Lodi, Italy.,IBBA-CNR, Via Edoardo Bassini, 15, Milan, 20133, Italy
| | - Laura Rossini
- PTP Science Park, Via Einstein - Loc. Cascina Codazza, Lodi, Italy. .,Università degli Studi di Milano - DiSAA, Via Celoria 2, Milano, Italy.
| |
Collapse
|
27
|
Annicchiarico P, Nazzicari N, Wei Y, Pecetti L, Brummer EC. Genotyping-by-Sequencing and Its Exploitation for Forage and Cool-Season Grain Legume Breeding. Front Plant Sci 2017; 8:679. [PMID: 28536584 PMCID: PMC5423274 DOI: 10.3389/fpls.2017.00679] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/29/2016] [Accepted: 04/13/2017] [Indexed: 05/21/2023]
Abstract
Genotyping-by-Sequencing (GBS) may drastically reduce genotyping costs compared with single nucleotide polymorphism (SNP) array platforms. However, it may require optimization for specific crops to maximize the number of available markers. Exploiting GBS-generated markers may require optimization, too (e.g., to cope with missing data). This study aimed (i) to compare elements of GBS protocols on legume species that differ for genome size, ploidy, and breeding system, and (ii) to show successful applications and challenges of GBS data on legume species. Preliminary work on alfalfa and Medicago truncatula suggested the greater interest of ApeKI over PstI:MspI DNA digestion. We compared KAPA and NEB Taq polymerases in combination with primer extensions that were progressively more selective on restriction sites, and found greater number of polymorphic SNP loci in pea, white lupin and diploid alfalfa when adopting KAPA with a non-selective primer. This protocol displayed a slight advantage also for tetraploid alfalfa (where SNP calling requires higher read depth). KAPA offered the further advantage of more uniform amplification than NEB over fragment sizes and GC contents. The number of GBS-generated polymorphic markers exceeded 6,500 in two tetraploid alfalfa reference populations and a world collection of lupin genotypes, and 2,000 in different sets of pea or lupin recombinant inbred lines. The predictive ability of GBS-based genomic selection was influenced by the genotype missing data threshold and imputation, as well as by the genomic selection model, with the best model depending on traits and data sets. We devised a simple method for comparing phenotypic vs. genomic selection in terms of predicted yield gain per year for same evaluation costs, whose application to preliminary data for alfalfa and pea in a hypothetical selection scenario for each crop indicated a distinct advantage of genomic selection.
Collapse
Affiliation(s)
- Paolo Annicchiarico
- Centro di Ricerca per le Produzioni Foraggere e Lattiero-Casearie (FLC), CREALodi, Italy
| | - Nelson Nazzicari
- Centro di Ricerca per le Produzioni Foraggere e Lattiero-Casearie (FLC), CREALodi, Italy
| | - Yanling Wei
- Centro di Ricerca per le Produzioni Foraggere e Lattiero-Casearie (FLC), CREALodi, Italy
- Plant Breeding Center, Department of Plant Sciences, University of California, Davis, DavisCA, USA
| | - Luciano Pecetti
- Centro di Ricerca per le Produzioni Foraggere e Lattiero-Casearie (FLC), CREALodi, Italy
| | - Edward C. Brummer
- Plant Breeding Center, Department of Plant Sciences, University of California, Davis, DavisCA, USA
| |
Collapse
|
28
|
Biazzi E, Nazzicari N, Pecetti L, Brummer EC, Palmonari A, Tava A, Annicchiarico P. Genome-Wide Association Mapping and Genomic Selection for Alfalfa (Medicago sativa) Forage Quality Traits. PLoS One 2017; 12:e0169234. [PMID: 28068350 PMCID: PMC5222375 DOI: 10.1371/journal.pone.0169234] [Citation(s) in RCA: 53] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2016] [Accepted: 12/13/2016] [Indexed: 12/03/2022] Open
Abstract
Genetic progress for forage quality has been poor in alfalfa (Medicago sativa L.), the most-grown forage legume worldwide. This study aimed at exploring opportunities for marker-assisted selection (MAS) and genomic selection of forage quality traits based on breeding values of parent plants. Some 154 genotypes from a broadly-based reference population were genotyped by genotyping-by-sequencing (GBS), and phenotyped for leaf-to-stem ratio, leaf and stem contents of protein, neutral detergent fiber (NDF) and acid detergent lignin (ADL), and leaf and stem NDF digestibility after 24 hours (NDFD), of their dense-planted half-sib progenies in three growing conditions (summer harvest, full irrigation; summer harvest, suspended irrigation; autumn harvest). Trait-marker analyses were performed on progeny values averaged over conditions, owing to modest germplasm × condition interaction. Genomic selection exploited 11,450 polymorphic SNP markers, whereas a subset of 8,494 M. truncatula-aligned markers were used for a genome-wide association study (GWAS). GWAS confirmed the polygenic control of quality traits and, in agreement with phenotypic correlations, indicated substantially different genetic control of a given trait in stems and leaves. It detected several SNPs in different annotated genes that were highly linked to stem protein content. Also, it identified a small genomic region on chromosome 8 with high concentration of annotated genes associated with leaf ADL, including one gene probably involved in the lignin pathway. Three genomic selection models, i.e., Ridge-regression BLUP, Bayes B and Bayesian Lasso, displayed similar prediction accuracy, whereas SVR-lin was less accurate. Accuracy values were moderate (0.3-0.4) for stem NDFD and leaf protein content, modest for leaf ADL and NDFD, and low to very low for the other traits. Along with previous results for the same germplasm set, this study indicates that GBS data can be exploited to improve both quality traits (by genomic selection or MAS) and forage yield.
Collapse
Affiliation(s)
- Elisa Biazzi
- Council for Agricultural Research and Economics—Research Centre for Fodder Crops and Dairy Productions (CREA-FLC), Lodi, Italy
| | - Nelson Nazzicari
- Council for Agricultural Research and Economics—Research Centre for Fodder Crops and Dairy Productions (CREA-FLC), Lodi, Italy
| | - Luciano Pecetti
- Council for Agricultural Research and Economics—Research Centre for Fodder Crops and Dairy Productions (CREA-FLC), Lodi, Italy
| | - E. Charles Brummer
- Plant Breeding Center, Department of Plant Sciences, University of California, Davis, CA, United States of America
| | - Alberto Palmonari
- Department of Veterinary Medicine, University of Bologna, Bologna, Italy
| | - Aldo Tava
- Council for Agricultural Research and Economics—Research Centre for Fodder Crops and Dairy Productions (CREA-FLC), Lodi, Italy
| | - Paolo Annicchiarico
- Council for Agricultural Research and Economics—Research Centre for Fodder Crops and Dairy Productions (CREA-FLC), Lodi, Italy
| |
Collapse
|
29
|
Biscarini F, Nazzicari N, Broccanello C, Stevanato P, Marini S. "Noisy beets": impact of phenotyping errors on genomic predictions for binary traits in Beta vulgaris. Plant Methods 2016; 12:36. [PMID: 27437026 PMCID: PMC4949885 DOI: 10.1186/s13007-016-0136-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/01/2016] [Accepted: 07/06/2016] [Indexed: 06/06/2023]
Abstract
BACKGROUND Noise (errors) in scientific data is endemic and may have a detrimental effect on statistical analyses and experimental results. The effects of noisy data have been assessed in genome-wide association studies for case-control experiments in human medicine. Little is known, however, on the impact of noisy data on genomic predictions, a widely used statistical application in plant and animal breeding. RESULTS In this study, the sensitivity to noise in the data of five classification methods (K-nearest neighbours-KNN, random forest-RF, ridge logistic regression-LR, and support vector machines with linear or radial basis function kernels) was investigated. A sugar beet population of 123 plants phenotyped for a binary trait and genotyped for 192 SNP (single nucleotide polymorphism) markers was used. Labels (0/1 phenotype) were randomly sampled to generate noise. From the base scenario without errors in the labels, increasing proportions of noisy labels-up to 50 %-were generated and introduced in the data. CONCLUSIONS Local classification methods-KNN and RF-showed higher tolerance to noisy labels compared to methods that leverage global data properties-LR and the two SVM models. In particular, KNN outperformed all other classifiers with AUC (area under the ROC curve) higher than 0.95 up to 20 % noisy labels. The runner-up method, RF, had an AUC of 0.941 with 20 % noise.
Collapse
Affiliation(s)
- Filippo Biscarini
- />Department of Bioinformatics and Biostatistics, PTP Science Park, Via Einstein - Loc. Cascina Codazza, 26900 Lodi, Italy
| | - Nelson Nazzicari
- />Council for Agricultural Research and Economics (CREA), Research Centre for Fodder Crops and Dairy Productions, Lodi, Italy
| | | | | | - Simone Marini
- />Bioinformatics Center, Institute for Chemical Research, Kyoto University, Kyoto, Japan
| |
Collapse
|
30
|
Annicchiarico P, Nazzicari N, Ananta A, Carelli M, Wei Y, Brummer EC. Assessment of Cultivar Distinctness in Alfalfa: A Comparison of Genotyping-by-Sequencing, Simple-Sequence Repeat Marker, and Morphophysiological Observations. Plant Genome 2016; 9. [PMID: 27898838 DOI: 10.3835/plantgenome2015.10.0105] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Cultivar registration agencies typically require morphophysiological trait-based distinctness of candidate cultivars. This requirement is difficult to achieve for cultivars of major perennial forages because of their genetic structure and ever-increasing number of registered material, leading to possible rejection of agronomically valuable cultivars. This study aimed to explore the value of molecular markers applied to replicated bulked plants (three bulks of 100 independent plants each per cultivar) to assess alfalfa ( L. subsp. ) cultivar distinctness. We compared genotyping-by-sequencing information based on 2902 polymorphic single-nucleotide polymorphism (SNP) markers (>30 reads per DNA sample) with morphophysiological information based on 11 traits and with simple-sequence repeat (SSR) marker information from 41 polymorphic markers for their ability to distinguish 11 alfalfa landraces representative of the germplasm from northern Italy. Three molecular criteria, one based on cultivar differences for individual SSR bands and two based on overall SNP marker variation assessed either by statistically significant cultivar differences on principal component axes or discriminant analysis, distinctly outperformed the morphophysiological criterion. Combining the morphophysiological criterion with either molecular marker method increased discrimination among cultivars, since morphophysiological diversity was unrelated to SSR marker-based diversity ( = 0.04) and poorly related to SNP marker-based diversity ( = 0.23, < 0.15). The criterion based on statistically significant SNP allele frequency differences was less discriminating than morphophysiological variation. Marker-based distinctness, which can be assessed at low cost and without interactions with testing conditions, could validly substitute for (or complement) morphophysiological distinctness in alfalfa cultivar registration schemes. It also has interest in sui generis registration systems aimed at marketing alfalfa landraces.
Collapse
|
31
|
Annicchiarico P, Nazzicari N, Li X, Wei Y, Pecetti L, Brummer EC. Accuracy of genomic selection for alfalfa biomass yield in different reference populations. BMC Genomics 2015; 16:1020. [PMID: 26626170 PMCID: PMC4667460 DOI: 10.1186/s12864-015-2212-y] [Citation(s) in RCA: 83] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2015] [Accepted: 11/13/2015] [Indexed: 11/29/2022] Open
Abstract
BACKGROUND Genomic selection based on genotyping-by-sequencing (GBS) data could accelerate alfalfa yield gains, if it displayed moderate ability to predict parent breeding values. Its interest would be enhanced by predicting ability also for germplasm/reference populations other than those for which it was defined. Predicting accuracy may be influenced by statistical models, SNP calling procedures and missing data imputation strategies. RESULTS Landrace and variety material from two genetically-contrasting reference populations, i.e., 124 elite genotypes adapted to the Po Valley (sub-continental climate; PV population) and 154 genotypes adapted to Mediterranean-climate environments (Me population), were genotyped by GBS and phenotyped in separate environments for dry matter yield of their dense-planted half-sib progenies. Both populations showed no sub-population genetic structure. Predictive accuracy was higher by joint rather than separate SNP calling for the two data sets, and using random forest imputation of missing data. Highest accuracy was obtained using Support Vector Regression (SVR) for PV, and Ridge Regression BLUP and SVR for Me germplasm. Bayesian methods (Bayes A, Bayes B and Bayesian Lasso) tended to be less accurate. Random Forest Regression was the least accurate model. Accuracy attained about 0.35 for Me in the range of 0.30-0.50 missing data, and 0.32 for PV at 0.50 missing data, using at least 10,000 SNP markers. Cross-population predictions based on a smaller subset of common SNPs implied a relative loss of accuracy of about 25% for Me and 30% for PV. Genome-wide association analyses based on large subsets of M. truncatula-aligned markers revealed many SNPs with modest association with yield, and some genome areas hosting putative QTLs. A comparison of genomic vs. conventional selection for parent breeding value assuming 1-year vs. 5-year selection cycles, respectively, indicated over three-fold greater predicted yield gain per unit time for genomic selection. CONCLUSIONS Genomic selection for alfalfa yield is promising, based on its moderate prediction accuracy, moderate value of cross-population predictions, and lack of sub-population structure. There is limited scope for searching individual QTLs with overwhelming effect on yield. Some of our results can contribute to better design of genomic selection experiments for alfalfa and other crops with similar mating systems.
Collapse
Affiliation(s)
- Paolo Annicchiarico
- Council for Agricultural Research and Economics (CREA), Research Centre for Fodder Crops and Dairy Productions, 29 viale Piacenza, 26900, Lodi, Italy.
| | - Nelson Nazzicari
- Council for Agricultural Research and Economics (CREA), Research Centre for Fodder Crops and Dairy Productions, 29 viale Piacenza, 26900, Lodi, Italy.
| | - Xuehui Li
- Department of Plant Sciences, North Dakota State University, 1340 Administration Avenue, Fargo, ND, 58108, USA.
| | - Yanling Wei
- Plant Sciences Department, University of California, Davis, Plant Breeding Center, One Shields Avenue, Davis, CA, 95616, USA.
| | - Luciano Pecetti
- Council for Agricultural Research and Economics (CREA), Research Centre for Fodder Crops and Dairy Productions, 29 viale Piacenza, 26900, Lodi, Italy.
| | - E Charles Brummer
- Plant Sciences Department, University of California, Davis, Plant Breeding Center, One Shields Avenue, Davis, CA, 95616, USA.
| |
Collapse
|
32
|
Annicchiarico P, Nazzicari N, Li X, Wei Y, Pecetti L, Brummer EC. Accuracy of genomic selection for alfalfa biomass yield in different reference populations. BMC Genomics 2015. [PMID: 26626170 DOI: 10.1186/s12864‐015‐2212‐y] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
BACKGROUND Genomic selection based on genotyping-by-sequencing (GBS) data could accelerate alfalfa yield gains, if it displayed moderate ability to predict parent breeding values. Its interest would be enhanced by predicting ability also for germplasm/reference populations other than those for which it was defined. Predicting accuracy may be influenced by statistical models, SNP calling procedures and missing data imputation strategies. RESULTS Landrace and variety material from two genetically-contrasting reference populations, i.e., 124 elite genotypes adapted to the Po Valley (sub-continental climate; PV population) and 154 genotypes adapted to Mediterranean-climate environments (Me population), were genotyped by GBS and phenotyped in separate environments for dry matter yield of their dense-planted half-sib progenies. Both populations showed no sub-population genetic structure. Predictive accuracy was higher by joint rather than separate SNP calling for the two data sets, and using random forest imputation of missing data. Highest accuracy was obtained using Support Vector Regression (SVR) for PV, and Ridge Regression BLUP and SVR for Me germplasm. Bayesian methods (Bayes A, Bayes B and Bayesian Lasso) tended to be less accurate. Random Forest Regression was the least accurate model. Accuracy attained about 0.35 for Me in the range of 0.30-0.50 missing data, and 0.32 for PV at 0.50 missing data, using at least 10,000 SNP markers. Cross-population predictions based on a smaller subset of common SNPs implied a relative loss of accuracy of about 25% for Me and 30% for PV. Genome-wide association analyses based on large subsets of M. truncatula-aligned markers revealed many SNPs with modest association with yield, and some genome areas hosting putative QTLs. A comparison of genomic vs. conventional selection for parent breeding value assuming 1-year vs. 5-year selection cycles, respectively, indicated over three-fold greater predicted yield gain per unit time for genomic selection. CONCLUSIONS Genomic selection for alfalfa yield is promising, based on its moderate prediction accuracy, moderate value of cross-population predictions, and lack of sub-population structure. There is limited scope for searching individual QTLs with overwhelming effect on yield. Some of our results can contribute to better design of genomic selection experiments for alfalfa and other crops with similar mating systems.
Collapse
Affiliation(s)
- Paolo Annicchiarico
- Council for Agricultural Research and Economics (CREA), Research Centre for Fodder Crops and Dairy Productions, 29 viale Piacenza, 26900, Lodi, Italy.
| | - Nelson Nazzicari
- Council for Agricultural Research and Economics (CREA), Research Centre for Fodder Crops and Dairy Productions, 29 viale Piacenza, 26900, Lodi, Italy.
| | - Xuehui Li
- Department of Plant Sciences, North Dakota State University, 1340 Administration Avenue, Fargo, ND, 58108, USA.
| | - Yanling Wei
- Plant Sciences Department, University of California, Davis, Plant Breeding Center, One Shields Avenue, Davis, CA, 95616, USA.
| | - Luciano Pecetti
- Council for Agricultural Research and Economics (CREA), Research Centre for Fodder Crops and Dairy Productions, 29 viale Piacenza, 26900, Lodi, Italy.
| | - E Charles Brummer
- Plant Sciences Department, University of California, Davis, Plant Breeding Center, One Shields Avenue, Davis, CA, 95616, USA.
| |
Collapse
|
33
|
Nicolazzi EL, Biffani S, Biscarini F, Orozco Ter Wengel P, Caprera A, Nazzicari N, Stella A. Software solutions for the livestock genomics SNP array revolution. Anim Genet 2015; 46:343-53. [PMID: 25907889 DOI: 10.1111/age.12295] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/07/2015] [Indexed: 02/04/2023]
Abstract
Since the beginning of the genomic era, the number of available single nucleotide polymorphism (SNP) arrays has grown considerably. In the bovine species alone, 11 SNP chips not completely covered by intellectual property are currently available, and the number is growing. Genomic/genotype data are not standardized, and this hampers its exchange and integration. In addition, software used for the analyses of these data usually requires not standard (i.e. case specific) input files which, considering the large amount of data to be handled, require at least some programming skills in their production. In this work, we describe a software toolkit for SNP array data management, imputation, genome-wide association studies, population genetics and genomic selection. However, this toolkit does not solve the critical need for standardization of the genotypic data and software input files. It only highlights the chaotic situation each researcher has to face on a daily basis and gives some helpful advice on the currently available tools in order to navigate the SNP array data complexity.
Collapse
Affiliation(s)
- E L Nicolazzi
- Fondazione Parco Tecnologico Padano (PTP), Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| | - S Biffani
- Istituto di biologia e biotecnologia Agraria (IBBA-CNR), Consiglio Nazionale delle Ricerche, Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| | - F Biscarini
- Fondazione Parco Tecnologico Padano (PTP), Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| | - P Orozco Ter Wengel
- School of Biosciences, Cardiff University, Museum Avenue, Cardiff, CF10 3AX, UK
| | - A Caprera
- Fondazione Parco Tecnologico Padano (PTP), Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| | - N Nazzicari
- Fondazione Parco Tecnologico Padano (PTP), Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| | - A Stella
- Fondazione Parco Tecnologico Padano (PTP), Via Einstein, Cascina Codazza, Lodi, 26900, Italy.,Istituto di biologia e biotecnologia Agraria (IBBA-CNR), Consiglio Nazionale delle Ricerche, Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| |
Collapse
|
34
|
Nicolazzi EL, Caprera A, Nazzicari N, Cozzi P, Strozzi F, Lawley C, Pirani A, Soans C, Brew F, Jorjani H, Evans G, Simpson B, Tosser-Klopp G, Brauning R, Williams JL, Stella A. SNPchiMp v.3: integrating and standardizing single nucleotide polymorphism data for livestock species. BMC Genomics 2015; 16:283. [PMID: 25881165 PMCID: PMC4399246 DOI: 10.1186/s12864-015-1497-1] [Citation(s) in RCA: 58] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2014] [Accepted: 03/27/2015] [Indexed: 01/06/2023] Open
Abstract
BACKGROUND In recent years, the use of genomic information in livestock species for genetic improvement, association studies and many other fields has become routine. In order to accommodate different market requirements in terms of genotyping cost, manufacturers of single nucleotide polymorphism (SNP) arrays, private companies and international consortia have developed a large number of arrays with different content and different SNP density. The number of currently available SNP arrays differs among species: ranging from one for goats to more than ten for cattle, and the number of arrays available is increasing rapidly. However, there is limited or no effort to standardize and integrate array- specific (e.g. SNP IDs, allele coding) and species-specific (i.e. past and current assemblies) SNP information. RESULTS Here we present SNPchiMp v.3, a solution to these issues for the six major livestock species (cow, pig, horse, sheep, goat and chicken). Original data was collected directly from SNP array producers and specific international genome consortia, and stored in a MySQL database. The database was then linked to an open-access web tool and to public databases. SNPchiMp v.3 ensures fast access to the database (retrieving within/across SNP array data) and the possibility of annotating SNP array data in a user-friendly fashion. CONCLUSIONS This platform allows easy integration and standardization, and it is aimed at both industry and research. It also enables users to easily link the information available from the array producer with data in public databases, without the need of additional bioinformatics tools or pipelines. In recognition of the open-access use of Ensembl resources, SNPchiMp v.3 was officially credited as an Ensembl E!mpowered tool. Availability at http://bioinformatics.tecnoparco.org/SNPchimp.
Collapse
Affiliation(s)
- Ezequiel L Nicolazzi
- Bioinformatics and Biostatistical Genomics group, Fondazione Parco Tecnologico Padano, Via Einstein, Loc. Cascina Codazza, 26900, Lodi, Italy.
| | - Andrea Caprera
- Bioinformatics and Biostatistical Genomics group, Fondazione Parco Tecnologico Padano, Via Einstein, Loc. Cascina Codazza, 26900, Lodi, Italy.
| | - Nelson Nazzicari
- Bioinformatics and Biostatistical Genomics group, Fondazione Parco Tecnologico Padano, Via Einstein, Loc. Cascina Codazza, 26900, Lodi, Italy.
| | - Paolo Cozzi
- Bioinformatics and Biostatistical Genomics group, Fondazione Parco Tecnologico Padano, Via Einstein, Loc. Cascina Codazza, 26900, Lodi, Italy.
| | - Francesco Strozzi
- Bioinformatics and Biostatistical Genomics group, Fondazione Parco Tecnologico Padano, Via Einstein, Loc. Cascina Codazza, 26900, Lodi, Italy.
| | - Cindy Lawley
- Illumina Inc, 5200 Illumina Way, San Diego, CA, 92121, USA.
| | - Ali Pirani
- Affymetrix Inc, 3420 Central Expressway, Santa Clara, CA, 95051, USA.
| | | | - Fiona Brew
- Affymetrix UK Ltd, Mercury Park, Wycombe Lane, High Wycombe, HP10 0HH, UK.
| | | | - Gary Evans
- GeneSeek, a Neogen Company, Auchincruive, Ayr KA6 5HU, Scotland, UK.
| | - Barry Simpson
- GeneSeek, a Neogen Company, Lincoln, NE, 68504, USA.
| | - Gwenola Tosser-Klopp
- Génétique, Physiologie et Systèmes d'Élevage, Institut National de la Recherche Agronomique & Ecole Nationale Vétérinaire de Toulouse & Ecole Nationale Supérieure Agronomique de Toulouse, Castanet-Tolosan, 31326, France.
| | - Rudiger Brauning
- AgResearch, Invermay Agricultural Centre, PB 50034, Mosgiel, New Zealand.
| | - John L Williams
- Bioinformatics and Biostatistical Genomics group, Fondazione Parco Tecnologico Padano, Via Einstein, Loc. Cascina Codazza, 26900, Lodi, Italy.
| | - Alessandra Stella
- Bioinformatics and Biostatistical Genomics group, Fondazione Parco Tecnologico Padano, Via Einstein, Loc. Cascina Codazza, 26900, Lodi, Italy.
- Istituto di Biologia e Biotecnologia Agraria, Consiglio Nazionale delle Ricerche, Via Einstein, Cascina Codazza, Lodi 26900, Italy.
| |
Collapse
|
35
|
Micheletti D, Dettori MT, Micali S, Aramini V, Pacheco I, Da Silva Linge C, Foschi S, Banchi E, Barreneche T, Quilot-Turion B, Lambert P, Pascal T, Iglesias I, Carbó J, Wang LR, Ma RJ, Li XW, Gao ZS, Nazzicari N, Troggio M, Bassi D, Rossini L, Verde I, Laurens F, Arús P, Aranzana MJ. Whole-Genome Analysis of Diversity and SNP-Major Gene Association in Peach Germplasm. PLoS One 2015; 10:e0136803. [PMID: 26352671 PMCID: PMC4564248 DOI: 10.1371/journal.pone.0136803] [Citation(s) in RCA: 72] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2015] [Accepted: 08/07/2015] [Indexed: 12/24/2022] Open
Abstract
Peach was domesticated in China more than four millennia ago and from there it spread world-wide. Since the middle of the last century, peach breeding programs have been very dynamic generating hundreds of new commercial varieties, however, in most cases such varieties derive from a limited collection of parental lines (founders). This is one reason for the observed low levels of variability of the commercial gene pool, implying that knowledge of the extent and distribution of genetic variability in peach is critical to allow the choice of adequate parents to confer enhanced productivity, adaptation and quality to improved varieties. With this aim we genotyped 1,580 peach accessions (including a few closely related Prunus species) maintained and phenotyped in five germplasm collections (four European and one Chinese) with the International Peach SNP Consortium 9K SNP peach array. The study of population structure revealed the subdivision of the panel in three main populations, one mainly made up of Occidental varieties from breeding programs (POP1OCB), one of Occidental landraces (POP2OCT) and the third of Oriental accessions (POP3OR). Analysis of linkage disequilibrium (LD) identified differential patterns of genome-wide LD blocks in each of the populations. Phenotypic data for seven monogenic traits were integrated in a genome-wide association study (GWAS). The significantly associated SNPs were always in the regions predicted by linkage analysis, forming haplotypes of markers. These diagnostic haplotypes could be used for marker-assisted selection (MAS) in modern breeding programs.
Collapse
Affiliation(s)
- Diego Micheletti
- IRTA, Centre de Recerca en Agrigenòmica CSIC-IRTA-UAB-UB, Campus UAB, Bellaterra (Cerdanyola del Vallès), Barcelona, Spain
| | - Maria Teresa Dettori
- Consiglio per la ricerca in agricoltura e l’analisi dell’economia agraria (CRA), Centro di Ricerca per la Frutticoltura, Roma, Italy
| | - Sabrina Micali
- Consiglio per la ricerca in agricoltura e l’analisi dell’economia agraria (CRA), Centro di Ricerca per la Frutticoltura, Roma, Italy
| | - Valeria Aramini
- Consiglio per la ricerca in agricoltura e l’analisi dell’economia agraria (CRA), Centro di Ricerca per la Frutticoltura, Roma, Italy
| | - Igor Pacheco
- Università degli Studi di Milano, DiSAA, Milan, Italy
| | | | | | - Elisa Banchi
- Research and Innovation Centre, Fondazione Edmund Mach (FEM), San Michele all'Adige (TN), Italy
| | - Teresa Barreneche
- INRA, UMR 1332 de Biologie du Fruit et Pathologie, Villenave d’Ornon, France
- Univ. Bordeaux, UMR 1332 de Biologie du Fruit et Pathologie, Villenave d’Ornon, France
| | | | | | - Thierry Pascal
- INRA UR1052 GAFL, Domaine Saint Maurice, Montfavet, France
| | - Ignasi Iglesias
- IRTA, Estació Experimental de Lleida, Parc de Gardeny, Edifici Fruitcentre, Lleida, Spain
| | - Joaquim Carbó
- IRTA, Estacio Experimental Mas Badia, La Tallada d'Empordà, Girona, Spain
| | - Li-rong Wang
- Zhenzhou Fruit Research Institute, CAAS, Zhengzhou, China
| | - Rui-juan Ma
- Horticultural Institute, Jiangsu Academy of Agricultural Sciences, Nanjing, China
| | - Xiong-wei Li
- Department of Horticulture, Zhejiang University, Hangzhou, China
| | - Zhong-shan Gao
- Department of Horticulture, Zhejiang University, Hangzhou, China
| | | | - Michela Troggio
- Research and Innovation Centre, Fondazione Edmund Mach (FEM), San Michele all'Adige (TN), Italy
| | - Daniele Bassi
- Università degli Studi di Milano, DiSAA, Milan, Italy
| | - Laura Rossini
- Università degli Studi di Milano, DiSAA, Milan, Italy
- Parco Tecnologico Padano, Lodi, Italy
| | - Ignazio Verde
- Consiglio per la ricerca in agricoltura e l’analisi dell’economia agraria (CRA), Centro di Ricerca per la Frutticoltura, Roma, Italy
| | - François Laurens
- INRA, UMR 1345 Institut de Recherche en Horticulture et Semences, Beaucouzé, France
- Université d’Angers, UMR 1345 Institut de Recherche en Horticulture et Semences, SFR 4207 QUASAV, PRES L’UNAM, Angers, France
- AgroCampus-Ouest, UMR 1345 Institut de Recherche en Horticulture et Semences, Angers, France
| | - Pere Arús
- IRTA, Centre de Recerca en Agrigenòmica CSIC-IRTA-UAB-UB, Campus UAB, Bellaterra (Cerdanyola del Vallès), Barcelona, Spain
| | - Maria José Aranzana
- IRTA, Centre de Recerca en Agrigenòmica CSIC-IRTA-UAB-UB, Campus UAB, Bellaterra (Cerdanyola del Vallès), Barcelona, Spain
- * E-mail:
| |
Collapse
|
36
|
|