1
|
Nazzicari N, Franguelli N, Ferrari B, Pecetti L, Annicchiarico P. The Effect of Genome Parametrization and SNP Marker Subsetting on Genomic Selection in Autotetraploid Alfalfa. Genes (Basel) 2024; 15:449. [PMID: 38674384 PMCID: PMC11050091 DOI: 10.3390/genes15040449] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Revised: 03/21/2024] [Accepted: 03/27/2024] [Indexed: 04/28/2024] Open
Abstract
BACKGROUND Alfalfa, the most economically important forage legume worldwide, features modest genetic progress due to long selection cycles and the extent of the non-additive genetic variance associated with its autotetraploid genome. METHODS To improve the efficiency of genomic selection in alfalfa, we explored the effects of genome parametrization (as tetraploid and diploid dosages, plus allele ratios) and SNP marker subsetting (all available SNPs, only genic regions, and only non-genic regions) on genomic regressions, together with various levels of filtering on reading depth and missing rates. We used genotyping by sequencing-generated data and focused on traits of different genetic complexity, i.e., dry biomass yield in moisture-favorable (FE) and drought stress (SE) environments, leaf size, and the onset of flowering, which were assessed in 143 genotyped plants from a genetically broad European reference population and their phenotyped half-sib progenies. RESULTS On average, the allele ratio improved the predictive ability compared with other genome parametrizations (+7.9% vs. tetraploid dosage, +12.6% vs. diploid dosage), while using all the SNPs offered an advantage compared with any specific SNP subsetting (+3.7% vs. genic regions, +7.6% vs. non-genic regions). However, when focusing on specific traits, different combinations of genome parametrization and subsetting achieved better performances. We also released Legpipe2, an SNP calling pipeline tailored for reduced representation (GBS, RAD) in medium-sized genotyping experiments.
Collapse
Affiliation(s)
- Nelson Nazzicari
- Council for Agricultural Research and Economics (CREA), Research Center for Animal Production and Aquaculture, Viale Piacenza 29, 26900 Lodi, Italy
| | | | | | | | | |
Collapse
|
2
|
Lu G, Liu P, Wu Q, Zhang S, Zhao P, Zhang Y, Que Y. Sugarcane breeding: a fantastic past and promising future driven by technology and methods. FRONTIERS IN PLANT SCIENCE 2024; 15:1375934. [PMID: 38525140 PMCID: PMC10957636 DOI: 10.3389/fpls.2024.1375934] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Accepted: 02/21/2024] [Indexed: 03/26/2024]
Abstract
Sugarcane is the most important sugar and energy crop in the world. During sugarcane breeding, technology is the requirement and methods are the means. As we know, seed is the cornerstone of the development of the sugarcane industry. Over the past century, with the advancement of technology and the expansion of methods, sugarcane breeding has continued to improve, and sugarcane production has realized a leaping growth, providing a large amount of essential sugar and clean energy for the long-term mankind development, especially in the face of the future threats of world population explosion, reduction of available arable land, and various biotic and abiotic stresses. Moreover, due to narrow genetic foundation, serious varietal degradation, lack of breakthrough varieties, as well as long breeding cycle and low probability of gene polymerization, it is particularly important to realize the leapfrog development of sugarcane breeding by seizing the opportunity for the emerging Breeding 4.0, and making full use of modern biotechnology including but not limited to whole genome selection, transgene, gene editing, and synthetic biology, combined with information technology such as remote sensing and deep learning. In view of this, we focus on sugarcane breeding from the perspective of technology and methods, reviewing the main history, pointing out the current status and challenges, and providing a reasonable outlook on the prospects of smart breeding.
Collapse
Affiliation(s)
- Guilong Lu
- National Key Laboratory of Tropical Crop Breeding, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences/Yunan Academy of Agricultural Sciences, Sanya/Kaiyuan, China
- College of Horticulture and Landscape Architecture, Henan Institute of Science and Technology, Xinxiang, China
| | - Purui Liu
- College of Horticulture and Landscape Architecture, Henan Institute of Science and Technology, Xinxiang, China
| | - Qibin Wu
- National Key Laboratory of Tropical Crop Breeding, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences/Yunan Academy of Agricultural Sciences, Sanya/Kaiyuan, China
- Key Laboratory of Sugarcane Biology and Genetic Breeding, Ministry of Agriculture and Rural Affairs, National Engineering Research Center for Sugarcane, College of Agriculture, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Shuzhen Zhang
- National Key Laboratory of Tropical Crop Breeding, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences/Yunan Academy of Agricultural Sciences, Sanya/Kaiyuan, China
| | - Peifang Zhao
- National Key Laboratory of Tropical Crop Breeding, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences/Yunan Academy of Agricultural Sciences, Sanya/Kaiyuan, China
| | - Yuebin Zhang
- National Key Laboratory of Tropical Crop Breeding, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences/Yunan Academy of Agricultural Sciences, Sanya/Kaiyuan, China
| | - Youxiong Que
- National Key Laboratory of Tropical Crop Breeding, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences/Yunan Academy of Agricultural Sciences, Sanya/Kaiyuan, China
- Key Laboratory of Sugarcane Biology and Genetic Breeding, Ministry of Agriculture and Rural Affairs, National Engineering Research Center for Sugarcane, College of Agriculture, Fujian Agriculture and Forestry University, Fuzhou, China
| |
Collapse
|
3
|
Bilton TP, Sharma SK, Schofield MR, Black MA, Jacobs JME, Bryan GJ, Dodds KG. Construction of relatedness matrices in autopolyploid populations using low-depth high-throughput sequencing data. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:64. [PMID: 38430392 PMCID: PMC10908621 DOI: 10.1007/s00122-024-04568-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 01/30/2024] [Indexed: 03/03/2024]
Abstract
KEY MESSAGE An improved estimator of genomic relatedness using low-depth high-throughput sequencing data for autopolyploids is developed. Its outputs strongly correlate with SNP array-based estimates and are available in the package GUSrelate. High-throughput sequencing (HTS) methods have reduced sequencing costs and resources compared to array-based tools, facilitating the investigation of many non-model polyploid species. One important quantity that can be computed from HTS data is the genetic relatedness between all individuals in a population. However, HTS data are often messy, with multiple sources of errors (i.e. sequencing errors or missing parental alleles) which, if not accounted for, can lead to bias in genomic relatedness estimates. We derive a new estimator for constructing a genomic relationship matrix (GRM) from HTS data for autopolyploid species that accounts for errors associated with low sequencing depths, implemented in the R package GUSrelate. Simulations revealed that GUSrelate performed similarly to existing GRM methods at high depth but reduced bias in self-relatedness estimates when the sequencing depth was low. Using a panel consisting of 351 tetraploid potato genotypes, we found that GUSrelate produced GRMs from genotyping-by-sequencing (GBS) data that were highly correlated with a GRM computed from SNP array data, and less biased than existing methods when benchmarking against the array-based GRM estimates. GUSrelate provides researchers with a tool to reliably construct GRMs from low-depth HTS data.
Collapse
Affiliation(s)
- Timothy P Bilton
- AgResearch, Invermay Agricultural Centre, Mosgiel, New Zealand.
- Department of Mathematics and Statistics, University of Otago, Dunedin, New Zealand.
| | - Sanjeev Kumar Sharma
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, UK
| | - Matthew R Schofield
- Department of Mathematics and Statistics, University of Otago, Dunedin, New Zealand
| | - Michael A Black
- Department of Biochemistry, University of Otago, Dunedin, New Zealand
| | | | - Glenn J Bryan
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, UK
| | - Ken G Dodds
- AgResearch, Invermay Agricultural Centre, Mosgiel, New Zealand
| |
Collapse
|
4
|
Islam MS, Corak K, McCord P, Hulse-Kemp AM, Lipka AE. A first look at the ability to use genomic prediction for improving the ratooning ability of sugarcane. FRONTIERS IN PLANT SCIENCE 2023; 14:1205999. [PMID: 37600177 PMCID: PMC10433174 DOI: 10.3389/fpls.2023.1205999] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Accepted: 07/03/2023] [Indexed: 08/22/2023]
Abstract
The sugarcane ratooning ability (RA) is the most important target trait for breeders seeking to enhance the profitability of sugarcane production by reducing the planting cost. Understanding the genetics governing the RA could help breeders by identifying molecular markers that could be used for genomics-assisted breeding (GAB). A replicated field trial was conducted for three crop cycles (plant cane, first ratoon, and second ratoon) using 432 sugarcane clones and used for conducting genome-wide association and genomic prediction of five sugar and yield component traits of the RA. The RA traits for economic index (EI), stalk population (SP), stalk weight (SW), tonns of cane per hectare (TCH), and tonns of sucrose per hectare (TSH) were estimated from the yield and sugar data. A total of six putative quantitative trait loci and eight nonredundant single-nucleotide polymorphism (SNP) markers were associated with all five tested RA traits and appear to be unique. Seven putative candidate genes were colocated with significant SNPs associated with the five RA traits. The genomic prediction accuracies for those tested traits were moderate and ranged from 0.21 to 0.36. However, the models fitting fixed effects for the most significant associated markers for each respective trait did not give any advantages over the standard models without fixed effects. As a result of this study, more robust markers could be used in the future for clone selection in sugarcane, potentially helping resolve the genetic control of the RA in sugarcane.
Collapse
Affiliation(s)
| | - Keo Corak
- Genomics and Bioinformatics Research Unit, USDA-ARS, Raleigh, NC, United States
| | - Per McCord
- Sugarcane Field Station, USDA-ARS, Canal Point, FL, United States
- Irrigated Agriculture Research and Extension Center, Washington State University, Prosser, WA, United States
| | - Amanda M. Hulse-Kemp
- Genomics and Bioinformatics Research Unit, USDA-ARS, Raleigh, NC, United States
- Department of Crop and Soil Sciences, North Carolina State University, Raleigh, NC, United States
| | - Alexander E. Lipka
- Department of Crop Sciences, University of Illinois, Urbana-Champaign, IL, United States
| |
Collapse
|
5
|
Ortiz R, Reslow F, Vetukuri R, García-Gil MR, Pérez-Rodríguez P, Crossa J. Inbreeding Effects on the Performance and Genomic Prediction for Polysomic Tetraploid Potato Offspring Grown at High Nordic Latitudes. Genes (Basel) 2023; 14:1302. [PMID: 37372482 DOI: 10.3390/genes14061302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2023] [Revised: 06/18/2023] [Accepted: 06/19/2023] [Indexed: 06/29/2023] Open
Abstract
Inbreeding depression (ID) is caused by increased homozygosity in the offspring after selfing. Although the self-compatible, highly heterozygous, tetrasomic polyploid potato (Solanum tuberosum L.) suffers from ID, some argue that the potential genetic gains from using inbred lines in a sexual propagation system of potato are too large to be ignored. The aim of this research was to assess the effects of inbreeding on potato offspring performance under a high latitude and the accuracy of the genomic prediction of breeding values (GEBVs) for further use in selection. Four inbred (S1) and two hybrid (F1) offspring and their parents (S0) were used in the experiment, with a field layout of an augmented design with the four S0 replicated in nine incomplete blocks comprising 100, four-plant plots at Umeå (63°49'30″ N 20°15'50″ E), Sweden. S0 was significantly (p < 0.01) better than both S1 and F1 offspring for tuber weight (total and according to five grading sizes), tuber shape and size uniformity, tuber eye depth and reducing sugars in the tuber flesh, while F1 was significantly (p < 0.01) better than S1 for all tuber weight and uniformity traits. Some F1 hybrid offspring (15-19%) had better total tuber yield than the best-performing parent. The GEBV accuracy ranged from -0.3928 to 0.4436. Overall, tuber shape uniformity had the highest GEBV accuracy, while tuber weight traits exhibited the lowest accuracy. The F1 full sib's GEBV accuracy was higher, on average, than that of S1. Genomic prediction may facilitate eliminating undesired inbred or hybrid offspring for further use in the genetic betterment of potato.
Collapse
Affiliation(s)
- Rodomiro Ortiz
- Department of Plant Breeding, Swedish University of Agricultural Sciences (SLU), SE 23436 Lomma, Sweden
- Umeå Plant Science Center, SLU Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences (SLU), SE 90183 Umeå, Sweden
| | - Fredrik Reslow
- Department of Plant Breeding, Swedish University of Agricultural Sciences (SLU), SE 23436 Lomma, Sweden
| | - Ramesh Vetukuri
- Department of Plant Breeding, Swedish University of Agricultural Sciences (SLU), SE 23436 Lomma, Sweden
| | - M Rosario García-Gil
- Umeå Plant Science Center, SLU Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences (SLU), SE 90183 Umeå, Sweden
| | | | - José Crossa
- Colegio de Postgraduados (COLPOS), Montecillos 56230, Edo. de México, Mexico
- International Maize and Wheat Improvement Center (CIMMYT), El Batán, Texcoco 56237, Edo. de México, Mexico
| |
Collapse
|
6
|
Endelman JB. Fully efficient, two-stage analysis of multi-environment trials with directional dominance and multi-trait genomic selection. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 136:65. [PMID: 36949348 PMCID: PMC10033618 DOI: 10.1007/s00122-023-04298-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Accepted: 01/02/2023] [Indexed: 06/18/2023]
Abstract
R/StageWise enables fully efficient, two-stage analysis of multi-environment, multi-trait datasets for genomic selection, including support for dominance heterosis and polyploidy. Plant breeders interested in genomic selection often face challenges to fully utilizing multi-trait, multi-environment datasets. R package StageWise was developed to go beyond the capabilities of most specialized software for genomic prediction, without requiring the programming skills needed for more general-purpose software for mixed models. As the name suggests, one of the core features is a fully efficient, two-stage analysis for multiple environments, in which the full variance-covariance matrix of the Stage 1 genotype means is used in Stage 2. Another feature is directional dominance, including for polyploids, to account for inbreeding depression in outbred crops. StageWise enables selection with multi-trait indices, including restricted indices with one or more traits constrained to have zero response. For a potato dataset with 943 genotypes evaluated over 6 years, including the Stage 1 errors in Stage 2 reduced the Akaike Information Criterion (AIC) by 29, 67, and 104 for maturity, yield, and fry color, respectively. The proportion of variation explained by heterosis was largest for yield but still only 0.03, likely because of limited variation for the genomic inbreeding coefficient. Due to the large additive genetic correlation (0.57) between yield and maturity, naïve selection on an index combining yield and fry color led to an undesirable response for later maturity. The restricted index coefficients to maximize genetic merit without delaying maturity were identified. The software and three vignettes are available at https://github.com/jendelman/StageWise .
Collapse
Affiliation(s)
- Jeffrey B Endelman
- Department of Horticulture, University of Wisconsin-Madison, Madison, WI, 53706, USA.
| |
Collapse
|
7
|
Mbo Nkoulou LF, Ngalle HB, Cros D, Adje COA, Fassinou NVH, Bell J, Achigan-Dako EG. Perspective for genomic-enabled prediction against black sigatoka disease and drought stress in polyploid species. FRONTIERS IN PLANT SCIENCE 2022; 13:953133. [PMID: 36388523 PMCID: PMC9650417 DOI: 10.3389/fpls.2022.953133] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Accepted: 09/28/2022] [Indexed: 06/16/2023]
Abstract
Genomic selection (GS) in plant breeding is explored as a promising tool to solve the problems related to the biotic and abiotic threats. Polyploid plants like bananas (Musa spp.) face the problem of drought and black sigatoka disease (BSD) that restrict their production. The conventional plant breeding is experiencing difficulties, particularly phenotyping costs and long generation interval. To overcome these difficulties, GS in plant breeding is explored as an alternative with a great potential for reducing costs and time in selection process. So far, GS does not have the same success in polyploid plants as with diploid plants because of the complexity of their genome. In this review, we present the main constraints to the application of GS in polyploid plants and the prospects for overcoming these constraints. Particular emphasis is placed on breeding for BSD and drought-two major threats to banana production-used in this review as a model of polyploid plant. It emerges that the difficulty in obtaining markers of good quality in polyploids is the first challenge of GS on polyploid plants, because the main tools used were developed for diploid species. In addition to that, there is a big challenge of mastering genetic interactions such as dominance and epistasis effects as well as the genotype by environment interaction, which are very common in polyploid plants. To get around these challenges, we have presented bioinformatics tools, as well as artificial intelligence approaches, including machine learning. Furthermore, a scheme for applying GS to banana for BSD and drought has been proposed. This review is of paramount impact for breeding programs that seek to reduce the selection cycle of polyploids despite the complexity of their genome.
Collapse
Affiliation(s)
- Luther Fort Mbo Nkoulou
- Genetics, Biotechnology, and Seed Science Unit (GBioS), Department of Plant Sciences, Faculty of Agronomic Sciences, University of Abomey Calavi, Cotonou, Benin
- Unit of Genetics and Plant Breeding (UGAP), Department of Plant Biology, Faculty of Sciences, University of Yaoundé 1, Yaoundé, Cameroon
- Institute of Agricultural Research for Development, Centre de Recherche Agricole de Mbalmayo (CRAM), Mbalmayo, Cameroon
| | - Hermine Bille Ngalle
- Unit of Genetics and Plant Breeding (UGAP), Department of Plant Biology, Faculty of Sciences, University of Yaoundé 1, Yaoundé, Cameroon
| | - David Cros
- Centre de Coopération Internationale en Recherche Agronomique pour le Développement (CIRAD), Unité Mixte de Recherche (UMR) Amélioration Génétique et Adaptation des Plantes méditerranéennes et tropicales (AGAP) Institut, Montpellier, France
- Unité Mixte de Recherche (UMR) Amélioration Génétique et Adaptation des Plantes méditerranéennes et tropicales (AGAP) Institut, University of Montpellier, Centre de Coopération Internationale en Recherche Agronomique pour le Développement (CIRAD), Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement (INRAE), Institut Agro, Montpellier, France
| | - Charlotte O. A. Adje
- Genetics, Biotechnology, and Seed Science Unit (GBioS), Department of Plant Sciences, Faculty of Agronomic Sciences, University of Abomey Calavi, Cotonou, Benin
| | - Nicodeme V. H. Fassinou
- Genetics, Biotechnology, and Seed Science Unit (GBioS), Department of Plant Sciences, Faculty of Agronomic Sciences, University of Abomey Calavi, Cotonou, Benin
| | - Joseph Bell
- Unit of Genetics and Plant Breeding (UGAP), Department of Plant Biology, Faculty of Sciences, University of Yaoundé 1, Yaoundé, Cameroon
| | - Enoch G. Achigan-Dako
- Genetics, Biotechnology, and Seed Science Unit (GBioS), Department of Plant Sciences, Faculty of Agronomic Sciences, University of Abomey Calavi, Cotonou, Benin
| |
Collapse
|
8
|
Strategies to Increase Prediction Accuracy in Genomic Selection of Complex Traits in Alfalfa ( Medicago sativa L.). Cells 2021; 10:cells10123372. [PMID: 34943880 PMCID: PMC8699225 DOI: 10.3390/cells10123372] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Revised: 11/19/2021] [Accepted: 11/24/2021] [Indexed: 12/27/2022] Open
Abstract
Agronomic traits such as biomass yield and abiotic stress tolerance are genetically complex and challenging to improve through conventional breeding approaches. Genomic selection (GS) is an alternative approach in which genome-wide markers are used to determine the genomic estimated breeding value (GEBV) of individuals in a population. In alfalfa (Medicago sativa L.), previous results indicated that low to moderate prediction accuracy values (<70%) were obtained in complex traits, such as yield and abiotic stress resistance. There is a need to increase the prediction value in order to employ GS in breeding programs. In this paper we reviewed different statistic models and their applications in polyploid crops, such as alfalfa and potato. Specifically, we used empirical data affiliated with alfalfa yield under salt stress to investigate approaches that use DNA marker importance values derived from machine learning models, and genome-wide association studies (GWAS) of marker-trait association scores based on different GWASpoly models, in weighted GBLUP analyses. This approach increased prediction accuracies from 50% to more than 80% for alfalfa yield under salt stress. Finally, we expended the weighted GBLUP approach to potato and analyzed 13 phenotypic traits and obtained similar results. This is the first report on alfalfa to use variable importance and GWAS-assisted approaches to increase the prediction accuracy of GS, thus helping to select superior alfalfa lines based on their GEBVs.
Collapse
|