1
|
Xie Z, Xu X, Li L, Wu C, Ma Y, He J, Wei S, Wang J, Feng X. Residual networks without pooling layers improve the accuracy of genomic predictions. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:138. [PMID: 38771334 DOI: 10.1007/s00122-024-04649-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 05/10/2024] [Indexed: 05/22/2024]
Abstract
KEY MESSAGE Residual neural network genomic selection is the first GS algorithm to reach 35 layers, and its prediction accuracy surpasses previous algorithms. With the decrease in DNA sequencing costs and the development of deep learning, phenotype prediction accuracy by genomic selection (GS) continues to improve. Residual networks, a widely validated deep learning technique, are introduced to deep learning for GS. Since each locus has a different weighted impact on the phenotype, strided convolutions are more suitable for GS problems than pooling layers. Through the above technological innovations, we propose a GS deep learning algorithm, residual neural network for genomic selection (ResGS). ResGS is the first neural network to reach 35 layers in GS. In 15 cases from four public data, the prediction accuracy of ResGS is higher than that of ridge-regression best linear unbiased prediction, support vector regression, random forest, gradient boosting regressor, and deep neural network genomic prediction in most cases. ResGS performs well in dealing with gene-environment interaction. Phenotypes from other environments are imported into ResGS along with genetic data. The prediction results are much better than just providing genetic data as input, which demonstrates the effectiveness of GS multi-modal learning. Standard deviation is recommended as an auxiliary GS evaluation metric, which could improve the distribution of predicted results. Deep learning for GS, such as ResGS, is becoming more accurate in phenotype prediction.
Collapse
Affiliation(s)
| | - Xiaogang Xu
- School of Computer Science and Technology, Zhejiang Gongshang University, Hangzhou, 310012, China.
| | - Ling Li
- Zhejiang Laboratory, Hangzhou, 311100, China
| | - Cuiling Wu
- Zhejiang Laboratory, Hangzhou, 311100, China
| | - Yinxing Ma
- Zhejiang Laboratory, Hangzhou, 311100, China
| | - Jingjing He
- Zhejiang Laboratory, Hangzhou, 311100, China
| | - Sidi Wei
- Zhejiang Laboratory, Hangzhou, 311100, China
| | - Jun Wang
- Zhejiang Laboratory, Hangzhou, 311100, China
| | - Xianzhong Feng
- Key Laboratory of Soybean Molecular Design Breeding, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun, 130102, China
| |
Collapse
|
2
|
Liu S, Jiang Y, Wang Y, Huo H, Cilkiz M, Chen P, Han Y, Li L, Wang K, Zhao M, Zhu L, Lei J, Wang Y, Zhang M. Genetic and molecular dissection of ginseng ( Panax ginseng Mey.) germplasm using high-density genic SNP markers, secondary metabolites, and gene expressions. FRONTIERS IN PLANT SCIENCE 2023; 14:1165349. [PMID: 37575919 PMCID: PMC10416250 DOI: 10.3389/fpls.2023.1165349] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 06/27/2023] [Indexed: 08/15/2023]
Abstract
Genetic and molecular knowledge of a species is crucial to its gene discovery and enhanced breeding. Here, we report the genetic and molecular dissection of ginseng, an important herb for healthy food and medicine. A mini-core collection consisting of 344 cultivars and landraces was developed for ginseng that represents the genetic variation of ginseng existing in its origin and diversity center. We sequenced the transcriptomes of all 344 cultivars and landraces; identified over 1.5 million genic SNPs, thereby revealing the genic diversity of ginseng; and analyzed them with 26,600 high-quality genic SNPs or a selection of them. Ginseng had a wide molecular diversity and was clustered into three subpopulations. Analysis of 16 ginsenosides, the major bioactive components for healthy food and medicine, showed that ginseng had a wide variation in the contents of all 16 ginsenosides and an extensive correlation of their contents, suggesting that they are synthesized through a single or multiple correlated pathways. Furthermore, we pair-wisely examined the relationships between the cultivars and landraces, revealing their relationships in gene expression, gene variation, and ginsenoside biosynthesis. These results provide new knowledge and new genetic and genic resources for advanced research and breeding of ginseng and related species.
Collapse
Affiliation(s)
- Sizhang Liu
- College of Life Science, Jilin Agricultural University, Changchun, Jilin, China
| | - Yue Jiang
- College of Life Science, Jilin Agricultural University, Changchun, Jilin, China
| | - Yanfang Wang
- College of Chinese Medicinal Materials, Jilin Agricultural University, Changchun, Jilin, China
| | - Huimin Huo
- College of Life Science, Jilin Agricultural University, Changchun, Jilin, China
| | - Mustafa Cilkiz
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX, United States
| | - Ping Chen
- College of Life Science, Jilin Agricultural University, Changchun, Jilin, China
- Research Center for Ginseng Genetic Resources Development and Utilization, Jilin Province, Jilin Agricultural University, Changchun, Jilin, China
| | - Yilai Han
- College of Life Science, Jilin Agricultural University, Changchun, Jilin, China
| | - Li Li
- College of Life Science, Jilin Agricultural University, Changchun, Jilin, China
| | - Kangyu Wang
- College of Life Science, Jilin Agricultural University, Changchun, Jilin, China
- Research Center for Ginseng Genetic Resources Development and Utilization, Jilin Province, Jilin Agricultural University, Changchun, Jilin, China
| | - Mingzhu Zhao
- College of Life Science, Jilin Agricultural University, Changchun, Jilin, China
- Research Center for Ginseng Genetic Resources Development and Utilization, Jilin Province, Jilin Agricultural University, Changchun, Jilin, China
| | - Lei Zhu
- College of Life Science, Jilin Agricultural University, Changchun, Jilin, China
| | - Jun Lei
- College of Life Science, Jilin Agricultural University, Changchun, Jilin, China
- Research Center for Ginseng Genetic Resources Development and Utilization, Jilin Province, Jilin Agricultural University, Changchun, Jilin, China
| | - Yi Wang
- College of Life Science, Jilin Agricultural University, Changchun, Jilin, China
- Research Center for Ginseng Genetic Resources Development and Utilization, Jilin Province, Jilin Agricultural University, Changchun, Jilin, China
| | - Meiping Zhang
- College of Life Science, Jilin Agricultural University, Changchun, Jilin, China
- Research Center for Ginseng Genetic Resources Development and Utilization, Jilin Province, Jilin Agricultural University, Changchun, Jilin, China
| |
Collapse
|
3
|
Zhang M, Liu YH, Wang Y, Sze SH, Scheuring CF, Qi X, Ekinci O, Pekar J, Murray SC, Zhang HB. Genome-wide identification of genes enabling accurate prediction of hybrid performance from parents across environments and populations for gene-based breeding in maize. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2022; 324:111424. [PMID: 35995113 DOI: 10.1016/j.plantsci.2022.111424] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 08/07/2022] [Accepted: 08/16/2022] [Indexed: 06/15/2023]
Abstract
Accurate prediction of hybrid offspring complex trait phenotype from parents is paramount to enhanced plant breeding, animal breeding, and human medicine. Here we report genome-wide identification of genes enabling accurate prediction of hybrid offspring complex traits from parents using maize grain yield as the target trait. We identified 181 ZmF1GY genes enabling prediction of maize (Zea mays L.) F1 hybrid grain yield from parents and tested their utility and efficiency for predicting F1 hybrid grain yields from parents using their expressions, genic SNPs, and number of favorable alleles (NFAs), respectively. The ZmF1GY genes predicted hybrid grain yields from parents at an accuracy of 0.86, presented by correlation coefficient between predicted and observed phenotypes, within an environment, 0.74 across environments, and 0.64 across populations, outperforming genomic prediction by 27-406%, 23%, and 40%, respectively. Furthermore, we identified nine of the ZmF1GY genes containing SNPs or InDels in parents that increased or decreased hybrid grain yields by 14-46%. When the NFAs of these nine ZmF1GY genes were used for hybrid grain yield prediction from parents, they predicted hybrid grain yields at an accuracy of 0.79, outperforming genomic prediction by 21% that was based on up to tens of thousands of genome-wide SNPs. These results demonstrate the feasibility of developing a gene toolkit for a species enabling gene-based breeding across environments and populations that is much more powerful and efficient than current breeding, thereby helping secure the world's food production. The methodology is applicable to all crops, livestock, and humans.
Collapse
Affiliation(s)
- Meiping Zhang
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843, USA.
| | - Yun-Hua Liu
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843, USA.
| | - Yinglei Wang
- Department of Computer Science, Cornell University, Ithaca, NY 14853, USA.
| | - Sing-Hoi Sze
- Department of Computer Science and Engineering and Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX 77843, USA.
| | - Chantel F Scheuring
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843, USA.
| | - Xiaoli Qi
- College of Life Science, Jiamusi University, Jiamusi, Heilongjiang 154007, China.
| | - Ozge Ekinci
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843, USA.
| | - Jacob Pekar
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843, USA.
| | - Seth C Murray
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843, USA.
| | - Hong-Bin Zhang
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843, USA.
| |
Collapse
|
4
|
Liu YH, Zhang M, Sze SH, Smith CW, Zhang HB. Analysis of the genes controlling cotton fiber length reveals the molecular basis of plant breeding and the genetic potential of current cultivars for continued improvement. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2022; 321:111318. [PMID: 35696918 DOI: 10.1016/j.plantsci.2022.111318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/09/2022] [Revised: 05/02/2022] [Accepted: 05/08/2022] [Indexed: 06/15/2023]
Abstract
Stagnated crop improvement has raised questions of whether and how current crop cultivars can be further improved. Genes are the core determinants of performance of all cultivars. Here, we report the molecular basis of plant breeding and address these questions by analyzing 226 GFL genes controlling and accurately predicting fiber length, an important breeding objective trait, in cotton (Gossypium sp.). We first identified the favorable allele and the number of favorable alleles (NFAs) of each GFL gene, calculated the total NFAs of the 226 GFL genes accumulated in 198 advanced breeding lines, and analyzed them against fiber lengths. Fiber lengths of the breeding lines were strongly correlated with the total NFAs of the GFL genes (r = 0.85, P < 0.0001), suggesting that accumulation of the favorable alleles of the genes controlling objective traits is the molecular basis of cotton breeding. Surprisingly, a breeding line with a fiber length of present cultivars having the longest fibers contained only about 51% of the total NFAs of the 226 GFL genes. The genetic potentials of current cultivars were then predicted using linear and non-linear models, respectively, revealing that a breeding line or cultivar with a fiber length of 33.8 mm could be further improved in fiber length by up to 118%. Finally, we showed that the genetic potential of such a breeding line can be realized through gene-based breeding. Therefore, these findings shed light on continued crop improvement in general and provide 740 genic biomarkers desirable for enhanced cotton fiber breeding.
Collapse
Affiliation(s)
- Yun-Hua Liu
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843, USA.
| | - Meiping Zhang
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843, USA.
| | - Sing-Hoi Sze
- Department of Computer Science and Engineering, and Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX 77843, USA.
| | - C Wayne Smith
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843, USA.
| | - Hong-Bin Zhang
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843, USA.
| |
Collapse
|
5
|
Billings GT, Jones MA, Rustgi S, Bridges WC, Holland JB, Hulse-Kemp AM, Campbell BT. Outlook for Implementation of Genomics-Based Selection in Public Cotton Breeding Programs. PLANTS 2022; 11:plants11111446. [PMID: 35684219 PMCID: PMC9182660 DOI: 10.3390/plants11111446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/15/2022] [Revised: 05/09/2022] [Accepted: 05/16/2022] [Indexed: 11/16/2022]
Abstract
Researchers have used quantitative genetics to map cotton fiber quality and agronomic performance loci, but many alleles may be population or environment-specific, limiting their usefulness in a pedigree selection, inbreeding-based system. Here, we utilized genotypic and phenotypic data on a panel of 80 important historical Upland cotton (Gossypium hirsutum L.) lines to investigate the potential for genomics-based selection within a cotton breeding program’s relatively closed gene pool. We performed a genome-wide association study (GWAS) to identify alleles correlated to 20 fiber quality, seed composition, and yield traits and looked for a consistent detection of GWAS hits across 14 individual field trials. We also explored the potential for genomic prediction to capture genotypic variation for these quantitative traits and tested the incorporation of GWAS hits into the prediction model. Overall, we found that genomic selection programs for fiber quality can begin immediately, and the prediction ability for most other traits is lower but commensurate with heritability. Stably detected GWAS hits can improve prediction accuracy, although a significance threshold must be carefully chosen to include a marker as a fixed effect. We place these results in the context of modern public cotton line-breeding and highlight the need for a community-based approach to amass the data and expertise necessary to launch US public-sector cotton breeders into the genomics-based selection era.
Collapse
Affiliation(s)
- Grant T. Billings
- Bioinformatics Graduate Program, North Carolina State University, Raleigh, NC 27695, USA; (G.T.B.); (J.B.H.)
- Department of Crop and Soil Sciences, North Carolina State University, Raleigh, NC 27695, USA
| | - Michael A. Jones
- Pee Dee Research and Education Center, Clemson University, Florence, SC 29506, USA; (M.A.J.); (S.R.)
| | - Sachin Rustgi
- Pee Dee Research and Education Center, Clemson University, Florence, SC 29506, USA; (M.A.J.); (S.R.)
| | - William C. Bridges
- Department of Mathematical and Statistical Sciences, Clemson University, Clemson, SC 29634, USA;
| | - James B. Holland
- Bioinformatics Graduate Program, North Carolina State University, Raleigh, NC 27695, USA; (G.T.B.); (J.B.H.)
- Plant Sciences Research Unit, The Agricultural Research Service of U.S. Department of Agriculture, Raleigh, NC 27695, USA
| | - Amanda M. Hulse-Kemp
- Bioinformatics Graduate Program, North Carolina State University, Raleigh, NC 27695, USA; (G.T.B.); (J.B.H.)
- Department of Crop and Soil Sciences, North Carolina State University, Raleigh, NC 27695, USA
- Genomics and Bioinformatics Research Unit, The Agricultural Research Service of U.S. Department of Agriculture, Raleigh, NC 27965, USA
- Correspondence: (A.M.H.-K.); (B.T.C.)
| | - B. Todd Campbell
- Coastal Plains Soil, Water, and Plant Research Center, The Agricultural Research Service of U.S. Department of Agriculture, Florence, SC 29501, USA
- Correspondence: (A.M.H.-K.); (B.T.C.)
| |
Collapse
|
6
|
Conaty WC, Broughton KJ, Egan LM, Li X, Li Z, Liu S, Llewellyn DJ, MacMillan CP, Moncuquet P, Rolland V, Ross B, Sargent D, Zhu QH, Pettolino FA, Stiller WN. Cotton Breeding in Australia: Meeting the Challenges of the 21st Century. FRONTIERS IN PLANT SCIENCE 2022; 13:904131. [PMID: 35646011 PMCID: PMC9136452 DOI: 10.3389/fpls.2022.904131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Accepted: 04/08/2022] [Indexed: 06/15/2023]
Abstract
The Commonwealth Scientific and Industrial Research Organisation (CSIRO) cotton breeding program is the sole breeding effort for cotton in Australia, developing high performing cultivars for the local industry which is worth∼AU$3 billion per annum. The program is supported by Cotton Breeding Australia, a Joint Venture between CSIRO and the program's commercial partner, Cotton Seed Distributors Ltd. (CSD). While the Australian industry is the focus, CSIRO cultivars have global impact in North America, South America, and Europe. The program is unique compared with many other public and commercial breeding programs because it focuses on diverse and integrated research with commercial outcomes. It represents the full research pipeline, supporting extensive long-term fundamental molecular research; native and genetically modified (GM) trait development; germplasm enhancement focused on yield and fiber quality improvements; integration of third-party GM traits; all culminating in the release of new commercial cultivars. This review presents evidence of past breeding successes and outlines current breeding efforts, in the areas of yield and fiber quality improvement, as well as the development of germplasm that is resistant to pests, diseases and abiotic stressors. The success of the program is based on the development of superior germplasm largely through field phenotyping, together with strong commercial partnerships with CSD and Bayer CropScience. These relationships assist in having a shared focus and ensuring commercial impact is maintained, while also providing access to markets, traits, and technology. The historical successes, current foci and future requirements of the CSIRO cotton breeding program have been used to develop a framework designed to augment our breeding system for the future. This will focus on utilizing emerging technologies from the genome to phenome, as well as a panomics approach with data management and integration to develop, test and incorporate new technologies into a breeding program. In addition to streamlining the breeding pipeline for increased genetic gain, this technology will increase the speed of trait and marker identification for use in genome editing, genomic selection and molecular assisted breeding, ultimately producing novel germplasm that will meet the coming challenges of the 21st Century.
Collapse
Affiliation(s)
| | | | - Lucy M. Egan
- CSIRO Agriculture and Food, Narrabri, NSW, Australia
| | - Xiaoqing Li
- CSIRO Agriculture and Food, Canberra, ACT, Australia
| | - Zitong Li
- CSIRO Agriculture and Food, Canberra, ACT, Australia
| | - Shiming Liu
- CSIRO Agriculture and Food, Narrabri, NSW, Australia
| | | | | | | | | | - Brett Ross
- Cotton Seed Distributors Ltd., Wee Waa, NSW, Australia
| | - Demi Sargent
- CSIRO Agriculture and Food, Narrabri, NSW, Australia
- Hawkesbury Institute for the Environment, Western Sydney University, Richmond, NSW, Australia
| | - Qian-Hao Zhu
- CSIRO Agriculture and Food, Canberra, ACT, Australia
| | | | | |
Collapse
|
7
|
Genomic prediction of cotton fibre quality and yield traits using Bayesian regression methods. Heredity (Edinb) 2022; 129:103-112. [PMID: 35523950 PMCID: PMC9338257 DOI: 10.1038/s41437-022-00537-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Revised: 04/05/2022] [Accepted: 04/07/2022] [Indexed: 01/26/2023] Open
Abstract
Genomic selection or genomic prediction (GP) has increasingly become an important molecular breeding technology for crop improvement. GP aims to utilise genome-wide marker data to predict genomic breeding value for traits of economic importance. Though GP studies have been widely conducted in various crop species such as wheat and maize, its application in cotton, an essential renewable textile fibre crop, is still significantly underdeveloped. We aim to develop a new GP-based breeding system that can improve the efficiency of our cotton breeding program. This article presents a GP study on cotton fibre quality and yield traits using 1385 breeding lines from the Commonwealth Scientific and Industrial Research Organisation (CSIRO, Australia) cotton breeding program which were genotyped using a high-density SNP chip that generated 12,296 informative SNPs. The aim of this study was twofold: (1) to identify the models and data sources (i.e. genomic and pedigree) that produce the highest prediction accuracies; and (2) to assess the effectiveness of GP as a selection tool in the CSIRO cotton breeding program. The prediction analyses were conducted under various scenarios using different Bayesian predictive models. Results highlighted that the model combining genomic and pedigree information resulted in the best cross validated prediction accuracies: 0.76 for fibre length, 0.65 for fibre strength, and 0.64 for lint yield. Overall, this work represents the largest scale genomic selection studies based on cotton breeding trial data. Prediction accuracies reported in our study indicate the potential of GP as a breeding tool for cotton. The study highlighted the importance of incorporating pedigree and environmental factors in GP models to optimise the prediction performance.
Collapse
|
8
|
Song C, Acuña T, Adler-Agmon M, Rachmilevitch S, Barak S, Fait A. Leveraging a graft collection to develop metabolome-based trait prediction for the selection of tomato rootstocks with enhanced salt tolerance. HORTICULTURE RESEARCH 2022; 9:uhac061. [PMID: 35531316 PMCID: PMC9071376 DOI: 10.1093/hr/uhac061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/02/2021] [Accepted: 02/27/2022] [Indexed: 06/14/2023]
Abstract
Grafting has been demonstrated to significantly enhance the salt tolerance of crops. However, breeding efforts to develop enhanced graft combinations are hindered by knowledge-gaps as to how rootstocks mediate scion-response to salt stress. We grafted the scion of cultivated M82 onto rootstocks of 254 tomato accessions and explored the morphological and metabolic responses of grafts under saline conditions (EC = 20 dS m-1) as compared to self-grafted M82 (SG-M82). Correlation analysis and Least Absolute Shrinkage and Selection Operator were performed to address the association between morphological diversification and metabolic perturbation. We demonstrate that grafting the same variety onto different rootstocks resulted in scion phenotypic heterogeneity and emphasized the productivity efficiency of M82 irrespective of the rootstock. Spectrophotometric analysis to test lipid oxidation showed largest variability of malondialdehyde (MDA) equivalents across the population, while the least responsive trait was the ratio of fruit fresh weight to total fresh weight (FFW/TFW). Generally, grafts showed greater values for the traits measured than SG-M82, except for branch number and wild race-originated rootstocks; the latter were associated with smaller scion growth parameters. Highly responsive and correlated metabolites were identified across the graft collection including malate, citrate, and aspartate, and their variance was partly related to rootstock origin. A group of six metabolites that consistently characterized exceptional graft response was observed, consisting of sorbose, galactose, sucrose, fructose, myo-inositol, and proline. The correlation analysis and predictive modelling, integrating phenotype- and leaf metabolite data, suggest a potential predictive relation between a set of leaf metabolites and yield-related traits.
Collapse
Affiliation(s)
- Chao Song
- The Albert Katz International School for Desert Studies, The Jacob Blaustein Institutes for Desert Research, Ben-Gurion University of the Negev, Sede Boqer Campus, 8499000, Israel
| | - Tania Acuña
- Albert Katz Department of Dryland Biotechnologies, French Associates Institute for Agriculture and Biotechnology of Drylands, Jacob Blaustein Institutes for Desert Research, Ben-Gurion University of the Negev, Sede Boqer Campus, 8499000, Israel
| | | | - Shimon Rachmilevitch
- Albert Katz Department of Dryland Biotechnologies, French Associates Institute for Agriculture and Biotechnology of Drylands, Jacob Blaustein Institutes for Desert Research, Ben-Gurion University of the Negev, Sede Boqer Campus, 8499000, Israel
| | - Simon Barak
- Albert Katz Department of Dryland Biotechnologies, French Associates Institute for Agriculture and Biotechnology of Drylands, Jacob Blaustein Institutes for Desert Research, Ben-Gurion University of the Negev, Sede Boqer Campus, 8499000, Israel
| | - Aaron Fait
- Albert Katz Department of Dryland Biotechnologies, French Associates Institute for Agriculture and Biotechnology of Drylands, Jacob Blaustein Institutes for Desert Research, Ben-Gurion University of the Negev, Sede Boqer Campus, 8499000, Israel
| |
Collapse
|
9
|
Liu YH, Zhang M, Scheuring CF, Cilkiz M, Sze SH, Smith CW, Murray SC, Xu W, Zhang HB. Accurate prediction of complex traits for individuals and offspring from parents using a simple, rapid, and efficient method for gene-based breeding in cotton and maize. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2022; 316:111153. [PMID: 35151437 DOI: 10.1016/j.plantsci.2021.111153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Accepted: 12/11/2021] [Indexed: 06/14/2023]
Abstract
Accurate, simple, rapid, and inexpensive prediction of complex traits controlled by numerous genes is paramount to enhanced plant breeding, animal breeding, and human medicine. Here we report a novel method that enables accurate, simple, and rapid prediction of complex traits of individuals or offspring from parents based on the number of favorable alleles (NFAs) of the genes controlling the objective traits. The NFAs of 226 cotton fiber length (GFL) genes and nine maize hybrid grain yield related (ZmF1GY) genes were directly used to predict cotton fiber lengths of individual plants and maize grain yields of F1 hybrids from parents, respectively, using prediction model-based methods as controls. The NFAs of the 226 GFL genes predicted cotton fiber lengths at an accuracy of 0.85, as the model methods and outperforming genomic prediction by 82 % - 170 %. The NFAs of the nine ZmF1GY genes predicted grain yields of maize hybrids from parents at an accuracy of 0.80, outperforming genomic prediction by 67 %. Moreover, the prediction accuracies of these traits were consistent across years, environments, and eco-agricultural systems. Importantly, the accurate prediction of these traits directly using the NFAs of the genes allows breeding to be performed in greenhouse, phytotron, or off-season, without the need of the model training and validation steps essential and costly for model-based genomic or genic prediction. Therefore, this new method dramatically outperforms the current model-based genomic methods used for phenotype prediction and streamlines the process of breeding, thus promising to substantially enhance current plant and animal breeding.
Collapse
Affiliation(s)
- Yun-Hua Liu
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843, USA
| | - Meiping Zhang
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843, USA
| | - Chantel F Scheuring
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843, USA
| | - Mustafa Cilkiz
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843, USA
| | - Sing-Hoi Sze
- Department of Computer Science and Engineering and Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX 77843, USA
| | - C Wayne Smith
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843, USA
| | - Seth C Murray
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843, USA
| | - Wenwei Xu
- Texas A&M AgriLife Research, Lubbock, TX 79403, USA
| | - Hong-Bin Zhang
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843, USA.
| |
Collapse
|
10
|
Song JM, Arif M, Zi Y, Sze SH, Zhang M, Zhang HB. Molecular and genetic dissection of the USDA rice mini-core collection using high-density SNP markers. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2021; 308:110910. [PMID: 34034867 DOI: 10.1016/j.plantsci.2021.110910] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Revised: 04/05/2021] [Accepted: 04/10/2021] [Indexed: 06/12/2023]
Abstract
Molecular tools and knowledge of crop germplasm are vital for their effective utilization. In this study, we developed 40,866 high-quality and well distributed SNPs for a rice mini-core collection (RMC) developed by the United States Department of Agriculture (USDA). The high-quality SNPs clustered the USDA-RMC into five subpopulations (Ind, indica; Aus, aus; Afr, African rice; TeJ, temperate japonica; TrJ, tropical japonica) and one admixture (Adm). This classification was further confirmed by phylogenetic and principal component analyses. The rice ARO (aromatic) subpopulation of previous studies was re-assigned with Adm and the WD (wild-type) subpopulation was re-defined to the Afr subpopulation because most of its accessions are African cultivated rice. The Aus and Ind subpopulations had a substantially wider genetic variation than the TrJ and TeJ subpopulations. The genetic diversities were much larger between the Ind or Aus subpopulation and the TrJ or TeJ subpopulation than between the Afr subpopulation and the Ind, Aus, TrJ or TeJ subpopulation. Comparative agronomic trait analysis between the subpopulations also supported the genetic structure and variation of the RMC, and suggested the existence of extensive variation in the genes controlling agronomic traits among them. Furthermore, analysis of ancestral membership of the RMC accessions revealed that reproductive barrier or wide incompatibility existed between the Indica and Japonica groups, while gene flow occurred between them. These results provide high-quality SNPs and knowledge of genetic structure and diversity of the USDA-RMC necessary for enhanced rice research and breeding.
Collapse
Affiliation(s)
- Jian-Min Song
- Crop Research Institute/National Engineering Laboratory for Wheat and Maize, Shandong Academy of Agricultural Sciences (SAAS), Jinan, 250100, PR China; Department of Soil and Crop Sciences, Texas A&M University, College Station, TX, 77843-2474, USA.
| | - Muhammad Arif
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX, 77843-2474, USA; Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering, Faisalabad, Pakistan.
| | - Yan Zi
- Crop Research Institute/National Engineering Laboratory for Wheat and Maize, Shandong Academy of Agricultural Sciences (SAAS), Jinan, 250100, PR China
| | - Sing-Hoi Sze
- Department of Computer Science and Engineering and Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA.
| | - Meiping Zhang
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX, 77843-2474, USA.
| | - Hong-Bin Zhang
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX, 77843-2474, USA.
| |
Collapse
|