1
|
Jung M, Hodel M, Knauf A, Kupper D, Neuditschko M, Bühlmann-Schütz S, Studer B, Patocchi A, Broggini GA. Evaluation of genomic and phenomic prediction for application in apple breeding. BMC PLANT BIOLOGY 2025; 25:103. [PMID: 39856563 PMCID: PMC11759423 DOI: 10.1186/s12870-025-06104-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2024] [Accepted: 01/13/2025] [Indexed: 01/27/2025]
Abstract
BACKGROUND Apple breeding schemes can be improved by using genomic prediction models to forecast the performance of breeding material. The predictive ability of these models depends on factors like trait genetic architecture, training set size, relatedness of the selected material to the training set, and the validation method used. Alternative genotyping methods such as RADseq and complementary data from near-infrared spectroscopy could help improve the cost-effectiveness of genomic prediction. However, the impact of these factors and alternative approaches on predictive ability beyond experimental populations still need to be investigated. In this study, we evaluated 137 prediction scenarios varying the described factors and alternative approaches, offering recommendations for implementing genomic selection in apple breeding. RESULTS Our results show that extending the training set with germplasm related to the predicted breeding material can improve average predictive ability across eleven studied traits by up to 0.08. The study emphasizes the usefulness of leave-one-family-out cross-validation, reflecting the application of genomic prediction to a new family, although it reduced average predictive ability across traits by up to 0.24 compared to 10-fold cross-validation. Similar average predictive abilities across traits indicate that imputed RADseq data could be a suitable genotyping alternative to SNP array datasets. The best-performing scenario using near-infrared spectroscopy data for phenomic prediction showed a 0.35 decrease in average predictive ability across traits compared to conventional genomic prediction, suggesting that the tested phenomic prediction approach is impractical. CONCLUSIONS Extending the training set using germplasm related with the target breeding material is crucial to improve the predictive ability of genomic prediction in apple. RADseq is a viable alternative to SNP array genotyping, while phenomic prediction is impractical. These findings offer valuable guidance for applying genomic selection in apple breeding, ultimately leading to the development of breeding material with improved quality.
Collapse
Affiliation(s)
- Michaela Jung
- Agroscope, Mueller-Thurgau-Strasse 29, Waedenswil, 8820, Switzerland.
- Molecular Plant Breeding, Institute of Agricultural Sciences, ETH Zurich, Universitaetstrasse 2, Zurich, 8092, Switzerland.
| | - Marius Hodel
- Agroscope, Mueller-Thurgau-Strasse 29, Waedenswil, 8820, Switzerland
| | - Andrea Knauf
- Agroscope, Mueller-Thurgau-Strasse 29, Waedenswil, 8820, Switzerland
- Molecular Plant Breeding, Institute of Agricultural Sciences, ETH Zurich, Universitaetstrasse 2, Zurich, 8092, Switzerland
| | - Daniela Kupper
- Agroscope, Mueller-Thurgau-Strasse 29, Waedenswil, 8820, Switzerland
- Molecular Plant Breeding, Institute of Agricultural Sciences, ETH Zurich, Universitaetstrasse 2, Zurich, 8092, Switzerland
| | | | | | - Bruno Studer
- Molecular Plant Breeding, Institute of Agricultural Sciences, ETH Zurich, Universitaetstrasse 2, Zurich, 8092, Switzerland
| | - Andrea Patocchi
- Agroscope, Mueller-Thurgau-Strasse 29, Waedenswil, 8820, Switzerland
| | - Giovanni Al Broggini
- Molecular Plant Breeding, Institute of Agricultural Sciences, ETH Zurich, Universitaetstrasse 2, Zurich, 8092, Switzerland
| |
Collapse
|
2
|
Sweet DD, Tirado SB, Cooper J, Springer NM, Hirsch CD, Hirsch CN. Temporally resolved growth patterns reveal novel information about the polygenic nature of complex quantitative traits. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024; 120:1969-1986. [PMID: 39462452 PMCID: PMC11629746 DOI: 10.1111/tpj.17092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/29/2024] [Revised: 09/29/2024] [Accepted: 10/08/2024] [Indexed: 10/29/2024]
Abstract
Plant height can be an indicator of plant health across environments and used to identify superior genotypes. Typically plant height is measured at a single timepoint when plants reach terminal height. Evaluating plant height using unoccupied aerial vehicles allows for measurements throughout the growing season, facilitating a better understanding of plant-environment interactions and the genetic basis of this complex trait. To assess variation throughout development, plant height data was collected from planting until terminal height at anthesis (14 flights 2018, 27 in 2019, 12 in 2020, and 11 in 2021) for a panel of ~500 diverse maize inbred lines. The percent variance explained in plant height throughout the season was significantly explained by genotype (9-48%), year (4-52%), and genotype-by-year interactions (14-36%) to varying extents throughout development. Genome-wide association studies revealed 717 significant single nucleotide polymorphisms associated with plant height and growth rate at different parts of the growing season specific to certain phases of vegetative growth. When plant height growth curves were compared to growth curves estimated from canopy cover, greater Fréchet distance stability was observed in plant height growth curves than for canopy cover. This indicated canopy cover may be more useful for understanding environmental modulation of overall plant growth and plant height better for understanding genotypic modulation of overall plant growth. This study demonstrated that substantial information can be gained from high temporal resolution data to understand how plants differentially interact with the environment and can enhance our understanding of the genetic basis of complex polygenic traits.
Collapse
Affiliation(s)
- Dorothy D. Sweet
- Department of Agronomy and Plant GeneticsUniversity of MinnesotaSaint PaulMinnesota55108USA
- Department of Plant PathologyUniversity of MinnesotaSaint PaulMinnesota55108USA
| | - Sara B. Tirado
- Department of Agronomy and Plant GeneticsUniversity of MinnesotaSaint PaulMinnesota55108USA
- Department of Plant and Microbial BiologyUniversity of MinnesotaSaint PaulMinnesota55108USA
| | - Julian Cooper
- Department of Plant PathologyUniversity of MinnesotaSaint PaulMinnesota55108USA
| | - Nathan M. Springer
- Department of Plant and Microbial BiologyUniversity of MinnesotaSaint PaulMinnesota55108USA
| | - Cory D. Hirsch
- Department of Plant PathologyUniversity of MinnesotaSaint PaulMinnesota55108USA
| | - Candice N. Hirsch
- Department of Agronomy and Plant GeneticsUniversity of MinnesotaSaint PaulMinnesota55108USA
| |
Collapse
|
3
|
Adak A, DeSalvio AJ, Arik MA, Murray SC. Field-based high-throughput phenotyping enhances phenomic and genomic predictions for grain yield and plant height across years in maize. G3 (BETHESDA, MD.) 2024; 14:jkae092. [PMID: 38776257 PMCID: PMC11228873 DOI: 10.1093/g3journal/jkae092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/03/2024] [Accepted: 04/24/2024] [Indexed: 05/24/2024]
Abstract
Field-based phenomic prediction employs novel features, like vegetation indices (VIs) from drone images, to predict key agronomic traits in maize, despite challenges in matching biomarker measurement time points across years or environments. This study utilized functional principal component analysis (FPCA) to summarize the variation of temporal VIs, uniquely allowing the integration of this data into phenomic prediction models tested across multiple years (2018-2021) and environments. The models, which included 1 genomic, 2 phenomic, 2 multikernel, and 1 multitrait type, were evaluated in 4 prediction scenarios (CV2, CV1, CV0, and CV00), relevant for plant breeding programs, assessing both tested and untested genotypes in observed and unobserved environments. Two hybrid populations (415 and 220 hybrids) demonstrated the visible atmospherically resistant index's strong temporal correlation with grain yield (up to 0.59) and plant height. The first 2 FPCAs explained 59.3 ± 13.9% and 74.2 ± 9.0% of the temporal variation of temporal data of VIs, respectively, facilitating predictions where flight times varied. Phenomic data, particularly when combined with genomic data, often were comparable to or numerically exceeded the base genomic model in prediction accuracy, particularly for grain yield in untested hybrids, although no significant differences in these models' performance were consistently observed. Overall, this approach underscores the effectiveness of FPCA and combined models in enhancing the prediction of grain yield and plant height across environments and diverse agricultural settings.
Collapse
Affiliation(s)
- Alper Adak
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843-2474, USA
| | - Aaron J DeSalvio
- Interdisciplinary Graduate Program in Genetics and Genomics (Department of Biochemistry and Biophysics), Texas A&M University, College Station, TX 77843-2128, USA
| | - Mustafa A Arik
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843-2474, USA
| | - Seth C Murray
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843-2474, USA
| |
Collapse
|
4
|
Xue W, Ding H, Jin T, Meng J, Wang S, Liu Z, Ma X, Li J. CucumberAI: Cucumber Fruit Morphology Identification System Based on Artificial Intelligence. PLANT PHENOMICS (WASHINGTON, D.C.) 2024; 6:0193. [PMID: 39144674 PMCID: PMC11324094 DOI: 10.34133/plantphenomics.0193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Accepted: 04/30/2024] [Indexed: 08/16/2024]
Abstract
Cucumber is an important vegetable crop that has high nutritional and economic value and is thus favored by consumers worldwide. Exploring an accurate and fast technique for measuring the morphological traits of cucumber fruit could be helpful for improving its breeding efficiency and further refining the development models for pepo fruits. At present, several sets of measurement schemes and standards have been proposed and applied for the characterization of cucumber fruits; however, these manual methods are time-consuming and inefficient. Therefore, in this paper, we propose a cucumber fruit morphological trait identification framework and software called CucumberAI, which combines image processing techniques with deep learning models to efficiently identify up to 51 cucumber features, including 32 newly defined parameters. The proposed tool introduces an algorithm for performing cucumber contour extraction and fruit segmentation based on image processing techniques. The identification framework comprises 6 deep learning models that combine fruit feature recognition rules with MobileNetV2 to construct a decision tree for fruit shape recognition. Additionally, the framework employs U-Net segmentation models for fruit stripe and endocarp segmentation, a MobileNetV2 model for carpel classification, a ResNet50 model for stripe classification and a YOLOv5 model for tumor identification. The relationships between the image-based manual and algorithmic traits are highly correlated, and validation tests were conducted to perform correlation analyses of fruit surface smoothness and roughness, and a fruit appearance cluster analysis was also performed. In brief, CucumberAI offers an efficient approach for extracting and analyzing cucumber phenotypes and provides valuable information for future cucumber genetic improvements.
Collapse
Affiliation(s)
- Wei Xue
- College of Artificial Intelligence,
Nanjing Agricultural University, Nanjing 210095, China
| | - Haifeng Ding
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, College of Horticulture,
Nanjing Agricultural University, No. 1 Weigang, Nanjing 210095, China
| | - Tao Jin
- College of Artificial Intelligence,
Nanjing Agricultural University, Nanjing 210095, China
| | - Jialing Meng
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, College of Horticulture,
Nanjing Agricultural University, No. 1 Weigang, Nanjing 210095, China
| | - Shiyou Wang
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, College of Horticulture,
Nanjing Agricultural University, No. 1 Weigang, Nanjing 210095, China
| | - Zuo Liu
- College of Artificial Intelligence,
Nanjing Agricultural University, Nanjing 210095, China
| | - Xiupeng Ma
- College of Foreign Studies,
Nanjing Agricultural University, Nanjing 210095, China
| | - Ji Li
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, College of Horticulture,
Nanjing Agricultural University, No. 1 Weigang, Nanjing 210095, China
| |
Collapse
|
5
|
Adak A, Murray SC, Washburn JD. Deciphering temporal growth patterns in maize: integrative modeling of phenotype dynamics and underlying genomic variations. THE NEW PHYTOLOGIST 2024; 242:121-136. [PMID: 38348523 DOI: 10.1111/nph.19575] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Accepted: 01/11/2024] [Indexed: 03/08/2024]
Abstract
Quantifying the temporal or longitudinal growth dynamics of crops in diverse environmental conditions is crucial for understanding plant development, requiring further modeling techniques. In this study, we analyzed the growth patterns of two different maize (Zea mays L.) populations using high-throughput phenotyping with a maize population consisting of 515 recombinant inbred lines (RILs) grown in Texas and a hybrid population containing 1090 hybrids grown in Missouri. Two models, Gaussian peak and functional principal component analysis (FPCA), were employed to study the Normalized Green-Red Difference Index (NGRDI) scores. The Gaussian peak model showed strong correlations (c. 0.94 for RILs and c. 0.97 for hybrids) between modeled and non-modeled temporal trajectories. Functional principal component analysis differentiated NGRDI trajectories in RILs under different conditions, capturing substantial variability (75%, 20%, and 5% for RILs; 88% and 12% for hybrids). By comparing these models with conventional BLUP values, common quantitative trait loci (QTLs) were identified, containing candidate genes of brd1, pin11, zcn8 and rap2. The harmony between these loci's additive effects and growing degree days, as well as the differentiation of RIL haplotypes across growth stages, underscores the significant interplay of these loci in driving plant development. These findings contribute to advancing understanding of plant-environment interactions and have implications for crop improvement strategies.
Collapse
Affiliation(s)
- Alper Adak
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX, 77843, USA
| | - Seth C Murray
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX, 77843, USA
| | | |
Collapse
|
6
|
Adak A, Kang M, Anderson SL, Murray SC, Jarquin D, Wong RKW, Katzfuß M. Phenomic data-driven biological prediction of maize through field-based high-throughput phenotyping integration with genomic data. JOURNAL OF EXPERIMENTAL BOTANY 2023; 74:5307-5326. [PMID: 37279568 DOI: 10.1093/jxb/erad216] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Accepted: 06/02/2023] [Indexed: 06/08/2023]
Abstract
High-throughput phenotyping (HTP) has expanded the dimensionality of data in plant research; however, HTP has resulted in few novel biological discoveries to date. Field-based HTP (FHTP), using small unoccupied aerial vehicles (UAVs) equipped with imaging sensors, can be deployed routinely to monitor segregating plant population interactions with the environment under biologically meaningful conditions. Here, flowering dates and plant height, important phenological fitness traits, were collected on 520 segregating maize recombinant inbred lines (RILs) in both irrigated and drought stress trials in 2018. Using UAV phenomic, single nucleotide polymorphism (SNP) genomic, as well as combined data, flowering times were predicted using several scenarios. Untested genotypes were predicted with 0.58, 0.59, and 0.41 prediction ability for anthesis, silking, and terminal plant height, respectively, using genomic data, but prediction ability increased to 0.77, 0.76, and 0.58 when phenomic and genomic data were used together. Using the phenomic data in a genome-wide association study, a heat-related candidate gene (GRMZM2G083810; hsp18f) was discovered using temporal reflectance phenotypes belonging to flowering times (both irrigated and drought) trials where heat stress also peaked. Thus, a relationship between plants and abiotic stresses belonging to a specific time of growth was revealed only through use of temporal phenomic data. Overall, this study showed that (i) it is possible to predict complex traits using high dimensional phenomic data between different environments, and (ii) temporal phenomic data can reveal a time-dependent association between genotypes and abiotic stresses, which can help understand mechanisms to develop resilient plants.
Collapse
Affiliation(s)
- Alper Adak
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843-2474, USA
| | - Myeongjong Kang
- Department of Statistics, Texas A&M University, College Station, TX 77843, USA
| | | | - Seth C Murray
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843-2474, USA
| | - Diego Jarquin
- Agronomy Department, University of Florida, Gainesville, FL 32611, USA
| | - Raymond K W Wong
- Department of Statistics, Texas A&M University, College Station, TX 77843, USA
| | - Matthias Katzfuß
- Department of Statistics, Texas A&M University, College Station, TX 77843, USA
| |
Collapse
|