1
|
Naake T, Zhu F, Alseekh S, Scossa F, Perez de Souza L, Borghi M, Brotman Y, Mori T, Nakabayashi R, Tohge T, Fernie AR. Genome-wide association studies identify loci controlling specialized seed metabolites in Arabidopsis. PLANT PHYSIOLOGY 2024; 194:1705-1721. [PMID: 37758174 PMCID: PMC10904349 DOI: 10.1093/plphys/kiad511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 08/01/2023] [Accepted: 08/24/2023] [Indexed: 10/03/2023]
Abstract
Plants synthesize specialized metabolites to facilitate environmental and ecological interactions. During evolution, plants diversified in their potential to synthesize these metabolites. Quantitative differences in metabolite levels of natural Arabidopsis (Arabidopsis thaliana) accessions can be employed to unravel the genetic basis for metabolic traits using genome-wide association studies (GWAS). Here, we performed metabolic GWAS on seeds of a panel of 315 A. thaliana natural accessions, including the reference genotypes C24 and Col-0, for polar and semi-polar seed metabolites using untargeted ultra-performance liquid chromatography-mass spectrometry. As a complementary approach, we performed quantitative trait locus (QTL) mapping of near-isogenic introgression lines between C24 and Col-0 for specific seed specialized metabolites. Besides common QTL between seeds and leaves, GWAS revealed seed-specific QTL for specialized metabolites, indicating differences in the genetic architecture of seeds and leaves. In seeds, aliphatic methylsulfinylalkyl and methylthioalkyl glucosinolates associated with the ALKENYL HYDROXYALKYL PRODUCING loci (GS-ALK and GS-OHP) on chromosome 4 containing alkenyl hydroxyalkyl producing 2 (AOP2) and 3 (AOP3) or with the GS-ELONG locus on chromosome 5 containing methylthioalkyl malate synthase (MAM1) and MAM3. We detected two unknown sulfur-containing compounds that were also mapped to these loci. In GWAS, some of the annotated flavonoids (kaempferol 3-O-rhamnoside-7-O-rhamnoside, quercetin 3-O-rhamnoside-7-O-rhamnoside) were mapped to transparent testa 7 (AT5G07990), encoding a cytochrome P450 75B1 monooxygenase. Three additional mass signals corresponding to quercetin-containing flavonols were mapped to UGT78D2 (AT5G17050). The association of the loci and associating metabolic features were functionally verified in knockdown mutant lines. By performing GWAS and QTL mapping, we were able to leverage variation of natural populations and parental lines to study seed specialized metabolism. The GWAS data set generated here is a high-quality resource that can be investigated in further studies.
Collapse
Affiliation(s)
- Thomas Naake
- Central Metabolism, Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, 14476 Potsdam-Golm, Germany
| | - Feng Zhu
- Central Metabolism, Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, 14476 Potsdam-Golm, Germany
| | - Saleh Alseekh
- Central Metabolism, Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, 14476 Potsdam-Golm, Germany
- Center of Plant Systems Biology and Biotechnology, 4000 Plovdiv, Bulgaria
| | - Federico Scossa
- Central Metabolism, Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, 14476 Potsdam-Golm, Germany
- Research Center for Genomics and Bioinformatics (CREA-GB), Council for Agricultural Research and Economics, Via Ardeatina 546, 00178 Rome, Italy
| | - Leonardo Perez de Souza
- Central Metabolism, Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, 14476 Potsdam-Golm, Germany
| | - Monica Borghi
- Central Metabolism, Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, 14476 Potsdam-Golm, Germany
- Department of Biology, Utah State University, 5305 Old Main Hill, Logan, UT 84321-5305, USA
| | - Yariv Brotman
- Department of Life Sciences, Ben-Gurion University of the Negev, 8410501 Be’er Sheva, Israel
| | - Tetsuya Mori
- RIKEN Center for Sustainable Resource Science, Tsurumi, 1-7-22 Suehiro, Yokohama, Kanagawa 230-0045, Japan
| | - Ryo Nakabayashi
- RIKEN Center for Sustainable Resource Science, Tsurumi, 1-7-22 Suehiro, Yokohama, Kanagawa 230-0045, Japan
| | - Takayuki Tohge
- Graduate School of Biological Science, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma, Nara 630-0192, Japan
| | - Alisdair R Fernie
- Central Metabolism, Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, 14476 Potsdam-Golm, Germany
- Center of Plant Systems Biology and Biotechnology, 4000 Plovdiv, Bulgaria
| |
Collapse
|
3
|
Canella Vieira C, Zhou J, Usovsky M, Vuong T, Howland AD, Lee D, Li Z, Zhou J, Shannon G, Nguyen HT, Chen P. Exploring Machine Learning Algorithms to Unveil Genomic Regions Associated With Resistance to Southern Root-Knot Nematode in Soybeans. FRONTIERS IN PLANT SCIENCE 2022; 13:883280. [PMID: 35592556 PMCID: PMC9111516 DOI: 10.3389/fpls.2022.883280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Accepted: 04/08/2022] [Indexed: 06/15/2023]
Abstract
Southern root-knot nematode [SRKN, Meloidogyne incognita (Kofold & White) Chitwood] is a plant-parasitic nematode challenging to control due to its short life cycle, a wide range of hosts, and limited management options, of which genetic resistance is the main option to efficiently control the damage caused by SRKN. To date, a major quantitative trait locus (QTL) mapped on chromosome (Chr.) 10 plays an essential role in resistance to SRKN in soybean varieties. The confidence of discovered trait-loci associations by traditional methods is often limited by the assumptions of individual single nucleotide polymorphisms (SNPs) always acting independently as well as the phenotype following a Gaussian distribution. Therefore, the objective of this study was to conduct machine learning (ML)-based genome-wide association studies (GWAS) utilizing Random Forest (RF) and Support Vector Machine (SVM) algorithms to unveil novel regions of the soybean genome associated with resistance to SRKN. A total of 717 breeding lines derived from 330 unique bi-parental populations were genotyped with the Illumina Infinium BARCSoySNP6K BeadChip and phenotyped for SRKN resistance in a greenhouse. A GWAS pipeline involving a supervised feature dimension reduction based on Variable Importance in Projection (VIP) and SNP detection based on classification accuracy was proposed. Minor effect SNPs were detected by the proposed ML-GWAS methodology but not identified using Bayesian-information and linkage-disequilibrium Iteratively Nested Keyway (BLINK), Fixed and Random Model Circulating Probability Unification (FarmCPU), and Enriched Compressed Mixed Linear Model (ECMLM) models. Besides the genomic region on Chr. 10 that can explain most of SRKN resistance variance, additional minor effects SNPs were also identified on Chrs. 10 and 11. The findings in this study demonstrated that overfitting in GWAS may lead to lower prediction accuracy, and the detection of significant SNPs based on classification accuracy limited false-positive associations. The expansion of the basis of the genetic resistance to SRKN can potentially reduce the selection pressure over the major QTL on Chr. 10 and achieve higher levels of resistance.
Collapse
Affiliation(s)
- Caio Canella Vieira
- Fisher Delta Research, Extension, and Education Center, Division of Plant Science and Technology, University of Missouri, Portageville, MO, United States
| | - Jing Zhou
- Biological Systems Engineering, University of Wisconsin–Madison, Madison, WI, United States
| | - Mariola Usovsky
- Division of Plant Science and Technology, University of Missouri, Columbia, MO, United States
| | - Tri Vuong
- Division of Plant Science and Technology, University of Missouri, Columbia, MO, United States
| | - Amanda D. Howland
- Department of Entomology, College of Agriculture and Natural Resources, Michigan State University, East Lansing, MI, United States
| | - Dongho Lee
- Fisher Delta Research, Extension, and Education Center, Division of Plant Science and Technology, University of Missouri, Portageville, MO, United States
| | - Zenglu Li
- Institute of Plant Breeding, Genetics, and Genomics, College of Agricultural and Environmental Sciences, University of Georgia, Athens, GA, United States
| | - Jianfeng Zhou
- Division of Plant Science and Technology, University of Missouri, Columbia, MO, United States
| | - Grover Shannon
- Fisher Delta Research, Extension, and Education Center, Division of Plant Science and Technology, University of Missouri, Portageville, MO, United States
| | - Henry T. Nguyen
- Division of Plant Science and Technology, University of Missouri, Columbia, MO, United States
| | - Pengyin Chen
- Fisher Delta Research, Extension, and Education Center, Division of Plant Science and Technology, University of Missouri, Portageville, MO, United States
| |
Collapse
|
6
|
Arkin Y, Rahmani E, Kleber ME, Laaksonen R, März W, Halperin E. EPIQ-efficient detection of SNP-SNP epistatic interactions for quantitative traits. Bioinformatics 2014; 30:i19-25. [PMID: 24931983 PMCID: PMC4229902 DOI: 10.1093/bioinformatics/btu261] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Gene-gene interactions are of potential biological and medical interest, as they can shed light on both the inheritance mechanism of a trait and on the underlying biological mechanisms. Evidence of epistatic interactions has been reported in both humans and other organisms. Unlike single-locus genome-wide association studies (GWAS), which proved efficient in detecting numerous genetic loci related with various traits, interaction-based GWAS have so far produced very few reproducible discoveries. Such studies introduce a great computational and statistical burden by necessitating a large number of hypotheses to be tested including all pairs of single nucleotide polymorphisms (SNPs). Thus, many software tools have been developed for interaction-based case-control studies, some leading to reliable discoveries. For quantitative data, on the other hand, only a handful of tools exist, and the computational burden is still substantial. RESULTS We present an efficient algorithm for detecting epistasis in quantitative GWAS, achieving a substantial runtime speedup by avoiding the need to exhaustively test all SNP pairs using metric embedding and random projections. Unlike previous metric embedding methods for case-control studies, we introduce a new embedding, where each SNP is mapped to two Euclidean spaces. We implemented our method in a tool named EPIQ (EPIstasis detection for Quantitative GWAS), and we show by simulations that EPIQ requires hours of processing time where other methods require days and sometimes weeks. Applying our method to a dataset from the Ludwigshafen risk and cardiovascular health study, we discovered a pair of SNPs with a near-significant interaction (P = 2.2 × 10(-13)), in only 1.5 h on 10 processors. AVAILABILITY https://github.com/yaarasegre/EPIQ
Collapse
Affiliation(s)
- Ya'ara Arkin
- The Blavatnik School of Computer Science, Tel Aviv University, Tel-Aviv 69978, Israel, V Department of Medicine (Nephrology, Hypertensiology, Endocrinology, Diabetology, Rheumatology), Medical Faculty of Mannheim, University of Heidelberg, Mannheim D-68167, Germany, Zora Biosciences Oy, Espoo 02150, Finland, Medical School, University of Tampere, Tampere 33104, Finland, Clinical Institute of Medical and Chemical Laboratory Diagnostics, Medical University of Graz, Graz A-8036, Austria, Synlab Academy, Synlab Services GmbH, Mannheim D-68165, Germany, Department of Molecular Microbiology and Biotechnology, George Wise Faculty of Life Science, Tel-Aviv University, Tel-Aviv 69978, Israel and International Computer Science Institute, Berkeley, CA 94704, USA
| | - Elior Rahmani
- The Blavatnik School of Computer Science, Tel Aviv University, Tel-Aviv 69978, Israel, V Department of Medicine (Nephrology, Hypertensiology, Endocrinology, Diabetology, Rheumatology), Medical Faculty of Mannheim, University of Heidelberg, Mannheim D-68167, Germany, Zora Biosciences Oy, Espoo 02150, Finland, Medical School, University of Tampere, Tampere 33104, Finland, Clinical Institute of Medical and Chemical Laboratory Diagnostics, Medical University of Graz, Graz A-8036, Austria, Synlab Academy, Synlab Services GmbH, Mannheim D-68165, Germany, Department of Molecular Microbiology and Biotechnology, George Wise Faculty of Life Science, Tel-Aviv University, Tel-Aviv 69978, Israel and International Computer Science Institute, Berkeley, CA 94704, USA
| | - Marcus E Kleber
- The Blavatnik School of Computer Science, Tel Aviv University, Tel-Aviv 69978, Israel, V Department of Medicine (Nephrology, Hypertensiology, Endocrinology, Diabetology, Rheumatology), Medical Faculty of Mannheim, University of Heidelberg, Mannheim D-68167, Germany, Zora Biosciences Oy, Espoo 02150, Finland, Medical School, University of Tampere, Tampere 33104, Finland, Clinical Institute of Medical and Chemical Laboratory Diagnostics, Medical University of Graz, Graz A-8036, Austria, Synlab Academy, Synlab Services GmbH, Mannheim D-68165, Germany, Department of Molecular Microbiology and Biotechnology, George Wise Faculty of Life Science, Tel-Aviv University, Tel-Aviv 69978, Israel and International Computer Science Institute, Berkeley, CA 94704, USA
| | - Reijo Laaksonen
- The Blavatnik School of Computer Science, Tel Aviv University, Tel-Aviv 69978, Israel, V Department of Medicine (Nephrology, Hypertensiology, Endocrinology, Diabetology, Rheumatology), Medical Faculty of Mannheim, University of Heidelberg, Mannheim D-68167, Germany, Zora Biosciences Oy, Espoo 02150, Finland, Medical School, University of Tampere, Tampere 33104, Finland, Clinical Institute of Medical and Chemical Laboratory Diagnostics, Medical University of Graz, Graz A-8036, Austria, Synlab Academy, Synlab Services GmbH, Mannheim D-68165, Germany, Department of Molecular Microbiology and Biotechnology, George Wise Faculty of Life Science, Tel-Aviv University, Tel-Aviv 69978, Israel and International Computer Science Institute, Berkeley, CA 94704, USAThe Blavatnik School of Computer Science, Tel Aviv University, Tel-Aviv 69978, Israel, V Department of Medicine (Nephrology, Hypertensiology, Endocrinology, Diabetology, Rheumatology), Medical Faculty of Mannheim, University of Heidelberg, Mannheim D-68167, Germany, Zora Biosciences Oy, Espoo 02150, Finland, Medical School, University of Tampere, Tampere 33104, Finland, Clinical Institute of Medical and Chemical Laboratory Diagnostics, Medical University of Graz, Graz A-8036, Austria, Synlab Academy, Synlab Services GmbH, Mannheim D-68165, Germany, Department of Molecular Microbiology and Biotechnology, George Wise Faculty of Life Science, Tel-Aviv University, Tel-Aviv 69978, Israel and International Computer Science Institute, Berkeley, CA 94704, USA
| | - Winfried März
- The Blavatnik School of Computer Science, Tel Aviv University, Tel-Aviv 69978, Israel, V Department of Medicine (Nephrology, Hypertensiology, Endocrinology, Diabetology, Rheumatology), Medical Faculty of Mannheim, University of Heidelberg, Mannheim D-68167, Germany, Zora Biosciences Oy, Espoo 02150, Finland, Medical School, University of Tampere, Tampere 33104, Finland, Clinical Institute of Medical and Chemical Laboratory Diagnostics, Medical University of Graz, Graz A-8036, Austria, Synlab Academy, Synlab Services GmbH, Mannheim D-68165, Germany, Department of Molecular Microbiology and Biotechnology, George Wise Faculty of Life Science, Tel-Aviv University, Tel-Aviv 69978, Israel and International Computer Science Institute, Berkeley, CA 94704, USAThe Blavatnik School of Computer Science, Tel Aviv University, Tel-Aviv 69978, Israel, V Department of Medicine (Nephrology, Hypertensiology, Endocrinology, Diabetology, Rheumatology), Medical Faculty of Mannheim, University of Heidelberg, Mannheim D-68167, Germany, Zora Biosciences Oy, Espoo 02150, Finland, Medical School, University of Tampere, Tampere 33104, Finland, Clinical Institute of Medical and Chemical Laboratory Diagnostics, Medical University of Graz, Graz A-8036, Austria, Synlab Academy, Synlab Services GmbH, Mannheim D-68165, Germany, Department of Molecular Microbiology and Biotechnology, George Wise Faculty of Life Science, Tel-Aviv University, Tel-Aviv 69978, Israel and International Computer Science Institute, Berkeley, CA 94704, USAThe Blavatnik School of Computer Science, Tel Aviv University, Tel-Aviv 69978, Israel, V Department of Medicine (Nephrology, Hypertensiology, Endocrinology, Diabetology, Rheumatology), Medical Faculty of Mannheim, University of Heidelberg, Mannheim D-68167, Germany, Zora Biosciences Oy, Espoo 02150, Finland, Medical School, University of Tampere, Tampere 33104, Finland, Clinical Institute of Medical and Chemical Laboratory Diagnostics, Medical University of Graz, Graz A-803
| | - Eran Halperin
- The Blavatnik School of Computer Science, Tel Aviv University, Tel-Aviv 69978, Israel, V Department of Medicine (Nephrology, Hypertensiology, Endocrinology, Diabetology, Rheumatology), Medical Faculty of Mannheim, University of Heidelberg, Mannheim D-68167, Germany, Zora Biosciences Oy, Espoo 02150, Finland, Medical School, University of Tampere, Tampere 33104, Finland, Clinical Institute of Medical and Chemical Laboratory Diagnostics, Medical University of Graz, Graz A-8036, Austria, Synlab Academy, Synlab Services GmbH, Mannheim D-68165, Germany, Department of Molecular Microbiology and Biotechnology, George Wise Faculty of Life Science, Tel-Aviv University, Tel-Aviv 69978, Israel and International Computer Science Institute, Berkeley, CA 94704, USAThe Blavatnik School of Computer Science, Tel Aviv University, Tel-Aviv 69978, Israel, V Department of Medicine (Nephrology, Hypertensiology, Endocrinology, Diabetology, Rheumatology), Medical Faculty of Mannheim, University of Heidelberg, Mannheim D-68167, Germany, Zora Biosciences Oy, Espoo 02150, Finland, Medical School, University of Tampere, Tampere 33104, Finland, Clinical Institute of Medical and Chemical Laboratory Diagnostics, Medical University of Graz, Graz A-8036, Austria, Synlab Academy, Synlab Services GmbH, Mannheim D-68165, Germany, Department of Molecular Microbiology and Biotechnology, George Wise Faculty of Life Science, Tel-Aviv University, Tel-Aviv 69978, Israel and International Computer Science Institute, Berkeley, CA 94704, USAThe Blavatnik School of Computer Science, Tel Aviv University, Tel-Aviv 69978, Israel, V Department of Medicine (Nephrology, Hypertensiology, Endocrinology, Diabetology, Rheumatology), Medical Faculty of Mannheim, University of Heidelberg, Mannheim D-68167, Germany, Zora Biosciences Oy, Espoo 02150, Finland, Medical School, University of Tampere, Tampere 33104, Finland, Clinical Institute of Medical and Chemical Laboratory Diagnostics, Medical University of Graz, Graz A-803
| |
Collapse
|