1
|
Peleke FF, Zumkeller SM, Gültas M, Schmitt A, Szymański J. Deep learning the cis-regulatory code for gene expression in selected model plants. Nat Commun 2024; 15:3488. [PMID: 38664394 PMCID: PMC11045779 DOI: 10.1038/s41467-024-47744-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Accepted: 04/09/2024] [Indexed: 04/28/2024] Open
Abstract
Elucidating the relationship between non-coding regulatory element sequences and gene expression is crucial for understanding gene regulation and genetic variation. We explored this link with the training of interpretable deep learning models predicting gene expression profiles from gene flanking regions of the plant species Arabidopsis thaliana, Solanum lycopersicum, Sorghum bicolor, and Zea mays. With over 80% accuracy, our models enabled predictive feature selection, highlighting e.g. the significant role of UTR regions in determining gene expression levels. The models demonstrated remarkable cross-species performance, effectively identifying both conserved and species-specific regulatory sequence features and their predictive power for gene expression. We illustrated the application of our approach by revealing causal links between genetic variation and gene expression changes across fourteen tomato genomes. Lastly, our models efficiently predicted genotype-specific expression of key functional gene groups, exemplified by underscoring known phenotypic and metabolic differences between Solanum lycopersicum and its wild, drought-resistant relative, Solanum pennellii.
Collapse
Affiliation(s)
- Fritz Forbang Peleke
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Corrensstraße 3, D-06466 Seeland, OT, Gatersleben, Germany
| | - Simon Maria Zumkeller
- Institute of Bio- and Geosciences, IBG-4: Bioinformatics, Forschungszentrum Jülich, D-52428, Jülich, Germany
- Cluster of Excellence on Plant Sciences (CEPLAS), Heinrich-Heine-Universität Düsseldorf, 40225, Düsseldorf, Germany
| | - Mehmet Gültas
- Faculty of Agriculture, South Westphalia University of Applied Sciences, Soest, 59494, Germany
| | - Armin Schmitt
- Breeding Informatics Group, University of Göttingen, Göttingen, 37075, Germany
- Center of Integrated Breeding Research (CiBreed), Göttingen, 37075, Germany
| | - Jędrzej Szymański
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Corrensstraße 3, D-06466 Seeland, OT, Gatersleben, Germany.
- Institute of Bio- and Geosciences, IBG-4: Bioinformatics, Forschungszentrum Jülich, D-52428, Jülich, Germany.
- Cluster of Excellence on Plant Sciences (CEPLAS), Heinrich-Heine-Universität Düsseldorf, 40225, Düsseldorf, Germany.
| |
Collapse
|
2
|
Xu L, Hao J, Lv M, Liu P, Ge Q, Zhang S, Yang J, Niu H, Wang Y, Xue Y, Lu X, Tang J, Zheng J, Gou M. A genome-wide association study identifies genes associated with cuticular wax metabolism in maize. Plant Physiol 2024; 194:2616-2630. [PMID: 38206190 DOI: 10.1093/plphys/kiae007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Revised: 11/20/2023] [Accepted: 12/11/2023] [Indexed: 01/12/2024]
Abstract
The plant cuticle is essential in plant defense against biotic and abiotic stresses. To systematically elucidate the genetic architecture of maize (Zea mays L.) cuticular wax metabolism, 2 cuticular wax-related traits, the chlorophyll extraction rate (CER) and water loss rate (WLR) of 389 maize inbred lines, were investigated and a genome-wide association study (GWAS) was performed using 1.25 million single nucleotide polymorphisms (SNPs). In total, 57 nonredundant quantitative trait loci (QTL) explaining 5.57% to 15.07% of the phenotypic variation for each QTL were identified. These QTLs contained 183 genes, among which 21 strong candidates were identified based on functional annotations and previous publications. Remarkably, 3 candidate genes that express differentially during cuticle development encode β-ketoacyl-CoA synthase (KCS). While ZmKCS19 was known to be involved in cuticle wax metabolism, ZmKCS12 and ZmKCS3 functions were not reported. The association between ZmKCS12 and WLR was confirmed by resequencing 106 inbred lines, and the variation of WLR was significant between different haplotypes of ZmKCS12. In this study, the loss-of-function mutant of ZmKCS12 exhibited wrinkled leaf morphology, altered wax crystal morphology, and decreased C32 wax monomer levels, causing an increased WLR and sensitivity to drought. These results confirm that ZmKCS12 plays a vital role in maize C32 wax monomer synthesis and is critical for drought tolerance. In sum, through GWAS of 2 cuticular wax-associated traits, this study reveals comprehensively the genetic architecture in maize cuticular wax metabolism and provides a valuable reference for the genetic improvement of stress tolerance in maize.
Collapse
Affiliation(s)
- Liping Xu
- State Key Laboratory of Wheat and Maize Crops Science, Collaborative Innovation Center of Henan Grain Crops, Henan Agricultural University, Zhengzhou 450002, China
- The Shennong Laboratory, Zhengzhou 450002, China
| | - Jiaxin Hao
- State Key Laboratory of Wheat and Maize Crops Science, Collaborative Innovation Center of Henan Grain Crops, Henan Agricultural University, Zhengzhou 450002, China
| | - Mengfan Lv
- State Key Laboratory of Wheat and Maize Crops Science, Collaborative Innovation Center of Henan Grain Crops, Henan Agricultural University, Zhengzhou 450002, China
| | - Peipei Liu
- State Key Laboratory of Wheat and Maize Crops Science, Collaborative Innovation Center of Henan Grain Crops, Henan Agricultural University, Zhengzhou 450002, China
| | - Qidong Ge
- State Key Laboratory of Wheat and Maize Crops Science, Collaborative Innovation Center of Henan Grain Crops, Henan Agricultural University, Zhengzhou 450002, China
| | - Sainan Zhang
- State Key Laboratory of Wheat and Maize Crops Science, Collaborative Innovation Center of Henan Grain Crops, Henan Agricultural University, Zhengzhou 450002, China
| | - Jianping Yang
- State Key Laboratory of Wheat and Maize Crops Science, Collaborative Innovation Center of Henan Grain Crops, Henan Agricultural University, Zhengzhou 450002, China
| | - Hongbin Niu
- State Key Laboratory of Wheat and Maize Crops Science, Collaborative Innovation Center of Henan Grain Crops, Henan Agricultural University, Zhengzhou 450002, China
| | - Yiru Wang
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Yadong Xue
- State Key Laboratory of Wheat and Maize Crops Science, Collaborative Innovation Center of Henan Grain Crops, Henan Agricultural University, Zhengzhou 450002, China
| | - Xiaoduo Lu
- Institute of Advanced Agricultural Technology, Qilu Normal University, Jinan 250200, China
| | - Jihua Tang
- State Key Laboratory of Wheat and Maize Crops Science, Collaborative Innovation Center of Henan Grain Crops, Henan Agricultural University, Zhengzhou 450002, China
- The Shennong Laboratory, Zhengzhou 450002, China
| | - Jun Zheng
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Mingyue Gou
- State Key Laboratory of Wheat and Maize Crops Science, Collaborative Innovation Center of Henan Grain Crops, Henan Agricultural University, Zhengzhou 450002, China
- The Shennong Laboratory, Zhengzhou 450002, China
| |
Collapse
|
3
|
Kim RJ, Han S, Kim HJ, Hur JH, Suh MC. Tetracosanoic acids produced by 3-ketoacyl-CoA synthase 17 are required for synthesizing seed coat suberin in Arabidopsis. J Exp Bot 2024; 75:1767-1780. [PMID: 37769208 DOI: 10.1093/jxb/erad381] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Accepted: 09/27/2023] [Indexed: 09/30/2023]
Abstract
Very long-chain fatty acids (VLCFAs) are precursors for the synthesis of membrane lipids, cuticular waxes, suberins, and storage oils in plants. 3-Ketoacyl CoA synthase (KCS) catalyzes the condensation of C2 units from malonyl-CoA to acyl-CoA, the first rate-limiting step in VLCFA synthesis. In this study, we revealed that Arabidopsis KCS17 catalyzes the elongation of C22-C24 VLCFAs required for synthesizing seed coat suberin. Histochemical analysis of Arabidopsis plants expressing GUS (β-glucuronidase) under the control of the KCS17 promoter revealed predominant GUS expression in seed coats, petals, stigma, and developing pollen. The expression of KCS17:eYFP (enhanced yellow fluorescent protein) driven by the KCS17 promoter was observed in the outer integument1 of Arabidopsis seed coats. The KCS17:eYFP signal was detected in the endoplasmic reticulum of tobacco epidermal cells. The levels of C22 VLCFAs and their derivatives, primary alcohols, α,ω-alkane diols, ω-hydroxy fatty acids, and α,ω-dicarboxylic acids increased by ~2-fold, but those of C24 VLCFAs, ω-hydroxy fatty acids, and α,ω-dicarboxylic acids were reduced by half in kcs17-1 and kcs17-2 seed coats relative to the wild type (WT). The seed coat of kcs17 displayed decreased autofluorescence under UV and increased permeability to tetrazolium salt compared with the WT. Seed germination and seedling establishment of kcs17 were more delayed by salt and osmotic stress treatments than the WT. KCS17 formed homo- and hetero-interactions with KCR1, PAS2, and ECR, but not with PAS1. Therefore, KCS17-mediated VLCFA synthesis is required for suberin layer formation in Arabidopsis seed coats.
Collapse
Affiliation(s)
- Ryeo Jin Kim
- Department of Life Sciences, Sogang University, Seoul 04107, Republic of Korea
| | - Sol Han
- Department of Life Sciences, Sogang University, Seoul 04107, Republic of Korea
| | - Hyeon Jun Kim
- Department of Life Sciences, Sogang University, Seoul 04107, Republic of Korea
| | - Ji Hyun Hur
- Department of Life Sciences, Sogang University, Seoul 04107, Republic of Korea
| | - Mi Chung Suh
- Department of Life Sciences, Sogang University, Seoul 04107, Republic of Korea
| |
Collapse
|
4
|
Knoch D, Meyer RC, Heuermann MC, Riewe D, Peleke FF, Szymański J, Abbadi A, Snowdon RJ, Altmann T. Integrated multi-omics analyses and genome-wide association studies reveal prime candidate genes of metabolic and vegetative growth variation in canola. Plant J 2024; 117:713-728. [PMID: 37964699 DOI: 10.1111/tpj.16524] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 10/17/2023] [Accepted: 10/23/2023] [Indexed: 11/16/2023]
Abstract
Genome-wide association studies (GWAS) identified thousands of genetic loci associated with complex plant traits, including many traits of agronomical importance. However, functional interpretation of GWAS results remains challenging because of large candidate regions due to linkage disequilibrium. High-throughput omics technologies, such as genomics, transcriptomics, proteomics and metabolomics open new avenues for integrative systems biological analyses and help to nominate systems information supported (prime) candidate genes. In the present study, we capitalise on a diverse canola population with 477 spring-type lines which was previously analysed by high-throughput phenotyping of growth-related traits and by RNA sequencing and metabolite profiling for multi-omics-based hybrid performance prediction. We deepened the phenotypic data analysis, now providing 123 time-resolved image-based traits, to gain insight into the complex relations during early vegetative growth and reanalysed the transcriptome data based on the latest Darmor-bzh v10 genome assembly. Genome-wide association testing revealed 61 298 robust quantitative trait loci (QTL) including 187 metabolite QTL, 56814 expression QTL and 4297 phenotypic QTL, many clustered in pronounced hotspots. Combining information about QTL colocalisation across omics layers and correlations between omics features allowed us to discover prime candidate genes for metabolic and vegetative growth variation. Prioritised candidate genes for early biomass accumulation include A06p05760.1_BnaDAR (PIAL1), A10p16280.1_BnaDAR, C07p48260.1_BnaDAR (PRL1) and C07p48510.1_BnaDAR (CLPR4). Moreover, we observed unequal effects of the Brassica A and C subgenomes on early biomass production.
Collapse
Affiliation(s)
- Dominic Knoch
- Department of Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466, Corrensstrasse 3, Seeland OT, Gatersleben, Germany
| | - Rhonda C Meyer
- Department of Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466, Corrensstrasse 3, Seeland OT, Gatersleben, Germany
| | - Marc C Heuermann
- Department of Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466, Corrensstrasse 3, Seeland OT, Gatersleben, Germany
| | - David Riewe
- Department of Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466, Corrensstrasse 3, Seeland OT, Gatersleben, Germany
- Julius Kühn Institute (JKI) - Federal Research Centre for Cultivated Plants, Institute for Ecological Chemistry, Plant Analysis and Stored Product Protection, 14195, Berlin, Germany
| | - Fritz F Peleke
- Department of Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466, Corrensstrasse 3, Seeland OT, Gatersleben, Germany
| | - Jędrzej Szymański
- Department of Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466, Corrensstrasse 3, Seeland OT, Gatersleben, Germany
- Institute of Bio- and Geosciences IBG-4: Bioinformatics, Forschungszentrum Jülich, 52428, Jülich, Germany
| | - Amine Abbadi
- NPZ Innovation GmbH, Hohenlieth, 24363, Holtsee, Germany
- Norddeutsche Pflanzenzucht Hans-Georg Lembke KG, Hohenlieth, 24363, Holtsee, Germany
| | - Rod J Snowdon
- Department of Plant Breeding, Research Centre for Biosystems, Land Use and Nutrition (iFZ), Justus-Liebig-University Giessen, 35392, Giessen, Germany
| | - Thomas Altmann
- Department of Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466, Corrensstrasse 3, Seeland OT, Gatersleben, Germany
| |
Collapse
|
5
|
Gong Y, Wang D, Xie H, Zhao Z, Chen Y, Zhang D, Jiao Y, Shi M, Lv P, Sha Q, Yang J, Chu P, Sun Y. Genome-wide identification and expression analysis of the KCS gene family in soybean ( Glycine max) reveal their potential roles in response to abiotic stress. Front Plant Sci 2023; 14:1291731. [PMID: 38116151 PMCID: PMC10728876 DOI: 10.3389/fpls.2023.1291731] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/10/2023] [Accepted: 11/01/2023] [Indexed: 12/21/2023]
Abstract
Very long chain fatty acids (VLCFAs) are fatty acids with chain lengths of 20 or more carbon atoms, which are the building blocks of various lipids that regulate developmental processes and plant stress responses. 3-ketoacyl-CoA synthase encoded by the KCS gene is the key rate-limiting enzyme in VLCFA biosynthesis, but the KCS gene family in soybean (Glycine max) has not been adequately studied thus far. In this study, 31 KCS genes (namely GmKCS1 - GmKCS31) were identified in the soybean genome, which are unevenly distributed on 14 chromosomes. These GmKCS genes could be phylogenetically classified into seven groups. A total of 27 paralogous GmKCS gene pairs were identified with their Ka/Ks ratios indicating that they had undergone purifying selection during soybean genome expansion. Cis-acting element analysis revealed that GmKCS promoters contained multiple hormone- and stress-responsive elements, indicating that GmKCS gene expression levels may be regulated by various developmental and environmental stimuli. Expression profiles derived from RNA-seq data and qRT-PCR experiments indicated that GmKCS genes were diversely expressed in different organs/tissues, and many GmKCS genes were found to be differentially expressed in the leaves under cold, heat, salt, and drought stresses, suggesting their critical role in soybean resistance to abiotic stress. These results provide fundamental information about the soybean KCS genes and will aid in their further functional elucidation and exploitation.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | - Pengfei Chu
- School of Agricultural Science and Engineering, Liaocheng University, Liaocheng, China
| | - Yongwang Sun
- School of Agricultural Science and Engineering, Liaocheng University, Liaocheng, China
| |
Collapse
|
6
|
Yang L, Fang J, Wang J, Hui S, Zhou L, Xu B, Chen Y, Zhang Y, Lai C, Jiao G, Sheng Z, Wei X, Shao G, Xie L, Wang L, Chen Y, Zhao F, Hu S, Hu P, Tang S. Genome-wide identification and expression analysis of 3-ketoacyl-CoA synthase gene family in rice ( Oryza sativa L.) under cadmium stress. Front Plant Sci 2023; 14:1222288. [PMID: 37554558 PMCID: PMC10406525 DOI: 10.3389/fpls.2023.1222288] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/14/2023] [Accepted: 07/03/2023] [Indexed: 08/10/2023]
Abstract
3-Ketoacyl-CoA synthase (KCS) is the key rate-limiting enzyme for the synthesis of very long-chain fatty acids (VLCFAs) in plants, which determines the carbon chain length of VLCFAs. However, a comprehensive study of KCSs in Oryza sativa has not been reported yet. In this study, we identified 22 OsKCS genes in rice, which are unevenly distributed on nine chromosomes. The OsKCS gene family is divided into six subclasses. Many cis-acting elements related to plant growth, light, hormone, and stress response were enriched in the promoters of OsKCS genes. Gene duplication played a crucial role in the expansion of the OsKCS gene family and underwent a strong purifying selection. Quantitative Real-time polymerase chain reaction (qRT-PCR) results revealed that most KCS genes are constitutively expressed. We also revealed that KCS genes responded differently to exogenous cadmium stress in japonica and indica background, and the KCS genes with higher expression in leaves and seeds may have functions under cadmium stress. This study provides a basis for further understanding the functions of KCS genes and the biosynthesis of VLCFA in rice.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Shikai Hu
- State Key Laboratory of Rice Biology and Breeding, China National Rice Research Institute, Hangzhou, China
| | - Peisong Hu
- State Key Laboratory of Rice Biology and Breeding, China National Rice Research Institute, Hangzhou, China
| | - Shaoqing Tang
- State Key Laboratory of Rice Biology and Breeding, China National Rice Research Institute, Hangzhou, China
| |
Collapse
|