1
|
Poudel S, Hyun J, Hefner Y, Monk J, Nizet V, Palsson BO. Interpreting roles of mutations associated with the emergence of S. aureus USA300 strains using transcriptional regulatory network reconstruction. eLife 2025; 12:RP90668. [PMID: 40305082 PMCID: PMC12043316 DOI: 10.7554/elife.90668] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2025] Open
Abstract
The Staphylococcus aureus clonal complex 8 (CC8) is made up of several subtypes with varying levels of clinical burden; from community-associated methicillin-resistant S. aureus USA300 strains to hospital-associated (HA-MRSA) USA500 strains and ancestral methicillin-susceptible (MSSA) strains. This phenotypic distribution within a single clonal complex makes CC8 an ideal clade to study the emergence of mutations important for antibiotic resistance and community spread. Gene-level analysis comparing USA300 against MSSA and HA-MRSA strains have revealed key horizontally acquired genes important for its rapid spread in the community. However, efforts to define the contributions of point mutations and indels have been confounded by strong linkage disequilibrium resulting from clonal propagation. To break down this confounding effect, we combined genetic association testing with a model of the transcriptional regulatory network (TRN) to find candidate mutations that may have led to changes in gene regulation. First, we used a De Bruijn graph genome-wide association study to enrich mutations unique to the USA300 lineages within CC8. Next, we reconstructed the TRN by using independent component analysis on 670 RNA-sequencing samples from USA300 and non-USA300 CC8 strains which predicted several genes with strain-specific altered expression patterns. Examination of the regulatory region of one of the genes enriched by both approaches, isdH, revealed a 38-bp deletion containing a Fur-binding site and a conserved single-nucleotide polymorphism which likely led to the altered expression levels in USA300 strains. Taken together, our results demonstrate the utility of reconstructed TRNs to address the limits of genetic approaches when studying emerging pathogenic strains.
Collapse
Affiliation(s)
- Saugat Poudel
- University of California, San DiegoLa JollaUnited States
| | - Jason Hyun
- University of California, San DiegoLa JollaUnited States
| | - Ying Hefner
- University of California, San DiegoLa JollaUnited States
| | - Jon Monk
- Palmona PathogenomicsMenlo ParkUnited States
| | - Victor Nizet
- Collaborative to Halt Antibiotic-Resistant Microbes (CHARM), Department of Pediatrics, University of California San DiegoLa JollaUnited States
- Department of Pediatrics, University of California San DiegoLa JollaUnited States
| | - Bernhard O Palsson
- University of California, San DiegoLa JollaUnited States
- Collaborative to Halt Antibiotic-Resistant Microbes (CHARM), Department of Pediatrics, University of California San DiegoLa JollaUnited States
- Department of Pediatrics, University of California San DiegoLa JollaUnited States
| |
Collapse
|
2
|
Jönsson M, Sigrist R, Gren T, Semenov Petrov M, Marcussen NEJ, Svetlova A, Charusanti P, Gockel P, Palsson BO, Yang L, Özdemir E. Machine learning uncovers the transcriptional regulatory network for the production host Streptomyces albidoflavus. Cell Rep 2025; 44:115392. [PMID: 40057950 DOI: 10.1016/j.celrep.2025.115392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2024] [Revised: 01/16/2025] [Accepted: 02/12/2025] [Indexed: 03/29/2025] Open
Abstract
Streptomyces albidoflavus is a widely used strain for natural product discovery and production through heterologous biosynthetic gene clusters (BGCs). However, the transcriptional regulatory network (TRN) and its impact on secondary metabolism remain poorly understood. Here, we characterize the TRN using independent component analysis on 218 RNA sequencing (RNA-seq) transcriptomes across 88 unique growth conditions. We identify 78 independently modulated sets of genes (iModulons) that quantitatively describe the TRN across diverse conditions. Our analyses reveal (1) TRN adaptation to different growth conditions, (2) conserved and unique characteristics of the TRN across diverse lineages, (3) transcriptional activation of several endogenous BGCs, including surugamide, minimycin, and paulomycin, and (4) inferred functions of 40% of uncharacterized genes in the S. albidoflavus genome. These findings provide a comprehensive and quantitative understanding of the S. albidoflavus TRN, offering a knowledge base for further exploration and experimental validation.
Collapse
Affiliation(s)
- Mathias Jönsson
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kgs. Lyngby, Denmark
| | - Renata Sigrist
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kgs. Lyngby, Denmark
| | - Tetiana Gren
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kgs. Lyngby, Denmark
| | - Mykhaylo Semenov Petrov
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kgs. Lyngby, Denmark
| | - Nils Emil Junge Marcussen
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kgs. Lyngby, Denmark
| | - Anna Svetlova
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kgs. Lyngby, Denmark
| | - Pep Charusanti
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kgs. Lyngby, Denmark
| | - Peter Gockel
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kgs. Lyngby, Denmark
| | - Bernhard O Palsson
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kgs. Lyngby, Denmark; Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Lei Yang
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kgs. Lyngby, Denmark.
| | - Emre Özdemir
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kgs. Lyngby, Denmark.
| |
Collapse
|
3
|
Lim HG, Gao Y, Rychel K, Lamoureux C, Lou XA, Palsson BO. Revealing systematic changes in the transcriptome during the transition from exponential growth to stationary phase. mSystems 2025; 10:e0131524. [PMID: 39714213 PMCID: PMC11748552 DOI: 10.1128/msystems.01315-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2024] [Accepted: 12/04/2024] [Indexed: 12/24/2024] Open
Abstract
The composition of bacterial transcriptomes is determined by the transcriptional regulatory network (TRN). The TRN regulates the transition from one physiological state to another. Here, we use independent component analysis to monitor the composition of the transcriptome during the transition from the exponential growth phase to the stationary phase. With Escherichia coli K-12 MG1655 as a model strain, we trigger the transition using carbon, nitrogen, and sulfur starvation. We find that (i) the transition to the stationary phase accompanies common transcriptome changes, including increased stringent responses and reduced production of cellular building blocks and energy regardless of the limiting element; (ii) condition-specific changes are strongly associated with transcriptional regulators (e.g., Crp, NtrC, CysB, Cbl) responsible for metabolizing the limiting element; and (iii) the shortage of each limiting element differentially affects the production of amino acids and extracellular polymers. This study demonstrates how the combination of genome-scale datasets and new data analytics reveals the fundamental characteristics of a key transition in the life cycle of bacteria. IMPORTANCE Nutrient limitations are critical environmental perturbations in bacterial physiology. Despite its importance, a detailed understanding of how bacterial transcriptomes are adjusted has been limited. By utilizing independent component analysis (ICA) to decompose transcriptome data, this study reveals key regulatory events that enable bacteria to adapt to nutrient limitations. The findings not only highlight common responses, such as the stringent response, but also condition-specific regulatory shifts associated with carbon, nitrogen, and sulfur starvation. The insights gained from this work advance our knowledge of bacterial physiology, gene regulation, and metabolic adaptation.
Collapse
Affiliation(s)
- Hyun Gyu Lim
- Department of Biological Sciences and Bioengineering, Inha University, Incheon, South Korea
- Department of Bioengineering, University of California, San Diego, California, USA
- Joint BioEnergy Institute, Emeryville, California, USA
| | - Ye Gao
- Department of Bioengineering, University of California, San Diego, California, USA
- The Second Hospital of Shandong University, Jinan, Shandong, China
| | - Kevin Rychel
- Department of Bioengineering, University of California, San Diego, California, USA
| | - Cameron Lamoureux
- Department of Bioengineering, University of California, San Diego, California, USA
| | - Xuwen A. Lou
- Department of Bioengineering, University of California, San Diego, California, USA
| | - Bernhard O. Palsson
- Department of Bioengineering, University of California, San Diego, California, USA
- Joint BioEnergy Institute, Emeryville, California, USA
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby, Denmark
| |
Collapse
|
4
|
Zhang Y, Zhao J, Sun X, Zheng Y, Chen T, Wang Z. Leveraging independent component analysis to unravel transcriptional regulatory networks: A critical review and future directions. Biotechnol Adv 2025; 78:108479. [PMID: 39577573 DOI: 10.1016/j.biotechadv.2024.108479] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2024] [Revised: 11/11/2024] [Accepted: 11/14/2024] [Indexed: 11/24/2024]
Abstract
Transcriptional regulatory networks (TRNs) play a crucial role in exploring microbial life activities and complex regulatory mechanisms. The comprehensive reconstruction of TRNs requires the integration of large-scale experimental data, which poses significant challenges due to the complexity of regulatory relationships. The application of machine learning tools, such as clustering analysis, has been employed to investigate TRNs, but these methods have limitations in capturing both global and local co-expression effects. In contrast, Independent Component Analysis (ICA) has emerged as a powerful analysis algorithm for modularizing independently regulated gene sets in TRNs, allowing it to account for both global and local co-expression effects. In this review, we comprehensively summarize the application of ICA in unraveling TRNs and highlight the research progress in three key aspects: (1) extending TRNs with iModulon analysis; (2) elucidating the regulatory mechanisms triggered by environmental perturbation; and (3) exploring the mechanisms of transcriptional regulation triggered by changes in microbial physiological state. At the end of this review, we also address the challenges facing ICA in TRN analysis and outline future research directions to promote the advancement of ICA-based transcriptomics analysis in biotechnology and related fields.
Collapse
Affiliation(s)
- Yuhan Zhang
- Frontier Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China; SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China
| | - Jianxiao Zhao
- Frontier Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China; SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China
| | - Xi Sun
- Frontier Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China; SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China; School of Life Science, Ningxia University, Yinchuan 750021, China
| | - Yangyang Zheng
- Frontier Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China; SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China
| | - Tao Chen
- Frontier Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China; SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China
| | - Zhiwen Wang
- Frontier Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China; SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China; School of Life Science, Ningxia University, Yinchuan 750021, China.
| |
Collapse
|
5
|
Lee Y, Choe D, Palsson BO, Cho B. Machine-Learning Analysis of Streptomyces coelicolor Transcriptomes Reveals a Transcription Regulatory Network Encompassing Biosynthetic Gene Clusters. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2403912. [PMID: 39264300 PMCID: PMC11538686 DOI: 10.1002/advs.202403912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2024] [Revised: 07/26/2024] [Indexed: 09/13/2024]
Abstract
Streptomyces produces diverse secondary metabolites of biopharmaceutical importance, yet the rate of biosynthesis of these metabolites is often hampered by complex transcriptional regulation. Therefore, a fundamental understanding of transcriptional regulation in Streptomyces is key to fully harness its genetic potential. Here, independent component analysis (ICA) of 454 high-quality gene expression profiles of the model species Streptomyces coelicolor is performed, of which 249 profiles are newly generated for S. coelicolor cultivated on 20 different carbon sources and 64 engineered strains with overexpressed sigma factors. ICA of the transcriptome dataset reveals 117 independently modulated groups of genes (iModulons), which account for 81.6% of the variance in the dataset. The genes in each iModulon are involved in specific cellular responses, which are often transcriptionally controlled by specific regulators. Also, iModulons accurately predict 25 secondary metabolite biosynthetic gene clusters encoded in the genome. This systemic analysis leads to reveal the functions of previously uncharacterized genes, putative regulons for 40 transcriptional regulators, including 30 sigma factors, and regulation of secondary metabolism via phosphate- and iron-dependent mechanisms in S. coelicolor. ICA of large transcriptomic datasets thus enlightens a new and fundamental understanding of transcriptional regulation of secondary metabolite synthesis along with interconnected metabolic processes in Streptomyces.
Collapse
Affiliation(s)
- Yongjae Lee
- Department of Biological SciencesKorea Advanced Institute of Science and TechnologyDaejeon34141Republic of Korea
| | - Donghui Choe
- Department of BioengineeringUniversity of California San DiegoLa JollaCA92093USA
| | - Bernhard O. Palsson
- Department of BioengineeringUniversity of California San DiegoLa JollaCA92093USA
- Novo Nordisk Foundation Center for BiosustainabilityTechnical University of DenmarkKemitorvet, KongensLyngby2800Denmark
| | - Byung‐Kwan Cho
- Department of Biological SciencesKorea Advanced Institute of Science and TechnologyDaejeon34141Republic of Korea
- KI for the BioCenturyKorea Advanced Institute of Science and TechnologyDaejeon34141Republic of Korea
- Graduate School of Engineering BiologyKorea Advanced Institute of Science and TechnologyDaejeon34141Republic of Korea
| |
Collapse
|
6
|
Yuan Y, Al Bulushi T, Sastry AV, Sancar C, Szubin R, Golden SS, Palsson BO. Machine learning reveals the transcriptional regulatory network and circadian dynamics of Synechococcus elongatus PCC 7942. Proc Natl Acad Sci U S A 2024; 121:e2410492121. [PMID: 39269777 PMCID: PMC11420160 DOI: 10.1073/pnas.2410492121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2024] [Accepted: 08/05/2024] [Indexed: 09/15/2024] Open
Abstract
Synechococcus elongatus is an important cyanobacterium that serves as a versatile and robust model for studying circadian biology and photosynthetic metabolism. Its transcriptional regulatory network (TRN) is of fundamental interest, as it orchestrates the cell's adaptation to the environment, including its response to sunlight. Despite the previous characterization of constituent parts of the S. elongatus TRN, a comprehensive layout of its topology remains to be established. Here, we decomposed a compendium of 300 high-quality RNA sequencing datasets of the model strain PCC 7942 using independent component analysis. We obtained 57 independently modulated gene sets, or iModulons, that explain 67% of the variance in the transcriptional response and 1) accurately reflect the activity of known transcriptional regulations, 2) capture functional components of photosynthesis, 3) provide hypotheses for regulon structures and functional annotations of poorly characterized genes, and 4) describe the transcriptional shifts under dynamic light conditions. This transcriptome-wide analysis of S. elongatus provides a quantitative reconstruction of the TRN and presents a knowledge base that can guide future investigations. Our systems-level analysis also provides a global TRN structure for S. elongatus PCC 7942.
Collapse
Affiliation(s)
- Yuan Yuan
- Shu Chien-Gene Lay Department of Bioengineering, University of California San Diego, La Jolla, CA92093
| | - Tahani Al Bulushi
- Shu Chien-Gene Lay Department of Bioengineering, University of California San Diego, La Jolla, CA92093
| | - Anand V. Sastry
- Shu Chien-Gene Lay Department of Bioengineering, University of California San Diego, La Jolla, CA92093
| | - Cigdem Sancar
- Center for Circadian Biology, University of California, San Diego, La Jolla, CA92093
| | - Richard Szubin
- Shu Chien-Gene Lay Department of Bioengineering, University of California San Diego, La Jolla, CA92093
| | - Susan S. Golden
- Center for Circadian Biology, University of California, San Diego, La Jolla, CA92093
- Department of Molecular Biology, University of California, San Diego, La Jolla, CA92093
| | - Bernhard O. Palsson
- Shu Chien-Gene Lay Department of Bioengineering, University of California San Diego, La Jolla, CA92093
- Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, CA92093
- Department of Pediatrics, University of California, San Diego, La Jolla, CA92093
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kongens, Lyngby2800, Denmark
| |
Collapse
|
7
|
Shin J, Zielinski DC, Palsson BO. Deciphering nutritional stress responses via knowledge-enriched transcriptomics for microbial engineering. Metab Eng 2024; 84:34-47. [PMID: 38825177 DOI: 10.1016/j.ymben.2024.05.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Revised: 03/27/2024] [Accepted: 05/28/2024] [Indexed: 06/04/2024]
Abstract
Understanding diverse bacterial nutritional requirements and responses is foundational in microbial research and biotechnology. In this study, we employed knowledge-enriched transcriptomic analytics to decipher complex stress responses of Vibrio natriegens to supplied nutrients, aiming to enhance microbial engineering efforts. We computed 64 independently modulated gene sets that comprise a quantitative basis for transcriptome dynamics across a comprehensive transcriptomics dataset containing a broad array of nutrient conditions. Our approach led to the i) identification of novel transporter systems for diverse substrates, ii) a detailed understanding of how trace elements affect metabolism and growth, and iii) extensive characterization of nutrient-induced stress responses, including osmotic stress, low glycolytic flux, proteostasis, and altered protein expression. By clarifying the relationship between the acetate-associated regulon and glycolytic flux status of various nutrients, we have showcased its vital role in directing optimal carbon source selection. Our findings offer deep insights into the transcriptional landscape of bacterial nutrition and underscore its significance in tailoring strain engineering strategies, thereby facilitating the development of more efficient and robust microbial systems for biotechnological applications.
Collapse
Affiliation(s)
- Jongoh Shin
- Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA
| | - Daniel C Zielinski
- Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA
| | - Bernhard O Palsson
- Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA; Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, 2800, Denmark; Department of Pediatrics, University of California, San Diego, La Jolla, CA, USA.
| |
Collapse
|
8
|
Patel A, McGrosso D, Hefner Y, Campeau A, Sastry AV, Maurya S, Rychel K, Gonzalez DJ, Palsson BO. Proteome allocation is linked to transcriptional regulation through a modularized transcriptome. Nat Commun 2024; 15:5234. [PMID: 38898010 PMCID: PMC11187210 DOI: 10.1038/s41467-024-49231-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Accepted: 05/28/2024] [Indexed: 06/21/2024] Open
Abstract
It has proved challenging to quantitatively relate the proteome to the transcriptome on a per-gene basis. Recent advances in data analytics have enabled a biologically meaningful modularization of the bacterial transcriptome. We thus investigate whether matched datasets of transcriptomes and proteomes from bacteria under diverse conditions can be modularized in the same way to reveal novel relationships between their compositions. We find that; (1) the modules of the proteome and the transcriptome are comprised of a similar list of gene products, (2) the modules in the proteome often represent combinations of modules from the transcriptome, (3) known transcriptional and post-translational regulation is reflected in differences between two sets of modules, allowing for knowledge-mapping when interpreting module functions, and (4) through statistical modeling, absolute proteome allocation can be inferred from the transcriptome alone. Quantitative and knowledge-based relationships can thus be found at the genome-scale between the proteome and transcriptome in bacteria.
Collapse
Affiliation(s)
- Arjun Patel
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Dominic McGrosso
- Department of Pharmacology, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Ying Hefner
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Anaamika Campeau
- Department of Pharmacology, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Anand V Sastry
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Svetlana Maurya
- Department of Pharmacology, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Kevin Rychel
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, 92093, USA
| | - David J Gonzalez
- Department of Pharmacology, University of California, San Diego, La Jolla, CA, 92093, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Bernhard O Palsson
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, 92093, USA.
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kgs, Lyngby, Denmark.
| |
Collapse
|
9
|
Josephs-Spaulding J, Rajput A, Hefner Y, Szubin R, Balasubramanian A, Li G, Zielinski DC, Jahn L, Sommer M, Phaneuf P, Palsson BO. Reconstructing the transcriptional regulatory network of probiotic L. reuteri is enabled by transcriptomics and machine learning. mSystems 2024; 9:e0125723. [PMID: 38349131 PMCID: PMC10949432 DOI: 10.1128/msystems.01257-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Accepted: 01/09/2024] [Indexed: 03/20/2024] Open
Abstract
Limosilactobacillus reuteri, a probiotic microbe instrumental to human health and sustainable food production, adapts to diverse environmental shifts via dynamic gene expression. We applied the independent component analysis (ICA) to 117 RNA-seq data sets to decode its transcriptional regulatory network (TRN), identifying 35 distinct signals that modulate specific gene sets. Our findings indicate that the ICA provides a qualitative advancement and captures nuanced relationships within gene clusters that other methods may miss. This study uncovers the fundamental properties of L. reuteri's TRN and deepens our understanding of its arginine metabolism and the co-regulation of riboflavin metabolism and fatty acid conversion. It also sheds light on conditions that regulate genes within a specific biosynthetic gene cluster and allows for the speculation of the potential role of isoprenoid biosynthesis in L. reuteri's adaptive response to environmental changes. By integrating transcriptomics and machine learning, we provide a system-level understanding of L. reuteri's response mechanism to environmental fluctuations, thus setting the stage for modeling the probiotic transcriptome for applications in microbial food production. IMPORTANCE We have studied Limosilactobacillus reuteri, a beneficial probiotic microbe that plays a significant role in our health and production of sustainable foods, a type of foods that are nutritionally dense and healthier and have low-carbon emissions compared to traditional foods. Similar to how humans adapt their lifestyles to different environments, this microbe adjusts its behavior by modulating the expression of genes. We applied machine learning to analyze large-scale data sets on how these genes behave across diverse conditions. From this, we identified 35 unique patterns demonstrating how L. reuteri adjusts its genes based on 50 unique environmental conditions (such as various sugars, salts, microbial cocultures, human milk, and fruit juice). This research helps us understand better how L. reuteri functions, especially in processes like breaking down certain nutrients and adapting to stressful changes. More importantly, with our findings, we become closer to using this knowledge to improve how we produce more sustainable and healthier foods with the help of microbes.
Collapse
Affiliation(s)
- Jonathan Josephs-Spaulding
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Copenhagen, Denmark
| | - Akanksha Rajput
- Department of Bioengineering, University of California, San Diego, California, USA
| | - Ying Hefner
- Department of Bioengineering, University of California, San Diego, California, USA
| | - Richard Szubin
- Department of Bioengineering, University of California, San Diego, California, USA
| | | | - Gaoyuan Li
- Department of Bioengineering, University of California, San Diego, California, USA
| | - Daniel C. Zielinski
- Department of Bioengineering, University of California, San Diego, California, USA
| | - Leonie Jahn
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Copenhagen, Denmark
| | - Morten Sommer
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Copenhagen, Denmark
| | - Patrick Phaneuf
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Copenhagen, Denmark
| | - Bernhard O. Palsson
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Copenhagen, Denmark
- Department of Bioengineering, University of California, San Diego, California, USA
| |
Collapse
|
10
|
Borchert AJ, Bleem AC, Lim HG, Rychel K, Dooley KD, Kellermyer ZA, Hodges TL, Palsson BO, Beckham GT. Machine learning analysis of RB-TnSeq fitness data predicts functional gene modules in Pseudomonas putida KT2440. mSystems 2024; 9:e0094223. [PMID: 38323821 PMCID: PMC10949508 DOI: 10.1128/msystems.00942-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Accepted: 01/07/2024] [Indexed: 02/08/2024] Open
Abstract
There is growing interest in engineering Pseudomonas putida KT2440 as a microbial chassis for the conversion of renewable and waste-based feedstocks, and metabolic engineering of P. putida relies on the understanding of the functional relationships between genes. In this work, independent component analysis (ICA) was applied to a compendium of existing fitness data from randomly barcoded transposon insertion sequencing (RB-TnSeq) of P. putida KT2440 grown in 179 unique experimental conditions. ICA identified 84 independent groups of genes, which we call fModules ("functional modules"), where gene members displayed shared functional influence in a specific cellular process. This machine learning-based approach both successfully recapitulated previously characterized functional relationships and established hitherto unknown associations between genes. Selected gene members from fModules for hydroxycinnamate metabolism and stress resistance, acetyl coenzyme A assimilation, and nitrogen metabolism were validated with engineered mutants of P. putida. Additionally, functional gene clusters from ICA of RB-TnSeq data sets were compared with regulatory gene clusters from prior ICA of RNAseq data sets to draw connections between gene regulation and function. Because ICA profiles the functional role of several distinct gene networks simultaneously, it can reduce the time required to annotate gene function relative to manual curation of RB-TnSeq data sets. IMPORTANCE This study demonstrates a rapid, automated approach for elucidating functional modules within complex genetic networks. While Pseudomonas putida randomly barcoded transposon insertion sequencing data were used as a proof of concept, this approach is applicable to any organism with existing functional genomics data sets and may serve as a useful tool for many valuable applications, such as guiding metabolic engineering efforts in other microbes or understanding functional relationships between virulence-associated genes in pathogenic microbes. Furthermore, this work demonstrates that comparison of data obtained from independent component analysis of transcriptomics and gene fitness datasets can elucidate regulatory-functional relationships between genes, which may have utility in a variety of applications, such as metabolic modeling, strain engineering, or identification of antimicrobial drug targets.
Collapse
Affiliation(s)
- Andrew J. Borchert
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, Colorado, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, Tennessee, USA
| | - Alissa C. Bleem
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, Colorado, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, Tennessee, USA
- Agile BioFoundry, Emeryville, California, USA
| | - Hyun Gyu Lim
- Department of Bioengineering, University of California San Diego, La Jolla, California, USA
- Joint BioEnergy Institute, Emeryville, California, USA
- Department of Biological Engineering, Inha University, Incheon, Korea
| | - Kevin Rychel
- Department of Bioengineering, University of California San Diego, La Jolla, California, USA
| | - Keven D. Dooley
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, Colorado, USA
- Agile BioFoundry, Emeryville, California, USA
| | - Zoe A. Kellermyer
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, Colorado, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, Tennessee, USA
| | - Tracy L. Hodges
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, Colorado, USA
- Agile BioFoundry, Emeryville, California, USA
| | - Bernhard O. Palsson
- Department of Bioengineering, University of California San Diego, La Jolla, California, USA
- Joint BioEnergy Institute, Emeryville, California, USA
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, Denmark
- Department of Pediatrics, University of California, San Diego, California, USA
| | - Gregg T. Beckham
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, Colorado, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, Tennessee, USA
- Agile BioFoundry, Emeryville, California, USA
| |
Collapse
|
11
|
Qiu S, Huang Y, Liang S, Zeng H, Yang A. Systematic elucidation of independently modulated genes in Lactiplantibacillus plantarum reveals a trade-off between secondary and primary metabolism. Microb Biotechnol 2024; 17:e14425. [PMID: 38393514 PMCID: PMC10886434 DOI: 10.1111/1751-7915.14425] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Accepted: 02/02/2024] [Indexed: 02/25/2024] Open
Abstract
Lactiplantibacillus plantarum is a probiotic bacterium widely used in food and health industries, but its gene regulatory information is limited in existing databases, which impedes the research of its physiology and its applications. To obtain a better understanding of the transcriptional regulatory network of L. plantarum, independent component analysis of its transcriptomes was used to derive 45 sets of independently modulated genes (iModulons). Those iModulons were annotated for associated transcription factors and functional pathways, and active iModulons in response to different growth conditions were identified and characterized in detail. Eventually, the analysis of iModulon activities reveals a trade-off between regulatory activities of secondary and primary metabolism in L. plantarum.
Collapse
Affiliation(s)
- Sizhe Qiu
- Department of Engineering ScienceUniversity of OxfordOxfordUK
- School of Food and HealthBeijing Technology and Business UniversityBeijingChina
| | - Yidi Huang
- School of Computer Science and EngineeringBeihang UniversityBeijingChina
| | - Shishun Liang
- Department of Life ScienceImperial College LondonLondonUK
| | - Hong Zeng
- School of Food and HealthBeijing Technology and Business UniversityBeijingChina
| | - Aidong Yang
- Department of Engineering ScienceUniversity of OxfordOxfordUK
| |
Collapse
|
12
|
Meimetis N, Pullen KM, Zhu DY, Nilsson A, Hoang TN, Magliacane S, Lauffenburger DA. AutoTransOP: translating omics signatures without orthologue requirements using deep learning. NPJ Syst Biol Appl 2024; 10:13. [PMID: 38287079 PMCID: PMC10825146 DOI: 10.1038/s41540-024-00341-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2023] [Accepted: 01/17/2024] [Indexed: 01/31/2024] Open
Abstract
The development of therapeutics and vaccines for human diseases requires a systematic understanding of human biology. Although animal and in vitro culture models can elucidate some disease mechanisms, they typically fail to adequately recapitulate human biology as evidenced by the predominant likelihood of clinical trial failure. To address this problem, we developed AutoTransOP, a neural network autoencoder framework, to map omics profiles from designated species or cellular contexts into a global latent space, from which germane information for different contexts can be identified without the typically imposed requirement of matched orthologues. This approach was found in general to perform at least as well as current alternative methods in identifying animal/culture-specific molecular features predictive of other contexts-most importantly without requiring homology matching. For an especially challenging test case, we successfully applied our framework to a set of inter-species vaccine serology studies, where 1-to-1 mapping between human and non-human primate features does not exist.
Collapse
Affiliation(s)
- Nikolaos Meimetis
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Krista M Pullen
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Daniel Y Zhu
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Avlant Nilsson
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, SE, 41296, Sweden
| | - Trong Nghia Hoang
- School of Electrical Engineering and Computer Science, Washington State University, Pullman, WA, 99164-236, USA
| | - Sara Magliacane
- Institute of Informatics, University of Amsterdam, Amsterdam, The Netherlands
- MIT-IBM Watson AI Lab, Cambridge, MA, 02139, USA
| | - Douglas A Lauffenburger
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA.
| |
Collapse
|
13
|
Miano A, Rychel K, Lezia A, Sastry A, Palsson B, Hasty J. High-resolution temporal profiling of E. coli transcriptional response. Nat Commun 2023; 14:7606. [PMID: 37993418 PMCID: PMC10665441 DOI: 10.1038/s41467-023-43173-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2023] [Accepted: 11/02/2023] [Indexed: 11/24/2023] Open
Abstract
Understanding how cells dynamically adapt to their environment is a primary focus of biology research. Temporal information about cellular behavior is often limited by both small numbers of data time-points and the methods used to analyze this data. Here, we apply unsupervised machine learning to a data set containing the activity of 1805 native promoters in E. coli measured every 10 minutes in a high-throughput microfluidic device via fluorescence time-lapse microscopy. Specifically, this data set reveals E. coli transcriptome dynamics when exposed to different heavy metal ions. We use a bioinformatics pipeline based on Independent Component Analysis (ICA) to generate insights and hypotheses from this data. We discovered three primary, time-dependent stages of promoter activation to heavy metal stress (fast, intermediate, and steady). Furthermore, we uncovered a global strategy E. coli uses to reallocate resources from stress-related promoters to growth-related promoters following exposure to heavy metal stress.
Collapse
Affiliation(s)
- Arianna Miano
- Department of Bioengineering, University of California San Diego, 9500 Gliman Dr, La Jolla, CA, USA.
| | - Kevin Rychel
- Department of Bioengineering, University of California San Diego, 9500 Gliman Dr, La Jolla, CA, USA
| | - Andrew Lezia
- Department of Bioengineering, University of California San Diego, 9500 Gliman Dr, La Jolla, CA, USA
| | - Anand Sastry
- Department of Bioengineering, University of California San Diego, 9500 Gliman Dr, La Jolla, CA, USA
| | - Bernhard Palsson
- Department of Bioengineering, University of California San Diego, 9500 Gliman Dr, La Jolla, CA, USA
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kgs, Lyngby, Denmark
| | - Jeff Hasty
- Department of Bioengineering, University of California San Diego, 9500 Gliman Dr, La Jolla, CA, USA
- Department of Molecular Biology, School of Biological Sciences, University of California San Diego, 9500 Gliman Dr, La Jolla, CA, USA
- Synthetic Biology Institute, University of California San Diego, 9500 Gliman Dr, La Jolla, CA, USA
| |
Collapse
|
14
|
Zhao J, Sun X, Mao Z, Zheng Y, Geng Z, Zhang Y, Ma H, Wang Z. Independent component analysis of Corynebacterium glutamicum transcriptomes reveals its transcriptional regulatory network. Microbiol Res 2023; 276:127485. [PMID: 37683565 DOI: 10.1016/j.micres.2023.127485] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 08/28/2023] [Accepted: 08/29/2023] [Indexed: 09/10/2023]
Abstract
Gene expression in bacteria is regulated by multiple transcription factors. Clarifying the regulation mechanism of gene expression is necessary to understand bacterial physiological activities. To further understand the structure of the transcriptional regulatory network of Corynebacterium glutamicum, we applied independent component analysis, an unsupervised machine learning algorithm, to the high-quality C. glutamicum gene expression profile which includes 263 samples from 29 independent projects. We obtained 87 robust independent regulatory modules (iModulons). These iModulons explain 76.7% of the variance in the expression profile and constitute the quantitative transcriptional regulatory network of C. glutamicum. By analyzing the constituent genes in iModulons, we identified potential targets for 20 transcription factors. We also captured the changes in iModulon activities under different growth rates and dissolved oxygen concentrations, demonstrating the ability of iModulons to comprehensively interpret transcriptional responses to environmental changes. In summary, this study provides a genome-scale quantitative transcriptional regulatory network for C. glutamicum and informs future research on complex changes in the transcriptome.
Collapse
Affiliation(s)
- Jianxiao Zhao
- Frontier Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China; SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China; Biodesign Center, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China; National Technology Innovation Center of Synthetic Biology, Tianjin 300308, China
| | - Xi Sun
- Frontier Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China; SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China
| | - Zhitao Mao
- Biodesign Center, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China; National Technology Innovation Center of Synthetic Biology, Tianjin 300308, China
| | - Yangyang Zheng
- Frontier Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China; SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China
| | - Zhouxiao Geng
- Frontier Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China; SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China
| | - Yuhan Zhang
- Frontier Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China; SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China
| | - Hongwu Ma
- Biodesign Center, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China; National Technology Innovation Center of Synthetic Biology, Tianjin 300308, China.
| | - Zhiwen Wang
- Frontier Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China; SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China.
| |
Collapse
|
15
|
Lamoureux CR, Decker KT, Sastry AV, Rychel K, Gao Y, McConn J, Zielinski D, Palsson BO. A multi-scale expression and regulation knowledge base for Escherichia coli. Nucleic Acids Res 2023; 51:10176-10193. [PMID: 37713610 PMCID: PMC10602906 DOI: 10.1093/nar/gkad750] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 08/02/2023] [Accepted: 09/05/2023] [Indexed: 09/17/2023] Open
Abstract
Transcriptomic data is accumulating rapidly; thus, scalable methods for extracting knowledge from this data are critical. Here, we assembled a top-down expression and regulation knowledge base for Escherichia coli. The expression component is a 1035-sample, high-quality RNA-seq compendium consisting of data generated in our lab using a single experimental protocol. The compendium contains diverse growth conditions, including: 9 media; 39 supplements, including antibiotics; 42 heterologous proteins; and 76 gene knockouts. Using this resource, we elucidated global expression patterns. We used machine learning to extract 201 modules that account for 86% of known regulatory interactions, creating the regulatory component. With these modules, we identified two novel regulons and quantified systems-level regulatory responses. We also integrated 1675 curated, publicly-available transcriptomes into the resource. We demonstrated workflows for analyzing new data against this knowledge base via deconstruction of regulation during aerobic transition. This resource illuminates the E. coli transcriptome at scale and provides a blueprint for top-down transcriptomic analysis of non-model organisms.
Collapse
Affiliation(s)
- Cameron R Lamoureux
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Katherine T Decker
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Anand V Sastry
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Kevin Rychel
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Ye Gao
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - John Luke McConn
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Daniel C Zielinski
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Bernhard O Palsson
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kgs. Lyngby, Denmark
| |
Collapse
|
16
|
Bajpe H, Rychel K, Lamoureux CR, Sastry AV, Palsson BO. Machine learning uncovers the Pseudomonas syringae transcriptome in microbial communities and during infection. mSystems 2023; 8:e0043723. [PMID: 37638727 PMCID: PMC10654099 DOI: 10.1128/msystems.00437-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Accepted: 07/19/2023] [Indexed: 08/29/2023] Open
Abstract
IMPORTANCE Pseudomonas syringae pv. tomato DC3000 is a model plant pathogen that infects tomatoes and Arabidopsis thaliana. The current understanding of global transcriptional regulation in the pathogen is limited. Here, we applied iModulon analysis to a compendium of RNA-seq data to unravel its transcriptional regulatory network. We characterize each co-regulated gene set, revealing the activity of major regulators across diverse conditions. We provide new insights on the transcriptional dynamics in interactions with the plant immune system and with other bacterial species, such as AlgU-dependent regulation of flagellar genes during plant infection and downregulation of siderophore production in the presence of a siderophore cheater. This study demonstrates the novel application of iModulons in studying temporal dynamics during host-pathogen and microbe-microbe interactions, and reveals specific insights of interest.
Collapse
Affiliation(s)
- Heera Bajpe
- Department of Bioengineering, University of California San Diego, La Jolla, California, USA
| | - Kevin Rychel
- Department of Bioengineering, University of California San Diego, La Jolla, California, USA
| | - Cameron R. Lamoureux
- Department of Bioengineering, University of California San Diego, La Jolla, California, USA
| | - Anand V. Sastry
- Department of Bioengineering, University of California San Diego, La Jolla, California, USA
| | - Bernhard O. Palsson
- Department of Bioengineering, University of California San Diego, La Jolla, California, USA
- Department of Pediatrics, University of California San Diego, La Jolla, California, USA
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, California, USA
- Center for Microbiome Innovation, University of California San Diego, La Jolla, California, USA
- Novo Nordisk Foundation Center for Biosustainability, Kongens Lyngby, Denmark
| |
Collapse
|
17
|
Rychel K, Tan J, Patel A, Lamoureux C, Hefner Y, Szubin R, Johnsen J, Mohamed ETT, Phaneuf PV, Anand A, Olson CA, Park JH, Sastry AV, Yang L, Feist AM, Palsson BO. Laboratory evolution, transcriptomics, and modeling reveal mechanisms of paraquat tolerance. Cell Rep 2023; 42:113105. [PMID: 37713311 PMCID: PMC10591938 DOI: 10.1016/j.celrep.2023.113105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Revised: 07/09/2023] [Accepted: 08/23/2023] [Indexed: 09/17/2023] Open
Abstract
Relationships between the genome, transcriptome, and metabolome underlie all evolved phenotypes. However, it has proved difficult to elucidate these relationships because of the high number of variables measured. A recently developed data analytic method for characterizing the transcriptome can simplify interpretation by grouping genes into independently modulated sets (iModulons). Here, we demonstrate how iModulons reveal deep understanding of the effects of causal mutations and metabolic rewiring. We use adaptive laboratory evolution to generate E. coli strains that tolerate high levels of the redox cycling compound paraquat, which produces reactive oxygen species (ROS). We combine resequencing, iModulons, and metabolic models to elucidate six interacting stress-tolerance mechanisms: (1) modification of transport, (2) activation of ROS stress responses, (3) use of ROS-sensitive iron regulation, (4) motility, (5) broad transcriptional reallocation toward growth, and (6) metabolic rewiring to decrease NADH production. This work thus demonstrates the power of iModulon knowledge mapping for evolution analysis.
Collapse
Affiliation(s)
- Kevin Rychel
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Justin Tan
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Arjun Patel
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Cameron Lamoureux
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Ying Hefner
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Richard Szubin
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Josefin Johnsen
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kgs. Lyngby, Denmark
| | - Elsayed Tharwat Tolba Mohamed
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kgs. Lyngby, Denmark
| | - Patrick V Phaneuf
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kgs. Lyngby, Denmark
| | - Amitesh Anand
- Tata Institute of Fundamental Research, Homi Bhabha Road, Colaba, Mumbai, Maharashtra, India
| | - Connor A Olson
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Joon Ho Park
- Department of Chemical Engineering, Massachusetts Institute of Technology, 500 Main Street, Building 76, Cambridge, MA 02139, USA
| | - Anand V Sastry
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Laurence Yang
- Department of Chemical Engineering, Queen's University, Kingston, ON K7L 3N6, Canada
| | - Adam M Feist
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA; Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kgs. Lyngby, Denmark
| | - Bernhard O Palsson
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA; Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kgs. Lyngby, Denmark.
| |
Collapse
|
18
|
Gao ZP, Gu WC, Li J, Qiu QT, Ma BG. Independent Component Analysis Reveals the Transcriptional Regulatory Modules in Bradyrhizobium diazoefficiens USDA110. Int J Mol Sci 2023; 24:12544. [PMID: 37628727 PMCID: PMC10454721 DOI: 10.3390/ijms241612544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Revised: 07/30/2023] [Accepted: 08/05/2023] [Indexed: 08/27/2023] Open
Abstract
The dynamic adaptation of bacteria to environmental changes is achieved through the coordinated expression of many genes, which constitutes a transcriptional regulatory network (TRN). Bradyrhizobium diazoefficiens USDA110 is an important model strain for the study of symbiotic nitrogen fixation (SNF), and its SNF ability largely depends on the TRN. In this study, independent component analysis was applied to 226 high-quality gene expression profiles of B. diazoefficiens USDA110 microarray datasets, from which 64 iModulons were identified. Using these iModulons and their condition-specific activity levels, we (1) provided new insights into the connection between the FixLJ-FixK2-FixK1 regulatory cascade and quorum sensing, (2) discovered the independence of the FixLJ-FixK2-FixK1 and NifA/RpoN regulatory cascades in response to oxygen, (3) identified the FixLJ-FixK2 cascade as a mediator connecting the FixK2-2 iModulon and the Phenylalanine iModulon, (4) described the differential activation of iModulons in B. diazoefficiens USDA110 under different environmental conditions, and (5) proposed a notion of active-TRN based on the changes in iModulon activity to better illustrate the relationship between gene regulation and environmental condition. In sum, this research offered an iModulon-based TRN for B. diazoefficiens USDA110, which formed a foundation for comprehensively understanding the intricate transcriptional regulation during SNF.
Collapse
Affiliation(s)
| | | | | | | | - Bin-Guang Ma
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China; (Z.-P.G.); (W.-C.G.); (J.L.); (Q.-T.Q.)
| |
Collapse
|
19
|
Hirose Y, Poudel S, Sastry AV, Rychel K, Lamoureux CR, Szubin R, Zielinski DC, Lim HG, Menon ND, Bergsten H, Uchiyama S, Hanada T, Kawabata S, Palsson BO, Nizet V. Elucidation of independently modulated genes in Streptococcus pyogenes reveals carbon sources that control its expression of hemolytic toxins. mSystems 2023; 8:e0024723. [PMID: 37278526 PMCID: PMC10308926 DOI: 10.1128/msystems.00247-23] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2023] [Accepted: 04/02/2023] [Indexed: 06/07/2023] Open
Abstract
Streptococcus pyogenes can cause a wide variety of acute infections throughout the body of its human host. An underlying transcriptional regulatory network (TRN) is responsible for altering the physiological state of the bacterium to adapt to each unique host environment. Consequently, an in-depth understanding of the comprehensive dynamics of the S. pyogenes TRN could inform new therapeutic strategies. Here, we compiled 116 existing high-quality RNA sequencing data sets of invasive S. pyogenes serotype M1 and estimated the TRN structure in a top-down fashion by performing independent component analysis (ICA). The algorithm computed 42 independently modulated sets of genes (iModulons). Four iModulons contained the nga-ifs-slo virulence-related operon, which allowed us to identify carbon sources that control its expression. In particular, dextrin utilization upregulated the nga-ifs-slo operon by activation of two-component regulatory system CovRS-related iModulons, altering bacterial hemolytic activity compared to glucose or maltose utilization. Finally, we show that the iModulon-based TRN structure can be used to simplify the interpretation of noisy bacterial transcriptome data at the infection site. IMPORTANCE S. pyogenes is a pre-eminent human bacterial pathogen that causes a wide variety of acute infections throughout the body of its host. Understanding the comprehensive dynamics of its TRN could inform new therapeutic strategies. Since at least 43 S. pyogenes transcriptional regulators are known, it is often difficult to interpret transcriptomic data from regulon annotations. This study shows the novel ICA-based framework to elucidate the underlying regulatory structure of S. pyogenes allows us to interpret the transcriptome profile using data-driven regulons (iModulons). Additionally, the observations of the iModulon architecture lead us to identify the multiple regulatory inputs governing the expression of a virulence-related operon. The iModulons identified in this study serve as a powerful guidepost to further our understanding of S. pyogenes TRN structure and dynamics.
Collapse
Affiliation(s)
- Yujiro Hirose
- Department of Microbiology, Graduate School of Dentistry, Osaka University, Suita, Osaka, Japan
- Department of Pediatrics, University of California at San Diego School of Medicine, La Jolla, California, USA
| | - Saugat Poudel
- Department of Bioengineering, University of California San Diego, La Jolla, California, USA
| | - Anand V. Sastry
- Department of Bioengineering, University of California San Diego, La Jolla, California, USA
| | - Kevin Rychel
- Department of Bioengineering, University of California San Diego, La Jolla, California, USA
| | - Cameron R. Lamoureux
- Department of Bioengineering, University of California San Diego, La Jolla, California, USA
| | - Richard Szubin
- Department of Bioengineering, University of California San Diego, La Jolla, California, USA
| | - Daniel C. Zielinski
- Department of Bioengineering, University of California San Diego, La Jolla, California, USA
| | - Hyun Gyu Lim
- Department of Bioengineering, University of California San Diego, La Jolla, California, USA
- Department of Biological Engineering, Inha University, Michuhol-gu, Incheon, South Korea
| | - Nitasha D. Menon
- Department of Pediatrics, University of California at San Diego School of Medicine, La Jolla, California, USA
- School of Biotechnology, Amrita Vishwa Vidyapeetham, Amritapuri, Kerala, India
| | - Helena Bergsten
- Department of Pediatrics, University of California at San Diego School of Medicine, La Jolla, California, USA
| | - Satoshi Uchiyama
- Department of Pediatrics, University of California at San Diego School of Medicine, La Jolla, California, USA
| | - Tomoki Hanada
- Department of Microbiology, Graduate School of Dentistry, Osaka University, Suita, Osaka, Japan
| | - Shigetada Kawabata
- Department of Microbiology, Graduate School of Dentistry, Osaka University, Suita, Osaka, Japan
- Center for Infectious Diseases Education and Research, Osaka University, Suita, Osaka, Japan
| | - Bernhard O. Palsson
- Department of Bioengineering, University of California San Diego, La Jolla, California, USA
| | - Victor Nizet
- Department of Pediatrics, University of California at San Diego School of Medicine, La Jolla, California, USA
- Skaggs School of Pharmaceutical Sciences, University of California at San Diego, La Jolla, California, USA
| |
Collapse
|
20
|
Shin J, Rychel K, Palsson BO. Systems biology of competency in Vibrio natriegens is revealed by applying novel data analytics to the transcriptome. Cell Rep 2023; 42:112619. [PMID: 37285268 DOI: 10.1016/j.celrep.2023.112619] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Revised: 04/27/2023] [Accepted: 05/22/2023] [Indexed: 06/09/2023] Open
Abstract
Vibrio natriegens regulates natural competence through the TfoX and QstR transcription factors, which are involved in external DNA capture and transport. However, the extensive genetic and transcriptional regulatory basis for competency remains unknown. We used a machine-learning approach to decompose Vibrio natriegens's transcriptome into 45 groups of independently modulated sets of genes (iModulons). Our findings show that competency is associated with the repression of two housekeeping iModulons (iron metabolism and translation) and the activation of six iModulons; including TfoX and QstR, a novel iModulon of unknown function, and three housekeeping iModulons (representing motility, polycations, and reactive oxygen species [ROS] responses). Phenotypic screening of 83 gene deletion strains demonstrates that loss of iModulon function reduces or eliminates competency. This database-iModulon-discovery cycle unveils the transcriptomic basis for competency and its relationship to housekeeping functions. These results provide the genetic basis for systems biology of competency in this organism.
Collapse
Affiliation(s)
- Jongoh Shin
- Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA
| | - Kevin Rychel
- Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA
| | - Bernhard O Palsson
- Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA; Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800 Lyngby, Denmark; Department of Pediatrics, University of California, San Diego, La Jolla, CA, USA.
| |
Collapse
|
21
|
Patel A, McGrosso D, Hefner Y, Campeau A, Sastry AV, Maurya S, Rychel K, Gonzalez DJ, Palsson BO. Proteome allocation is linked to transcriptional regulation through a modularized transcriptome. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.20.529291. [PMID: 36865326 PMCID: PMC9980150 DOI: 10.1101/2023.02.20.529291] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/25/2023]
Abstract
It has proved challenging to quantitatively relate the proteome to the transcriptome on a per-gene basis. Recent advances in data analytics have enabled a biologically meaningful modularization of the bacterial transcriptome. We thus investigated whether matched datasets of transcriptomes and proteomes from bacteria under diverse conditions could be modularized in the same way to reveal novel relationships between their compositions. We found that; 1) the modules of the proteome and the transcriptome are comprised of a similar list of gene products, 2) the modules in the proteome often represent combinations of modules from the transcriptome, 3) known transcriptional and post-translational regulation is reflected in differences between two sets of modules, allowing for knowledge-mapping when interpreting module functions, and 4) through statistical modeling, absolute proteome allocation can be inferred from the transcriptome alone. Quantitative and knowledge-based relationships can thus be found at the genome-scale between the proteome and transcriptome in bacteria.
Collapse
Affiliation(s)
- Arjun Patel
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Dominic McGrosso
- Department of Pharmacology, University of California, San Diego, La Jolla, CA 92093, USA
| | - Ying Hefner
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Anaamika Campeau
- Department of Pharmacology, University of California, San Diego, La Jolla, CA 92093, USA
| | - Anand V. Sastry
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Svetlana Maurya
- Department of Pharmacology, University of California, San Diego, La Jolla, CA 92093, USA
| | - Kevin Rychel
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - David J Gonzalez
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA 92093, USA
| | - Bernhard O. Palsson
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kgs. Lyngby, Denmark
| |
Collapse
|
22
|
Pan-Genome Analysis of Transcriptional Regulation in Six Salmonella enterica Serovar Typhimurium Strains Reveals Their Different Regulatory Structures. mSystems 2022; 7:e0046722. [PMID: 36317888 PMCID: PMC9764980 DOI: 10.1128/msystems.00467-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
Abstract
Establishing transcriptional regulatory networks (TRNs) in bacteria has been limited to well-characterized model strains. Using machine learning methods, we established the transcriptional regulatory networks of six Salmonella enterica serovar Typhimurium strains from their transcriptomes. By decomposing a compendia of RNA sequencing (RNA-seq) data with independent component analysis, we obtained 400 independently modulated sets of genes, called iModulons. We (i) performed pan-genome analysis of the phylogroup structure of S. Typhimurium and analyzed the iModulons against this background, (ii) revealed different genetic signatures in pathogenicity islands that explained phenotypes, (iii) discovered three transport iModulons linked to antibiotic resistance, (iv) described concerted responses to cationic antimicrobial peptides, and (v) uncovered new regulons. Thus, by combining pan-genome and transcriptomic analytics, we revealed variations in TRNs across six strains of serovar Typhimurium. IMPORTANCE Salmonella enterica serovar Typhimurium is a pathogen involved in human nontyphoidal infections. Treating S. Typhimurium infections is difficult due to the species's dynamic adaptation to its environment, which is dictated by a complex transcriptional regulatory network (TRN) that is different across strains. In this study, we describe the use of independent component analysis to characterize the differential TRNs across the S. Typhimurium pan-genome using a compendium of high-quality RNA-seq data. This approach provided unprecedented insights into the differences between regulation of key cellular functions and pathogenicity in the different strains. The study provides an impetus to initiate a large-scale effort to reveal the TRN differences between the major phylogroups of the pathogenic bacteria, which could fundamentally impact personalizing treatments of bacterial pathogens.
Collapse
|
23
|
Poudel S, Hefner Y, Szubin R, Sastry A, Gao Y, Nizet V, Palsson BO. Coordination of CcpA and CodY Regulators in Staphylococcus aureus USA300 Strains. mSystems 2022; 7:e0048022. [PMID: 36321827 PMCID: PMC9765215 DOI: 10.1128/msystems.00480-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Accepted: 10/10/2022] [Indexed: 11/07/2022] Open
Abstract
The complex cross talk between metabolism and gene regulatory networks makes it difficult to untangle individual constituents and study their precise roles and interactions. To address this issue, we modularized the transcriptional regulatory network (TRN) of the Staphylococcus aureus USA300 strain by applying independent component analysis (ICA) to 385 RNA sequencing samples. We then combined the modular TRN model with a metabolic model to study the regulation of carbon and amino acid metabolism. Our analysis showed that regulation of central carbon metabolism by CcpA and amino acid biosynthesis by CodY are closely coordinated. In general, S. aureus increases the expression of CodY-regulated genes in the presence of preferred carbon sources such as glucose. This transcriptional coordination was corroborated by metabolic model simulations that also showed increased amino acid biosynthesis in the presence of glucose. Further, we found that CodY and CcpA cooperatively regulate the expression of ribosome hibernation-promoting factor, thus linking metabolic cues with translation. In line with this hypothesis, expression of CodY-regulated genes is tightly correlated with expression of genes encoding ribosomal proteins. Together, we propose a coarse-grained model where expression of S. aureus genes encoding enzymes that control carbon flux and nitrogen flux through the system is coregulated with expression of translation machinery to modularly control protein synthesis. While this work focuses on three key regulators, the full TRN model we present contains 76 total independently modulated sets of genes, each with the potential to uncover other complex regulatory structures and interactions. IMPORTANCE Staphylococcus aureus is a versatile pathogen with an expanding antibiotic resistance profile. The biology underlying its clinical success emerges from an interplay of many systems such as metabolism and gene regulatory networks. This work brings together models for these two systems to establish fundamental principles governing the regulation of S. aureus central metabolism and protein synthesis. Studies of these fundamental biological principles are often confined to model organisms such as Escherichia coli. However, expanding these models to pathogens can provide a framework from which complex and clinically important phenotypes such as virulence and antibiotic resistance can be better understood. Additionally, the expanded gene regulatory network model presented here can deconvolute the biology underlying other important phenotypes in this pathogen.
Collapse
Affiliation(s)
- Saugat Poudel
- Department of Bioengineering, University of California San Diego, San Diego, California, USA
| | - Ying Hefner
- Department of Bioengineering, University of California San Diego, San Diego, California, USA
| | - Richard Szubin
- Department of Bioengineering, University of California San Diego, San Diego, California, USA
| | - Anand Sastry
- Department of Bioengineering, University of California San Diego, San Diego, California, USA
| | - Ye Gao
- Department of Bioengineering, University of California San Diego, San Diego, California, USA
- Department of Biological Sciences, University of California San Diego, San Diego, California, USA
| | - Victor Nizet
- Collaborative to Halt Antibiotic-Resistant Microbes (CHARM), Department of Pediatrics, University of California San Diego, San Diego, California, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego, California, USA
| | - Bernhard O. Palsson
- Department of Bioengineering, University of California San Diego, San Diego, California, USA
- Collaborative to Halt Antibiotic-Resistant Microbes (CHARM), Department of Pediatrics, University of California San Diego, San Diego, California, USA
| |
Collapse
|
24
|
Rajput A, Tsunemoto H, Sastry AV, Szubin R, Rychel K, Chauhan SM, Pogliano J, Palsson BO. Advanced transcriptomic analysis reveals the role of efflux pumps and media composition in antibiotic responses of Pseudomonas aeruginosa. Nucleic Acids Res 2022; 50:9675-9688. [PMID: 36095122 PMCID: PMC9508857 DOI: 10.1093/nar/gkac743] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Revised: 08/06/2022] [Accepted: 09/06/2022] [Indexed: 11/14/2022] Open
Abstract
Pseudomonas aeruginosa is an opportunistic pathogen and major cause of hospital-acquired infections. The virulence of P. aeruginosa is largely determined by its transcriptional regulatory network (TRN). We used 411 transcription profiles of P. aeruginosa from diverse growth conditions to construct a quantitative TRN by identifying independently modulated sets of genes (called iModulons) and their condition-specific activity levels. The current study focused on the use of iModulons to analyze the biofilm production and antibiotic resistance of P. aeruginosa. Our analysis revealed: (i) 116 iModulons, 81 of which show strong association with known regulators; (ii) novel roles of regulators in modulating antibiotics efflux pumps; (iii) substrate-efflux pump associations; (iv) differential iModulon activity in response to beta-lactam antibiotics in bacteriological and physiological media; (v) differential activation of 'Cell Division' iModulon resulting from exposure to different beta-lactam antibiotics and (vi) a role of the PprB iModulon in the stress-induced transition from planktonic to biofilm lifestyle. In light of these results, the construction of an iModulon-based TRN provides a transcriptional regulatory basis for key aspects of P. aeruginosa infection, such as antibiotic stress responses and biofilm formation. Taken together, our results offer a novel mechanistic understanding of P. aeruginosa virulence.
Collapse
Affiliation(s)
- Akanksha Rajput
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Hannah Tsunemoto
- Division of Biological Sciences, University of California San Diego, La Jolla, CA 92093, USA
| | - Anand V Sastry
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Richard Szubin
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Kevin Rychel
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Siddharth M Chauhan
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Joe Pogliano
- Division of Biological Sciences, University of California San Diego, La Jolla, CA 92093, USA
| | - Bernhard O Palsson
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA.,Department of Pediatrics, University of California, San Diego, La Jolla, CA, USA.,Center for Microbiome Innovation, University of California San Diego, La Jolla, CA 92093, USA.,Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kongens, Lyngby, Denmark
| |
Collapse
|
25
|
Yi Y, Lai S, Wang W, Li S, Zhang R, Luo Y, Zhou W, Wang J. SDNMF: Semisupervised discriminative nonnegative matrix factorization for feature learning. INT J INTELL SYST 2022. [DOI: 10.1002/int.23054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Yugen Yi
- School of Software Jiangxi Normal University Nanchang China
| | - Shumin Lai
- School of Software Jiangxi Normal University Nanchang China
| | - Wenle Wang
- School of Software Jiangxi Normal University Nanchang China
| | - Shicheng Li
- School of Software Jiangxi Normal University Nanchang China
| | - Renbo Zhang
- School of Software Jiangxi Normal University Nanchang China
| | - Yong Luo
- School of Software Jiangxi Normal University Nanchang China
| | - Wei Zhou
- College of Computer Science Shenyang Aerospace University Shenyang China
| | - Jianzhong Wang
- College of Information Science and Technology Northeast Normal University Changchun China
| |
Collapse
|
26
|
Lim HG, Rychel K, Sastry AV, Bentley GJ, Mueller J, Schindel HS, Larsen PE, Laible PD, Guss AM, Niu W, Johnson CW, Beckham GT, Feist AM, Palsson BO. Machine-learning from Pseudomonas putida KT2440 transcriptomes reveals its transcriptional regulatory network. Metab Eng 2022; 72:297-310. [PMID: 35489688 DOI: 10.1016/j.ymben.2022.04.004] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Revised: 03/23/2022] [Accepted: 04/23/2022] [Indexed: 11/17/2022]
Abstract
Bacterial gene expression is orchestrated by numerous transcription factors (TFs). Elucidating how gene expression is regulated is fundamental to understanding bacterial physiology and engineering it for practical use. In this study, a machine-learning approach was applied to uncover the genome-scale transcriptional regulatory network (TRN) in Pseudomonas putida KT2440, an important organism for bioproduction. We performed independent component analysis of a compendium of 321 high-quality gene expression profiles, which were previously published or newly generated in this study. We identified 84 groups of independently modulated genes (iModulons) that explain 75.7% of the total variance in the compendium. With these iModulons, we (i) expand our understanding of the regulatory functions of 39 iModulon associated TFs (e.g., HexR, Zur) by systematic comparison with 1993 previously reported TF-gene interactions; (ii) outline transcriptional changes after the transition from the exponential growth to stationary phases; (iii) capture group of genes required for utilizing diverse carbon sources and increased stationary response with slower growth rates; (iv) unveil multiple evolutionary strategies of transcriptome reallocation to achieve fast growth rates; and (v) define an osmotic stimulon, which includes the Type VI secretion system, as coordination of multiple iModulon activity changes. Taken together, this study provides the first quantitative genome-scale TRN for P. putida KT2440 and a basis for a comprehensive understanding of its complex transcriptome changes in a variety of physiological states.
Collapse
Affiliation(s)
- Hyun Gyu Lim
- Department of Bioengineering, University of California San Diego, 9500 Gilman Dr., La Jolla, CA, 92093, USA; Joint BioEnergy Institute, 5885 Hollis Street, 4th Floor, Emeryville, CA, 94608, USA
| | - Kevin Rychel
- Department of Bioengineering, University of California San Diego, 9500 Gilman Dr., La Jolla, CA, 92093, USA
| | - Anand V Sastry
- Department of Bioengineering, University of California San Diego, 9500 Gilman Dr., La Jolla, CA, 92093, USA
| | - Gayle J Bentley
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, 15013 Denver West Parkway, Golden, CO, 80401, USA; Agile BioFoundry, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Emeryville, CA, 94720, USA
| | - Joshua Mueller
- Department of Chemical & Biomolecular Engineering, University of Nebraska-Lincoln, 1400 R St, Lincoln, NE, 68588, USA
| | - Heidi S Schindel
- Biosciences Division, Oak Ridge National Laboratory, 5200 Bethel Valley Rd, Oak Ridge, TN, 37830, USA
| | - Peter E Larsen
- Biosciences Division, Argonne National Laboratory, 9700 South Cass Avenue, Lemont, IL, 60539, USA
| | - Philip D Laible
- Biosciences Division, Argonne National Laboratory, 9700 South Cass Avenue, Lemont, IL, 60539, USA
| | - Adam M Guss
- Agile BioFoundry, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Emeryville, CA, 94720, USA; Biosciences Division, Oak Ridge National Laboratory, 5200 Bethel Valley Rd, Oak Ridge, TN, 37830, USA
| | - Wei Niu
- Department of Chemical & Biomolecular Engineering, University of Nebraska-Lincoln, 1400 R St, Lincoln, NE, 68588, USA
| | - Christopher W Johnson
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, 15013 Denver West Parkway, Golden, CO, 80401, USA; Agile BioFoundry, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Emeryville, CA, 94720, USA
| | - Gregg T Beckham
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, 15013 Denver West Parkway, Golden, CO, 80401, USA; Agile BioFoundry, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Emeryville, CA, 94720, USA
| | - Adam M Feist
- Department of Bioengineering, University of California San Diego, 9500 Gilman Dr., La Jolla, CA, 92093, USA; Joint BioEnergy Institute, 5885 Hollis Street, 4th Floor, Emeryville, CA, 94608, USA; Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Kgs, Lyngby, Denmark
| | - Bernhard O Palsson
- Department of Bioengineering, University of California San Diego, 9500 Gilman Dr., La Jolla, CA, 92093, USA; Joint BioEnergy Institute, 5885 Hollis Street, 4th Floor, Emeryville, CA, 94608, USA; Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Kgs, Lyngby, Denmark; Department of Pediatrics, University of California, San Diego, CA, 92093, USA.
| |
Collapse
|
27
|
Rajput A, Tsunemoto H, Sastry AV, Szubin R, Rychel K, Sugie J, Pogliano J, Palsson BO. Machine learning from Pseudomonas aeruginosa transcriptomes identifies independently modulated sets of genes associated with known transcriptional regulators. Nucleic Acids Res 2022; 50:3658-3672. [PMID: 35357493 PMCID: PMC9023270 DOI: 10.1093/nar/gkac187] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Revised: 02/28/2022] [Accepted: 03/29/2022] [Indexed: 12/16/2022] Open
Abstract
The transcriptional regulatory network (TRN) of Pseudomonas aeruginosa coordinates cellular processes in response to stimuli. We used 364 transcriptomes (281 publicly available + 83 in-house generated) to reconstruct the TRN of P. aeruginosa using independent component analysis. We identified 104 independently modulated sets of genes (iModulons) among which 81 reflect the effects of known transcriptional regulators. We identified iModulons that (i) play an important role in defining the genomic boundaries of biosynthetic gene clusters (BGCs), (ii) show increased expression of the BGCs and associated secretion systems in nutrient conditions that are important in cystic fibrosis, (iii) show the presence of a novel ribosomally synthesized and post-translationally modified peptide (RiPP) BGC which might have a role in P. aeruginosa virulence, (iv) exhibit interplay of amino acid metabolism regulation and central metabolism across different carbon sources and (v) clustered according to their activity changes to define iron and sulfur stimulons. Finally, we compared the identified iModulons of P. aeruginosa with those previously described in Escherichia coli to observe conserved regulons across two Gram-negative species. This comprehensive TRN framework encompasses the majority of the transcriptional regulatory machinery in P. aeruginosa, and thus should prove foundational for future research into its physiological functions.
Collapse
Affiliation(s)
- Akanksha Rajput
- Department of Bioengineering, University of California, San Diego, La Jolla, USA
| | - Hannah Tsunemoto
- Division of Biological Sciences, University of California San Diego, La Jolla, CA 92093, USA
| | - Anand V Sastry
- Department of Bioengineering, University of California, San Diego, La Jolla, USA
| | - Richard Szubin
- Department of Bioengineering, University of California, San Diego, La Jolla, USA
| | - Kevin Rychel
- Department of Bioengineering, University of California, San Diego, La Jolla, USA
| | - Joseph Sugie
- Division of Biological Sciences, University of California San Diego, La Jolla, CA 92093, USA
| | - Joe Pogliano
- Division of Biological Sciences, University of California San Diego, La Jolla, CA 92093, USA
| | - Bernhard O Palsson
- Department of Bioengineering, University of California, San Diego, La Jolla, USA.,Department of Pediatrics, University of California, San Diego, La Jolla, CA, USA.,Center for Microbiome Innovation, University of California San Diego, La Jolla, CA 92093, USA.,Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800 Kongens, Lyngby, Denmark
| |
Collapse
|