1
|
James JS, Dai J, Chew WL, Cai Y. The design and engineering of synthetic genomes. Nat Rev Genet 2025; 26:298-319. [PMID: 39506144 DOI: 10.1038/s41576-024-00786-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/23/2024] [Indexed: 11/08/2024]
Abstract
Synthetic genomics seeks to design and construct entire genomes to mechanistically dissect fundamental questions of genome function and to engineer organisms for diverse applications, including bioproduction of high-value chemicals and biologics, advanced cell therapies, and stress-tolerant crops. Recent progress has been fuelled by advancements in DNA synthesis, assembly, delivery and editing. Computational innovations, such as the use of artificial intelligence to provide prediction of function, also provide increasing capabilities to guide synthetic genome design and construction. However, translating synthetic genome-scale projects from idea to implementation remains highly complex. Here, we aim to streamline this implementation process by comprehensively reviewing the strategies for design, construction, delivery, debugging and tailoring of synthetic genomes as well as their potential applications.
Collapse
Affiliation(s)
- Joshua S James
- Manchester Institute of Biotechnology, University of Manchester, Manchester, UK
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
| | - Junbiao Dai
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Shenzhen Key Laboratory of Agricultural Synthetic Biology, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
- Shenzhen Key Laboratory of Synthetic Genomics, Guangdong Provincial Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Wei Leong Chew
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
| | - Yizhi Cai
- Manchester Institute of Biotechnology, University of Manchester, Manchester, UK.
| |
Collapse
|
2
|
Cantore T, Gasperini P, Bevilacqua R, Ciani Y, Sinha S, Ruppin E, Demichelis F. PRODE recovers essential and context-essential genes through neighborhood-informed scores. Genome Biol 2025; 26:42. [PMID: 40022167 PMCID: PMC11869679 DOI: 10.1186/s13059-025-03501-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Accepted: 02/05/2025] [Indexed: 03/03/2025] Open
Abstract
Gene context-essentiality assessment supports precision oncology opportunities. The variability of gene effects inference from loss-of-function screenings across models and technologies limits identifying robust hits. We propose a computational framework named PRODE that integrates gene effects with protein-protein interactions to generate neighborhood-informed essential (NIE) and neighborhood-informed context essential (NICE) scores. It outperforms the canonical gene effect approach in recovering missed essential genes in shRNA screens and prioritizing context-essential hits from CRISPR-KO screens, as supported by in vitro validations. Applied to Her2 + breast cancer tumor samples, PRODE identifies oxidative phosphorylation genes as vulnerabilities with prognostic value, highlighting new therapeutic opportunities.
Collapse
Affiliation(s)
- Thomas Cantore
- Laboratory of Computational and Functional Oncology, Department of Cellular, Computational, and Integrative Biology, University of Trento, Via Sommarive 9, Trento, 38123, Italy
| | - Paola Gasperini
- Laboratory of Computational and Functional Oncology, Department of Cellular, Computational, and Integrative Biology, University of Trento, Via Sommarive 9, Trento, 38123, Italy
| | - Riccardo Bevilacqua
- Laboratory of Computational and Functional Oncology, Department of Cellular, Computational, and Integrative Biology, University of Trento, Via Sommarive 9, Trento, 38123, Italy
| | - Yari Ciani
- Laboratory of Computational and Functional Oncology, Department of Cellular, Computational, and Integrative Biology, University of Trento, Via Sommarive 9, Trento, 38123, Italy
| | - Sanju Sinha
- Cancer Data Science Laboratory (CDSL), Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Bethesda, MD, USA
- Currently at Sanford Burnham Prebys Medical Discovery Institute, San Diego, CA, USA
| | - Eytan Ruppin
- Cancer Data Science Laboratory (CDSL), Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Bethesda, MD, USA
| | - Francesca Demichelis
- Laboratory of Computational and Functional Oncology, Department of Cellular, Computational, and Integrative Biology, University of Trento, Via Sommarive 9, Trento, 38123, Italy.
| |
Collapse
|
3
|
Ngoi NYL, Gallo D, Torrado C, Nardo M, Durocher D, Yap TA. Synthetic lethal strategies for the development of cancer therapeutics. Nat Rev Clin Oncol 2025; 22:46-64. [PMID: 39627502 DOI: 10.1038/s41571-024-00966-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/01/2024] [Indexed: 12/20/2024]
Abstract
Synthetic lethality is a genetic phenomenon whereby the simultaneous presence of two different genetic alterations impairs cellular viability. Importantly, targeting synthetic lethal interactions offers potential therapeutic strategies for cancers with alterations in pathways that might otherwise be considered undruggable. High-throughput screening methods based on modern CRISPR-Cas9 technologies have emerged and become crucial for identifying novel synthetic lethal interactions with the potential for translation into biologically rational cancer therapeutic strategies as well as associated predictive biomarkers of response capable of guiding patient selection. Spurred by the clinical success of PARP inhibitors in patients with BRCA-mutant cancers, novel agents targeting multiple synthetic lethal interactions within DNA damage response pathways are in clinical development, and rational strategies targeting synthetic lethal interactions spanning alterations in epigenetic, metabolic and proliferative pathways have also emerged and are in late preclinical and/or early clinical testing. In this Review, we provide a comprehensive overview of established and emerging technologies for synthetic lethal drug discovery and development and discuss promising therapeutic strategies targeting such interactions.
Collapse
Affiliation(s)
- Natalie Y L Ngoi
- Department of Investigational Cancer Therapeutics (Phase I Clinical Trials Program), Division of Cancer Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
- Department of Haematology-Oncology, National University Cancer Institute, Singapore, Singapore
- Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| | - David Gallo
- Repare Therapeutics, Inc., Montreal, Quebec, Canada
| | - Carlos Torrado
- Department of Investigational Cancer Therapeutics (Phase I Clinical Trials Program), Division of Cancer Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Mirella Nardo
- Department of Investigational Cancer Therapeutics (Phase I Clinical Trials Program), Division of Cancer Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Daniel Durocher
- Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Timothy A Yap
- Department of Investigational Cancer Therapeutics (Phase I Clinical Trials Program), Division of Cancer Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.
- Therapeutics Discovery Division, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.
- Khalifa Institute for Personalized Cancer Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.
| |
Collapse
|
4
|
Bao L, Zhu Z, Ismail A, Zhu B, Anandan V, Whiteley M, Kitten T, Xu P. Experimental evolution of gene essentiality in bacteria. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.16.600122. [PMID: 39071448 PMCID: PMC11275930 DOI: 10.1101/2024.07.16.600122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
Essential gene products carry out fundamental cellular activities in interaction with other components. However, the lack of essential gene mutants and appropriate methodologies to link essential gene functions with their partners poses significant challenges. Here, we have generated deletion mutants in 32 genes previously identified as essential, with 23 mutants showing extremely slow growth in the SK36 strain of Streptococcus sanguinis. The 23 genes corresponding to these mutants encode components of diverse pathways, are widely conserved among bacteria, and are essential in many other bacterial species. Whole-genome sequencing of 243 independently evolved populations of these mutants has identified >1000 spontaneous suppressor mutations in experimental evolution. Many of these mutations define new gene and pathway relationships, such as F1Fo-ATPase/V1Vo-ATPase/TrkA1-H1 that were demonstrated across multiple Streptococcus species. Patterns of spontaneous mutations occurring in essential gene mutants differed from those found in wildtype. While gene duplications occurred rarely and appeared most often at later stages of evolution, substitutions, deletions, and insertions were prevalent in evolved populations. These essential gene deletion mutants and spontaneous mutations fixed in the mutant populations during evolution establish a foundation for understanding gene essentiality and the interaction of essential genes in networks.
Collapse
Affiliation(s)
- Liang Bao
- Department of Oral and Craniofacial Molecular Biology, Philips Institute for Oral Health Research, School of Dentistry, Virginia Commonwealth University, Virginia, USA
| | - Zan Zhu
- Department of Oral and Craniofacial Molecular Biology, Philips Institute for Oral Health Research, School of Dentistry, Virginia Commonwealth University, Virginia, USA
| | - Ahmed Ismail
- Department of Oral and Craniofacial Molecular Biology, Philips Institute for Oral Health Research, School of Dentistry, Virginia Commonwealth University, Virginia, USA
| | - Bin Zhu
- Massey Cancer Center, Virginia Commonwealth University, Virginia, USA
| | - Vysakh Anandan
- Department of Oral and Craniofacial Molecular Biology, Philips Institute for Oral Health Research, School of Dentistry, Virginia Commonwealth University, Virginia, USA
| | - Marvin Whiteley
- School of Biological Sciences, Center for Microbial Dynamics and Infection, Georgia Institute of Technology, Georgia, USA
| | - Todd Kitten
- Department of Oral and Craniofacial Molecular Biology, Philips Institute for Oral Health Research, School of Dentistry, Virginia Commonwealth University, Virginia, USA
| | - Ping Xu
- Department of Oral and Craniofacial Molecular Biology, Philips Institute for Oral Health Research, School of Dentistry, Virginia Commonwealth University, Virginia, USA
| |
Collapse
|
5
|
Liang WW, Müller S, Hart SK, Wessels HH, Méndez-Mancilla A, Sookdeo A, Choi O, Caragine CM, Corman A, Lu L, Kolumba O, Williams B, Sanjana NE. Transcriptome-scale RNA-targeting CRISPR screens reveal essential lncRNAs in human cells. Cell 2024; 187:7637-7654.e29. [PMID: 39532094 DOI: 10.1016/j.cell.2024.10.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Revised: 07/09/2024] [Accepted: 10/12/2024] [Indexed: 11/16/2024]
Abstract
Mammalian genomes host a diverse array of RNA that includes protein-coding and noncoding transcripts. However, the functional roles of most long noncoding RNAs (lncRNAs) remain elusive. Using RNA-targeting CRISPR-Cas13 screens, we probed how the loss of ∼6,200 lncRNAs impacts cell fitness across five human cell lines and identified 778 lncRNAs with context-specific or broad essentiality. We confirm their essentiality with individual perturbations and find that the majority of essential lncRNAs operate independently of their nearest protein-coding genes. Using transcriptome profiling in single cells, we discover that the loss of essential lncRNAs impairs cell-cycle progression and drives apoptosis. Many essential lncRNAs demonstrate dynamic expression across tissues during development. Using ∼9,000 primary tumors, we pinpoint those lncRNAs whose expression in tumors correlates with survival, yielding new biomarkers and potential therapeutic targets. This transcriptome-wide survey of functional lncRNAs advances our understanding of noncoding transcripts and demonstrates the potential of transcriptome-scale noncoding screens with Cas13.
Collapse
Affiliation(s)
- Wen-Wei Liang
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Simon Müller
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Sydney K Hart
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Hans-Hermann Wessels
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Alejandro Méndez-Mancilla
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Akash Sookdeo
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Olivia Choi
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Christina M Caragine
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Alba Corman
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Lu Lu
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Olena Kolumba
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Breanna Williams
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Neville E Sanjana
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA.
| |
Collapse
|
6
|
Watson IJ, Maranas C, Nemhauser JL, Leydon AR. A Hot-Swappable Genetic Switch: Building an inducible and trackable functional assay for the essential gene MEDIATOR 21. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.12.16.628800. [PMID: 39763940 PMCID: PMC11702731 DOI: 10.1101/2024.12.16.628800] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/19/2025]
Abstract
Essential genes, estimated at approximately 20% of the Arabidopsis genome, are broadly expressed and required for reproductive success. They are difficult to study, as interfering with their function leads to premature death. Transcription is one of the essential functions of life, and the multi-protein Mediator complex coordinates the regulation of gene expression at nearly every eukaryotic promoter. In this study, we focused on a core Mediator component called MEDIATOR21 (MED21), which is required for activation of transcription. Our previous work has also shown a role for MED21 in repression of gene expression through its interaction with a corepressor protein. Here, we sought to differentiate the role MED21 plays in activation versus repression using the model plant Arabidopsis. As mutations in MED21 lead to embryo lethal phenotypes, we constructed a set of synthetic switches using PhiC31 serine integrases to create an "on-to-off" inducible loss of function MED21 in a non-essential tissue. Our technology, which we call Integrase Erasers, made it possible for med21 mutant plants to survive into adulthood by ablating protein expression selectively in lateral root primordia, allowing quantification and characterization of med21 mutant phenotypes in a post-embryonic context. In addition, we engineered chemical induction of the Integrase Eraser to ablate MED21 expression in whole seedlings at a user-specified timepoint. Finally, we extended this technology to build a hot swappable Integrase Isoform Switch where expression of the integrase toggled cells from expressing wildtype MED21 to expressing MED21 sequence variants. Our analysis of the entire set of new integrase-based tools demonstrates that this is a highly efficient and robust approach to the study of essential genes.
Collapse
Affiliation(s)
- Isabella J Watson
- Department of Biology, University of Washington, Seattle, WA 98195-1800 USA
| | - Cassandra Maranas
- Department of Biology, University of Washington, Seattle, WA 98195-1800 USA
| | | | - Alexander R Leydon
- Department of Biology, University of Washington, Seattle, WA 98195-1800 USA
| |
Collapse
|
7
|
Xu T, Wang S, Ma T, Dong Y, Ashby CR, Hao GF. The identification of essential cellular genes is critical for validating drug targets. Drug Discov Today 2024; 29:104215. [PMID: 39428084 DOI: 10.1016/j.drudis.2024.104215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2024] [Revised: 10/06/2024] [Accepted: 10/15/2024] [Indexed: 10/22/2024]
Abstract
Accurately identifying biological targets is crucial for advancing treatment options. Essential genes, vital for cell or organism survival, hold promise as potential drug targets in disease treatment. Although many studies have sought to identify essential genes as therapeutic targets in medicine and bioinformatics, systematic reviews on their relationship with drug targets are relatively rare. This work presents a comprehensive analysis to aid in identifying essential genes as potential targets for drug discovery, encompassing their relevance, identification methods, successful case studies, and challenges. This work will facilitate the identification of essential genes as therapeutic targets, thereby boosting new drug development.
Collapse
Affiliation(s)
- Ting Xu
- School of Pharmaceutical Sciences, Guizhou Engineering Laboratory for Synthetic Drugs, Guizhou University, Guiyang 550025, China
| | - Shuang Wang
- State Key Laboratory of Green Pesticide, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for R&D of Fine Chemicals, Guizhou University, Guiyang 550025, China
| | - Tingting Ma
- School of Pharmaceutical Sciences, Guizhou Engineering Laboratory for Synthetic Drugs, Guizhou University, Guiyang 550025, China
| | - Yawen Dong
- School of Pharmaceutical Sciences, Guizhou Engineering Laboratory for Synthetic Drugs, Guizhou University, Guiyang 550025, China.
| | - Charles R Ashby
- Department of Pharmaceutical Sciences, St. John's University, New York, NY, USA.
| | - Ge-Fei Hao
- State Key Laboratory of Green Pesticide, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for R&D of Fine Chemicals, Guizhou University, Guiyang 550025, China.
| |
Collapse
|
8
|
Oktriani R, Pirona AC, Kalmár L, Rahadian AS, Miao B, Bauer AS, Hoheisel JD, Boettcher M, Du H. Genome-Wide CRISPR Screen Identifies Genes Involved in Metastasis of Pancreatic Ductal Adenocarcinoma. Cancers (Basel) 2024; 16:3684. [PMID: 39518122 PMCID: PMC11545026 DOI: 10.3390/cancers16213684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2024] [Revised: 10/28/2024] [Accepted: 10/30/2024] [Indexed: 11/16/2024] Open
Abstract
Background/Objectives: Early and aggressive metastasis is a major feature of pancreatic ductal adenocarcinoma. Understanding the processes underlying metastasis is crucial for making a difference to disease outcome. Towards these ends, we looked in a comprehensive manner for genes that are metastasis-specific. Methods: A genome-wide CRISPR-Cas9 gene knockout screen with 259,900 single guide RNA constructs was performed on pancreatic cancer cell lines with very high or very low metastatic capacity, respectively. Functional aspects of some of the identified genes were analysed in vitro. The injection of tumour cells with or without a gene knockout into mice was used to confirm the effect on metastasis. Results: The knockout of 590 genes-and, with higher analysis stringency, 67 genes-affected the viability of metastatic cells substantially, while these genes were not vital to non-metastasizing cells. Further evaluations identified different molecular processes related to this observation. One of the genes was MYBL2, encoding for a well-known transcription factor involved in the regulation of cell survival, proliferation, and differentiation in cancer tissues. In our metastasis-focussed study, no novel functional activity was detected for MYBL2, however. Instead, a metastasis-specific transformation of its genetic interaction with FOXM1 was observed. The interaction was synergistic in cells of low metastatic capacity, while there was a strong switch to a buffering mode in metastatic cells. In vivo analyses confirmed the strong effect of MYBL2 on metastasis. Conclusions: The genes found to be critical for the viability of metastatic cells form a basis for further investigations of the processes responsible for triggering and driving metastasis. As shown for MYBL2, unexpected processes of regulating metastasis might also be involved.
Collapse
Affiliation(s)
- Risky Oktriani
- Functional Genome Analysis, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 580, 69120 Heidelberg, Germany; (R.O.); (A.C.P.); (L.K.); (A.S.R.); (A.S.B.); (H.D.)
- Faculty of Biosciences, Heidelberg University, Im Neuenheimer Feld 234, 69120 Heidelberg, Germany
- Department of Biochemistry, Faculty of Medicine, Public Health and Nursing, Universitas Gadjah Mada, Farmako Sekip Utara, Yogyakarta 55281, Indonesia
| | - Anna Chiara Pirona
- Functional Genome Analysis, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 580, 69120 Heidelberg, Germany; (R.O.); (A.C.P.); (L.K.); (A.S.R.); (A.S.B.); (H.D.)
- Faculty of Biosciences, Heidelberg University, Im Neuenheimer Feld 234, 69120 Heidelberg, Germany
| | - Lili Kalmár
- Functional Genome Analysis, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 580, 69120 Heidelberg, Germany; (R.O.); (A.C.P.); (L.K.); (A.S.R.); (A.S.B.); (H.D.)
- Mannheim University of Applied Sciences, Paul-Wittsack-Straße 10, 68163 Mannheim, Germany
| | - Ariani S. Rahadian
- Functional Genome Analysis, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 580, 69120 Heidelberg, Germany; (R.O.); (A.C.P.); (L.K.); (A.S.R.); (A.S.B.); (H.D.)
- Faculty of Biosciences, Heidelberg University, Im Neuenheimer Feld 234, 69120 Heidelberg, Germany
| | - Beiping Miao
- Immune Regulation in Cancer, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 580, 69120 Heidelberg, Germany;
| | - Andrea S. Bauer
- Functional Genome Analysis, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 580, 69120 Heidelberg, Germany; (R.O.); (A.C.P.); (L.K.); (A.S.R.); (A.S.B.); (H.D.)
| | - Jörg D. Hoheisel
- Functional Genome Analysis, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 580, 69120 Heidelberg, Germany; (R.O.); (A.C.P.); (L.K.); (A.S.R.); (A.S.B.); (H.D.)
| | - Michael Boettcher
- Medical Faculty, Martin Luther University Halle-Wittenberg, Kurt-Mothes-Straße 3a, 06120 Halle, Germany;
| | - Haoqi Du
- Functional Genome Analysis, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 580, 69120 Heidelberg, Germany; (R.O.); (A.C.P.); (L.K.); (A.S.R.); (A.S.B.); (H.D.)
- School of Medicine, Faculty of Life Sciences and Medicine, Northwest University, 229 Taibai North Road, Xi’an 710069, China
| |
Collapse
|
9
|
A. Ghomi F, Jung JJ, Langridge GC, Cain AK, Boinett CJ, Abd El Ghany M, Pickard DJ, Kingsley RA, Thomson NR, Parkhill J, Gardner PP, Barquist L. High-throughput transposon mutagenesis in the family Enterobacteriaceae reveals core essential genes and rapid turnover of essentiality. mBio 2024; 15:e0179824. [PMID: 39207104 PMCID: PMC11481867 DOI: 10.1128/mbio.01798-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2024] [Accepted: 08/05/2024] [Indexed: 09/04/2024] Open
Abstract
The Enterobacteriaceae are a scientifically and medically important clade of bacteria, containing the model organism Escherichia coli, as well as major human pathogens including Salmonella enterica and Klebsiella pneumoniae. Essential gene sets have been determined for several members of the Enterobacteriaceae, with the Keio E. coli single-gene deletion library often regarded as a gold standard. However, it remains unclear how gene essentiality varies between related strains and species. To investigate this, we have assembled a collection of 13 sequenced high-density transposon mutant libraries from five genera within the Enterobacteriaceae. We first assess several gene essentiality prediction approaches, investigate the effects of transposon density on essentiality prediction, and identify biases in transposon insertion sequencing data. Based on these investigations, we develop a new classifier for gene essentiality. Using this new classifier, we define a core essential genome in the Enterobacteriaceae of 201 universally essential genes. Despite the presence of a large cohort of variably essential genes, we find an absence of evidence for genus-specific essential genes. A clear example of this sporadic essentiality is given by the set of genes regulating the σE extracytoplasmic stress response, which appears to have independently acquired essentiality multiple times in the Enterobacteriaceae. Finally, we compare our essential gene sets to the natural experiment of gene loss in obligate insect endosymbionts that have emerged from within the Enterobacteriaceae. This isolates a remarkably small set of genes absolutely required for survival and identifies several instances of essential stress responses masked by redundancy in free-living bacteria.IMPORTANCEThe essential genome, that is the set of genes absolutely required to sustain life, is a core concept in genetics. Essential genes in bacteria serve as drug targets, put constraints on the engineering of biological chassis for technological or industrial purposes, and are key to constructing synthetic life. Despite decades of study, relatively little is known about how gene essentiality varies across related bacteria. In this study, we have collected gene essentiality data for 13 bacteria related to the model organism Escherichia coli, including several human pathogens, and investigated the conservation of essentiality. We find that approximately a third of the genes essential in any particular strain are non-essential in another related strain. Surprisingly, we do not find evidence for essential genes unique to specific genera; rather it appears a substantial fraction of the essential genome rapidly gains or loses essentiality during evolution. This suggests that essentiality is not an immutable characteristic but depends crucially on the genomic context. We illustrate this through a comparison of our essential genes in free-living bacteria to genes conserved in 34 insect endosymbionts with naturally reduced genomes, finding several cases where genes generally regarded as being important for specific stress responses appear to have become essential in endosymbionts due to a loss of functional redundancy in the genome.
Collapse
Affiliation(s)
- Fatemeh A. Ghomi
- Biomolecular Interactions Centre, School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
| | - Jakob J. Jung
- Helmholtz Institute for RNA-based Infection Research (HIRI), Helmholtz Center for Infection Research (HZI), Würzburg, Germany
| | - Gemma C. Langridge
- Microbes in the Food Chain, Quadram Institute Bioscience, Norwich Research Park, Norwich, United Kingdom
| | - Amy K. Cain
- ARC Centre of Excellence in Synthetic Biology, School of Natural Sciences, Macquarie University, Sydney, Australia
| | | | - Moataz Abd El Ghany
- The Westmead Institute for Medical Research, University of Sydney, Sydney, Australia
- Sydney Institute for Infectious Diseases, University of Sydney, Sydney, Australia
- School of Public Health, Faculty of Medicine and Health, University of Sydney, Sydney, Australia
- King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
| | - Derek J. Pickard
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
- Department of Medicine, University of Cambridge, Cambridge, United Kingdom
| | - Robert A. Kingsley
- Microbes in the Food Chain, Quadram Institute Bioscience, Norwich Research Park, Norwich, United Kingdom
- Department of Biological Sciences, University of East Anglia, Norwich, United Kingdom
| | - Nicholas R. Thomson
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
- London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Julian Parkhill
- Department of Veterinary Medicine, University of Cambridge, Cambridge, United Kingdom
| | - Paul P. Gardner
- Biomolecular Interactions Centre, School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
- Department of Biochemistry, Otago University, Dunedin, New Zealand
| | - Lars Barquist
- Helmholtz Institute for RNA-based Infection Research (HIRI), Helmholtz Center for Infection Research (HZI), Würzburg, Germany
- Faculty of Medicine, University of Würzburg, Würzburg, Germany
- Department of Biology, University of Toronto, Mississauga, Ontario, Canada
| |
Collapse
|
10
|
Chen J, Nilsen ED, Chitboonthavisuk C, Mo CY, Raman S. Systematic, high-throughput characterization of bacteriophage gene essentiality on diverse hosts. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.10.617714. [PMID: 39416107 PMCID: PMC11482910 DOI: 10.1101/2024.10.10.617714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 10/19/2024]
Abstract
Understanding core and conditional gene essentiality is crucial for decoding genotype-phenotype relationships in organisms. We present PhageMaP, a high-throughput method to create genome-scale phage knockout libraries for systematically assessing gene essentiality in bacteriophages. Using PhageMaP, we generate gene essentiality maps across hundreds of genes in the model phage T7 and the non-model phage Bas63, on diverse hosts. These maps provide fundamental insights into genome organization, gene function, and host-specific conditional essentiality. By applying PhageMaP to a collection of anti-phage defense systems, we uncover phage genes that either inhibit or activate eight defenses and offer novel mechanistic hypotheses. Furthermore, we engineer synthetic phages with enhanced infectivity by modular transfer of a PhageMaP-discovered defense inhibitor from Bas63 to T7. PhageMaP is generalizable, as it leverages homologous recombination, a universal cellular process, for locus-specific barcoding. This versatile tool advances bacteriophage functional genomics and accelerates rational phage design for therapy.
Collapse
Affiliation(s)
- Jackie Chen
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Erick D Nilsen
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
| | | | - Charlie Y Mo
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
| | - Srivatsan Raman
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, USA
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
- Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, WI, USA
| |
Collapse
|
11
|
Granata I, Maddalena L, Manzo M, Guarracino MR, Giordano M. HELP: A computational framework for labelling and predicting human common and context-specific essential genes. PLoS Comput Biol 2024; 20:e1012076. [PMID: 39331694 PMCID: PMC11463781 DOI: 10.1371/journal.pcbi.1012076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Revised: 10/09/2024] [Accepted: 08/19/2024] [Indexed: 09/29/2024] Open
Abstract
Machine learning-based approaches are particularly suitable for identifying essential genes as they allow the generation of predictive models trained on features from multi-source data. Gene essentiality is neither binary nor static but determined by the context. The databases for essential gene annotation do not permit the personalisation of the context, and their update can be slower than the publication of new experimental data. We propose HELP (Human Gene Essentiality Labelling & Prediction), a computational framework for labelling and predicting essential genes. Its double scope allows for identifying genes based on dependency or not on experimental data. The effectiveness of the labelling method was demonstrated by comparing it with other approaches in overlapping the reference sets of essential gene annotations, where HELP demonstrated the best compromise between false and true positive rates. The gene attributes, including multi-omics and network embedding features, lead to high-performance prediction of essential genes while confirming the existence of essentiality nuances.
Collapse
Affiliation(s)
- Ilaria Granata
- Institute for High-Performance Computing and Networking, National Research Council, Naples, Italy
| | - Lucia Maddalena
- Institute for High-Performance Computing and Networking, National Research Council, Naples, Italy
| | - Mario Manzo
- Information Technology Services, University of Naples “L’Orientale”, Naples, Italy
| | - Mario Rosario Guarracino
- Laboratory of Algorithms and Technologies for Network Analysis, National Research University Higher School of Economics, Nizhny Novgorod, Russia
- Department of Economics and Law, University of Cassino and Southern Lazio, Cassino, Frosinone, Italy
| | - Maurizio Giordano
- Institute for High-Performance Computing and Networking, National Research Council, Naples, Italy
| |
Collapse
|
12
|
Rahiminejad S, De Sanctis B, Pevzner P, Mushegian A. Synthetic lethality and the minimal genome size problem. mSphere 2024; 9:e0013924. [PMID: 38904396 PMCID: PMC11288024 DOI: 10.1128/msphere.00139-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Accepted: 05/13/2024] [Indexed: 06/22/2024] Open
Abstract
Gene knockout studies suggest that ~300 genes in a bacterial genome and ~1,100 genes in a yeast genome cannot be deleted without loss of viability. These single-gene knockout experiments do not account for negative genetic interactions, when two or more genes can each be deleted without effect, but their joint deletion is lethal. Thus, large-scale single-gene deletion studies underestimate the size of a minimal gene set compatible with cell survival. In yeast Saccharomyces cerevisiae, the viability of all possible deletions of gene pairs (2-tuples), and of some deletions of gene triplets (3-tuples), has been experimentally tested. To estimate the size of a yeast minimal genome from that data, we first established that finding the size of a minimal gene set is equivalent to finding the minimum vertex cover in the lethality (hyper)graph, where the vertices are genes and (hyper)edges connect k-tuples of genes whose joint deletion is lethal. Using the Lovász-Johnson-Chvatal greedy approximation algorithm, we computed the minimum vertex cover of the synthetic-lethal 2-tuples graph to be 1,723 genes. We next simulated the genetic interactions in 3-tuples, extrapolating from the existing triplet sample, and again estimated minimum vertex covers. The size of a minimal gene set in yeast rapidly approaches the size of the entire genome even when considering only synthetic lethalities in k-tuples with small k. In contrast, several studies reported successful experimental reductions of yeast and bacterial genomes by simultaneous deletions of hundreds of genes, without eliciting synthetic lethality. We discuss possible reasons for this apparent contradiction.IMPORTANCEHow can we estimate the smallest number of genes sufficient for a unicellular organism to survive on a rich medium? One approach is to remove genes one at a time and count how many of such deletion strains are unable to grow. However, the single-gene knockout data are insufficient, because joint gene deletions may result in negative genetic interactions, also known as synthetic lethality. We used a technique from graph theory to estimate the size of minimal yeast genome from partial data on synthetic lethality. The number of potential synthetic lethal interactions grows very fast when multiple genes are deleted, revealing a paradoxical contrast with the experimental reductions of yeast genome by ~100 genes, and of bacterial genomes by several hundreds of genes.
Collapse
Affiliation(s)
- Sara Rahiminejad
- Department of Bioengineering, University of California—San Diego, La Jolla, California, USA
| | - Bianca De Sanctis
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
- Department of Ecology and Evolutionary Biology, University of California—Santa Cruz, Santa Cruz, California, USA
| | - Pavel Pevzner
- Department of Computer Science and Engineering, University of California—San Diego, La Jolla, California, USA
| | - Arcady Mushegian
- Molecular and Cellular Biosciences Division, National Science Foundation, Alexandria, Virginia, USA
- Clare Hall College, Cambridge, United Kingdom
| |
Collapse
|
13
|
Kurmi A, Sen P, Dash M, Ray SK, Satapathy SS. Differentially used codons among essential genes in bacteria identified by machine learning-based analysis. Mol Genet Genomics 2024; 299:72. [PMID: 39060647 DOI: 10.1007/s00438-024-02163-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2024] [Accepted: 07/10/2024] [Indexed: 07/28/2024]
Abstract
Codon usage bias (CUB), the uneven usage of synonymous codons encoding the same amino acid, differs among genes within and across bacteria genomes. CUB is known to be influenced by gene expression and accordingly, CUB differs between the high-expression and low-expression genes in several bacteria. In this article, we have extended codon usage study considering gene essentiality as a feature. Using machine learning (ML) based approaches, we have analysed Relative Synonymous Codon Usage (RSCU) values between essential and non-essential genes in Escherichia coli and thirty-four other bacterial genomes whose gene essentiality features were available in public databases. We observed significant differences in codon usage patterns between essential and non-essential genes for majority of the bacterial genomes and accordingly, ML based classifiers achieved high area under curve (AUC) scores, with a minimum score of 70.0 across twenty-eight organisms. Further, importance of the codons towards classifying genes found to differ among the codons in each genome. Arg codon CGT and Gly codon GGT were observed to be the most preferred codons among essential genes in Escherichia coli. Interestingly, some of the codons like CGT, ATA, GGT and GGG observed to be contributing consistently towards classifying essential genes across thirty-five bacteria genomes studied. In other hand, codons TGY and CAY encoding amino acids Cys and His respectively were among the least contributing codons towards classification among all these bacteria. This study demonstrates the gene essentiality based differences in synonymous codon usage in bacteria genomes and presents a common codon usage pattern across bacteria.
Collapse
Affiliation(s)
- Annushree Kurmi
- Department of Computer Science and Engineering, Tezpur University, Napaam, Assam, 784028, India
- Department of Computer Science and Engineering, The Assam Kaziranga University, Jorhat, Assam, 785006, India
| | - Piyali Sen
- Department of Computer Science and Engineering, Tezpur University, Napaam, Assam, 784028, India
| | - Madhusmita Dash
- Department of Electronics and Communication Engineering, NIT, Jote, Arunachal Pradesh, 791113, India
| | - Suvendra Kumar Ray
- Department of Molecular Biology and Biotechnology, Tezpur University, Napaam, Assam, 784028, India
| | | |
Collapse
|
14
|
Delarouzée A, Lopes Ferreira N, Baum C, Wasels F. Gene essentiality in the solventogenic Clostridium acetobutylicum DSM 792. Appl Environ Microbiol 2024; 90:e0028224. [PMID: 38864631 PMCID: PMC11267918 DOI: 10.1128/aem.00282-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Accepted: 05/20/2024] [Indexed: 06/13/2024] Open
Abstract
Clostridium acetobutylicum is a solventogenic, anaerobic, gram-positive bacterium that is commonly considered the model organism for studying acetone-butanol-ethanol fermentation. The need to produce these chemicals sustainably and with a minimal impact on the environment has revived the interest in research on this bacterium. The recent development of efficient genetic tools allows to better understand the physiology of this micro-organism, aiming at improving its fermentation capacities. Knowledge about gene essentiality would guide the future genetic editing strategies and support the understanding of crucial cellular functions in this bacterium. In this work, we applied a transposon insertion site sequencing method to generate large mutant libraries containing millions of independent mutants that allowed us to identify a core group of 418 essential genes needed for in vitro development. Future research on this significant biocatalyst will be guided by the data provided in this work, which will serve as a valuable resource for the community. IMPORTANCE Clostridium acetobutylicum is a leading candidate to synthesize valuable compounds like three and four carbons alcohols. Its ability to convert carbohydrates into a mixture of acetone, butanol, and ethanol as well as other chemicals of interest upon genetic engineering makes it an advantageous organism for the valorization of lignocellulose-derived sugar mixtures. Since, genetic optimization depends on the fundamental insights supplied by accurate gene function assignment, gene essentiality analysis is of great interest as it can shed light on the function of many genes whose functions are still to be confirmed. The data obtained in this study will be of great value for the research community aiming to develop C. acetobutylicum as a platform organism for the production of chemicals of interest.
Collapse
Affiliation(s)
| | | | - Chloé Baum
- Institut Pasteur, Université Paris Cité, Plate-forme Technologique Biomics, Paris, France
| | | |
Collapse
|
15
|
Xiao MS, Damodaran AP, Kumari B, Dickson E, Xing K, On TA, Parab N, King HE, Perez AR, Guiblet WM, Duncan G, Che A, Chari R, Andresson T, Vidigal JA, Weatheritt RJ, Aregger M, Gonatopoulos-Pournatzis T. Genome-scale exon perturbation screens uncover exons critical for cell fitness. Mol Cell 2024; 84:2553-2572.e19. [PMID: 38917794 PMCID: PMC11246229 DOI: 10.1016/j.molcel.2024.05.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2023] [Revised: 04/04/2024] [Accepted: 05/24/2024] [Indexed: 06/27/2024]
Abstract
CRISPR-Cas technology has transformed functional genomics, yet understanding of how individual exons differentially shape cellular phenotypes remains limited. Here, we optimized and conducted massively parallel exon deletion and splice-site mutation screens in human cell lines to identify exons that regulate cellular fitness. Fitness-promoting exons are prevalent in essential and highly expressed genes and commonly overlap with protein domains and interaction interfaces. Conversely, fitness-suppressing exons are enriched in nonessential genes, exhibiting lower inclusion levels, and overlap with intrinsically disordered regions and disease-associated mutations. In-depth mechanistic investigation of the screen-hit TAF5 alternative exon-8 revealed that its inclusion is required for assembly of the TFIID general transcription initiation complex, thereby regulating global gene expression output. Collectively, our orthogonal exon perturbation screens established a comprehensive repository of phenotypically important exons and uncovered regulatory mechanisms governing cellular fitness and gene expression.
Collapse
Affiliation(s)
- Mei-Sheng Xiao
- RNA Biology Laboratory, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Frederick, MD 21702, USA
| | - Arun Prasath Damodaran
- RNA Biology Laboratory, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Frederick, MD 21702, USA.
| | - Bandana Kumari
- RNA Biology Laboratory, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Frederick, MD 21702, USA
| | - Ethan Dickson
- RNA Biology Laboratory, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Frederick, MD 21702, USA
| | - Kun Xing
- RNA Biology Laboratory, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Frederick, MD 21702, USA
| | - Tyler A On
- Molecular Targets Program, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Frederick, MD 21702, USA
| | - Nikhil Parab
- RNA Biology Laboratory, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Frederick, MD 21702, USA
| | - Helen E King
- EMBL Australia and Garvan Institute of Medical Research, Sydney, NSW 2010, Australia
| | - Alexendar R Perez
- Laboratory of Biochemistry and Molecular Biology, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Bethesda, MD 20892, USA; Department of Anesthesia and Perioperative Care, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Wilfried M Guiblet
- RNA Biology Laboratory, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Frederick, MD 21702, USA
| | - Gerard Duncan
- Protein Characterization Laboratory, Frederick National Laboratory for Cancer Research (FNLCR), Frederick, MD 21701, USA
| | - Anney Che
- Advanced Biomedical Computational Science, Frederick National Laboratory for Cancer Research (FNLCR), Frederick, MD 21701, USA
| | - Raj Chari
- Genome Modification Core, Frederick National Laboratory for Cancer Research (FNLCR), Frederick, MD 21702, USA
| | - Thorkell Andresson
- Protein Characterization Laboratory, Frederick National Laboratory for Cancer Research (FNLCR), Frederick, MD 21701, USA
| | - Joana A Vidigal
- Laboratory of Biochemistry and Molecular Biology, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Bethesda, MD 20892, USA
| | - Robert J Weatheritt
- EMBL Australia and Garvan Institute of Medical Research, Sydney, NSW 2010, Australia; School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW 2010, Australia
| | - Michael Aregger
- Molecular Targets Program, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Frederick, MD 21702, USA.
| | - Thomas Gonatopoulos-Pournatzis
- RNA Biology Laboratory, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Frederick, MD 21702, USA.
| |
Collapse
|
16
|
Adedeji EO, Beder T, Damiani C, Cappelli A, Accoti A, Tapanelli S, Ogunlana OO, Fatumo S, Favia G, Koenig R, Adebiyi E. Combination of computational techniques and RNAi reveal targets in Anopheles gambiae for malaria vector control. PLoS One 2024; 19:e0305207. [PMID: 38968330 PMCID: PMC11226046 DOI: 10.1371/journal.pone.0305207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Accepted: 05/25/2024] [Indexed: 07/07/2024] Open
Abstract
Increasing reports of insecticide resistance continue to hamper the gains of vector control strategies in curbing malaria transmission. This makes identifying new insecticide targets or alternative vector control strategies necessary. CLassifier of Essentiality AcRoss EukaRyote (CLEARER), a leave-one-organism-out cross-validation machine learning classifier for essential genes, was used to predict essential genes in Anopheles gambiae and selected predicted genes experimentally validated. The CLEARER algorithm was trained on six model organisms: Caenorhabditis elegans, Drosophila melanogaster, Homo sapiens, Mus musculus, Saccharomyces cerevisiae and Schizosaccharomyces pombe, and employed to identify essential genes in An. gambiae. Of the 10,426 genes in An. gambiae, 1,946 genes (18.7%) were predicted to be Cellular Essential Genes (CEGs), 1716 (16.5%) to be Organism Essential Genes (OEGs), and 852 genes (8.2%) to be essential as both OEGs and CEGs. RNA interference (RNAi) was used to validate the top three highly expressed non-ribosomal predictions as probable vector control targets, by determining the effect of these genes on the survival of An. gambiae G3 mosquitoes. In addition, the effect of knockdown of arginase (AGAP008783) on Plasmodium berghei infection in mosquitoes was evaluated, an enzyme we computationally inferred earlier to be essential based on chokepoint analysis. Arginase and the top three genes, AGAP007406 (Elongation factor 1-alpha, Elf1), AGAP002076 (Heat shock 70kDa protein 1/8, HSP), AGAP009441 (Elongation factor 2, Elf2), had knockdown efficiencies of 91%, 75%, 63%, and 61%, respectively. While knockdown of HSP or Elf2 significantly reduced longevity of the mosquitoes (p<0.0001) compared to control groups, Elf1 or arginase knockdown had no effect on survival. However, arginase knockdown significantly reduced P. berghei oocytes counts in the midgut of mosquitoes when compared to LacZ-injected controls. The study reveals HSP and Elf2 as important contributors to mosquito survival and arginase as important for parasite development, hence placing them as possible targets for vector control.
Collapse
Affiliation(s)
- Eunice O. Adedeji
- Covenant University Bioinformatics Research (CUBRe), Covenant University, Ota, Ogun State, Nigeria
- Department of Biochemistry, Covenant University, Ota, Ogun State, Nigeria
- School of Biosciences & Veterinary Medicine, University of Camerino, Camerino, Italy
- Department of Biology, University of York, York, United Kingdom
| | - Thomas Beder
- Medical Department II, Hematology and Oncology, University Medical Center Schleswig-Holstein, Kiel, Germany
- University Cancer Center Schleswig-Holstein, University Medical Center Schleswig-Holstein, Kiel and Lübeck, Germany
- Institute for Infectious Diseases and Infection Control (IIMK, RG Systemsbiology), Jena University Hospital, Jena, Germany
| | - Claudia Damiani
- School of Biosciences & Veterinary Medicine, University of Camerino, Camerino, Italy
| | - Alessia Cappelli
- School of Biosciences & Veterinary Medicine, University of Camerino, Camerino, Italy
| | - Anastasia Accoti
- School of Biosciences & Veterinary Medicine, University of Camerino, Camerino, Italy
| | - Sofia Tapanelli
- Department of Life Sciences, Imperial College, London, United Kingdom
| | - Olubanke O. Ogunlana
- Covenant University Bioinformatics Research (CUBRe), Covenant University, Ota, Ogun State, Nigeria
- Department of Biochemistry, Covenant University, Ota, Ogun State, Nigeria
- African Center of Excellence in Bioinformatics & Data Intensive Science, Makerere University, Kampala, Uganda
| | - Segun Fatumo
- Department of Non-Communicable Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, United Kingdom
| | - Guido Favia
- School of Biosciences & Veterinary Medicine, University of Camerino, Camerino, Italy
| | - Rainer Koenig
- Institute for Infectious Diseases and Infection Control (IIMK, RG Systemsbiology), Jena University Hospital, Jena, Germany
| | - Ezekiel Adebiyi
- Covenant University Bioinformatics Research (CUBRe), Covenant University, Ota, Ogun State, Nigeria
- African Center of Excellence in Bioinformatics & Data Intensive Science, Makerere University, Kampala, Uganda
- Division of Applied Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Germany
| |
Collapse
|
17
|
Chen XR, Cui YZ, Li BZ, Yuan YJ. Genome engineering on size reduction and complexity simplification: A review. J Adv Res 2024; 60:159-171. [PMID: 37442424 PMCID: PMC11156615 DOI: 10.1016/j.jare.2023.07.006] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Revised: 06/25/2023] [Accepted: 07/10/2023] [Indexed: 07/15/2023] Open
Abstract
BACKGROUND Genome simplification is an important topic in the field of life sciences that has attracted attention from its conception to the present day. It can help uncover the essential components of the genome and, in turn, shed light on the underlying operating principles of complex biological systems. This has made it a central focus of both basic and applied research in the life sciences. With the recent advancements in related technologies and our increasing knowledge of the genome, now is an opportune time to delve into this topic. AIM OF REVIEW Our review investigates the progress of genome simplification from two perspectives: genome size reduction and complexity simplification. In addition, we provide insights into the future development trends of genome simplification. KEY SCIENTIFIC CONCEPTS OF REVIEW Reducing genome size requires eliminating non-essential elements as much as possible. This process has been facilitated by advances in genome manipulation and synthesis techniques. However, we still need a better and clearer understanding of living systems to reduce genome complexity. As there is a lack of quantitative and clearly defined standards for this task, we have opted to approach the topic from various perspectives and present our findings accordingly.
Collapse
Affiliation(s)
- Xiang-Rong Chen
- Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), School of Chemical Engineering and Technology, Tianjin University, Tianjin, China; Frontiers Research Institute for Synthetic Biology, Tianjin University, Tianjin, China
| | - You-Zhi Cui
- Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), School of Chemical Engineering and Technology, Tianjin University, Tianjin, China; Frontiers Research Institute for Synthetic Biology, Tianjin University, Tianjin, China
| | - Bing-Zhi Li
- Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), School of Chemical Engineering and Technology, Tianjin University, Tianjin, China; Frontiers Research Institute for Synthetic Biology, Tianjin University, Tianjin, China.
| | - Ying-Jin Yuan
- Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), School of Chemical Engineering and Technology, Tianjin University, Tianjin, China; Frontiers Research Institute for Synthetic Biology, Tianjin University, Tianjin, China
| |
Collapse
|
18
|
James C, Trevisan-Herraz M, Juan D, Rico D. Evolutionary analysis of gene ages across TADs associates chromatin topology with whole-genome duplications. Cell Rep 2024; 43:113895. [PMID: 38517894 DOI: 10.1016/j.celrep.2024.113895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2022] [Revised: 11/03/2023] [Accepted: 02/16/2024] [Indexed: 03/24/2024] Open
Abstract
Topologically associated domains (TADs) are interaction subnetworks of chromosomal regions in 3D genomes. TAD boundaries frequently coincide with genome breaks while boundary deletion is under negative selection, suggesting that TADs may facilitate genome rearrangements and evolution. We show that genes co-localize by evolutionary age in humans and mice, resulting in TADs having different proportions of younger and older genes. We observe a major transition in the age co-localization patterns between the genes born during vertebrate whole-genome duplications (WGDs) or before and those born afterward. We also find that genes recently duplicated in primates and rodents are more frequently essential when they are located in old-enriched TADs and interact with genes that last duplicated during the WGD. Therefore, the evolutionary relevance of recent genes may increase when located in TADs with established regulatory networks. Our data suggest that TADs could play a role in organizing ancestral functions and evolutionary novelty.
Collapse
Affiliation(s)
- Caelinn James
- Biosciences Institute, Newcastle University, Newcastle Upon Tyne, UK; Scotland's Rural College (SRUC), The Roslin Institute Building, Easter Bush, Midlothian, UK
| | - Marco Trevisan-Herraz
- Biosciences Institute, Newcastle University, Newcastle Upon Tyne, UK; Translational and Clinical Research Institute, Newcastle University, Newcastle Upon Tyne, UK
| | - David Juan
- Institut de Biologia Evolutiva, Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona, Barcelona, Spain; Systems Biology Department, Spanish National Centre for Biotechnology (CNB-CSIC), Madrid, Spain
| | - Daniel Rico
- Biosciences Institute, Newcastle University, Newcastle Upon Tyne, UK; Centro Andaluz de Biología Molecular y Medicina Regenerativa (CABIMER), CSIC-Universidad de Sevilla-Universidad Pablo de Olavide-Junta de Andalucía, Seville, Spain.
| |
Collapse
|
19
|
Dalhat MH, Narayan S, Serio H, Arango D. Dissecting the oncogenic properties of essential RNA-modifying enzymes: a focus on NAT10. Oncogene 2024; 43:1077-1086. [PMID: 38409550 PMCID: PMC11092965 DOI: 10.1038/s41388-024-02975-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2023] [Revised: 02/05/2024] [Accepted: 02/07/2024] [Indexed: 02/28/2024]
Abstract
Chemical modifications of ribonucleotides significantly alter the physicochemical properties and functions of RNA. Initially perceived as static and essential marks in ribosomal RNA (rRNA) and transfer RNA (tRNA), recent discoveries unveiled a dynamic landscape of RNA modifications in messenger RNA (mRNA) and other regulatory RNAs. These findings spurred extensive efforts to map the distribution and function of RNA modifications, aiming to elucidate their distribution and functional significance in normal cellular homeostasis and pathological states. Significant dysregulation of RNA modifications is extensively documented in cancers, accentuating the potential of RNA-modifying enzymes as therapeutic targets. However, the essential role of several RNA-modifying enzymes in normal physiological functions raises concerns about potential side effects. A notable example is N-acetyltransferase 10 (NAT10), which is responsible for acetylating cytidines in RNA. While emerging evidence positions NAT10 as an oncogenic factor and a potential target in various cancer types, its essential role in normal cellular processes complicates the development of targeted therapies. This review aims to comprehensively analyze the essential and oncogenic properties of NAT10. We discuss its crucial role in normal cell biology and aging alongside its contribution to cancer development and progression. We advocate for agnostic approaches to disentangling the intertwined essential and oncogenic functions of RNA-modifying enzymes. Such approaches are crucial for understanding the full spectrum of RNA-modifying enzymes and imperative for designing effective and safe therapeutic strategies.
Collapse
Affiliation(s)
- Mahmood H Dalhat
- Department of Pharmacology, Northwestern University, Chicago, IL, USA
| | - Sharath Narayan
- Department of Pharmacology, Northwestern University, Chicago, IL, USA
- Driskill Graduate Program in Life Sciences, Northwestern University, Chicago, IL, USA
| | - Hannah Serio
- Department of Pharmacology, Northwestern University, Chicago, IL, USA
| | - Daniel Arango
- Department of Pharmacology, Northwestern University, Chicago, IL, USA.
- Robert H. Lurie Comprehensive Cancer Center, Northwestern University, Chicago, IL, USA.
| |
Collapse
|
20
|
Jerabek T, Weiß L, Fahrion H, Zeh N, Raab N, Lindner B, Fischer S, Otte K. In pursuit of a minimal CHO genome: Establishment of large-scale genome deletions. N Biotechnol 2024; 79:100-110. [PMID: 38154614 DOI: 10.1016/j.nbt.2023.12.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 11/27/2023] [Accepted: 12/24/2023] [Indexed: 12/30/2023]
Abstract
Chinese hamster ovary (CHO) cells are the most commonly used mammalian cell line for the production of complex therapeutic glycoproteins. As CHO cells have evolved as part of a multicellular organism, they harbor many cellular functions irrelevant for their application as production hosts in industrial bioprocesses. Consequently, CHO cells have been the target for numerous genetic engineering efforts in the past, but a tailored host cell chassis holistically optimized for its specific task in a bioreactor is still missing. While the concept of genome reduction has already been successfully applied to bacterial production cells, attempts to create higher eukaryotic production hosts exhibiting reduced genomes have not been reported yet. Here, we present the establishment and application of a large-scale genome deletion strategy for targeted excision of large genomic regions in CHO cells. We demonstrate the feasibility of genome reduction in CHO cells using optimized CRISPR/Cas9 based experimental protocols targeting large non-essential genomic regions with high efficiency. Achieved genome deletions of non-essential genetic regions did not introduce negative effects on bioprocess relevant parameters, although we conducted the largest reported genomic excision of 864 kilobase pairs in CHO cells so far. The concept presented serves as a directive to accelerate the development of a significantly genome-reduced CHO host cell chassis paving the way for a next generation of CHO cell factories.
Collapse
Affiliation(s)
- Tobias Jerabek
- University of Applied Sciences Biberach, Institute of Applied Biotechnology, Biberach, Germany.
| | - Linus Weiß
- University of Applied Sciences Biberach, Institute of Applied Biotechnology, Biberach, Germany
| | - Hannah Fahrion
- University of Applied Sciences Biberach, Institute of Applied Biotechnology, Biberach, Germany
| | - Nikolas Zeh
- University of Applied Sciences Biberach, Institute of Applied Biotechnology, Biberach, Germany; Boehringer Ingelheim Pharma GmbH & Co KG, Bioprocess Development Biologicals, Cell Line Development, Biberach, Germany
| | - Nadja Raab
- University of Applied Sciences Biberach, Institute of Applied Biotechnology, Biberach, Germany
| | - Benjamin Lindner
- Boehringer Ingelheim Pharma GmbH & Co KG, Bioprocess Development Biologicals, Cell Line Development, Biberach, Germany
| | - Simon Fischer
- Boehringer Ingelheim Pharma GmbH & Co KG, Bioprocess Development Biologicals, Cell Line Development, Biberach, Germany
| | - Kerstin Otte
- University of Applied Sciences Biberach, Institute of Applied Biotechnology, Biberach, Germany
| |
Collapse
|
21
|
Teyssonniere EM, Shichino Y, Mito M, Friedrich A, Iwasaki S, Schacherer J. Translation variation across genetic backgrounds reveals a post-transcriptional buffering signature in yeast. Nucleic Acids Res 2024; 52:2434-2445. [PMID: 38261993 PMCID: PMC10954453 DOI: 10.1093/nar/gkae030] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 12/21/2023] [Accepted: 01/11/2024] [Indexed: 01/25/2024] Open
Abstract
Gene expression is known to vary among individuals, and this variability can impact the phenotypic diversity observed in natural populations. While the transcriptome and proteome have been extensively studied, little is known about the translation process itself. Here, we therefore performed ribosome and transcriptomic profiling on a genetically and ecologically diverse set of natural isolates of the Saccharomyces cerevisiae yeast. Interestingly, we found that the Euclidean distances between each profile and the expression fold changes in each pairwise isolate comparison were higher at the transcriptomic level. This observation clearly indicates that the transcriptional variation observed in the different isolates is buffered through a phenomenon known as post-transcriptional buffering at the translation level. Furthermore, this phenomenon seemed to have a specific signature by preferentially affecting essential genes as well as genes involved in complex-forming proteins, and low transcribed genes. We also explored the translation of the S. cerevisiae pangenome and found that the accessory genes related to introgression events displayed similar transcription and translation levels as the core genome. By contrast, genes acquired through horizontal gene transfer events tended to be less efficiently translated. Together, our results highlight both the extent and signature of the post-transcriptional buffering.
Collapse
Affiliation(s)
| | - Yuichi Shichino
- RNA Systems Biochemistry Laboratory, RIKEN Cluster for Pioneering Research, Wako, Saitama 351-0198, Japan
| | - Mari Mito
- RNA Systems Biochemistry Laboratory, RIKEN Cluster for Pioneering Research, Wako, Saitama 351-0198, Japan
| | - Anne Friedrich
- Université de Strasbourg, CNRS, GMGM UMR, 7156 Strasbourg, France
| | - Shintaro Iwasaki
- RNA Systems Biochemistry Laboratory, RIKEN Cluster for Pioneering Research, Wako, Saitama 351-0198, Japan
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 277-8561, Japan
| | - Joseph Schacherer
- Université de Strasbourg, CNRS, GMGM UMR, 7156 Strasbourg, France
- Institut Universitaire de France (IUF), Paris, France
| |
Collapse
|
22
|
Liang Y, Luo H, Lin Y, Gao F. Recent advances in the characterization of essential genes and development of a database of essential genes. IMETA 2024; 3:e157. [PMID: 38868518 PMCID: PMC10989110 DOI: 10.1002/imt2.157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 10/09/2023] [Indexed: 06/14/2024]
Abstract
Over the past few decades, there has been a significant interest in the study of essential genes, which are crucial for the survival of an organism under specific environmental conditions and thus have practical applications in the fields of synthetic biology and medicine. An increasing amount of experimental data on essential genes has been obtained with the continuous development of technological methods. Meanwhile, various computational prediction methods, related databases and web servers have emerged accordingly. To facilitate the study of essential genes, we have established a database of essential genes (DEG), which has become popular with continuous updates to facilitate essential gene feature analysis and prediction, drug and vaccine development, as well as artificial genome design and construction. In this article, we summarized the studies of essential genes, overviewed the relevant databases, and discussed their practical applications. Furthermore, we provided an overview of the main applications of DEG and conducted comprehensive analyses based on its latest version. However, it should be noted that the essential gene is a dynamic concept instead of a binary one, which presents both opportunities and challenges for their future development.
Collapse
Affiliation(s)
| | - Hao Luo
- Department of PhysicsTianjin UniversityTianjinChina
| | - Yan Lin
- Department of PhysicsTianjin UniversityTianjinChina
| | - Feng Gao
- Department of PhysicsTianjin UniversityTianjinChina
- Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education)Tianjin UniversityTianjinChina
- SynBio Research PlatformCollaborative Innovation Center of Chemical Science and Engineering (Tianjin)TianjinChina
| |
Collapse
|
23
|
Couce A, Limdi A, Magnan M, Owen SV, Herren CM, Lenski RE, Tenaillon O, Baym M. Changing fitness effects of mutations through long-term bacterial evolution. Science 2024; 383:eadd1417. [PMID: 38271521 DOI: 10.1126/science.add1417] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Accepted: 12/12/2023] [Indexed: 01/27/2024]
Abstract
The distribution of fitness effects of new mutations shapes evolution, but it is challenging to observe how it changes as organisms adapt. Using Escherichia coli lineages spanning 50,000 generations of evolution, we quantify the fitness effects of insertion mutations in every gene. Macroscopically, the fraction of deleterious mutations changed little over time whereas the beneficial tail declined sharply, approaching an exponential distribution. Microscopically, changes in individual gene essentiality and deleterious effects often occurred in parallel; altered essentiality is only partly explained by structural variation. The identity and effect sizes of beneficial mutations changed rapidly over time, but many targets of selection remained predictable because of the importance of loss-of-function mutations. Taken together, these results reveal the dynamic-but statistically predictable-nature of mutational fitness effects.
Collapse
Affiliation(s)
- Alejandro Couce
- Université Paris Cité and Université Sorbonne Paris Nord, Inserm, IAME, F-75018 Paris, France
- Department of Life Sciences, Imperial College London, London SW7 2AZ, UK
- Centro de Biotecnología y Genómica de Plantas, Universidad Politécnica de Madrid (UPM), 28223 Madrid, Spain
| | - Anurag Limdi
- Department of Biomedical Informatics, and Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA 02115, USA
| | - Melanie Magnan
- Université Paris Cité and Université Sorbonne Paris Nord, Inserm, IAME, F-75018 Paris, France
| | - Siân V Owen
- Department of Biomedical Informatics, and Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA 02115, USA
| | - Cristina M Herren
- Department of Biomedical Informatics, and Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA 02115, USA
- Department of Marine and Environmental Sciences, Northeastern University, Boston, MA 02115, USA
| | - Richard E Lenski
- Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI 48824, USA
- Program in Ecology, Evolution, and Behavior, Michigan State University, East Lansing, MI 48824, USA
| | - Olivier Tenaillon
- Université Paris Cité and Université Sorbonne Paris Nord, Inserm, IAME, F-75018 Paris, France
- Université Paris Cité, Inserm, Institut Cochin, F-75014 Paris, France
| | - Michael Baym
- Department of Biomedical Informatics, and Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
24
|
Hu W, Li M, Xiao H, Guan L. Essential genes identification model based on sequence feature map and graph convolutional neural network. BMC Genomics 2024; 25:47. [PMID: 38200437 PMCID: PMC10777564 DOI: 10.1186/s12864-024-09958-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2023] [Accepted: 01/01/2024] [Indexed: 01/12/2024] Open
Abstract
BACKGROUND Essential genes encode functions that play a vital role in the life activities of organisms, encompassing growth, development, immune system functioning, and cell structure maintenance. Conventional experimental techniques for identifying essential genes are resource-intensive and time-consuming, and the accuracy of current machine learning models needs further enhancement. Therefore, it is crucial to develop a robust computational model to accurately predict essential genes. RESULTS In this study, we introduce GCNN-SFM, a computational model for identifying essential genes in organisms, based on graph convolutional neural networks (GCNN). GCNN-SFM integrates a graph convolutional layer, a convolutional layer, and a fully connected layer to model and extract features from gene sequences of essential genes. Initially, the gene sequence is transformed into a feature map using coding techniques. Subsequently, a multi-layer GCN is employed to perform graph convolution operations, effectively capturing both local and global features of the gene sequence. Further feature extraction is performed, followed by integrating convolution and fully-connected layers to generate prediction results for essential genes. The gradient descent algorithm is utilized to iteratively update the cross-entropy loss function, thereby enhancing the accuracy of the prediction results. Meanwhile, model parameters are tuned to determine the optimal parameter combination that yields the best prediction performance during training. CONCLUSIONS Experimental evaluation demonstrates that GCNN-SFM surpasses various advanced essential gene prediction models and achieves an average accuracy of 94.53%. This study presents a novel and effective approach for identifying essential genes, which has significant implications for biology and genomics research.
Collapse
Affiliation(s)
- Wenxing Hu
- College of Physics and Electronic Information, Gannan Normal University, Ganzhou, Jiangxi, 341000, China
| | - Mengshan Li
- College of Physics and Electronic Information, Gannan Normal University, Ganzhou, Jiangxi, 341000, China.
| | - Haiyang Xiao
- College of Physics and Electronic Information, Gannan Normal University, Ganzhou, Jiangxi, 341000, China
| | - Lixin Guan
- College of Physics and Electronic Information, Gannan Normal University, Ganzhou, Jiangxi, 341000, China
| |
Collapse
|
25
|
Pons C, van Leeuwen J. Meta-analysis of dispensable essential genes and their interactions with bypass suppressors. Life Sci Alliance 2024; 7:e202302192. [PMID: 37918966 PMCID: PMC10622647 DOI: 10.26508/lsa.202302192] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 10/24/2023] [Accepted: 10/25/2023] [Indexed: 11/04/2023] Open
Abstract
Genes have been historically classified as essential or non-essential based on their requirement for viability. However, genomic mutations can sometimes bypass the requirement for an essential gene, challenging the binary classification of gene essentiality. Such dispensable essential genes represent a valuable model for understanding the incomplete penetrance of loss-of-function mutations often observed in natural populations. Here, we compiled data from multiple studies on essential gene dispensability in Saccharomyces cerevisiae to comprehensively characterize these genes. In analyses spanning different evolutionary timescales, dispensable essential genes exhibited distinct phylogenetic properties compared with other essential and non-essential genes. Integration of interactions with suppressor genes that can bypass the gene essentiality revealed the high functional modularity of the bypass suppression network. Furthermore, dispensable essential and bypass suppressor gene pairs reflected simultaneous changes in the mutational landscape of S. cerevisiae strains. Importantly, species in which dispensable essential genes were non-essential tended to carry bypass suppressor mutations in their genomes. Overall, our study offers a comprehensive view of dispensable essential genes and illustrates how their interactions with bypass suppressors reflect evolutionary outcomes.
Collapse
Affiliation(s)
- Carles Pons
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute for Science and Technology, Barcelona, Spain
| | - Jolanda van Leeuwen
- Center for Integrative Genomics, Bâtiment Génopode, University of Lausanne, Lausanne, Switzerland
| |
Collapse
|
26
|
Boob AG, Chen J, Zhao H. Enabling pathway design by multiplex experimentation and machine learning. Metab Eng 2024; 81:70-87. [PMID: 38040110 DOI: 10.1016/j.ymben.2023.11.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 11/01/2023] [Accepted: 11/25/2023] [Indexed: 12/03/2023]
Abstract
The remarkable metabolic diversity observed in nature has provided a foundation for sustainable production of a wide array of valuable molecules. However, transferring the biosynthetic pathway to the desired host often runs into inherent failures that arise from intermediate accumulation and reduced flux resulting from competing pathways within the host cell. Moreover, the conventional trial and error methods utilized in pathway optimization struggle to fully grasp the intricacies of installed pathways, leading to time-consuming and labor-intensive experiments, ultimately resulting in suboptimal yields. Considering these obstacles, there is a pressing need to explore the enzyme expression landscape and identify the optimal pathway configuration for enhanced production of molecules. This review delves into recent advancements in pathway engineering, with a focus on multiplex experimentation and machine learning techniques. These approaches play a pivotal role in overcoming the limitations of traditional methods, enabling exploration of a broader design space and increasing the likelihood of discovering optimal pathway configurations for enhanced production of molecules. We discuss several tools and strategies for pathway design, construction, and optimization for sustainable and cost-effective microbial production of molecules ranging from bulk to fine chemicals. We also highlight major successes in academia and industry through compelling case studies.
Collapse
Affiliation(s)
- Aashutosh Girish Boob
- Department of Chemical and Biomolecular Engineering, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; Carl R. Woese Institute for Genomic Biology, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Junyu Chen
- Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; Carl R. Woese Institute for Genomic Biology, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Huimin Zhao
- Department of Chemical and Biomolecular Engineering, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; Carl R. Woese Institute for Genomic Biology, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States.
| |
Collapse
|
27
|
Freischem LJ, Oyarzún DA. A Machine Learning Approach for Predicting Essentiality of Metabolic Genes. Methods Mol Biol 2024; 2760:345-369. [PMID: 38468098 DOI: 10.1007/978-1-0716-3658-9_20] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2024]
Abstract
The identification of essential genes is a key challenge in systems and synthetic biology, particularly for engineering metabolic pathways that convert feedstocks into valuable products. Assessment of gene essentiality at a genome scale requires large and costly growth assays of knockout strains. Here we describe a strategy to predict the essentiality of metabolic genes using binary classification algorithms. The approach combines elements from genome-scale metabolic models, directed graphs, and machine learning into a predictive model that can be trained on small knockout data. We demonstrate the efficacy of this approach using the most complete metabolic model of Escherichia coli and various machine learning algorithms for binary classification.
Collapse
Affiliation(s)
| | - Diego A Oyarzún
- School of Informatics, University of Edinburgh, Edinburgh, UK.
- School of Biological Sciences, University of Edinburgh, Edinburgh, UK.
| |
Collapse
|
28
|
Giordano M, Falbo E, Maddalena L, Piccirillo M, Granata I. Untangling the Context-Specificity of Essential Genes by Means of Machine Learning: A Constructive Experience. Biomolecules 2023; 14:18. [PMID: 38254618 PMCID: PMC10813179 DOI: 10.3390/biom14010018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 11/29/2023] [Accepted: 12/20/2023] [Indexed: 01/24/2024] Open
Abstract
Gene essentiality is a genetic concept crucial for a comprehensive understanding of life and evolution. In the last decade, many essential genes (EGs) have been determined using different experimental and computational approaches, and this information has been used to reduce the genomes of model organisms. A growing amount of evidence highlights that essentiality is a property that depends on the context. Because of their importance in vital biological processes, recognising context-specific EGs (csEGs) could help for identifying new potential pharmacological targets and to improve precision therapeutics. Since most of the computational procedures proposed to identify and predict EGs neglect their context-specificity, we focused on this aspect, providing a theoretical and experimental overview of the literature, data and computational methods dedicated to recognising csEGs. To this end, we adapted existing computational methods to exploit a specific context (the kidney tissue) and experimented with four different prediction methods using the labels provided by four different identification approaches. The considerations derived from the analysis of the obtained results, confirmed and validated also by further experiments for a different tissue context, provide the reader with guidance on exploiting existing tools for achieving csEGs identification and prediction.
Collapse
Affiliation(s)
- Maurizio Giordano
- Institute for High-Performance Computing and Networking (ICAR), National Research Council (CNR), V. Pietro Castellino 111, 80131 Naples, Italy; (E.F.); (L.M.); (M.P.); (I.G.)
| | | | | | | | | |
Collapse
|
29
|
Theuretzbacher U, Blasco B, Duffey M, Piddock LJV. Unrealized targets in the discovery of antibiotics for Gram-negative bacterial infections. Nat Rev Drug Discov 2023; 22:957-975. [PMID: 37833553 DOI: 10.1038/s41573-023-00791-6] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/15/2023] [Indexed: 10/15/2023]
Abstract
Advances in areas that include genomics, systems biology, protein structure determination and artificial intelligence provide new opportunities for target-based antibacterial drug discovery. The selection of a 'good' new target for direct-acting antibacterial compounds is the first decision, for which multiple criteria must be explored, integrated and re-evaluated as drug discovery programmes progress. Criteria include essentiality of the target for bacterial survival, its conservation across different strains of the same species, bacterial species and growth conditions (which determines the spectrum of activity of a potential antibiotic) and the level of homology with human genes (which influences the potential for selective inhibition). Additionally, a bacterial target should have the potential to bind to drug-like molecules, and its subcellular location will govern the need for inhibitors to penetrate one or two bacterial membranes, which is a key challenge in targeting Gram-negative bacteria. The risk of the emergence of target-based drug resistance for drugs with single targets also requires consideration. This Review describes promising but as-yet-unrealized targets for antibacterial drugs against Gram-negative bacteria and examples of cognate inhibitors, and highlights lessons learned from past drug discovery programmes.
Collapse
Affiliation(s)
| | - Benjamin Blasco
- Global Antibiotic Research and Development Partnership (GARDP), Geneva, Switzerland
| | - Maëlle Duffey
- Global Antibiotic Research and Development Partnership (GARDP), Geneva, Switzerland
| | - Laura J V Piddock
- Global Antibiotic Research and Development Partnership (GARDP), Geneva, Switzerland.
| |
Collapse
|
30
|
Ma J, Song J, Young ND, Chang BCH, Korhonen PK, Campos TL, Liu H, Gasser RB. 'Bingo'-a large language model- and graph neural network-based workflow for the prediction of essential genes from protein data. Brief Bioinform 2023; 25:bbad472. [PMID: 38152979 PMCID: PMC10753293 DOI: 10.1093/bib/bbad472] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 10/22/2023] [Accepted: 11/28/2023] [Indexed: 12/29/2023] Open
Abstract
The identification and characterization of essential genes are central to our understanding of the core biological functions in eukaryotic organisms, and has important implications for the treatment of diseases caused by, for example, cancers and pathogens. Given the major constraints in testing the functions of genes of many organisms in the laboratory, due to the absence of in vitro cultures and/or gene perturbation assays for most metazoan species, there has been a need to develop in silico tools for the accurate prediction or inference of essential genes to underpin systems biological investigations. Major advances in machine learning approaches provide unprecedented opportunities to overcome these limitations and accelerate the discovery of essential genes on a genome-wide scale. Here, we developed and evaluated a large language model- and graph neural network (LLM-GNN)-based approach, called 'Bingo', to predict essential protein-coding genes in the metazoan model organisms Caenorhabditis elegans and Drosophila melanogaster as well as in Mus musculus and Homo sapiens (a HepG2 cell line) by integrating LLM and GNNs with adversarial training. Bingo predicts essential genes under two 'zero-shot' scenarios with transfer learning, showing promise to compensate for a lack of high-quality genomic and proteomic data for non-model organisms. In addition, the attention mechanisms and GNNExplainer were employed to manifest the functional sites and structural domain with most contribution to essentiality. In conclusion, Bingo provides the prospect of being able to accurately infer the essential genes of little- or under-studied organisms of interest, and provides a biological explanation for gene essentiality.
Collapse
Affiliation(s)
- Jiani Ma
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Jiangning Song
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Victoria 3800, Australia
| | - Neil D Young
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Bill C H Chang
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Pasi K Korhonen
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Tulio L Campos
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
- Bioinformatics Core Facility, Instituto Aggeu Magalhaes, Fundaçao Oswaldo Cruz (IAM-Fiocruz), Recife, Pernambuco, Brazil
| | - Hui Liu
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Robin B Gasser
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
| |
Collapse
|
31
|
Rivas-Marin E, Moyano-Palazuelo D, Henriques V, Merino E, Devos DP. Essential gene complement of Planctopirus limnophila from the bacterial phylum Planctomycetes. Nat Commun 2023; 14:7224. [PMID: 37940686 PMCID: PMC10632474 DOI: 10.1038/s41467-023-43096-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2023] [Accepted: 10/31/2023] [Indexed: 11/10/2023] Open
Abstract
Planctopirus limnophila belongs to the bacterial phylum Planctomycetes, a relatively understudied lineage with remarkable cell biology features. Here, we report a genome-wide analysis of essential gene content in P. limnophila. We show that certain genes involved in peptidoglycan synthesis or cell division, which are essential in most other studied bacteria, are not essential for growth under laboratory conditions in this species. We identify essential genes likely involved in lipopolysaccharide biosynthesis, consistent with the view of Planctomycetes as diderm bacteria, and highlight other essential genes of unknown functions. Furthermore, we explore potential stages of evolution of the essential gene repertoire in Planctomycetes and the related phyla Verrucomicrobia and Chlamydiae. Our results provide insights into the divergent molecular and cellular biology of Planctomycetes.
Collapse
Affiliation(s)
- Elena Rivas-Marin
- Centro Andaluz de Biología del Desarrollo, CSIC, Universidad Pablo de Olavide, Sevilla, Spain.
| | - David Moyano-Palazuelo
- Centro Andaluz de Biología del Desarrollo, CSIC, Universidad Pablo de Olavide, Sevilla, Spain
| | - Valentina Henriques
- Centro Andaluz de Biología del Desarrollo, CSIC, Universidad Pablo de Olavide, Sevilla, Spain
| | - Enrique Merino
- Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Damien P Devos
- Centro Andaluz de Biología del Desarrollo, CSIC, Universidad Pablo de Olavide, Sevilla, Spain.
- Institut Pasteur de Lille, Centre d'Infection et d'Immunité de Lille, University of Lille, Lille, France.
| |
Collapse
|
32
|
Wang L, Zheng Y, Sun Y, Mao S, Li H, Bo X, Li C, Chen H. TimeTalk uses single-cell RNA-seq datasets to decipher cell-cell communication during early embryo development. Commun Biol 2023; 6:901. [PMID: 37660148 PMCID: PMC10475079 DOI: 10.1038/s42003-023-05283-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Accepted: 08/24/2023] [Indexed: 09/04/2023] Open
Abstract
Early embryonic development is a dynamic process that relies on proper cell-cell communication to form a correctly patterned embryo. Early embryo development-related ligand-receptor pairs (eLRs) have been shown to guide cell fate decisions and morphogenesis. However, the scope of eLRs and their influence on early embryo development remain elusive. Here, we developed a computational framework named TimeTalk from integrated public time-course mouse scRNA-seq datasets to decipher the secret of eLRs. Extensive validations and analyses were performed to ensure the involvement of identified eLRs in early embryo development. Process analysis identified that eLRs could be divided into six temporal windows corresponding to sequential events in the early embryo development process. With the interpolation strategy, TimeTalk is powerful in revealing paracrine settings and studying cell-cell communication during early embryo development. Furthermore, by using TimeTalk in the blastocyst and blastoid models, we found that the blastoid models share the core communication pathways with the epiblast and primitive endoderm lineages in the blastocysts. This result suggests that TimeTalk has transferability to other bio-dynamic processes. We also curated eLRs recognized by TimeTalk, which may provide valuable clues for understanding early embryo development and relevant disorders.
Collapse
Affiliation(s)
- Longteng Wang
- Peking University-Tsinghua University-National Institute of Biological Sciences Joint Graduate Program, School of Life Sciences, Peking University, Beijing, 100871, China
- Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China
| | - Yang Zheng
- Institute of Health Service and Transfusion Medicine, Beijing, 100850, China
| | - Yu Sun
- Institute of Health Service and Transfusion Medicine, Beijing, 100850, China
| | - Shulin Mao
- Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China
- Yuanpei College, Peking University, Beijing, 100871, China
| | - Hao Li
- Institute of Health Service and Transfusion Medicine, Beijing, 100850, China
| | - Xiaochen Bo
- Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China
| | - Cheng Li
- Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China.
| | - Hebing Chen
- Institute of Health Service and Transfusion Medicine, Beijing, 100850, China.
| |
Collapse
|
33
|
Cacheiro P, Smedley D. Essential genes: a cross-species perspective. Mamm Genome 2023; 34:357-363. [PMID: 36897351 PMCID: PMC10382395 DOI: 10.1007/s00335-023-09984-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 02/17/2023] [Indexed: 03/11/2023]
Abstract
Protein coding genes exhibit different degrees of intolerance to loss-of-function variation. The most intolerant genes, whose function is essential for cell or/and organism survival, inform on fundamental biological processes related to cell proliferation and organism development and provide a window on the molecular mechanisms of human disease. Here we present a brief overview of the resources and knowledge gathered around gene essentiality, from cancer cell lines to model organisms to human development. We outline the implications of using different sources of evidence and definitions to determine which genes are essential and highlight how information on the essentiality status of a gene can inform novel disease gene discovery and therapeutic target identification.
Collapse
Affiliation(s)
- Pilar Cacheiro
- William Harvey Research Institute, Queen Mary University of London, London, UK
| | - Damian Smedley
- William Harvey Research Institute, Queen Mary University of London, London, UK.
| |
Collapse
|
34
|
Winkler KR, Mizrahi V, Warner DF, De Wet TJ. High-throughput functional genomics: A (myco)bacterial perspective. Mol Microbiol 2023; 120:141-158. [PMID: 37278255 PMCID: PMC10953053 DOI: 10.1111/mmi.15103] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Revised: 04/06/2023] [Accepted: 05/21/2023] [Indexed: 06/07/2023]
Abstract
Advances in sequencing technologies have enabled unprecedented insights into bacterial genome composition and dynamics. However, the disconnect between the rapid acquisition of genomic data and the (much slower) confirmation of inferred genetic function threatens to widen unless techniques for fast, high-throughput functional validation can be applied at scale. This applies equally to Mycobacterium tuberculosis, the leading infectious cause of death globally and a pathogen whose genome, despite being among the first to be sequenced two decades ago, still contains many genes of unknown function. Here, we summarize the evolution of bacterial high-throughput functional genomics, focusing primarily on transposon (Tn)-based mutagenesis and the construction of arrayed mutant libraries in diverse bacterial systems. We also consider the contributions of CRISPR interference as a transformative technique for probing bacterial gene function at scale. Throughout, we situate our analysis within the context of functional genomics of mycobacteria, focusing specifically on the potential to yield insights into M. tuberculosis pathogenicity and vulnerabilities for new drug and regimen development. Finally, we offer suggestions for future approaches that might be usefully applied in elucidating the complex cellular biology of this major human pathogen.
Collapse
Affiliation(s)
- Kristy R. Winkler
- Molecular Mycobacteriology Research Unit and DSI/NRF Centre of Excellence for Biomedical TB Research, Department of Pathology and Institute of Infectious Disease and Molecular MedicineUniversity of Cape TownRondeboschSouth Africa
| | - Valerie Mizrahi
- Molecular Mycobacteriology Research Unit and DSI/NRF Centre of Excellence for Biomedical TB Research, Department of Pathology and Institute of Infectious Disease and Molecular MedicineUniversity of Cape TownRondeboschSouth Africa
- Wellcome Centre for Infectious Diseases Research in AfricaUniversity of Cape TownRondeboschSouth Africa
| | - Digby F. Warner
- Molecular Mycobacteriology Research Unit and DSI/NRF Centre of Excellence for Biomedical TB Research, Department of Pathology and Institute of Infectious Disease and Molecular MedicineUniversity of Cape TownRondeboschSouth Africa
- Wellcome Centre for Infectious Diseases Research in AfricaUniversity of Cape TownRondeboschSouth Africa
| | - Timothy J. De Wet
- Molecular Mycobacteriology Research Unit and DSI/NRF Centre of Excellence for Biomedical TB Research, Department of Pathology and Institute of Infectious Disease and Molecular MedicineUniversity of Cape TownRondeboschSouth Africa
- Wellcome Centre for Infectious Diseases Research in AfricaUniversity of Cape TownRondeboschSouth Africa
- Department of Integrative Biomedical SciencesUniversity of Cape TownRondeboschSouth Africa
| |
Collapse
|
35
|
Fong WY, Canals R, Predeus AV, Perez-Sepulveda B, Wenner N, Lacharme-Lora L, Feasey N, Wigley P, Hinton JCD. Genome-wide fitness analysis identifies genes required for in vitro growth and macrophage infection by African and global epidemic pathovariants of Salmonella enterica Enteritidis. Microb Genom 2023; 9:mgen001017. [PMID: 37219927 PMCID: PMC10272866 DOI: 10.1099/mgen.0.001017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Accepted: 03/17/2023] [Indexed: 05/24/2023] Open
Abstract
Salmonella enterica Enteritidis is the second most common serovar associated with invasive non-typhoidal Salmonella (iNTS) disease in sub-Saharan Africa. Previously, genomic and phylogenetic characterization of S . enterica Enteritidis isolates from the human bloodstream led to the discovery of the Central/Eastern African clade (CEAC) and West African clade, which were distinct from the gastroenteritis-associated global epidemic clade (GEC). The African S . enterica Enteritidis clades have unique genetic signatures that include genomic degradation, novel prophage repertoires and multi-drug resistance, but the molecular basis for the enhanced propensity of African S . enterica Enteritidis to cause bloodstream infection is poorly understood. We used transposon insertion sequencing (TIS) to identify the genetic determinants of the GEC representative strain P125109 and the CEAC representative strain D7795 for growth in three in vitro conditions (LB or minimal NonSPI2 and InSPI2 growth media), and for survival and replication in RAW 264.7 murine macrophages. We identified 207 in vitro -required genes that were common to both S . enterica Enteritidis strains and also required by S . enterica Typhimurium, S . enterica Typhi and Escherichia coli , and 63 genes that were only required by individual S . enterica Enteritidis strains. Similar types of genes were required by both P125109 and D7795 for optimal growth in particular media. Screening the transposon libraries during macrophage infection identified 177 P125109 and 201 D7795 genes that contribute to bacterial survival and replication in mammalian cells. The majority of these genes have proven roles in Salmonella virulence. Our analysis uncovered candidate strain-specific macrophage fitness genes that could encode novel Salmonella virulence factors.
Collapse
Affiliation(s)
- Wai Yee Fong
- Clinical Infection, Microbiology and Immunology, Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, UK
- Present address: Department of Laboratory Medicine and Pathology, School of Medicine, University of Washington, Seattle, USA
| | - Rocío Canals
- Clinical Infection, Microbiology and Immunology, Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, UK
- Present address: GSK Vaccines Institute for Global Health S.R.L., Siena, Italy
| | - Alexander V. Predeus
- Clinical Infection, Microbiology and Immunology, Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, UK
- Present address: Wellcome Trust Sanger Institute, Cambridge, UK
| | - Blanca Perez-Sepulveda
- Clinical Infection, Microbiology and Immunology, Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, UK
| | - Nicolas Wenner
- Clinical Infection, Microbiology and Immunology, Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, UK
- Present address: Biozentrum, University of Basel, Basel, Switzerland
| | - Lizeth Lacharme-Lora
- Clinical Infection, Microbiology and Immunology, Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, UK
| | - Nicholas Feasey
- Department of Clinical Sciences, Liverpool School of Tropical Medicine, Liverpool, UK
- Malawi-Liverpool-Wellcome Research Programme, Kamuzu University of Health Sciences, Blantyre, Malawi
| | - Paul Wigley
- Infection Biology and Microbiomes, Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Neston, UK
- Present address: Bristol Veterinary School,University of Bristol, Langford Campus, UK
| | - Jay C. D. Hinton
- Clinical Infection, Microbiology and Immunology, Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, UK
| |
Collapse
|
36
|
Fontana B, Gallerani G, Salamon I, Pace I, Roncarati R, Ferracin M. ARID1A in cancer: Friend or foe? Front Oncol 2023; 13:1136248. [PMID: 36890819 PMCID: PMC9987588 DOI: 10.3389/fonc.2023.1136248] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2023] [Accepted: 02/06/2023] [Indexed: 02/22/2023] Open
Abstract
ARID1A belongs to a class of chromatin regulatory proteins that function by maintaining accessibility at most promoters and enhancers, thereby regulating gene expression. The high frequency of ARID1A alterations in human cancers has highlighted its significance in tumorigenesis. The precise role of ARID1A in cancer is highly variable since ARID1A alterations can have a tumor suppressive or oncogenic role, depending on the tumor type and context. ARID1A is mutated in about 10% of all tumor types including endometrial, bladder, gastric, liver, biliopancreatic cancer, some ovarian cancer subtypes, and the extremely aggressive cancers of unknown primary. Its loss is generally associated with disease progression more often than onset. In some cancers, ARID1A loss is associated with worse prognostic features, thus supporting a major tumor suppressive role. However, some exceptions have been reported. Thus, the association of ARID1A genetic alterations with patient prognosis is controversial. However, ARID1A loss of function is considered conducive for the use of inhibitory drugs which are based on synthetic lethality mechanisms. In this review we summarize the current knowledge on the role of ARID1A as tumor suppressor or oncogene in different tumor types and discuss the strategies for treating ARID1A mutated cancers.
Collapse
Affiliation(s)
- Beatrice Fontana
- Department of Medical and Surgical Sciences (DIMEC), University of Bologna, Bologna, Italy
| | - Giulia Gallerani
- Department of Medical and Surgical Sciences (DIMEC), University of Bologna, Bologna, Italy
| | - Irene Salamon
- Department of Medical and Surgical Sciences (DIMEC), University of Bologna, Bologna, Italy
| | - Ilaria Pace
- Department of Medical and Surgical Sciences (DIMEC), University of Bologna, Bologna, Italy
| | - Roberta Roncarati
- Istituto di Genetica Molecolare ”Luigi Luca Cavalli-Sforza“ – Consiglio Nazionale delle Ricerce (CNR), Bologna, Italy
| | - Manuela Ferracin
- Department of Medical and Surgical Sciences (DIMEC), University of Bologna, Bologna, Italy
- IRCCS Azienda Ospedaliero-Universitaria di Bologna, Bologna, Italy
| |
Collapse
|
37
|
Kimmel J, Schmitt M, Sinner A, Jansen PWTC, Mainye S, Ramón-Zamorano G, Toenhake CG, Wichers-Misterek JS, Cronshagen J, Sabitzki R, Mesén-Ramírez P, Behrens HM, Bártfai R, Spielmann T. Gene-by-gene screen of the unknown proteins encoded on Plasmodium falciparum chromosome 3. Cell Syst 2023; 14:9-23.e7. [PMID: 36657393 DOI: 10.1016/j.cels.2022.12.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 10/07/2022] [Accepted: 12/08/2022] [Indexed: 01/19/2023]
Abstract
Taxon-specific proteins are key determinants defining the biology of all organisms and represent prime drug targets in pathogens. However, lacking comparability with proteins in other lineages makes them particularly difficult to study. In malaria parasites, this is exacerbated by technical limitations. Here, we analyzed the cellular location, essentiality, function, and, in selected cases, interactome of all unknown non-secretory proteins encoded on an entire P. falciparum chromosome. The nucleus was the most common localization, indicating that it is a hotspot of parasite-specific biology. More in-depth functional studies with four proteins revealed essential roles in DNA replication and mitosis. The mitosis proteins defined a possible orphan complex and a highly diverged complex needed for spindle-kinetochore connection. Structure-function comparisons indicated that the taxon-specific proteins evolved by different mechanisms. This work demonstrates the feasibility of gene-by-gene screens to elucidate the biology of malaria parasites and reveal critical parasite-specific processes of interest as drug targets.
Collapse
Affiliation(s)
- Jessica Kimmel
- Bernhard Nocht Institute for Tropical Medicine, Bernhard Nocht Str. 74, 20359 Hamburg, Germany
| | - Marius Schmitt
- Bernhard Nocht Institute for Tropical Medicine, Bernhard Nocht Str. 74, 20359 Hamburg, Germany
| | - Alexej Sinner
- Bernhard Nocht Institute for Tropical Medicine, Bernhard Nocht Str. 74, 20359 Hamburg, Germany
| | | | - Sheila Mainye
- Bernhard Nocht Institute for Tropical Medicine, Bernhard Nocht Str. 74, 20359 Hamburg, Germany
| | - Gala Ramón-Zamorano
- Bernhard Nocht Institute for Tropical Medicine, Bernhard Nocht Str. 74, 20359 Hamburg, Germany
| | - Christa Geeke Toenhake
- Department of Molecular Biology, Radboud Institute for Molecular Life Sciences, Radboud University, 6525 GA Nijmegen, the Netherlands
| | | | - Jakob Cronshagen
- Bernhard Nocht Institute for Tropical Medicine, Bernhard Nocht Str. 74, 20359 Hamburg, Germany
| | - Ricarda Sabitzki
- Bernhard Nocht Institute for Tropical Medicine, Bernhard Nocht Str. 74, 20359 Hamburg, Germany
| | - Paolo Mesén-Ramírez
- Bernhard Nocht Institute for Tropical Medicine, Bernhard Nocht Str. 74, 20359 Hamburg, Germany
| | - Hannah Michaela Behrens
- Bernhard Nocht Institute for Tropical Medicine, Bernhard Nocht Str. 74, 20359 Hamburg, Germany
| | - Richárd Bártfai
- Department of Molecular Biology, Radboud Institute for Molecular Life Sciences, Radboud University, 6525 GA Nijmegen, the Netherlands
| | - Tobias Spielmann
- Bernhard Nocht Institute for Tropical Medicine, Bernhard Nocht Str. 74, 20359 Hamburg, Germany.
| |
Collapse
|
38
|
Glazenburg MM, Laan L. Complexity and self-organization in the evolution of cell polarization. J Cell Sci 2023; 136:jcs259639. [PMID: 36691920 DOI: 10.1242/jcs.259639] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Cellular life exhibits order and complexity, which typically increase over the course of evolution. Cell polarization is a well-studied example of an ordering process that breaks the internal symmetry of a cell by establishing a preferential axis. Like many cellular processes, polarization is driven by self-organization, meaning that the macroscopic pattern emerges as a consequence of microscopic molecular interactions at the biophysical level. However, the role of self-organization in the evolution of complex protein networks remains obscure. In this Review, we provide an overview of the evolution of polarization as a self-organizing process, focusing on the model species Saccharomyces cerevisiae and its fungal relatives. Moreover, we use this model system to discuss how self-organization might relate to evolutionary change, offering a shift in perspective on evolution at the microscopic scale.
Collapse
Affiliation(s)
- Marieke M Glazenburg
- Department of Bionanoscience, Kavli Institute of Nanoscience, Faculty of Applied Sciences, Delft University of Technology, 2629 HZ Delft, The Netherlands
| | - Liedewij Laan
- Department of Bionanoscience, Kavli Institute of Nanoscience, Faculty of Applied Sciences, Delft University of Technology, 2629 HZ Delft, The Netherlands
| |
Collapse
|
39
|
Li Y, Zeng M, Zhang F, Wu FX, Li M. DeepCellEss: cell line-specific essential protein prediction with attention-based interpretable deep learning. Bioinformatics 2023; 39:btac779. [PMID: 36458923 PMCID: PMC9825760 DOI: 10.1093/bioinformatics/btac779] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Revised: 11/25/2022] [Accepted: 12/01/2022] [Indexed: 12/05/2022] Open
Abstract
MOTIVATION Protein essentiality is usually accepted to be a conditional trait and strongly affected by cellular environments. However, existing computational methods often do not take such characteristics into account, preferring to incorporate all available data and train a general model for all cell lines. In addition, the lack of model interpretability limits further exploration and analysis of essential protein predictions. RESULTS In this study, we proposed DeepCellEss, a sequence-based interpretable deep learning framework for cell line-specific essential protein predictions. DeepCellEss utilizes a convolutional neural network and bidirectional long short-term memory to learn short- and long-range latent information from protein sequences. Further, a multi-head self-attention mechanism is used to provide residue-level model interpretability. For model construction, we collected extremely large-scale benchmark datasets across 323 cell lines. Extensive computational experiments demonstrate that DeepCellEss yields effective prediction performance for different cell lines and outperforms existing sequence-based methods as well as network-based centrality measures. Finally, we conducted some case studies to illustrate the necessity of considering specific cell lines and the superiority of DeepCellEss. We believe that DeepCellEss can serve as a useful tool for predicting essential proteins across different cell lines. AVAILABILITY AND IMPLEMENTATION The DeepCellEss web server is available at http://csuligroup.com:8000/DeepCellEss. The source code and data underlying this study can be obtained from https://github.com/CSUBioGroup/DeepCellEss. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yiming Li
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China
| | - Min Zeng
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China
| | - Fuhao Zhang
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China
| | - Fang-Xiang Wu
- Division of Biomedical Engineering, Department of Computer Science, Department of Mechanical Engineering University of Saskatchewan, Saskatoon, SK S7N 5A9, Canada
| | - Min Li
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China
| |
Collapse
|
40
|
Saxena P, Rauniyar S, Thakur P, Singh RN, Bomgni A, Alaba MO, Tripathi AK, Gnimpieba EZ, Lushbough C, Sani RK. Integration of text mining and biological network analysis: Identification of essential genes in sulfate-reducing bacteria. Front Microbiol 2023; 14:1086021. [PMID: 37125195 PMCID: PMC10133479 DOI: 10.3389/fmicb.2023.1086021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Accepted: 03/23/2023] [Indexed: 05/02/2023] Open
Abstract
The growth and survival of an organism in a particular environment is highly depends on the certain indispensable genes, termed as essential genes. Sulfate-reducing bacteria (SRB) are obligate anaerobes which thrives on sulfate reduction for its energy requirements. The present study used Oleidesulfovibrio alaskensis G20 (OA G20) as a model SRB to categorize the essential genes based on their key metabolic pathways. Herein, we reported a feedback loop framework for gene of interest discovery, from bio-problem to gene set of interest, leveraging expert annotation with computational prediction. Defined bio-problem was applied to retrieve the genes of SRB from literature databases (PubMed, and PubMed Central) and annotated them to the genome of OA G20. Retrieved gene list was further used to enrich protein-protein interaction and was corroborated to the pangenome analysis, to categorize the enriched gene sets and the respective pathways under essential and non-essential. Interestingly, the sat gene (dde_2265) from the sulfur metabolism was the bridging gene between all the enriched pathways. Gene clusters involved in essential pathways were linked with the genes from seleno-compound metabolism, amino acid metabolism, secondary metabolite synthesis, and cofactor biosynthesis. Furthermore, pangenome analysis demonstrated the gene distribution, where 69.83% of the 116 enriched genes were mapped under "persistent," inferring the essentiality of these genes. Likewise, 21.55% of the enriched genes, which involves specially the formate dehydrogenases and metallic hydrogenases, appeared under "shell." Our methodology suggested that semi-automated text mining and network analysis may play a crucial role in deciphering the previously unexplored genes and key mechanisms which can help to generate a baseline prior to perform any experimental studies.
Collapse
Affiliation(s)
- Priya Saxena
- Department of Chemical and Biological Engineering, South Dakota School of Mines and Technology, Rapid City, SD, United States
- Data Driven Material Discovery Center for Bioengineering Innovation, South Dakota School of Mines and Technology, Rapid City, SD, United States
| | - Shailabh Rauniyar
- Department of Chemical and Biological Engineering, South Dakota School of Mines and Technology, Rapid City, SD, United States
- 2-Dimensional Materials for Biofilm Engineering, Science and Technology, South Dakota School of Mines and Technology, Rapid City, SD, United States
| | - Payal Thakur
- Department of Chemical and Biological Engineering, South Dakota School of Mines and Technology, Rapid City, SD, United States
- Data Driven Material Discovery Center for Bioengineering Innovation, South Dakota School of Mines and Technology, Rapid City, SD, United States
| | - Ram Nageena Singh
- Department of Chemical and Biological Engineering, South Dakota School of Mines and Technology, Rapid City, SD, United States
- 2-Dimensional Materials for Biofilm Engineering, Science and Technology, South Dakota School of Mines and Technology, Rapid City, SD, United States
| | - Alain Bomgni
- Department of Biomedical Engineering, University of South Dakota, Sioux Falls, SD, United States
| | - Mathew O. Alaba
- Department of Biomedical Engineering, University of South Dakota, Sioux Falls, SD, United States
| | - Abhilash Kumar Tripathi
- Department of Chemical and Biological Engineering, South Dakota School of Mines and Technology, Rapid City, SD, United States
- 2-Dimensional Materials for Biofilm Engineering, Science and Technology, South Dakota School of Mines and Technology, Rapid City, SD, United States
| | - Etienne Z. Gnimpieba
- Department of Biomedical Engineering, University of South Dakota, Sioux Falls, SD, United States
- *Correspondence: Etienne Z. Gnimpieba,
| | - Carol Lushbough
- Department of Biomedical Engineering, University of South Dakota, Sioux Falls, SD, United States
| | - Rajesh Kumar Sani
- Department of Chemical and Biological Engineering, South Dakota School of Mines and Technology, Rapid City, SD, United States
- Data Driven Material Discovery Center for Bioengineering Innovation, South Dakota School of Mines and Technology, Rapid City, SD, United States
- 2-Dimensional Materials for Biofilm Engineering, Science and Technology, South Dakota School of Mines and Technology, Rapid City, SD, United States
- BuG ReMeDEE Consortium, South Dakota School of Mines and Technology, Rapid City, SD, United States
- Rajesh Kumar Sani,
| |
Collapse
|
41
|
Zhou D, Huang G, Xu G, Xiang L, Huang S, Chen X, Zhang Y, Wang D. CRISPRi-Mediated Gene Suppression Reveals Putative Reverse Transcriptase Gene PA0715 to Be a Global Regulator of Pseudomonas aeruginosa. Infect Drug Resist 2022; 15:7577-7599. [PMID: 36579125 PMCID: PMC9792118 DOI: 10.2147/idr.s384980] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Accepted: 11/09/2022] [Indexed: 12/24/2022] Open
Abstract
Purpose Pseudomonas aeruginosa is a common pathogen of infection in burn and trauma patients, and multi-drug resistant P. aeruginosa has become an increasingly important pathogen. Essential genes are key to the development of novel antibiotics. The PA0715 gene is a novel unidentified essential gene that has attracted our interest as a potential antibiotic target. Our study aims to determine the exact role of PA0715 in cell physiology and bacterial pathogenicity, providing important clues for antibiotic development. Patients and Methods The shuttle vector pHERD20T containing an arabinose inducible promoter was used to construct the CRISPRi system. Alterations in cellular physiology and bacterial pathogenicity of P. aeruginosa PAO1 after PA0715 inhibition were characterized. High-throughput RNA-seq was performed to gain more insight into the mechanisms by which PA0715 regulates the vital activity of P. aeruginosa. Results We found that down-regulation of PA0715 significantly reduced PAO1 growth rate, motility and chemotaxis, antibiotic resistance, pyocyanin and biofilm production. In addition, PA0715 inhibition reduced the pathogenicity of PAO1 to the greater galleria mellonella larvae. Transcriptional profiling identified 1757 genes including those related to amino acid, carbohydrate, ketone body and organic salt metabolism, whose expression was directly or indirectly controlled by PA0715. Unexpectedly, genes involved in oxidative phosphorylation also varied with PA0715 levels, and these findings support a hitherto unrecognized critical role for PA0715 in oxidative respiration in P. aeruginosa. Conclusion We identified PA0715 as a global regulator of the metabolic network that is indispensable for the survival and reproduction of P. aeruginosa. Our results provide a basis for future studies of potential antibiotic targets for P. aeruginosa and offer new ideas for P. aeruginosa infection control.
Collapse
Affiliation(s)
- Dapeng Zhou
- Department of Burns and Plastic Surgery, Affiliated Hospital of Zunyi Medical University, Zunyi, People’s Republic of China
- The Collaborative Innovation Center of Tissue Damage Repair and Regeneration Medicine of Zunyi Medical University, Zunyi, People’s Republic of China
| | - Guangtao Huang
- Department of Burns and Plastic Surgery, Affiliated Hospital of Zunyi Medical University, Zunyi, People’s Republic of China
- The Collaborative Innovation Center of Tissue Damage Repair and Regeneration Medicine of Zunyi Medical University, Zunyi, People’s Republic of China
- Department of Burn and Plastic Surgery, Department of Wound Repair, Shenzhen Institute of Translational Medicine, The First Affiliated Hospital of Shenzhen University, Shenzhen Second People’s Hospital, Shenzhen, People’s Republic of China
- Department of Plastic and Reconstructive Surgery, Shanghai Ninth People’s Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, People’s Republic of China
| | - Guangchao Xu
- Department of Burns and Plastic Surgery, Affiliated Hospital of Zunyi Medical University, Zunyi, People’s Republic of China
- The Collaborative Innovation Center of Tissue Damage Repair and Regeneration Medicine of Zunyi Medical University, Zunyi, People’s Republic of China
| | - Lijuan Xiang
- Department of Clinical Laboratory, Shenzhen Institute of Translational Medicine, The First Affiliated Hospital of Shenzhen University, Shenzhen Second People’s Hospital, Shenzhen, People’s Republic of China
| | - Siyi Huang
- Department of Burns and Plastic Surgery, Affiliated Hospital of Zunyi Medical University, Zunyi, People’s Republic of China
- The Collaborative Innovation Center of Tissue Damage Repair and Regeneration Medicine of Zunyi Medical University, Zunyi, People’s Republic of China
| | - Xinchong Chen
- Department of Burns and Plastic Surgery, Affiliated Hospital of Zunyi Medical University, Zunyi, People’s Republic of China
- The Collaborative Innovation Center of Tissue Damage Repair and Regeneration Medicine of Zunyi Medical University, Zunyi, People’s Republic of China
| | - Yixin Zhang
- Department of Plastic and Reconstructive Surgery, Shanghai Ninth People’s Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, People’s Republic of China
| | - Dali Wang
- Department of Burns and Plastic Surgery, Affiliated Hospital of Zunyi Medical University, Zunyi, People’s Republic of China
- The Collaborative Innovation Center of Tissue Damage Repair and Regeneration Medicine of Zunyi Medical University, Zunyi, People’s Republic of China
| |
Collapse
|
42
|
Hao C, Dewar AE, West SA, Ghoul M. Gene transferability and sociality do not correlate with gene connectivity. Proc Biol Sci 2022; 289:20221819. [PMID: 36448285 PMCID: PMC9709509 DOI: 10.1098/rspb.2022.1819] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
The connectivity of a gene, defined as the number of interactions a gene's product has with other genes' products, is a key characteristic of a gene. In prokaryotes, the complexity hypothesis predicts that genes which undergo more frequent horizontal transfer will be less connected than genes which are only very rarely transferred. We tested the role of horizontal gene transfer, and other potentially important factors, by examining the connectivity of chromosomal and plasmid genes, across 134 diverse prokaryotic species. We found that (i) genes on plasmids were less connected than genes on chromosomes; (ii) connectivity of plasmid genes was not correlated with plasmid mobility; and (iii) the sociality of genes (cooperative or private) was not correlated with gene connectivity.
Collapse
Affiliation(s)
- Chunhui Hao
- Department of Biology, University of Oxford, Oxford OX1 3SZ, UK
| | - Anna E. Dewar
- Department of Biology, University of Oxford, Oxford OX1 3SZ, UK
| | - Stuart A. West
- Department of Biology, University of Oxford, Oxford OX1 3SZ, UK
| | - Melanie Ghoul
- Department of Biology, University of Oxford, Oxford OX1 3SZ, UK
| |
Collapse
|
43
|
Li Y, Zeng M, Wu Y, Li Y, Li M. Accurate Prediction of Human Essential Proteins Using Ensemble Deep Learning. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:3263-3271. [PMID: 34699365 DOI: 10.1109/tcbb.2021.3122294] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Essential proteins are considered the foundation of life as they are indispensable for the survival of living organisms. Computational methods for essential protein discovery provide a fast way to identify essential proteins. But most of them heavily rely on various biological information, especially protein-protein interaction networks, which limits their practical applications. With the rapid development of high-throughput sequencing technology, sequencing data has become the most accessible biological data. However, using only protein sequence information to predict essential proteins has limited accuracy. In this paper, we propose EP-EDL, an ensemble deep learning model using only protein sequence information to predict human essential proteins. EP-EDL integrates multiple classifiers to alleviate the class imbalance problem and to improve prediction accuracy and robustness. In each base classifier, we employ multi-scale text convolutional neural networks to extract useful features from protein sequence feature matrices with evolutionary information. Our computational results show that EP-EDL outperforms the state-of-the-art sequence-based methods. Furthermore, EP-EDL provides a more practical and flexible way for biologists to accurately predict essential proteins. The source code and datasets can be downloaded from https://github.com/CSUBioGroup/EP-EDL.
Collapse
|
44
|
Rosconi F, Rudmann E, Li J, Surujon D, Anthony J, Frank M, Jones DS, Rock C, Rosch JW, Johnston CD, van Opijnen T. A bacterial pan-genome makes gene essentiality strain-dependent and evolvable. Nat Microbiol 2022; 7:1580-1592. [PMID: 36097170 PMCID: PMC9519441 DOI: 10.1038/s41564-022-01208-7] [Citation(s) in RCA: 55] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Accepted: 07/21/2022] [Indexed: 11/09/2022]
Abstract
Many bacterial species are represented by a pan-genome, whose genetic repertoire far outstrips that of any single bacterial genome. Here we investigate how a bacterial pan-genome might influence gene essentiality and whether essential genes that are initially critical for the survival of an organism can evolve to become non-essential. By using Transposon insertion sequencing (Tn-seq), whole-genome sequencing and RNA-seq on a set of 36 clinical Streptococcus pneumoniae strains representative of >68% of the species' pan-genome, we identify a species-wide 'essentialome' that can be subdivided into universal, core strain-specific and accessory essential genes. By employing 'forced-evolution experiments', we show that specific genetic changes allow bacteria to bypass essentiality. Moreover, by untangling several genetic mechanisms, we show that gene essentiality can be highly influenced by and/or be dependent on: (1) the composition of the accessory genome, (2) the accumulation of toxic intermediates, (3) functional redundancy, (4) efficient recycling of critical metabolites and (5) pathway rewiring. While this functional characterization underscores the evolvability potential of many essential genes, we also show that genes with differential essentiality remain important antimicrobial drug target candidates, as their inactivation almost always has a severe fitness cost in vivo.
Collapse
Affiliation(s)
| | - Emily Rudmann
- Biology Department, Boston College, Chestnut Hill, MA, USA
| | - Jien Li
- Biology Department, Boston College, Chestnut Hill, MA, USA
| | - Defne Surujon
- Biology Department, Boston College, Chestnut Hill, MA, USA
| | - Jon Anthony
- Biology Department, Boston College, Chestnut Hill, MA, USA
| | - Matthew Frank
- Department of Infectious Diseases, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Dakota S Jones
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Charles Rock
- Department of Infectious Diseases, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Jason W Rosch
- Department of Infectious Diseases, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Christopher D Johnston
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Tim van Opijnen
- Biology Department, Boston College, Chestnut Hill, MA, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
45
|
Dubois‐Mignon T, Monget P. Gene essentiality and variability: What is the link? A within‐ and between‐species perspective. Bioessays 2022; 44:e2200132. [DOI: 10.1002/bies.202200132] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Revised: 08/17/2022] [Accepted: 08/30/2022] [Indexed: 11/07/2022]
Affiliation(s)
- Tania Dubois‐Mignon
- Institut de Biologie de l’École Normale Supérieure Université PSL 46 rue d'Ulm Paris 75005 France
| | - Philippe Monget
- Physiologie de la Reproduction et des Comportements, Centre Val de Loire – UMR INRAE, CNRS, IFCE Université de Tours Nouzilly France
| |
Collapse
|
46
|
Affiliation(s)
- Alan J S Beavan
- School of Life Sciences, University of Nottingham, Nottingham, UK.
| | | |
Collapse
|
47
|
Vihinen M. Individual Genetic Heterogeneity. Genes (Basel) 2022; 13:1626. [PMID: 36140794 PMCID: PMC9498725 DOI: 10.3390/genes13091626] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 08/25/2022] [Accepted: 09/08/2022] [Indexed: 11/28/2022] Open
Abstract
Genetic variation has been widely covered in literature, however, not from the perspective of an individual in any species. Here, a synthesis of genetic concepts and variations relevant for individual genetic constitution is provided. All the different levels of genetic information and variation are covered, ranging from whether an organism is unmixed or hybrid, has variations in genome, chromosomes, and more locally in DNA regions, to epigenetic variants or alterations in selfish genetic elements. Genetic constitution and heterogeneity of microbiota are highly relevant for health and wellbeing of an individual. Mutation rates vary widely for variation types, e.g., due to the sequence context. Genetic information guides numerous aspects in organisms. Types of inheritance, whether Mendelian or non-Mendelian, zygosity, sexual reproduction, and sex determination are covered. Functions of DNA and functional effects of variations are introduced, along with mechanism that reduce and modulate functional effects, including TARAR countermeasures and intraindividual genetic conflict. TARAR countermeasures for tolerance, avoidance, repair, attenuation, and resistance are essential for life, integrity of genetic information, and gene expression. The genetic composition, effects of variations, and their expression are considered also in diseases and personalized medicine. The text synthesizes knowledge and insight on individual genetic heterogeneity and organizes and systematizes the central concepts.
Collapse
Affiliation(s)
- Mauno Vihinen
- Department of Experimental Medical Science, BMC B13, Lund University, SE-22184 Lund, Sweden
| |
Collapse
|
48
|
Kondratyeva L, Alekseenko I, Chernov I, Sverdlov E. Data Incompleteness May form a Hard-to-Overcome Barrier to Decoding Life's Mechanism. BIOLOGY 2022; 11:1208. [PMID: 36009835 PMCID: PMC9404739 DOI: 10.3390/biology11081208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/19/2022] [Revised: 08/03/2022] [Accepted: 08/10/2022] [Indexed: 11/23/2022]
Abstract
In this brief review, we attempt to demonstrate that the incompleteness of data, as well as the intrinsic heterogeneity of biological systems, may form very strong and possibly insurmountable barriers for researchers trying to decipher the mechanisms of the functioning of live systems. We illustrate this challenge using the two most studied organisms: E. coli, with 34.6% genes lacking experimental evidence of function, and C. elegans, with identified proteins for approximately 50% of its genes. Another striking example is an artificial unicellular entity named JCVI-syn3.0, with a minimal set of genes. A total of 31.5% of the genes of JCVI-syn3.0 cannot be ascribed a specific biological function. The human interactome mapping project identified only 5-10% of all protein interactions in humans. In addition, most of the available data are static snapshots, and it is barely possible to generate realistic models of the dynamic processes within cells. Moreover, the existing interactomes reflect the de facto interaction but not its functional result, which is an unpredictable emerging property. Perhaps the completeness of molecular data on any living organism is beyond our reach and represents an unsolvable problem in biology.
Collapse
Affiliation(s)
- Liya Kondratyeva
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, Moscow 117997, Russia
| | - Irina Alekseenko
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, Moscow 117997, Russia
- Institute of Molecular Genetics of National Research Centre “Kurchatov Institute”, Moscow 123182, Russia
| | - Igor Chernov
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, Moscow 117997, Russia
| | - Eugene Sverdlov
- Institute of Molecular Genetics of National Research Centre “Kurchatov Institute”, Moscow 123182, Russia
- Kurchatov Center for Genome Research, National Research Center “Kurchatov Institute”, Moscow 123182, Russia
| |
Collapse
|
49
|
Oz N, Vayndorf EM, Tsuchiya M, McLean S, Turcios-Hernandez L, Pitt JN, Blue BW, Muir M, Kiflezghi MG, Tyshkovskiy A, Mendenhall A, Kaeberlein M, Kaya A. Evidence that conserved essential genes are enriched for pro-longevity factors. GeroScience 2022; 44:1995-2006. [PMID: 35695982 PMCID: PMC9616985 DOI: 10.1007/s11357-022-00604-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Accepted: 06/03/2022] [Indexed: 02/02/2023] Open
Abstract
At the cellular level, many aspects of aging are conserved across species. This has been demonstrated by numerous studies in simple model organisms like Saccharomyces cerevisiae, Caenorhabdits elegans, and Drosophila melanogaster. Because most genetic screens examine loss of function mutations or decreased expression of genes through reverse genetics, essential genes have often been overlooked as potential modulators of the aging process. By taking the approach of increasing the expression level of a subset of conserved essential genes, we found that 21% of these genes resulted in increased replicative lifespan in S. cerevisiae. This is greater than the ~ 3.5% of genes found to affect lifespan upon deletion, suggesting that activation of essential genes may have a relatively disproportionate effect on increasing lifespan. The results of our experiments demonstrate that essential gene overexpression is a rich, relatively unexplored means of increasing eukaryotic lifespan.
Collapse
Affiliation(s)
- Naci Oz
- Department of Biology, Virginia Commonwealth University, Richmond, VA, 23284, USA
| | - Elena M Vayndorf
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA, 98195, USA
| | - Mitsuhiro Tsuchiya
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA, 98195, USA
| | - Samantha McLean
- Department of Biology, Virginia Commonwealth University, Richmond, VA, 23284, USA
| | | | - Jason N Pitt
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA, 98195, USA
| | - Benjamin W Blue
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA, 98195, USA
| | - Michael Muir
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA, 98195, USA
| | - Michael G Kiflezghi
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA, 98195, USA
| | - Alexander Tyshkovskiy
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, 02115, USA
| | - Alexander Mendenhall
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA, 98195, USA
| | - Matt Kaeberlein
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA, 98195, USA.
| | - Alaattin Kaya
- Department of Biology, Virginia Commonwealth University, Richmond, VA, 23284, USA.
- Massey Cancer Center, Virginia Commonwealth University, Richmond, VA, 23298, USA.
- Department of Human and Molecular Genetics, Virginia Commonwealth University, Richmond, VA, 23284, USA.
| |
Collapse
|
50
|
Gu X. A Simple Evolutionary Model of Genetic Robustness After Gene Duplication. J Mol Evol 2022; 90:352-361. [PMID: 35913597 DOI: 10.1007/s00239-022-10065-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Accepted: 06/23/2022] [Indexed: 10/16/2022]
Abstract
When a dispensable gene is duplicated (referred to the ancestral dispensability denoted by O+), genetic buffering and duplicate compensation together maintain the duplicate redundancy, whereas duplicate compensation is the only mechanism when an essential gene is duplicated (referred to the ancestral essentiality denoted by O-). To investigate these evolutionary scenarios of genetic robustness, I formulated a simple mixture model for analyzing duplicate pairs with one of the following states: double dispensable (DD), semi-dispensable (one dispensable one essential, DE), or double essential (EE). This model was applied to the yeast duplicate pairs from a whole-genome duplication (WGD) occurred about 100 million years ago (mya), and the mouse duplicate pairs from a WGD occurred about more than 500 mya. Both case studies revealed that the proportion of essentiality for those duplicates with ancestral essentiality [PE(O-)] was much higher than that for those with ancestral dispensability [PE(O+)]. While it was negligible in the yeast duplicate pairs, PE(O+) (about 20%) was shown statistically significant in the mouse duplicate pairs. These findings, together, support the hypothesis that both sub-functionalization and neo-functionalization may play some roles after gene duplication, though the former may be much faster than the later.
Collapse
Affiliation(s)
- Xun Gu
- The Laurence H. Baker Center in Bioinformatics on Biological Statistics, Department of Genetics, Development and Cell Biology, Program of Ecological and Evolutionary Biology, Iowa State University, Ames, IA, 50011, USA.
| |
Collapse
|