1
|
Ptok J, Theiss S, Schaal H. Fully haplotyped genome assemblies of healthy individuals reveal variability in 5'ss strength and support by splicing regulatory proteins. NAR Genom Bioinform 2025; 7:lqaf036. [PMID: 40191587 PMCID: PMC11970367 DOI: 10.1093/nargab/lqaf036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2024] [Revised: 02/18/2025] [Accepted: 03/20/2025] [Indexed: 04/09/2025] Open
Abstract
This work presents a comprehensive investigation of sequence variations at human 5' splice sites (5'ss), exploring their impact on 5'ss strength and predicted splicing regulatory protein (SRP) binding. Leveraging 44 high-quality genomes, with fully haplotyped assemblies, we were able to fully assess homozygous and heterozygous sequence variations around and within 5'ss. Variations showed differing tolerance levels in protein-coding and non-coding transcripts. Around half of 5'ss variations did not alter 5'ss strength (measured by the HBond score, that estimates binding of the 11 nucleotides at the free U1 spliceosomal RNA 5' end). Heterozygous variations resulted in stronger 5'ss strength reductions and less compensatory effects of multiple sequence variations at the same 5'ss, compared to homozygous variations. Additionally, we observed a slight balance between changes in 5'ss strength and predicted SRP binding. Strong 5'ss (HBond score >18.8) showed strength variations in both directions, as theoretically expected for random distributions, whereas weaker 5'ss consistently showed lower strength reductions than expected or achievable. Variations at 5'ss of essential genes were less frequent than in other genes and showed a higher amount of variations that did not alter 5'ss strength. Acceptable changes in predicted SRP binding sites were highly dependent on their respective 5'ss strength.
Collapse
Affiliation(s)
- Johannes Ptok
- Institute of Virology, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf 40225, Germany
| | - Stephan Theiss
- Institute of Virology, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf 40225, Germany
| | - Heiner Schaal
- Institute of Virology, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf 40225, Germany
| |
Collapse
|
2
|
Zerbib J, Bloomberg A, Ben-David U. Targeting vulnerabilities of aneuploid cells for cancer therapy. Trends Cancer 2025:S2405-8033(25)00097-4. [PMID: 40368673 DOI: 10.1016/j.trecan.2025.04.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2025] [Revised: 04/04/2025] [Accepted: 04/04/2025] [Indexed: 05/16/2025]
Abstract
Aneuploidy is a common feature of cancer that drives tumor evolution, but it also creates cellular vulnerabilities that might be exploited therapeutically. Recent advances in genomic technologies and experimental models have uncovered diverse cellular consequences of aneuploidy, revealing dependencies on mitotic regulation, DNA replication and repair, proteostasis, metabolism, and immune interactions. Harnessing aneuploidy for precision oncology requires the combination of genomic, functional, and clinical studies that will enable translation of our improved understanding of aneuploidy to targeted therapies. In this review we discuss approaches to targeting both highly aneuploid cells and cells with specific common aneuploidies, summarize the biological underpinning of these aneuploidy-induced vulnerabilities, and explore their therapeutic implications.
Collapse
Affiliation(s)
- Johanna Zerbib
- Department of Human Molecular Genetics and Biochemistry, Gray Faculty of Medical & Health Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Amit Bloomberg
- Department of Human Molecular Genetics and Biochemistry, Gray Faculty of Medical & Health Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Uri Ben-David
- Department of Human Molecular Genetics and Biochemistry, Gray Faculty of Medical & Health Sciences, Tel Aviv University, Tel Aviv, Israel.
| |
Collapse
|
3
|
Gąsienica P, Toch K, Zając-Garlacz KS, Labocha-Derkowska M. Genetic Background and Gene Essentiality. Genes (Basel) 2025; 16:570. [PMID: 40428392 DOI: 10.3390/genes16050570] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2025] [Revised: 05/01/2025] [Accepted: 05/07/2025] [Indexed: 05/29/2025] Open
Abstract
BACKGROUND/OBJECTIVES Essential genes are those required for an organism's survival and reproduction. However, gene essentiality is not absolute; it can be highly context-dependent, varying across genetic and environmental conditions. Most previous studies have assessed gene essentiality in a single genetic background, limiting our understanding of its variability. The objective of this study was to investigate how genetic background influences gene essentiality in the multicellular model organism Caenorhabditis elegans. METHODS We examined gene essentiality in three genetically distinct C. elegans strains: N2, LKC34, and MY16. A total of 294 genes were selected for RNA interference (RNAi) knockdown: 101 previously classified as essential, 175 as nonessential and 18 as conditional (condition-dependent essentiality). Each gene-strain combination was tested in multiple biological and technical replicates, and rigorous quality control and statistical analyses were used to identify strain-specific effects. RESULTS Our results demonstrate substantial variation in gene essentiality across genetic backgrounds. Among the 101 genes previously identified as essential in the N2 strain, only 56% were consistently essential in all three strains. We identified 23 genes that were newly essential across all strains, 13 genes essential in two strains, and 9 genes essential in only one strain. These results reveal that a significant proportion of essential genes exhibit strain-dependent essentiality. CONCLUSIONS This study underscores the importance of genetic context in determining gene essentiality. Our findings suggest that relying on a single genetic background, such as N2, may lead to an incomplete or misleading view of gene essentiality. Understanding context-dependent gene essentiality has important implications for functional genomics, evolutionary biology, and potentially for translational research where genetic background can modulate phenotypic outcomes.
Collapse
Affiliation(s)
- Paulina Gąsienica
- Institute of Environmental Sciences, Faculty of Biology, Jagiellonian University, 30-387 Kraków, Poland
| | - Katarzyna Toch
- Institute of Environmental Sciences, Faculty of Biology, Jagiellonian University, 30-387 Kraków, Poland
| | | | - Marta Labocha-Derkowska
- Institute of Environmental Sciences, Faculty of Biology, Jagiellonian University, 30-387 Kraków, Poland
| |
Collapse
|
4
|
Samowitz P, Radnai L, Vaissiere T, Michaelson SD, Rojas C, Mitchell R, Kilinc M, Edwards A, Shumate J, Hawkins R, Fernandez-Vega V, Spicer TP, Scampavia L, Kamenecka T, Miller CA, Rumbaugh G. The Endo-GeneScreen Platform Identifies Drug-Like Probes that Regulate Endogenous Protein Levels within Physiological Contexts. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.13.643156. [PMID: 40161629 PMCID: PMC11952490 DOI: 10.1101/2025.03.13.643156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/02/2025]
Abstract
Traditional phenotypic drug discovery platforms have suffered from poor scalability and a lack of mechanistic understanding of newly discovered phenotypic probes. To address this, we created Endo- GeneScreen (EGS), a high-throughput enabled screening platform that identifies bioactive small molecules capable of regulating endogenous protein expression encoded by any preselected target gene within a biologically appropriate context. As a proof-of-concept, EGS successfully identified drug candidates that up-regulate endogenous expression of neuronal Syngap1, a gene that causes a neurodevelopmental disorder when haploinsufficient. For example, SR-1815, a previously unknown and undescribed kinase inhibitor, alleviated major cellular consequences of Syngap1 loss-of-function by restoring normal SynGAP protein levels and dampening neuronal hyperactivity within haploinsufficient neurons. Moreover, we demonstrate that EGS assays accelerate preclinical development of identified drug candidates and facilitate mode-of-action deconvolution studies. Thus, EGS identifies first-in-class bioactive small molecule probes that promote biological discovery and precision therapeutic development.
Collapse
|
5
|
Schwarzmueller LJ, Adam RS, Moreno LF, Nijman LE, Logiantara A, Eleonora S, Bril O, Vromans S, de Groot NE, Giugliano FP, Stepanova E, Muncan V, Elbers CC, Lenos KJ, Zwijnenburg DA, van Eijndhoven MAJ, Pegtel DM, van Neerven SM, Loayza-Puch F, Dadali T, Broom WJ, Maier MA, Koster J, Vermeulen L, Léveillé N. Identifying colorectal cancer-specific vulnerabilities in the Wnt-driven long non-coding transcriptome. Gut 2025; 74:571-585. [PMID: 39562049 PMCID: PMC12013597 DOI: 10.1136/gutjnl-2024-332752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Accepted: 10/31/2024] [Indexed: 11/21/2024]
Abstract
BACKGROUND Aberrant Wnt pathway activation is a key driver of colorectal cancer (CRC) and is essential to sustain tumour growth and progression. Although the downstream protein-coding target genes of the Wnt cascade are well known, the long non-coding transcriptome has not yet been fully resolved. OBJECTIVE In this study, we aim to comprehensively reveal the Wnt-regulated long non-coding transcriptome and exploit essential molecules as novel therapeutic targets. DESIGN We used global run-on sequencing to define β-catenin-regulated long non-coding RNAs (lncRNAs) in CRC. CRISPRi dropout screens were subsequently used to establish the functional relevance of a subset of these lncRNAs for long-term expansion of CRC. RESULTS We uncovered that LINC02418 is essential for cancer cell clonogenic outgrowth. Mechanistically, LINC02418 regulates MYC expression levels to promote CRC stem cell functionality and prevent terminal differentiation. Furthermore, we developed effective small interfering RNA (siRNA)-based therapeutics to target LINC02418 RNA in vivo. CONCLUSION We propose that cancer-specific Wnt-regulated lncRNAs provide novel therapeutic opportunities to interfere with the Wnt pathway, which has so far defied effective pharmacological inhibition.
Collapse
Affiliation(s)
- Laura J Schwarzmueller
- Laboratory for Experimental Oncology and Radiobiology, Center for Experimental and Molecular Medicine, Cancer Center Amsterdam, Amsterdam UMC, Amsterdam, The Netherlands
- Oncode Institute, Amsterdam, The Netherlands
| | - Ronja S Adam
- Laboratory for Experimental Oncology and Radiobiology, Center for Experimental and Molecular Medicine, Cancer Center Amsterdam, Amsterdam UMC, Amsterdam, The Netherlands
- Oncode Institute, Amsterdam, The Netherlands
| | - Leandro F Moreno
- Laboratory for Experimental Oncology and Radiobiology, Center for Experimental and Molecular Medicine, Cancer Center Amsterdam, Amsterdam UMC, Amsterdam, The Netherlands
- Oncode Institute, Amsterdam, The Netherlands
| | - Lisanne E Nijman
- Laboratory for Experimental Oncology and Radiobiology, Center for Experimental and Molecular Medicine, Cancer Center Amsterdam, Amsterdam UMC, Amsterdam, The Netherlands
- Oncode Institute, Amsterdam, The Netherlands
| | - Adrian Logiantara
- Laboratory for Experimental Oncology and Radiobiology, Center for Experimental and Molecular Medicine, Cancer Center Amsterdam, Amsterdam UMC, Amsterdam, The Netherlands
- Oncode Institute, Amsterdam, The Netherlands
| | - Steven Eleonora
- Laboratory for Experimental Oncology and Radiobiology, Center for Experimental and Molecular Medicine, Cancer Center Amsterdam, Amsterdam UMC, Amsterdam, The Netherlands
- Oncode Institute, Amsterdam, The Netherlands
| | - Oscar Bril
- Laboratory for Experimental Oncology and Radiobiology, Center for Experimental and Molecular Medicine, Cancer Center Amsterdam, Amsterdam UMC, Amsterdam, The Netherlands
- Oncode Institute, Amsterdam, The Netherlands
| | - Sophie Vromans
- Laboratory for Experimental Oncology and Radiobiology, Center for Experimental and Molecular Medicine, Cancer Center Amsterdam, Amsterdam UMC, Amsterdam, The Netherlands
- Oncode Institute, Amsterdam, The Netherlands
| | - Nina E de Groot
- Laboratory for Experimental Oncology and Radiobiology, Center for Experimental and Molecular Medicine, Cancer Center Amsterdam, Amsterdam UMC, Amsterdam, The Netherlands
- Oncode Institute, Amsterdam, The Netherlands
| | - Francesca Paola Giugliano
- Department of Gastroenterology and Hepatology, Tytgat Institute for Liver and Intestinal Research, Amsterdam Gastroenterology Endocrinology Metabolism, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
| | - Ekaterina Stepanova
- Translational Control and Metabolism, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Vanesa Muncan
- Department of Gastroenterology and Hepatology, Tytgat Institute for Liver and Intestinal Research, Amsterdam Gastroenterology Endocrinology Metabolism, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
| | - Clara C Elbers
- Laboratory for Experimental Oncology and Radiobiology, Center for Experimental and Molecular Medicine, Cancer Center Amsterdam, Amsterdam UMC, Amsterdam, The Netherlands
- Oncode Institute, Amsterdam, The Netherlands
| | - Kristiaan J Lenos
- Laboratory for Experimental Oncology and Radiobiology, Center for Experimental and Molecular Medicine, Cancer Center Amsterdam, Amsterdam UMC, Amsterdam, The Netherlands
- Oncode Institute, Amsterdam, The Netherlands
| | - Danny A Zwijnenburg
- Laboratory for Experimental Oncology and Radiobiology, Center for Experimental and Molecular Medicine, Cancer Center Amsterdam, Amsterdam UMC, Amsterdam, The Netherlands
| | | | - Dirk Michiel Pegtel
- Department of Pathology, Cancer Center Amsterdam, Amsterdam UMC, Amsterdam, The Netherlands
| | - Sanne M van Neerven
- Laboratory for Experimental Oncology and Radiobiology, Center for Experimental and Molecular Medicine, Cancer Center Amsterdam, Amsterdam UMC, Amsterdam, The Netherlands
- Oncode Institute, Amsterdam, The Netherlands
| | - Fabricio Loayza-Puch
- Translational Control and Metabolism, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Tulin Dadali
- Alnylam Pharmaceuticals Inc, Cambridge, Massachusetts, USA
| | - Wendy J Broom
- Alnylam Pharmaceuticals Inc, Cambridge, Massachusetts, USA
| | - Martin A Maier
- Alnylam Pharmaceuticals Inc, Cambridge, Massachusetts, USA
| | - Jan Koster
- Laboratory for Experimental Oncology and Radiobiology, Center for Experimental and Molecular Medicine, Cancer Center Amsterdam, Amsterdam UMC, Amsterdam, The Netherlands
| | - Louis Vermeulen
- Laboratory for Experimental Oncology and Radiobiology, Center for Experimental and Molecular Medicine, Cancer Center Amsterdam, Amsterdam UMC, Amsterdam, The Netherlands
- Oncode Institute, Amsterdam, The Netherlands
| | - Nicolas Léveillé
- Laboratory for Experimental Oncology and Radiobiology, Center for Experimental and Molecular Medicine, Cancer Center Amsterdam, Amsterdam UMC, Amsterdam, The Netherlands
- Oncode Institute, Amsterdam, The Netherlands
| |
Collapse
|
6
|
Kang B, Fan R, Cui C, Cui Q. Comprehensive prediction and analysis of human protein essentiality based on a pretrained large language model. NATURE COMPUTATIONAL SCIENCE 2025; 5:196-206. [PMID: 39604646 DOI: 10.1038/s43588-024-00733-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/10/2024] [Accepted: 10/31/2024] [Indexed: 11/29/2024]
Abstract
Human essential proteins (HEPs) are indispensable for individual viability and development. However, experimental methods to identify HEPs are often costly, time consuming and labor intensive. In addition, existing computational methods predict HEPs only at the cell line level, but HEPs vary across living human, cell line and animal models. Here we develop a sequence-based deep learning model, Protein Importance Calculator (PIC), by fine-tuning a pretrained protein language model. PIC not only substantially outperforms existing methods for predicting HEPs but also provides comprehensive prediction results across three levels: human, cell line and mouse. Furthermore, we define the protein essential score, derived from PIC, to quantify human protein essentiality and validate its effectiveness by a series of biological analyses. We also demonstrate the biomedical value of the protein essential score by identifying potential prognostic biomarkers for breast cancer and quantifying the essentiality of 617,462 human microproteins.
Collapse
Affiliation(s)
- Boming Kang
- Department of Biomedical Informatics, State Key Laboratory of Vascular Homeostasis and Remodeling, School of Basic Medical Sciences, Peking University, Beijing, China
| | - Rui Fan
- Department of Biomedical Informatics, State Key Laboratory of Vascular Homeostasis and Remodeling, School of Basic Medical Sciences, Peking University, Beijing, China
| | - Chunmei Cui
- Department of Biomedical Informatics, State Key Laboratory of Vascular Homeostasis and Remodeling, School of Basic Medical Sciences, Peking University, Beijing, China
| | - Qinghua Cui
- Department of Biomedical Informatics, State Key Laboratory of Vascular Homeostasis and Remodeling, School of Basic Medical Sciences, Peking University, Beijing, China.
- School of Sports Medicine, Wuhan Institute of Physical Education, Wuhan, China.
| |
Collapse
|
7
|
Juvik B, Falcucci L, Lundegaard PR, Stainier DYR. A new hypothesis to explain disease dominance. Trends Genet 2025; 41:187-193. [PMID: 39788833 DOI: 10.1016/j.tig.2024.11.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2024] [Revised: 11/12/2024] [Accepted: 11/20/2024] [Indexed: 01/12/2025]
Abstract
The onset and progression of dominant diseases are thought to result from haploinsufficiency or dominant negative effects. Here, we propose transcriptional adaptation (TA), a newly identified response to mRNA decay, as an additional cause of some dominant diseases. TA modulates the expression of so-called adapting genes, likely via mRNA decay products, resulting in genetic compensation or a worsening of the phenotype. Recent studies have challenged the current concepts of haploinsufficiency or poison proteins as the mechanisms underlying certain dominant diseases, including Brugada syndrome, hypertrophic cardiomyopathy, and frontotemporal lobar degeneration. We hypothesize that for these and other dominant diseases, when the underlying mutation leads to mRNA decay, the phenotype is due at least partly to the dysregulation of gene expression via TA.
Collapse
Affiliation(s)
- Brian Juvik
- Department of Developmental Genetics, Max Planck Institute for Heart and Lung Research, Bad Nauheim, Hessen, 61231, Germany; German Centre for Cardiovascular Research (DZHK), Partner Site Rhine-Main, Bad Nauheim, Hessen, 61231, Germany
| | - Lara Falcucci
- Department of Developmental Genetics, Max Planck Institute for Heart and Lung Research, Bad Nauheim, Hessen, 61231, Germany; German Centre for Cardiovascular Research (DZHK), Partner Site Rhine-Main, Bad Nauheim, Hessen, 61231, Germany
| | - Pia R Lundegaard
- Department of Biomedical Sciences, Faculty of Health and Medical sciences, University of Copenhagen, Copenhagen N, Denmark
| | - Didier Y R Stainier
- Department of Developmental Genetics, Max Planck Institute for Heart and Lung Research, Bad Nauheim, Hessen, 61231, Germany; German Centre for Cardiovascular Research (DZHK), Partner Site Rhine-Main, Bad Nauheim, Hessen, 61231, Germany; Excellence Cluster Cardio-Pulmonary Institute (CPI), Bad Nauheim, Frankfurt, Giessen, Germany.
| |
Collapse
|
8
|
Cantore T, Gasperini P, Bevilacqua R, Ciani Y, Sinha S, Ruppin E, Demichelis F. PRODE recovers essential and context-essential genes through neighborhood-informed scores. Genome Biol 2025; 26:42. [PMID: 40022167 PMCID: PMC11869679 DOI: 10.1186/s13059-025-03501-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Accepted: 02/05/2025] [Indexed: 03/03/2025] Open
Abstract
Gene context-essentiality assessment supports precision oncology opportunities. The variability of gene effects inference from loss-of-function screenings across models and technologies limits identifying robust hits. We propose a computational framework named PRODE that integrates gene effects with protein-protein interactions to generate neighborhood-informed essential (NIE) and neighborhood-informed context essential (NICE) scores. It outperforms the canonical gene effect approach in recovering missed essential genes in shRNA screens and prioritizing context-essential hits from CRISPR-KO screens, as supported by in vitro validations. Applied to Her2 + breast cancer tumor samples, PRODE identifies oxidative phosphorylation genes as vulnerabilities with prognostic value, highlighting new therapeutic opportunities.
Collapse
Affiliation(s)
- Thomas Cantore
- Laboratory of Computational and Functional Oncology, Department of Cellular, Computational, and Integrative Biology, University of Trento, Via Sommarive 9, Trento, 38123, Italy
| | - Paola Gasperini
- Laboratory of Computational and Functional Oncology, Department of Cellular, Computational, and Integrative Biology, University of Trento, Via Sommarive 9, Trento, 38123, Italy
| | - Riccardo Bevilacqua
- Laboratory of Computational and Functional Oncology, Department of Cellular, Computational, and Integrative Biology, University of Trento, Via Sommarive 9, Trento, 38123, Italy
| | - Yari Ciani
- Laboratory of Computational and Functional Oncology, Department of Cellular, Computational, and Integrative Biology, University of Trento, Via Sommarive 9, Trento, 38123, Italy
| | - Sanju Sinha
- Cancer Data Science Laboratory (CDSL), Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Bethesda, MD, USA
- Currently at Sanford Burnham Prebys Medical Discovery Institute, San Diego, CA, USA
| | - Eytan Ruppin
- Cancer Data Science Laboratory (CDSL), Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Bethesda, MD, USA
| | - Francesca Demichelis
- Laboratory of Computational and Functional Oncology, Department of Cellular, Computational, and Integrative Biology, University of Trento, Via Sommarive 9, Trento, 38123, Italy.
| |
Collapse
|
9
|
Liang WW, Müller S, Hart SK, Wessels HH, Méndez-Mancilla A, Sookdeo A, Choi O, Caragine CM, Corman A, Lu L, Kolumba O, Williams B, Sanjana NE. Transcriptome-scale RNA-targeting CRISPR screens reveal essential lncRNAs in human cells. Cell 2024; 187:7637-7654.e29. [PMID: 39532094 DOI: 10.1016/j.cell.2024.10.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Revised: 07/09/2024] [Accepted: 10/12/2024] [Indexed: 11/16/2024]
Abstract
Mammalian genomes host a diverse array of RNA that includes protein-coding and noncoding transcripts. However, the functional roles of most long noncoding RNAs (lncRNAs) remain elusive. Using RNA-targeting CRISPR-Cas13 screens, we probed how the loss of ∼6,200 lncRNAs impacts cell fitness across five human cell lines and identified 778 lncRNAs with context-specific or broad essentiality. We confirm their essentiality with individual perturbations and find that the majority of essential lncRNAs operate independently of their nearest protein-coding genes. Using transcriptome profiling in single cells, we discover that the loss of essential lncRNAs impairs cell-cycle progression and drives apoptosis. Many essential lncRNAs demonstrate dynamic expression across tissues during development. Using ∼9,000 primary tumors, we pinpoint those lncRNAs whose expression in tumors correlates with survival, yielding new biomarkers and potential therapeutic targets. This transcriptome-wide survey of functional lncRNAs advances our understanding of noncoding transcripts and demonstrates the potential of transcriptome-scale noncoding screens with Cas13.
Collapse
Affiliation(s)
- Wen-Wei Liang
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Simon Müller
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Sydney K Hart
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Hans-Hermann Wessels
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Alejandro Méndez-Mancilla
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Akash Sookdeo
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Olivia Choi
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Christina M Caragine
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Alba Corman
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Lu Lu
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Olena Kolumba
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Breanna Williams
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA
| | - Neville E Sanjana
- New York Genome Center, New York, NY 10013, USA; Department of Biology, New York University, New York, NY 10013, USA.
| |
Collapse
|
10
|
Lopes-Marques M, Peixoto MJ, Cooper DN, Prata MJ, Azevedo L, Castro LFC. Polymorphic pseudogenes in the human genome - a comprehensive assessment. Hum Genet 2024; 143:1465-1479. [PMID: 39488654 DOI: 10.1007/s00439-024-02715-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2024] [Accepted: 10/25/2024] [Indexed: 11/04/2024]
Abstract
BACKGROUND Over the past decade, variations of the coding portion of the human genome have become increasingly evident. In this study, we focus on polymorphic pseudogenes, a unique and relatively unexplored type of pseudogene whose inactivating mutations have not yet been fixed in the human genome at the global population level. Thus, polymorphic pseudogenes are characterized by the presence in the population of both coding alleles and non-coding alleles originating from Loss-of-Function (LoF) mutations. These alleles can be found both in heterozygosity and in homozygosity in different human populations and thus represent pseudogenes that have not yet been fixed in the population. RESULTS A methodical cross-population analysis of 232 polymorphic pseudogenes, including 35 new examples, reveals that human olfactory signalling, drug metabolism and immunity are among the systems most impacted by the variable presence of LoF variants at high frequencies. Within this dataset, a total of 179 genes presented polymorphic LoF variants in all analysed populations. Transcriptome and proteome analysis confirmed that although these genes may harbour LoF alleles, when the coding allele is present, the gene remains active and can play a functional role in various metabolic pathways, including drug/xenobiotic metabolism and immunity. The observation that many polymorphic pseudogenes are members of multigene families argues that genetic redundancy may play a key role in compensating for the inactivation of one paralogue. CONCLUSIONS The distribution, expression and integration of cellular/biological networks in relation to human polymorphic pseudogenes, provide novel insights into the architecture of the human genome and the dynamics of gene gain and loss with likely functional impact.
Collapse
Affiliation(s)
- Mónica Lopes-Marques
- CIIMAR-Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Porto, Portugal.
| | - M João Peixoto
- CIIMAR-Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Porto, Portugal
| | - David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, UK
| | - M João Prata
- Instituto de Investigação e Inovação em Saúde, University of Porto, Porto, Portugal
- FCUP- Faculty of Sciences, Biology Department, University of Porto, Porto, Portugal
| | - Luísa Azevedo
- UMIB-Unit for Multidisciplinary Research in Biomedicine, ICBAS - School of Medicine and Biomedical Sciences, University of Porto, Porto, Portugal
- ITR - Laboratory for Integrative and Translational Research in Population Health, Porto, Portugal
| | - L Filipe C Castro
- CIIMAR-Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Porto, Portugal
- FCUP- Faculty of Sciences, Biology Department, University of Porto, Porto, Portugal
| |
Collapse
|
11
|
Xu T, Wang S, Ma T, Dong Y, Ashby CR, Hao GF. The identification of essential cellular genes is critical for validating drug targets. Drug Discov Today 2024; 29:104215. [PMID: 39428084 DOI: 10.1016/j.drudis.2024.104215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2024] [Revised: 10/06/2024] [Accepted: 10/15/2024] [Indexed: 10/22/2024]
Abstract
Accurately identifying biological targets is crucial for advancing treatment options. Essential genes, vital for cell or organism survival, hold promise as potential drug targets in disease treatment. Although many studies have sought to identify essential genes as therapeutic targets in medicine and bioinformatics, systematic reviews on their relationship with drug targets are relatively rare. This work presents a comprehensive analysis to aid in identifying essential genes as potential targets for drug discovery, encompassing their relevance, identification methods, successful case studies, and challenges. This work will facilitate the identification of essential genes as therapeutic targets, thereby boosting new drug development.
Collapse
Affiliation(s)
- Ting Xu
- School of Pharmaceutical Sciences, Guizhou Engineering Laboratory for Synthetic Drugs, Guizhou University, Guiyang 550025, China
| | - Shuang Wang
- State Key Laboratory of Green Pesticide, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for R&D of Fine Chemicals, Guizhou University, Guiyang 550025, China
| | - Tingting Ma
- School of Pharmaceutical Sciences, Guizhou Engineering Laboratory for Synthetic Drugs, Guizhou University, Guiyang 550025, China
| | - Yawen Dong
- School of Pharmaceutical Sciences, Guizhou Engineering Laboratory for Synthetic Drugs, Guizhou University, Guiyang 550025, China.
| | - Charles R Ashby
- Department of Pharmaceutical Sciences, St. John's University, New York, NY, USA.
| | - Ge-Fei Hao
- State Key Laboratory of Green Pesticide, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for R&D of Fine Chemicals, Guizhou University, Guiyang 550025, China.
| |
Collapse
|
12
|
Jia W, Zhang C, Luo Y, Gao J, Yuan C, Zhang D, Zhou X, Tan Y, Wang S, Chen Z, Li G, Zhang X. GBF1 deficiency causes cataracts in human and mouse. Hum Genet 2024; 143:1281-1291. [PMID: 39110251 DOI: 10.1007/s00439-024-02697-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2024] [Accepted: 07/29/2024] [Indexed: 10/30/2024]
Abstract
Any opacification of the lens can be defined as cataracts, and lens epithelium cells play a crucial role in guaranteeing lens transparency by maintaining its homeostasis. Although several causative genes of congenital cataracts have been reported, the mechanisms underlying lens opacity remain unclear. In this study, a large family with congenital cataracts was collected and genetic analysis revealed a pathological mutation (c.3857 C > T, p.T1287I) in the GBF1 gene; all affected individuals in the family carried this heterozygous mutation, while unaffected family members did not. Functional studies in human lens epithelium cell line revealed that this mutation led to a reduction in GBF1 protein levels. Knockdown of endogenous GBF1 activated XBP1s in the unfolded protein response signal pathway, and enhances autophagy in an mTOR-independent manner. Heterozygous Gbf1 knockout mice also displayed typic cataract phenotype. Together, our study identified GBF1 as a novel causative gene for congenital cataracts. Additionally, we found that GBF1 deficiency activates the unfolded protein response and leads to enhanced autophagy, which may contribute to lens opacity.
Collapse
Affiliation(s)
- Weimin Jia
- Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Center for Human Genome Research, Huazhong University of Science and Technology, Wuhan, China
| | | | - Yalin Luo
- Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Center for Human Genome Research, Huazhong University of Science and Technology, Wuhan, China
| | - Jing Gao
- Department of Ophthalmology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Chao Yuan
- Scientific Research and Experiment Center, Zhaoqing Medical College, Zhaoqing, China
| | - Dazhi Zhang
- Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Center for Human Genome Research, Huazhong University of Science and Technology, Wuhan, China
| | - Xiaopei Zhou
- Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Center for Human Genome Research, Huazhong University of Science and Technology, Wuhan, China
| | - Yongyao Tan
- Department of Ophthalmology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Shuang Wang
- Department of Ophthalmology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Zhuo Chen
- Department of Ophthalmology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Guigang Li
- Department of Ophthalmology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Xianqin Zhang
- Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Center for Human Genome Research, Huazhong University of Science and Technology, Wuhan, China.
| |
Collapse
|
13
|
Xiao MS, Damodaran AP, Kumari B, Dickson E, Xing K, On TA, Parab N, King HE, Perez AR, Guiblet WM, Duncan G, Che A, Chari R, Andresson T, Vidigal JA, Weatheritt RJ, Aregger M, Gonatopoulos-Pournatzis T. Genome-scale exon perturbation screens uncover exons critical for cell fitness. Mol Cell 2024; 84:2553-2572.e19. [PMID: 38917794 PMCID: PMC11246229 DOI: 10.1016/j.molcel.2024.05.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2023] [Revised: 04/04/2024] [Accepted: 05/24/2024] [Indexed: 06/27/2024]
Abstract
CRISPR-Cas technology has transformed functional genomics, yet understanding of how individual exons differentially shape cellular phenotypes remains limited. Here, we optimized and conducted massively parallel exon deletion and splice-site mutation screens in human cell lines to identify exons that regulate cellular fitness. Fitness-promoting exons are prevalent in essential and highly expressed genes and commonly overlap with protein domains and interaction interfaces. Conversely, fitness-suppressing exons are enriched in nonessential genes, exhibiting lower inclusion levels, and overlap with intrinsically disordered regions and disease-associated mutations. In-depth mechanistic investigation of the screen-hit TAF5 alternative exon-8 revealed that its inclusion is required for assembly of the TFIID general transcription initiation complex, thereby regulating global gene expression output. Collectively, our orthogonal exon perturbation screens established a comprehensive repository of phenotypically important exons and uncovered regulatory mechanisms governing cellular fitness and gene expression.
Collapse
Affiliation(s)
- Mei-Sheng Xiao
- RNA Biology Laboratory, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Frederick, MD 21702, USA
| | - Arun Prasath Damodaran
- RNA Biology Laboratory, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Frederick, MD 21702, USA.
| | - Bandana Kumari
- RNA Biology Laboratory, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Frederick, MD 21702, USA
| | - Ethan Dickson
- RNA Biology Laboratory, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Frederick, MD 21702, USA
| | - Kun Xing
- RNA Biology Laboratory, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Frederick, MD 21702, USA
| | - Tyler A On
- Molecular Targets Program, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Frederick, MD 21702, USA
| | - Nikhil Parab
- RNA Biology Laboratory, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Frederick, MD 21702, USA
| | - Helen E King
- EMBL Australia and Garvan Institute of Medical Research, Sydney, NSW 2010, Australia
| | - Alexendar R Perez
- Laboratory of Biochemistry and Molecular Biology, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Bethesda, MD 20892, USA; Department of Anesthesia and Perioperative Care, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Wilfried M Guiblet
- RNA Biology Laboratory, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Frederick, MD 21702, USA
| | - Gerard Duncan
- Protein Characterization Laboratory, Frederick National Laboratory for Cancer Research (FNLCR), Frederick, MD 21701, USA
| | - Anney Che
- Advanced Biomedical Computational Science, Frederick National Laboratory for Cancer Research (FNLCR), Frederick, MD 21701, USA
| | - Raj Chari
- Genome Modification Core, Frederick National Laboratory for Cancer Research (FNLCR), Frederick, MD 21702, USA
| | - Thorkell Andresson
- Protein Characterization Laboratory, Frederick National Laboratory for Cancer Research (FNLCR), Frederick, MD 21701, USA
| | - Joana A Vidigal
- Laboratory of Biochemistry and Molecular Biology, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Bethesda, MD 20892, USA
| | - Robert J Weatheritt
- EMBL Australia and Garvan Institute of Medical Research, Sydney, NSW 2010, Australia; School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW 2010, Australia
| | - Michael Aregger
- Molecular Targets Program, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Frederick, MD 21702, USA.
| | - Thomas Gonatopoulos-Pournatzis
- RNA Biology Laboratory, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Frederick, MD 21702, USA.
| |
Collapse
|
14
|
Sun KY, Bai X, Chen S, Bao S, Zhang C, Kapoor M, Backman J, Joseph T, Maxwell E, Mitra G, Gorovits A, Mansfield A, Boutkov B, Gokhale S, Habegger L, Marcketta A, Locke AE, Ganel L, Hawes A, Kessler MD, Sharma D, Staples J, Bovijn J, Gelfman S, Di Gioia A, Rajagopal VM, Lopez A, Varela JR, Alegre-Díaz J, Berumen J, Tapia-Conyer R, Kuri-Morales P, Torres J, Emberson J, Collins R, Cantor M, Thornton T, Kang HM, Overton JD, Shuldiner AR, Cremona ML, Nafde M, Baras A, Abecasis G, Marchini J, Reid JG, Salerno W, Balasubramanian S. A deep catalogue of protein-coding variation in 983,578 individuals. Nature 2024; 631:583-592. [PMID: 38768635 PMCID: PMC11254753 DOI: 10.1038/s41586-024-07556-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Accepted: 05/10/2024] [Indexed: 05/22/2024]
Abstract
Rare coding variants that substantially affect function provide insights into the biology of a gene1-3. However, ascertaining the frequency of such variants requires large sample sizes4-8. Here we present a catalogue of human protein-coding variation, derived from exome sequencing of 983,578 individuals across diverse populations. In total, 23% of the Regeneron Genetics Center Million Exome (RGC-ME) data come from individuals of African, East Asian, Indigenous American, Middle Eastern and South Asian ancestry. The catalogue includes more than 10.4 million missense and 1.1 million predicted loss-of-function (pLOF) variants. We identify individuals with rare biallelic pLOF variants in 4,848 genes, 1,751 of which have not been previously reported. From precise quantitative estimates of selection against heterozygous loss of function (LOF), we identify 3,988 LOF-intolerant genes, including 86 that were previously assessed as tolerant and 1,153 that lack established disease annotation. We also define regions of missense depletion at high resolution. Notably, 1,482 genes have regions that are depleted of missense variants despite being tolerant of pLOF variants. Finally, we estimate that 3% of individuals have a clinically actionable genetic variant, and that 11,773 variants reported in ClinVar with unknown significance are likely to be deleterious cryptic splice sites. To facilitate variant interpretation and genetics-informed precision medicine, we make this resource of coding variation from the RGC-ME dataset publicly accessible through a variant allele frequency browser.
Collapse
Affiliation(s)
| | | | - Siying Chen
- Regeneron Genetics Center, Tarrytown, NY, USA
| | - Suying Bao
- Regeneron Genetics Center, Tarrytown, NY, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | - Liron Ganel
- Regeneron Genetics Center, Tarrytown, NY, USA
| | | | | | | | | | | | | | | | | | | | | | - Jesús Alegre-Díaz
- Faculty of Medicine, National Autonomous University of Mexico (UNAM), Mexico City, Mexico
| | - Jaime Berumen
- Faculty of Medicine, National Autonomous University of Mexico (UNAM), Mexico City, Mexico
| | - Roberto Tapia-Conyer
- Faculty of Medicine, National Autonomous University of Mexico (UNAM), Mexico City, Mexico
| | - Pablo Kuri-Morales
- Faculty of Medicine, National Autonomous University of Mexico (UNAM), Mexico City, Mexico
- Instituto Tecnológico y de Estudios Superiores de Monterrey, Monterrey, Mexico
| | - Jason Torres
- Clinical Trial Service Unit and Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Jonathan Emberson
- Clinical Trial Service Unit and Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Rory Collins
- Clinical Trial Service Unit and Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | | | | | | | | | | | | | - Mona Nafde
- Regeneron Genetics Center, Tarrytown, NY, USA
| | - Aris Baras
- Regeneron Genetics Center, Tarrytown, NY, USA
| | | | | | | | | | | |
Collapse
|
15
|
Ma K, Gauthier LO, Cheung F, Huang S, Lek M. High-throughput assays to assess variant effects on disease. Dis Model Mech 2024; 17:dmm050573. [PMID: 38940340 PMCID: PMC11225591 DOI: 10.1242/dmm.050573] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/29/2024] Open
Abstract
Interpreting the wealth of rare genetic variants discovered in population-scale sequencing efforts and deciphering their associations with human health and disease present a critical challenge due to the lack of sufficient clinical case reports. One promising avenue to overcome this problem is deep mutational scanning (DMS), a method of introducing and evaluating large-scale genetic variants in model cell lines. DMS allows unbiased investigation of variants, including those that are not found in clinical reports, thus improving rare disease diagnostics. Currently, the main obstacle limiting the full potential of DMS is the availability of functional assays that are specific to disease mechanisms. Thus, we explore high-throughput functional methodologies suitable to examine broad disease mechanisms. We specifically focus on methods that do not require robotics or automation but instead use well-designed molecular tools to transform biological mechanisms into easily detectable signals, such as cell survival rate, fluorescence or drug resistance. Here, we aim to bridge the gap between disease-relevant assays and their integration into the DMS framework.
Collapse
Affiliation(s)
- Kaiyue Ma
- Department of Genetics, Yale School of Medicine, New Haven, CT 06510, USA
| | - Logan O. Gauthier
- Department of Genetics, Yale School of Medicine, New Haven, CT 06510, USA
| | - Frances Cheung
- Department of Genetics, Yale School of Medicine, New Haven, CT 06510, USA
| | - Shushu Huang
- Department of Genetics, Yale School of Medicine, New Haven, CT 06510, USA
| | - Monkol Lek
- Department of Genetics, Yale School of Medicine, New Haven, CT 06510, USA
| |
Collapse
|
16
|
Plyler ZE, McAtee CW, Hill AE, Crowley MR, Tindall JM, Tindall SR, Joshi D, Sorscher EJ. Relationships between genomic dissipation and de novo SNP evolution. PLoS One 2024; 19:e0303257. [PMID: 38753830 PMCID: PMC11098520 DOI: 10.1371/journal.pone.0303257] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2024] [Accepted: 04/23/2024] [Indexed: 05/18/2024] Open
Abstract
Patterns of single nucleotide polymorphisms (SNPs) in eukaryotic DNA are traditionally attributed to selective pressure, drift, identity descent, or related factors-without accounting for ways in which bias during de novo SNP formation, itself, might contribute. A functional and phenotypic analysis based on evolutionary resilience of DNA points to decreased numbers of non-synonymous SNPs in human and other genomes, with a predominant component of SNP depletion in the human gene pool caused by robust preferences during de novo SNP formation (rather than selective constraint). Ramifications of these findings are broad, belie a number of concepts regarding human evolution, and point to a novel interpretation of evolving DNA across diverse species.
Collapse
Affiliation(s)
- Zackery E. Plyler
- Department of Biology, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
- Gregory Fleming James Cystic Fibrosis Research Center, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Christopher W. McAtee
- Department of Biology, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
- Gregory Fleming James Cystic Fibrosis Research Center, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Aubrey E. Hill
- Gregory Fleming James Cystic Fibrosis Research Center, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
- Department of Computer and Information Sciences, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Michael R. Crowley
- Department of Genetics, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | | | | | - Disha Joshi
- Emory University, Atlanta, Georgia, United States of America
| | | |
Collapse
|
17
|
Peleke FF, Zumkeller SM, Gültas M, Schmitt A, Szymański J. Deep learning the cis-regulatory code for gene expression in selected model plants. Nat Commun 2024; 15:3488. [PMID: 38664394 PMCID: PMC11045779 DOI: 10.1038/s41467-024-47744-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Accepted: 04/09/2024] [Indexed: 04/28/2024] Open
Abstract
Elucidating the relationship between non-coding regulatory element sequences and gene expression is crucial for understanding gene regulation and genetic variation. We explored this link with the training of interpretable deep learning models predicting gene expression profiles from gene flanking regions of the plant species Arabidopsis thaliana, Solanum lycopersicum, Sorghum bicolor, and Zea mays. With over 80% accuracy, our models enabled predictive feature selection, highlighting e.g. the significant role of UTR regions in determining gene expression levels. The models demonstrated remarkable cross-species performance, effectively identifying both conserved and species-specific regulatory sequence features and their predictive power for gene expression. We illustrated the application of our approach by revealing causal links between genetic variation and gene expression changes across fourteen tomato genomes. Lastly, our models efficiently predicted genotype-specific expression of key functional gene groups, exemplified by underscoring known phenotypic and metabolic differences between Solanum lycopersicum and its wild, drought-resistant relative, Solanum pennellii.
Collapse
Affiliation(s)
- Fritz Forbang Peleke
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Corrensstraße 3, D-06466 Seeland, OT, Gatersleben, Germany
| | - Simon Maria Zumkeller
- Institute of Bio- and Geosciences, IBG-4: Bioinformatics, Forschungszentrum Jülich, D-52428, Jülich, Germany
- Cluster of Excellence on Plant Sciences (CEPLAS), Heinrich-Heine-Universität Düsseldorf, 40225, Düsseldorf, Germany
| | - Mehmet Gültas
- Faculty of Agriculture, South Westphalia University of Applied Sciences, Soest, 59494, Germany
| | - Armin Schmitt
- Breeding Informatics Group, University of Göttingen, Göttingen, 37075, Germany
- Center of Integrated Breeding Research (CiBreed), Göttingen, 37075, Germany
| | - Jędrzej Szymański
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Corrensstraße 3, D-06466 Seeland, OT, Gatersleben, Germany.
- Institute of Bio- and Geosciences, IBG-4: Bioinformatics, Forschungszentrum Jülich, D-52428, Jülich, Germany.
- Cluster of Excellence on Plant Sciences (CEPLAS), Heinrich-Heine-Universität Düsseldorf, 40225, Düsseldorf, Germany.
| |
Collapse
|
18
|
James C, Trevisan-Herraz M, Juan D, Rico D. Evolutionary analysis of gene ages across TADs associates chromatin topology with whole-genome duplications. Cell Rep 2024; 43:113895. [PMID: 38517894 DOI: 10.1016/j.celrep.2024.113895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2022] [Revised: 11/03/2023] [Accepted: 02/16/2024] [Indexed: 03/24/2024] Open
Abstract
Topologically associated domains (TADs) are interaction subnetworks of chromosomal regions in 3D genomes. TAD boundaries frequently coincide with genome breaks while boundary deletion is under negative selection, suggesting that TADs may facilitate genome rearrangements and evolution. We show that genes co-localize by evolutionary age in humans and mice, resulting in TADs having different proportions of younger and older genes. We observe a major transition in the age co-localization patterns between the genes born during vertebrate whole-genome duplications (WGDs) or before and those born afterward. We also find that genes recently duplicated in primates and rodents are more frequently essential when they are located in old-enriched TADs and interact with genes that last duplicated during the WGD. Therefore, the evolutionary relevance of recent genes may increase when located in TADs with established regulatory networks. Our data suggest that TADs could play a role in organizing ancestral functions and evolutionary novelty.
Collapse
Affiliation(s)
- Caelinn James
- Biosciences Institute, Newcastle University, Newcastle Upon Tyne, UK; Scotland's Rural College (SRUC), The Roslin Institute Building, Easter Bush, Midlothian, UK
| | - Marco Trevisan-Herraz
- Biosciences Institute, Newcastle University, Newcastle Upon Tyne, UK; Translational and Clinical Research Institute, Newcastle University, Newcastle Upon Tyne, UK
| | - David Juan
- Institut de Biologia Evolutiva, Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona, Barcelona, Spain; Systems Biology Department, Spanish National Centre for Biotechnology (CNB-CSIC), Madrid, Spain
| | - Daniel Rico
- Biosciences Institute, Newcastle University, Newcastle Upon Tyne, UK; Centro Andaluz de Biología Molecular y Medicina Regenerativa (CABIMER), CSIC-Universidad de Sevilla-Universidad Pablo de Olavide-Junta de Andalucía, Seville, Spain.
| |
Collapse
|
19
|
Safadi A, Lovell SC, Doig AJ. Essentiality, protein-protein interactions and evolutionary properties are key predictors for identifying cancer-associated genes using machine learning. Sci Rep 2024; 14:9199. [PMID: 38649399 PMCID: PMC11035574 DOI: 10.1038/s41598-023-44118-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 10/04/2023] [Indexed: 04/25/2024] Open
Abstract
The distinctive nature of cancer as a disease prompts an exploration of the special characteristics the genes implicated in cancer exhibit. The identification of cancer-associated genes and their characteristics is crucial to further our understanding of this disease and enhanced likelihood of therapeutic drug targets success. However, the rate at which cancer genes are being identified experimentally is slow. Applying predictive analysis techniques, through the building of accurate machine learning models, is potentially a useful approach in enhancing the identification rate of these genes and their characteristics. Here, we investigated gene essentiality scores and found that they tend to be higher for cancer-associated genes compared to other protein-coding human genes. We built a dataset of extended gene properties linked to essentiality and used it to train a machine-learning model; this model reached 89% accuracy and > 0.85 for the Area Under Curve (AUC). The model showed that essentiality, evolutionary-related properties, and properties arising from protein-protein interaction networks are particularly effective in predicting cancer-associated genes. We were able to use the model to identify potential candidate genes that have not been previously linked to cancer. Prioritising genes that score highly by our methods could aid scientists in their cancer genes research.
Collapse
Affiliation(s)
- Amro Safadi
- Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester, M13 9PT, UK
| | - Simon C Lovell
- Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester, M13 9PT, UK
| | - Andrew J Doig
- Division of Neuroscience, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester, M13 9BL, UK.
| |
Collapse
|
20
|
Li Y, Gao W, Yang Z, Hu Z, Li J. Multi-omics pan-cancer analyses identify MCM4 as a promising prognostic and diagnostic biomarker. Sci Rep 2024; 14:6517. [PMID: 38499612 PMCID: PMC10948783 DOI: 10.1038/s41598-024-57299-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Accepted: 03/16/2024] [Indexed: 03/20/2024] Open
Abstract
Minichromosome Maintenance Complex Component 4 (MCM4) is a vital component of the mini-chromosome maintenance complex family, crucial for initiating the replication of eukaryotic genomes. Recently, there has been a growing interest in investigating the significance of MCM4 in different types of cancer. Despite the existing research on this topic, a comprehensive analysis of MCM4 across various cancer types has been lacking. This study aims to bridge this knowledge gap by presenting a thorough pan-cancer analysis of MCM4, shedding light on its functional implications and potential clinical applications. The study utilized multi-omics samples from various databases. Bioinformatic tools were employed to explore the expression profiles, genetic alterations, phosphorylation states, immune cell infiltration patterns, immune subtypes, functional enrichment, disease prognosis, as well as the diagnostic potential of MCM4 and its responsiveness to drugs in a range of cancers. Our research demonstrates that MCM4 is closely associated with the oncogenesis, prognosis and diagnosis of various tumors and proposes that MCM4 may function as a potential biomarker in pan-cancer, providing a deeper understanding of its potential role in cancer development and treatment.
Collapse
Affiliation(s)
- Yanxing Li
- Xi'an Jiaotong University Health Science Center, Xi'an, 710000, Shaanxi, People's Republic of China
| | - Wentao Gao
- Xi'an Jiaotong University Health Science Center, Xi'an, 710000, Shaanxi, People's Republic of China
| | - Zhen Yang
- Xi'an Jiaotong University Health Science Center, Xi'an, 710000, Shaanxi, People's Republic of China
| | - Zhenwei Hu
- Xi'an Jiaotong University Health Science Center, Xi'an, 710000, Shaanxi, People's Republic of China
| | - Jianjun Li
- Department of Cardiology, Jincheng People's Hospital Affiliated to Changzhi Medical College, Jincheng, Shanxi, People's Republic of China.
| |
Collapse
|
21
|
Zhou Q, Meng Y, Li D, Yao L, Le J, Liu Y, Sun Y, Zeng F, Chen X, Deng G. Ferroptosis in cancer: From molecular mechanisms to therapeutic strategies. Signal Transduct Target Ther 2024; 9:55. [PMID: 38453898 PMCID: PMC10920854 DOI: 10.1038/s41392-024-01769-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Revised: 01/21/2024] [Accepted: 02/03/2024] [Indexed: 03/09/2024] Open
Abstract
Ferroptosis is a non-apoptotic form of regulated cell death characterized by the lethal accumulation of iron-dependent membrane-localized lipid peroxides. It acts as an innate tumor suppressor mechanism and participates in the biological processes of tumors. Intriguingly, mesenchymal and dedifferentiated cancer cells, which are usually resistant to apoptosis and traditional therapies, are exquisitely vulnerable to ferroptosis, further underscoring its potential as a treatment approach for cancers, especially for refractory cancers. However, the impact of ferroptosis on cancer extends beyond its direct cytotoxic effect on tumor cells. Ferroptosis induction not only inhibits cancer but also promotes cancer development due to its potential negative impact on anticancer immunity. Thus, a comprehensive understanding of the role of ferroptosis in cancer is crucial for the successful translation of ferroptosis therapy from the laboratory to clinical applications. In this review, we provide an overview of the recent advancements in understanding ferroptosis in cancer, covering molecular mechanisms, biological functions, regulatory pathways, and interactions with the tumor microenvironment. We also summarize the potential applications of ferroptosis induction in immunotherapy, radiotherapy, and systemic therapy, as well as ferroptosis inhibition for cancer treatment in various conditions. We finally discuss ferroptosis markers, the current challenges and future directions of ferroptosis in the treatment of cancer.
Collapse
Affiliation(s)
- Qian Zhou
- Department of Dermatology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
- National Engineering Research Center of Personalized Diagnostic and Therapeutic Technology, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
- Furong Laboratory, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
- Hunan Key Laboratory of Skin Cancer and Psoriasis, Hunan Engineering Research Center of Skin Health and Disease, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
- National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
| | - Yu Meng
- Department of Dermatology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
- National Engineering Research Center of Personalized Diagnostic and Therapeutic Technology, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
- Furong Laboratory, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
- Hunan Key Laboratory of Skin Cancer and Psoriasis, Hunan Engineering Research Center of Skin Health and Disease, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
- National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
| | - Daishi Li
- Department of Dermatology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
- National Engineering Research Center of Personalized Diagnostic and Therapeutic Technology, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
- Furong Laboratory, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
- Hunan Key Laboratory of Skin Cancer and Psoriasis, Hunan Engineering Research Center of Skin Health and Disease, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
- National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
| | - Lei Yao
- Department of General Surgery, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
| | - Jiayuan Le
- Department of Dermatology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
- National Engineering Research Center of Personalized Diagnostic and Therapeutic Technology, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
- Furong Laboratory, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
- Hunan Key Laboratory of Skin Cancer and Psoriasis, Hunan Engineering Research Center of Skin Health and Disease, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
- National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
| | - Yihuang Liu
- Department of Dermatology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
- National Engineering Research Center of Personalized Diagnostic and Therapeutic Technology, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
- Furong Laboratory, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
- Hunan Key Laboratory of Skin Cancer and Psoriasis, Hunan Engineering Research Center of Skin Health and Disease, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
- National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
| | - Yuming Sun
- Department of Plastic and Cosmetic Surgery, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, Hunan Province, China
| | - Furong Zeng
- Department of Oncology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, Hunan Province, China.
| | - Xiang Chen
- Department of Dermatology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, Hunan Province, China.
- National Engineering Research Center of Personalized Diagnostic and Therapeutic Technology, 87 Xiangya Road, Changsha, 410008, Hunan Province, China.
- Furong Laboratory, 87 Xiangya Road, Changsha, 410008, Hunan Province, China.
- Hunan Key Laboratory of Skin Cancer and Psoriasis, Hunan Engineering Research Center of Skin Health and Disease, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, Hunan Province, China.
- National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, 87 Xiangya Road, Changsha, 410008, Hunan Province, China.
| | - Guangtong Deng
- Department of Dermatology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, Hunan Province, China.
- National Engineering Research Center of Personalized Diagnostic and Therapeutic Technology, 87 Xiangya Road, Changsha, 410008, Hunan Province, China.
- Furong Laboratory, 87 Xiangya Road, Changsha, 410008, Hunan Province, China.
- Hunan Key Laboratory of Skin Cancer and Psoriasis, Hunan Engineering Research Center of Skin Health and Disease, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, Hunan Province, China.
- National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, 87 Xiangya Road, Changsha, 410008, Hunan Province, China.
| |
Collapse
|
22
|
Gómez-Márquez C, Morales JA, Romero-Gutiérrez T, Paredes O, Borrayo E. Decoding semiotic minimal genome: a non-genocentric approach. Front Microbiol 2024; 15:1356050. [PMID: 38476952 PMCID: PMC10929006 DOI: 10.3389/fmicb.2024.1356050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Accepted: 02/02/2024] [Indexed: 03/14/2024] Open
Abstract
The search for the minimum information required for an organism to sustain a cellular system network has rendered both the identification of a fixed number of known genes and those genes whose function remains to be identified. The approaches used in such search generally focus their analysis on coding genomic regions, based on the genome to proteic-product perspective. Such approaches leave other fundamental processes aside, mainly those that include higher-level information management. To cope with this limitation, a non-genocentric approach based on genomic sequence analysis using language processing tools and gene ontology may prove an effective strategy for the identification of those fundamental genomic elements for life autonomy. Additionally, this approach will provide us with an integrative analysis of the information value present in all genomic elements, regardless of their coding status.
Collapse
Affiliation(s)
- Carolina Gómez-Márquez
- Biodigital Innovation Lab, Translational Bioengineering Department, Exact Sciences and Engineering University Center, Universidad de Guadalajara, Guadalajara, Mexico
| | - J. Alejandro Morales
- Biodigital Innovation Lab, Translational Bioengineering Department, Exact Sciences and Engineering University Center, Universidad de Guadalajara, Guadalajara, Mexico
| | - Teresa Romero-Gutiérrez
- Biodigital Innovation Lab, Translational Bioengineering Department, Exact Sciences and Engineering University Center, Universidad de Guadalajara, Guadalajara, Mexico
- Technological Innovation Department, Tlajomulco University Center, Universidad de Guadalajara, Guadalajara, Mexico
| | - Omar Paredes
- Biodigital Innovation Lab, Translational Bioengineering Department, Exact Sciences and Engineering University Center, Universidad de Guadalajara, Guadalajara, Mexico
| | - Ernesto Borrayo
- Biodigital Innovation Lab, Translational Bioengineering Department, Exact Sciences and Engineering University Center, Universidad de Guadalajara, Guadalajara, Mexico
| |
Collapse
|
23
|
Li H, Bartke R, Zhao L, Verma Y, Horacek A, Rechav Ben-Natan A, Pangilinan GR, Krishnappa N, Nielsen R, Hockemeyer D. Functional annotation of variants of the BRCA2 gene via locally haploid human pluripotent stem cells. Nat Biomed Eng 2024; 8:165-176. [PMID: 37488236 PMCID: PMC10878975 DOI: 10.1038/s41551-023-01065-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Accepted: 06/15/2023] [Indexed: 07/26/2023]
Abstract
Mutations in the BRCA2 gene are associated with sporadic and familial cancer, cause genomic instability and sensitize cancer cells to inhibition by the poly(ADP-ribose) polymerase (PARP). Here we show that human pluripotent stem cells (hPSCs) with one copy of BRCA2 deleted can be used to annotate variants of this gene and to test their sensitivities to PARP inhibition. By using Cas9 to edit the functional BRCA2 allele in the locally haploid hPSCs and in fibroblasts differentiated from them, we characterized essential regions in the gene to identify permissive and loss-of-function mutations. We also used Cas9 to directly test the function of individual amino acids, including amino acids encoded by clinical BRCA2 variants of uncertain significance, and identified alleles that are sensitive to PARP inhibitors used as a standard of care in BRCA2-deficient cancers. Locally haploid human pluripotent stem cells can facilitate detailed structure-function analyses of genes and the rapid functional evaluation of clinically observed mutations.
Collapse
Affiliation(s)
- Hanqin Li
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
- Innovative Genomics Institute, University of California, Berkeley, CA, USA
| | - Rebecca Bartke
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
| | - Lei Zhao
- Section for GeoGenetics, Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Yogendra Verma
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
| | - Anna Horacek
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
| | - Alma Rechav Ben-Natan
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
| | - Gabriella R Pangilinan
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
| | | | - Rasmus Nielsen
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
- Section for GeoGenetics, Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Dirk Hockemeyer
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA.
- Innovative Genomics Institute, University of California, Berkeley, CA, USA.
- Chan Zuckerberg Biohub, San Francisco, CA, USA.
| |
Collapse
|
24
|
Deutzmann A, Sullivan DK, Dhanasekaran R, Li W, Chen X, Tong L, Mahauad-Fernandez WD, Bell J, Mosley A, Koehler AN, Li Y, Felsher DW. Nuclear to cytoplasmic transport is a druggable dependency in MYC-driven hepatocellular carcinoma. Nat Commun 2024; 15:963. [PMID: 38302473 PMCID: PMC10834515 DOI: 10.1038/s41467-024-45128-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2019] [Accepted: 01/12/2024] [Indexed: 02/03/2024] Open
Abstract
The MYC oncogene is often dysregulated in human cancer, including hepatocellular carcinoma (HCC). MYC is considered undruggable to date. Here, we comprehensively identify genes essential for survival of MYChigh but not MYClow cells by a CRISPR/Cas9 genome-wide screen in a MYC-conditional HCC model. Our screen uncovers novel MYC synthetic lethal (MYC-SL) interactions and identifies most MYC-SL genes described previously. In particular, the screen reveals nucleocytoplasmic transport to be a MYC-SL interaction. We show that the majority of MYC-SL nucleocytoplasmic transport genes are upregulated in MYChigh murine HCC and are associated with poor survival in HCC patients. Inhibiting Exportin-1 (XPO1) in vivo induces marked tumor regression in an autochthonous MYC-transgenic HCC model and inhibits tumor growth in HCC patient-derived xenografts. XPO1 expression is associated with poor prognosis only in HCC patients with high MYC activity. We infer that MYC may generally regulate and require altered expression of nucleocytoplasmic transport genes for tumorigenesis.
Collapse
Affiliation(s)
- Anja Deutzmann
- Division of Oncology, Department of Medicine, Stanford University, Stanford, CA, 94305, USA
| | - Delaney K Sullivan
- Division of Oncology, Department of Medicine, Stanford University, Stanford, CA, 94305, USA
| | - Renumathy Dhanasekaran
- Division of Oncology, Department of Medicine, Stanford University, Stanford, CA, 94305, USA
- Division of Gastroenterology, Department of Medicine, Stanford University, Stanford, CA, 94305, USA
| | - Wei Li
- Center for Genetic Medicine Research, Children's National Hospital, Washington, DC, 20012, USA
- Department of Genomics and Precision Medicine, George Washington University, Washington, DC, 20012, USA
| | - Xinyu Chen
- Division of Oncology, Department of Medicine, Stanford University, Stanford, CA, 94305, USA
| | - Ling Tong
- Division of Oncology, Department of Medicine, Stanford University, Stanford, CA, 94305, USA
| | | | - John Bell
- Stanford Genome Technology Center, Stanford University, Stanford, CA, 94305, USA
| | - Adriane Mosley
- Division of Oncology, Department of Medicine, Stanford University, Stanford, CA, 94305, USA
| | - Angela N Koehler
- Koch Institute for Integrative Cancer Research at MIT, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
- Center for Precision Cancer Medicine, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Yulin Li
- Division of Oncology, Department of Medicine, Stanford University, Stanford, CA, 94305, USA.
- Institute for Academic Medicine, Houston Methodist and Weill Cornell Medical College, Houston, TX, 77030, USA.
| | - Dean W Felsher
- Division of Oncology, Department of Medicine, Stanford University, Stanford, CA, 94305, USA.
- Department of Pathology, Stanford University, Stanford, CA, 94305, USA.
- Stanford Cancer Institute, Stanford University, Stanford, CA, 94305, USA.
| |
Collapse
|
25
|
Liang Y, Luo H, Lin Y, Gao F. Recent advances in the characterization of essential genes and development of a database of essential genes. IMETA 2024; 3:e157. [PMID: 38868518 PMCID: PMC10989110 DOI: 10.1002/imt2.157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 10/09/2023] [Indexed: 06/14/2024]
Abstract
Over the past few decades, there has been a significant interest in the study of essential genes, which are crucial for the survival of an organism under specific environmental conditions and thus have practical applications in the fields of synthetic biology and medicine. An increasing amount of experimental data on essential genes has been obtained with the continuous development of technological methods. Meanwhile, various computational prediction methods, related databases and web servers have emerged accordingly. To facilitate the study of essential genes, we have established a database of essential genes (DEG), which has become popular with continuous updates to facilitate essential gene feature analysis and prediction, drug and vaccine development, as well as artificial genome design and construction. In this article, we summarized the studies of essential genes, overviewed the relevant databases, and discussed their practical applications. Furthermore, we provided an overview of the main applications of DEG and conducted comprehensive analyses based on its latest version. However, it should be noted that the essential gene is a dynamic concept instead of a binary one, which presents both opportunities and challenges for their future development.
Collapse
Affiliation(s)
| | - Hao Luo
- Department of PhysicsTianjin UniversityTianjinChina
| | - Yan Lin
- Department of PhysicsTianjin UniversityTianjinChina
| | - Feng Gao
- Department of PhysicsTianjin UniversityTianjinChina
- Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education)Tianjin UniversityTianjinChina
- SynBio Research PlatformCollaborative Innovation Center of Chemical Science and Engineering (Tianjin)TianjinChina
| |
Collapse
|
26
|
Martinez TC, McNerney ME. Haploinsufficient Transcription Factors in Myeloid Neoplasms. ANNUAL REVIEW OF PATHOLOGY 2024; 19:571-598. [PMID: 37906947 DOI: 10.1146/annurev-pathmechdis-051222-013421] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]
Abstract
Many transcription factors (TFs) function as tumor suppressor genes with heterozygous phenotypes, yet haploinsufficiency generally has an underappreciated role in neoplasia. This is no less true in myeloid cells, which are normally regulated by a delicately balanced and interconnected transcriptional network. Detailed understanding of TF dose in this circuitry sheds light on the leukemic transcriptome. In this review, we discuss the emerging features of haploinsufficient transcription factors (HITFs). We posit that: (a) monoallelic and biallelic losses can have distinct cellular outcomes; (b) the activity of a TF exists in a greater range than the traditional Mendelian genetic doses; and (c) how a TF is deleted or mutated impacts the cellular phenotype. The net effect of a HITF is a myeloid differentiation block and increased intercellular heterogeneity in the course of myeloid neoplasia.
Collapse
Affiliation(s)
- Tanner C Martinez
- Department of Pathology, Department of Pediatrics, Section of Hematology/Oncology, The University of Chicago Medicine Comprehensive Cancer Center, The University of Chicago, Chicago, Illinois, USA;
- Medical Scientist Training Program, The University of Chicago, Chicago, Illinois, USA
| | - Megan E McNerney
- Department of Pathology, Department of Pediatrics, Section of Hematology/Oncology, The University of Chicago Medicine Comprehensive Cancer Center, The University of Chicago, Chicago, Illinois, USA;
| |
Collapse
|
27
|
Kumar S, Pauline G, Vindal V. NetVA: an R package for network vulnerability and influence analysis. J Biomol Struct Dyn 2024:1-12. [PMID: 38234040 DOI: 10.1080/07391102.2024.2303607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Accepted: 01/04/2024] [Indexed: 01/19/2024]
Abstract
In biological network analysis, identifying key molecules plays a decisive role in the development of potential diagnostic and therapeutic candidates. Among various approaches of network analysis, network vulnerability analysis is quite important, as it assesses significant associations between topological properties and the functional essentiality of a network. Similarly, some node centralities are also used to screen out key molecules. Among these node centralities, escape velocity centrality (EVC), and its extended version (EVC+) outperform others, viz., Degree, Betweenness, and Clustering coefficient. Keeping this in mind, we aimed to develop a first-of-its-kind R package named NetVA, which analyzes networks to identify key molecular players (individual proteins and protein pairs/triplets) through network vulnerability and EVC+-based approaches. To demonstrate the application and relevance of our package in network analysis, previously published and publicly available protein-protein interactions (PPIs) data of human breast cancer were analyzed. This resulted in identifying some most important proteins. These included essential proteins, non-essential proteins, hubs, and bottlenecks, which play vital roles in breast cancer development. Thus, the NetVA package, available at https://github.com/kr-swapnil/NetVA with a detailed tutorial to download and use, assists in predicting potential candidates for therapeutic and diagnostic purposes by exploring various topological features of a disease-specific PPIs network.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Swapnil Kumar
- Department of Biotechnology & Bioinformatics, School of Life Sciences, University of Hyderabad, Hyderabad, India
| | - Grace Pauline
- Department of Biotechnology & Bioinformatics, School of Life Sciences, University of Hyderabad, Hyderabad, India
| | - Vaibhav Vindal
- Department of Biotechnology & Bioinformatics, School of Life Sciences, University of Hyderabad, Hyderabad, India
| |
Collapse
|
28
|
Freischem LJ, Oyarzún DA. A Machine Learning Approach for Predicting Essentiality of Metabolic Genes. Methods Mol Biol 2024; 2760:345-369. [PMID: 38468098 DOI: 10.1007/978-1-0716-3658-9_20] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2024]
Abstract
The identification of essential genes is a key challenge in systems and synthetic biology, particularly for engineering metabolic pathways that convert feedstocks into valuable products. Assessment of gene essentiality at a genome scale requires large and costly growth assays of knockout strains. Here we describe a strategy to predict the essentiality of metabolic genes using binary classification algorithms. The approach combines elements from genome-scale metabolic models, directed graphs, and machine learning into a predictive model that can be trained on small knockout data. We demonstrate the efficacy of this approach using the most complete metabolic model of Escherichia coli and various machine learning algorithms for binary classification.
Collapse
Affiliation(s)
| | - Diego A Oyarzún
- School of Informatics, University of Edinburgh, Edinburgh, UK.
- School of Biological Sciences, University of Edinburgh, Edinburgh, UK.
| |
Collapse
|
29
|
Giordano M, Falbo E, Maddalena L, Piccirillo M, Granata I. Untangling the Context-Specificity of Essential Genes by Means of Machine Learning: A Constructive Experience. Biomolecules 2023; 14:18. [PMID: 38254618 PMCID: PMC10813179 DOI: 10.3390/biom14010018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 11/29/2023] [Accepted: 12/20/2023] [Indexed: 01/24/2024] Open
Abstract
Gene essentiality is a genetic concept crucial for a comprehensive understanding of life and evolution. In the last decade, many essential genes (EGs) have been determined using different experimental and computational approaches, and this information has been used to reduce the genomes of model organisms. A growing amount of evidence highlights that essentiality is a property that depends on the context. Because of their importance in vital biological processes, recognising context-specific EGs (csEGs) could help for identifying new potential pharmacological targets and to improve precision therapeutics. Since most of the computational procedures proposed to identify and predict EGs neglect their context-specificity, we focused on this aspect, providing a theoretical and experimental overview of the literature, data and computational methods dedicated to recognising csEGs. To this end, we adapted existing computational methods to exploit a specific context (the kidney tissue) and experimented with four different prediction methods using the labels provided by four different identification approaches. The considerations derived from the analysis of the obtained results, confirmed and validated also by further experiments for a different tissue context, provide the reader with guidance on exploiting existing tools for achieving csEGs identification and prediction.
Collapse
Affiliation(s)
- Maurizio Giordano
- Institute for High-Performance Computing and Networking (ICAR), National Research Council (CNR), V. Pietro Castellino 111, 80131 Naples, Italy; (E.F.); (L.M.); (M.P.); (I.G.)
| | | | | | | | | |
Collapse
|
30
|
Huang J, Huang W, Yi J, Deng Y, Li R, Chen J, Shi J, Qiu Y, Wang T, Chen X, Zhang X, Xiang AP. Mesenchymal stromal cells alleviate depressive and anxiety-like behaviors via a lung vagal-to-brain axis in male mice. Nat Commun 2023; 14:7406. [PMID: 37973914 PMCID: PMC10654509 DOI: 10.1038/s41467-023-43150-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Accepted: 11/01/2023] [Indexed: 11/19/2023] Open
Abstract
Major depressive disorder (MDD) is one of the most common and disabling mental disorders, and current strategies remain inadequate. Although mesenchymal stromal cells (MSCs) have shown beneficial effects in experimental models of depression, underlying mechanisms remain elusive. Here, using murine depression models, we demonstrated that MSCs could alleviate depressive and anxiety-like behaviors not due to a reduction in proinflammatory cytokines, but rather activation of dorsal raphe nucleus (DRN) 5-hydroxytryptamine (5-HT) neurons. Mechanistically, peripheral delivery of MSCs activated pulmonary innervating vagal sensory neurons, which projected to the nucleus tractus solitarius, inducing the release of 5-HT in DRN. Furthermore, MSC-secreted brain-derived neurotrophic factor activated lung sensory neurons through tropomyosin receptor kinase B (TrkB), and inhalation of a TrkB agonist also achieved significant therapeutic effects in male mice. This study reveals a role of peripheral MSCs in regulating central nervous system function and demonstrates a potential "lung vagal-to-brain axis" strategy for MDD.
Collapse
Affiliation(s)
- Jing Huang
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
- National-Local Joint Engineering Research Center for Stem Cells and Regenerative Medicine, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
| | - Weijun Huang
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
- National-Local Joint Engineering Research Center for Stem Cells and Regenerative Medicine, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
| | - Junzhe Yi
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
- National-Local Joint Engineering Research Center for Stem Cells and Regenerative Medicine, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
| | - Yiwen Deng
- Key Laboratory of Medical Transformation of Jiujiang, Jiujiang University, Jiujiang, Jiangxi, 332005, China
| | - Ruijie Li
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
- National-Local Joint Engineering Research Center for Stem Cells and Regenerative Medicine, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
| | - Jieying Chen
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
- National-Local Joint Engineering Research Center for Stem Cells and Regenerative Medicine, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
| | - Jiahao Shi
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
- National-Local Joint Engineering Research Center for Stem Cells and Regenerative Medicine, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
| | - Yuan Qiu
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
- National-Local Joint Engineering Research Center for Stem Cells and Regenerative Medicine, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
- Department of Histoembryology and Cell Biology, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
| | - Tao Wang
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
- National-Local Joint Engineering Research Center for Stem Cells and Regenerative Medicine, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
- Department of Histoembryology and Cell Biology, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
| | - Xiaoyong Chen
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
- National-Local Joint Engineering Research Center for Stem Cells and Regenerative Medicine, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
- Department of Histoembryology and Cell Biology, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China
| | - Xiaoran Zhang
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China.
- National-Local Joint Engineering Research Center for Stem Cells and Regenerative Medicine, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China.
- Department of Histoembryology and Cell Biology, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China.
| | - Andy Peng Xiang
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China.
- National-Local Joint Engineering Research Center for Stem Cells and Regenerative Medicine, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China.
- Department of Histoembryology and Cell Biology, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, 510080, China.
| |
Collapse
|
31
|
Sun KY, Bai X, Chen S, Bao S, Kapoor M, Zhang C, Backman J, Joseph T, Maxwell E, Mitra G, Gorovits A, Mansfield A, Boutkov B, Gokhale S, Habegger L, Marcketta A, Locke A, Kessler MD, Sharma D, Staples J, Bovijn J, Gelfman S, Gioia AD, Rajagopal V, Lopez A, Varela JR, Alegre J, Berumen J, Tapia-Conyer R, Kuri-Morales P, Torres J, Emberson J, Collins R, Regeneron Genetics Center, RGC-ME Cohort Partners, Cantor M, Thornton T, Kang HM, Overton J, Shuldiner AR, Cremona ML, Nafde M, Baras A, Abecasis G, Marchini J, Reid JG, Salerno W, Balasubramanian S. A deep catalog of protein-coding variation in 985,830 individuals. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.09.539329. [PMID: 37214792 PMCID: PMC10197621 DOI: 10.1101/2023.05.09.539329] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Coding variants that have significant impact on function can provide insights into the biology of a gene but are typically rare in the population. Identifying and ascertaining the frequency of such rare variants requires very large sample sizes. Here, we present the largest catalog of human protein-coding variation to date, derived from exome sequencing of 985,830 individuals of diverse ancestry to serve as a rich resource for studying rare coding variants. Individuals of African, Admixed American, East Asian, Middle Eastern, and South Asian ancestry account for 20% of this Exome dataset. Our catalog of variants includes approximately 10.5 million missense (54% novel) and 1.1 million predicted loss-of-function (pLOF) variants (65% novel, 53% observed only once). We identified individuals with rare homozygous pLOF variants in 4,874 genes, and for 1,838 of these this work is the first to document at least one pLOF homozygote. Additional insights from the RGC-ME dataset include 1) improved estimates of selection against heterozygous loss-of-function and identification of 3,459 genes intolerant to loss-of-function, 83 of which were previously assessed as tolerant to loss-of-function and 1,241 that lack disease annotations; 2) identification of regions depleted of missense variation in 457 genes that are tolerant to loss-of-function; 3) functional interpretation for 10,708 variants of unknown or conflicting significance reported in ClinVar as cryptic splice sites using splicing score thresholds based on empirical variant deleteriousness scores derived from RGC-ME; and 4) an observation that approximately 3% of sequenced individuals carry a clinically actionable genetic variant in the ACMG SF 3.1 list of genes. We make this important resource of coding variation available to the public through a variant allele frequency browser. We anticipate that this report and the RGC-ME dataset will serve as a valuable reference for understanding rare coding variation and help advance precision medicine efforts.
Collapse
Affiliation(s)
| | | | - Siying Chen
- Regeneron Genetics Center, Tarrytown, NY, USA
| | - Suying Bao
- Regeneron Genetics Center, Tarrytown, NY, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | - Adam Locke
- Regeneron Genetics Center, Tarrytown, NY, USA
| | | | | | | | | | | | | | | | | | | | - Jesus Alegre
- Experimental Research Unit from the Faculty of Medicine (UIME), National Autonomous University of Mexico (UNAM)
| | - Jaime Berumen
- Experimental Research Unit from the Faculty of Medicine (UIME), National Autonomous University of Mexico (UNAM)
| | - Roberto Tapia-Conyer
- Experimental Research Unit from the Faculty of Medicine (UIME), National Autonomous University of Mexico (UNAM)
| | - Pablo Kuri-Morales
- Experimental Research Unit from the Faculty of Medicine (UIME), National Autonomous University of Mexico (UNAM)
| | - Jason Torres
- Clinical Trial Service Unit & Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Jonathan Emberson
- Clinical Trial Service Unit & Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
- MRC Population Health Research Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Rory Collins
- Clinical Trial Service Unit & Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | | | | | | | | | | | | | | | | | - Mona Nafde
- Regeneron Genetics Center, Tarrytown, NY, USA
| | - Aris Baras
- Regeneron Genetics Center, Tarrytown, NY, USA
| | | | | | | | | | | |
Collapse
|
32
|
Walton NA, Nguyen HH, Procknow SS, Johnson D, Anzelmi A, Jay PY. Repurposing Normal Chromosomal Microarray Data to Harbor Genetic Insights into Congenital Heart Disease. BIOLOGY 2023; 12:1290. [PMID: 37887000 PMCID: PMC10604103 DOI: 10.3390/biology12101290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Revised: 09/08/2023] [Accepted: 09/08/2023] [Indexed: 10/28/2023]
Abstract
About 15% of congenital heart disease (CHD) patients have a known pathogenic copy number variant. The majority of their chromosomal microarray (CMA) tests are deemed normal. Diagnostic interpretation typically ignores microdeletions smaller than 100 kb. We hypothesized that unreported microdeletions are enriched for CHD genes. We analyzed "normal" CMAs of 1762 patients who were evaluated at a pediatric referral center, of which 319 (18%) had CHD. Using CMAs from monozygotic twins or replicates from the same individual, we established a size threshold based on probe count for the reproducible detection of small microdeletions. Genes in the microdeletions were sequentially filtered by their nominal association with a CHD diagnosis, the expression level in the fetal heart, and the deleteriousness of a loss-of-function mutation. The subsequent enrichment for CHD genes was assessed using the presence of known or potentially novel genes implicated by a large whole-exome sequencing study of CHD. The unreported microdeletions were modestly enriched for both known CHD genes and those of unknown significance identified using their de novo mutation in CHD patients. Our results show that readily available "normal" CMA data can be a fruitful resource for genetic discovery and that smaller deletions should receive more attention in clinical evaluation.
Collapse
Affiliation(s)
- Nephi A. Walton
- Department of Pediatrics, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Hoang H. Nguyen
- Department of Pediatrics, UT Southwestern Medical Center, Dallas, TX 75390, USA;
| | - Sara S. Procknow
- Department of Pediatrics, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Darren Johnson
- Genomic Medicine Institute, Geisinger, Danville, PA 17822, USA
| | - Alexander Anzelmi
- Department of Medicine, Thomas Jefferson University Hospitals, Philadelphia, PA 19107, USA
| | - Patrick Y. Jay
- Department of Pediatrics, Washington University School of Medicine, St. Louis, MO 63110, USA
| |
Collapse
|
33
|
Powell G, Simon MM, Pulit S, Mallon AM, Lindgren CM. Genic constraint against nonsynonymous variation across the mouse genome. BMC Genomics 2023; 24:562. [PMID: 37736706 PMCID: PMC10514939 DOI: 10.1186/s12864-023-09637-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Accepted: 08/30/2023] [Indexed: 09/23/2023] Open
Abstract
BACKGROUND Selective constraint, the depletion of variation due to negative selection, provides insights into the functional impact of variants and disease mechanisms. However, its characterization in mice, the most commonly used mammalian model, remains limited. This study aims to quantify mouse gene constraint using a new metric called the nonsynonymous observed expected ratio (NOER) and investigate its relationship with gene function. RESULTS NOER was calculated using whole-genome sequencing data from wild mouse populations (Mus musculus sp and Mus spretus). Positive correlations were observed between mouse gene constraint and the number of associated knockout phenotypes, indicating stronger constraint on pleiotropic genes. Furthermore, mouse gene constraint showed a positive correlation with the number of pathogenic variant sites in their human orthologues, supporting the relevance of mouse models in studying human disease variants. CONCLUSIONS NOER provides a resource for assessing the fitness consequences of genetic variants in mouse genes and understanding the relationship between gene constraint and function. The study's findings highlight the importance of pleiotropy in selective constraint and support the utility of mouse models in investigating human disease variants. Further research with larger sample sizes can refine constraint estimates in mice and enable more comprehensive comparisons of constraint between mouse and human orthologues.
Collapse
Affiliation(s)
- George Powell
- Li Ka Shing Centre for Health Information and Discovery, Big Data Institute, University of Oxford, Oxford, UK.
- Mammalian Genetics Unit, MRC Harwell Institute, Oxfordshire, OX11 0RD, UK.
| | - Michelle M Simon
- Mammalian Genetics Unit, MRC Harwell Institute, Oxfordshire, OX11 0RD, UK
| | - Sara Pulit
- Li Ka Shing Centre for Health Information and Discovery, Big Data Institute, University of Oxford, Oxford, UK
| | - Ann-Marie Mallon
- Mammalian Genetics Unit, MRC Harwell Institute, Oxfordshire, OX11 0RD, UK
| | - Cecilia M Lindgren
- Li Ka Shing Centre for Health Information and Discovery, Big Data Institute, University of Oxford, Oxford, UK
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
- Medical and Population Genetics Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| |
Collapse
|
34
|
LaPolice TM, Huang YF. An unsupervised deep learning framework for predicting human essential genes from population and functional genomic data. BMC Bioinformatics 2023; 24:347. [PMID: 37723435 PMCID: PMC10506225 DOI: 10.1186/s12859-023-05481-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Accepted: 09/13/2023] [Indexed: 09/20/2023] Open
Abstract
BACKGROUND The ability to accurately predict essential genes intolerant to loss-of-function (LOF) mutations can dramatically improve the identification of disease-associated genes. Recently, there have been numerous computational methods developed to predict human essential genes from population genomic data. While the existing methods are highly predictive of essential genes of long length, they have limited power in pinpointing short essential genes due to the sparsity of polymorphisms in the human genome. RESULTS Motivated by the premise that population and functional genomic data may provide complementary evidence for gene essentiality, here we present an evolution-based deep learning model, DeepLOF, to predict essential genes in an unsupervised manner. Unlike previous population genetic methods, DeepLOF utilizes a novel deep learning framework to integrate both population and functional genomic data, allowing us to pinpoint short essential genes that can hardly be predicted from population genomic data alone. Compared with previous methods, DeepLOF shows unmatched performance in predicting ClinGen haploinsufficient genes, mouse essential genes, and essential genes in human cell lines. Notably, at a false positive rate of 5%, DeepLOF detects 50% more ClinGen haploinsufficient genes than previous methods. Furthermore, DeepLOF discovers 109 novel essential genes that are too short to be identified by previous methods. CONCLUSION The predictive power of DeepLOF shows that it is a compelling computational method to aid in the discovery of essential genes.
Collapse
Affiliation(s)
- Troy M LaPolice
- Department of Biology, Pennsylvania State University, University Park, PA, 16802, USA.
- Bioinformatics and Genomics Graduate Program, Pennsylvania State University, University Park, PA, 16802, USA.
- Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA, 16802, USA.
| | - Yi-Fei Huang
- Department of Biology, Pennsylvania State University, University Park, PA, 16802, USA.
- Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA, 16802, USA.
| |
Collapse
|
35
|
Wang L, Zheng Y, Sun Y, Mao S, Li H, Bo X, Li C, Chen H. TimeTalk uses single-cell RNA-seq datasets to decipher cell-cell communication during early embryo development. Commun Biol 2023; 6:901. [PMID: 37660148 PMCID: PMC10475079 DOI: 10.1038/s42003-023-05283-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Accepted: 08/24/2023] [Indexed: 09/04/2023] Open
Abstract
Early embryonic development is a dynamic process that relies on proper cell-cell communication to form a correctly patterned embryo. Early embryo development-related ligand-receptor pairs (eLRs) have been shown to guide cell fate decisions and morphogenesis. However, the scope of eLRs and their influence on early embryo development remain elusive. Here, we developed a computational framework named TimeTalk from integrated public time-course mouse scRNA-seq datasets to decipher the secret of eLRs. Extensive validations and analyses were performed to ensure the involvement of identified eLRs in early embryo development. Process analysis identified that eLRs could be divided into six temporal windows corresponding to sequential events in the early embryo development process. With the interpolation strategy, TimeTalk is powerful in revealing paracrine settings and studying cell-cell communication during early embryo development. Furthermore, by using TimeTalk in the blastocyst and blastoid models, we found that the blastoid models share the core communication pathways with the epiblast and primitive endoderm lineages in the blastocysts. This result suggests that TimeTalk has transferability to other bio-dynamic processes. We also curated eLRs recognized by TimeTalk, which may provide valuable clues for understanding early embryo development and relevant disorders.
Collapse
Affiliation(s)
- Longteng Wang
- Peking University-Tsinghua University-National Institute of Biological Sciences Joint Graduate Program, School of Life Sciences, Peking University, Beijing, 100871, China
- Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China
| | - Yang Zheng
- Institute of Health Service and Transfusion Medicine, Beijing, 100850, China
| | - Yu Sun
- Institute of Health Service and Transfusion Medicine, Beijing, 100850, China
| | - Shulin Mao
- Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China
- Yuanpei College, Peking University, Beijing, 100871, China
| | - Hao Li
- Institute of Health Service and Transfusion Medicine, Beijing, 100850, China
| | - Xiaochen Bo
- Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China
| | - Cheng Li
- Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China.
| | - Hebing Chen
- Institute of Health Service and Transfusion Medicine, Beijing, 100850, China.
| |
Collapse
|
36
|
Cacheiro P, Smedley D. Essential genes: a cross-species perspective. Mamm Genome 2023; 34:357-363. [PMID: 36897351 PMCID: PMC10382395 DOI: 10.1007/s00335-023-09984-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 02/17/2023] [Indexed: 03/11/2023]
Abstract
Protein coding genes exhibit different degrees of intolerance to loss-of-function variation. The most intolerant genes, whose function is essential for cell or/and organism survival, inform on fundamental biological processes related to cell proliferation and organism development and provide a window on the molecular mechanisms of human disease. Here we present a brief overview of the resources and knowledge gathered around gene essentiality, from cancer cell lines to model organisms to human development. We outline the implications of using different sources of evidence and definitions to determine which genes are essential and highlight how information on the essentiality status of a gene can inform novel disease gene discovery and therapeutic target identification.
Collapse
Affiliation(s)
- Pilar Cacheiro
- William Harvey Research Institute, Queen Mary University of London, London, UK
| | - Damian Smedley
- William Harvey Research Institute, Queen Mary University of London, London, UK.
| |
Collapse
|
37
|
Krill-Burger JM, Dempster JM, Borah AA, Paolella BR, Root DE, Golub TR, Boehm JS, Hahn WC, McFarland JM, Vazquez F, Tsherniak A. Partial gene suppression improves identification of cancer vulnerabilities when CRISPR-Cas9 knockout is pan-lethal. Genome Biol 2023; 24:192. [PMID: 37612728 PMCID: PMC10464129 DOI: 10.1186/s13059-023-03020-w] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Accepted: 07/21/2023] [Indexed: 08/25/2023] Open
Abstract
BACKGROUND Hundreds of functional genomic screens have been performed across a diverse set of cancer contexts, as part of efforts such as the Cancer Dependency Map, to identify gene dependencies-genes whose loss of function reduces cell viability or fitness. Recently, large-scale screening efforts have shifted from RNAi to CRISPR-Cas9, due to superior efficacy and specificity. However, many effective oncology drugs only partially inhibit their protein targets, leading us to question whether partial suppression of genes using RNAi could reveal cancer vulnerabilities that are missed by complete knockout using CRISPR-Cas9. Here, we compare CRISPR-Cas9 and RNAi dependency profiles of genes across approximately 400 matched cancer cell lines. RESULTS We find that CRISPR screens accurately identify more gene dependencies per cell line, but the majority of each cell line's dependencies are part of a set of 1867 genes that are shared dependencies across the entire collection (pan-lethals). While RNAi knockdown of about 30% of these genes is also pan-lethal, approximately 50% have selective dependency patterns across cell lines, suggesting they could still be cancer vulnerabilities. The accuracy of the unique RNAi selectivity is supported by associations to multi-omics profiles, drug sensitivity, and other expected co-dependencies. CONCLUSIONS Incorporating RNAi data for genes that are pan-lethal knockouts facilitates the discovery of a wider range of gene targets than could be detected using the CRISPR dataset alone. This can aid in the interpretation of contrasting results obtained from CRISPR and RNAi screens and reinforce the importance of partial gene suppression methods in building a cancer dependency map.
Collapse
Affiliation(s)
| | | | - Ashir A Borah
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | | | - David E Root
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Todd R Golub
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Dana-Farber Cancer Institute, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Jesse S Boehm
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - William C Hahn
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Dana-Farber Cancer Institute, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | | | - Francisca Vazquez
- Broad Institute of Harvard and MIT, Cambridge, MA, USA.
- Dana-Farber Cancer Institute, Boston, MA, USA.
| | | |
Collapse
|
38
|
Hu Qian S, Shi MW, Wang DY, Fear JM, Chen L, Tu YX, Liu HS, Zhang Y, Zhang SJ, Yu SS, Oliver B, Chen ZX. Integrating massive RNA-seq data to elucidate transcriptome dynamics in Drosophila melanogaster. Brief Bioinform 2023; 24:bbad177. [PMID: 37232385 PMCID: PMC10505420 DOI: 10.1093/bib/bbad177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 04/19/2023] [Accepted: 04/20/2023] [Indexed: 05/27/2023] Open
Abstract
The volume of ribonucleic acid (RNA)-seq data has increased exponentially, providing numerous new insights into various biological processes. However, due to significant practical challenges, such as data heterogeneity, it is still difficult to ensure the quality of these data when integrated. Although some quality control methods have been developed, sample consistency is rarely considered and these methods are susceptible to artificial factors. Here, we developed MassiveQC, an unsupervised machine learning-based approach, to automatically download and filter large-scale high-throughput data. In addition to the read quality used in other tools, MassiveQC also uses the alignment and expression quality as model features. Meanwhile, it is user-friendly since the cutoff is generated from self-reporting and is applicable to multimodal data. To explore its value, we applied MassiveQC to Drosophila RNA-seq data and generated a comprehensive transcriptome atlas across 28 tissues from embryogenesis to adulthood. We systematically characterized fly gene expression dynamics and found that genes with high expression dynamics were likely to be evolutionarily young and expressed at late developmental stages, exhibiting high nonsynonymous substitution rates and low phenotypic severity, and they were involved in simple regulatory programs. We also discovered that human and Drosophila had strong positive correlations in gene expression in orthologous organs, revealing the great potential of the Drosophila system for studying human development and disease.
Collapse
Affiliation(s)
- Sheng Hu Qian
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Meng-Wei Shi
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Dan-Yang Wang
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Justin M Fear
- Section of Developmental Genomics, National Institute of Diabetes and Kidney and Digestive Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Lu Chen
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Yi-Xuan Tu
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Hong-Shan Liu
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Yuan Zhang
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Shuai-Jie Zhang
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Shan-Shan Yu
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Brian Oliver
- Section of Developmental Genomics, National Institute of Diabetes and Kidney and Digestive Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Zhen-Xia Chen
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
- Section of Developmental Genomics, National Institute of Diabetes and Kidney and Digestive Diseases, National Institutes of Health, Bethesda, MD 20892, USA
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Life Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
- Interdisciplinary Sciences Institute, Huazhong Agricultural University, Wuhan 430070, China
- Shenzhen Institute of Nutrition and Health, Huazhong Agricultural University, Shenzhen 518000, China
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518000, China
| |
Collapse
|
39
|
Singh AK, Amar I, Ramadasan H, Kappagantula KS, Chavali S. Proteins with amino acid repeats constitute a rapidly evolvable and human-specific essentialome. Cell Rep 2023; 42:112811. [PMID: 37453061 DOI: 10.1016/j.celrep.2023.112811] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 05/30/2023] [Accepted: 06/29/2023] [Indexed: 07/18/2023] Open
Abstract
Protein products of essential genes, indispensable for organismal survival, are highly conserved and bring about fundamental functions. Interestingly, proteins that contain amino acid homorepeats that tend to evolve rapidly are enriched in eukaryotic essentialomes. Why are proteins with hypermutable homorepeats enriched in conserved and functionally vital essential proteins? We solve this functional versus evolutionary paradox by demonstrating that human essential proteins with homorepeats bring about crosstalk across biological processes through high interactability and have distinct regulatory functions affecting expansive global regulation. Importantly, essential proteins with homorepeats rapidly diverge with the amino acid substitutions frequently affecting functional sites, likely facilitating rapid adaptability. Strikingly, essential proteins with homorepeats influence human-specific embryonic and brain development, implying that the presence of homorepeats could contribute to the emergence of human-specific processes. Thus, we propose that homorepeat-containing essential proteins affecting species-specific traits can be potential intervention targets across pathologies, including cancers and neurological disorders.
Collapse
Affiliation(s)
- Anjali K Singh
- Department of Biology, Indian Institute of Science Education and Research (IISER) Tirupati, Tirupati 517507, Andhra Pradesh, India
| | - Ishita Amar
- Department of Biology, Indian Institute of Science Education and Research (IISER) Tirupati, Tirupati 517507, Andhra Pradesh, India
| | - Harikrishnan Ramadasan
- Department of Biology, Indian Institute of Science Education and Research (IISER) Tirupati, Tirupati 517507, Andhra Pradesh, India
| | - Keertana S Kappagantula
- Department of Biology, Indian Institute of Science Education and Research (IISER) Tirupati, Tirupati 517507, Andhra Pradesh, India
| | - Sreenivas Chavali
- Department of Biology, Indian Institute of Science Education and Research (IISER) Tirupati, Tirupati 517507, Andhra Pradesh, India.
| |
Collapse
|
40
|
Heshmatzad K, Naderi N, Maleki M, Abbasi S, Ghasemi S, Ashrafi N, Fazelifar AF, Mahdavi M, Kalayinia S. Role of non-coding variants in cardiovascular disease. J Cell Mol Med 2023; 27:1621-1636. [PMID: 37183561 PMCID: PMC10273088 DOI: 10.1111/jcmm.17762] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 03/29/2023] [Accepted: 04/25/2023] [Indexed: 05/16/2023] Open
Abstract
Cardiovascular diseases (CVDs) constitute one of the significant causes of death worldwide. Different pathological states are linked to CVDs, which despite interventions and treatments, still have poor prognoses. The genetic component, as a beneficial tool in the risk stratification of CVD development, plays a role in the pathogenesis of this group of diseases. The emergence of genome-wide association studies (GWAS) have led to the identification of non-coding parts associated with cardiovascular traits and disorders. Variants located in functional non-coding regions, including promoters/enhancers, introns, miRNAs and 5'/3' UTRs, account for 90% of all identified single-nucleotide polymorphisms associated with CVDs. Here, for the first time, we conducted a comprehensive review on the reported non-coding variants for different CVDs, including hypercholesterolemia, cardiomyopathies, congenital heart diseases, thoracic aortic aneurysms/dissections and coronary artery diseases. Additionally, we present the most commonly reported genes involved in each CVD. In total, 1469 non-coding variants constitute most reports on familial hypercholesterolemia, hypertrophic cardiomyopathy and dilated cardiomyopathy. The application and identification of non-coding variants are beneficial for the genetic diagnosis and better therapeutic management of CVDs.
Collapse
Affiliation(s)
- Katayoun Heshmatzad
- Rajaie Cardiovascular Medical and Research CenterIran University of Medical SciencesTehranIran
| | - Niloofar Naderi
- Cardiogenetic Research Center, Rajaie Cardiovascular Medical and Research CenterIran University of Medical SciencesTehranIran
| | - Majid Maleki
- Rajaie Cardiovascular Medical and Research CenterIran University of Medical SciencesTehranIran
| | - Shiva Abbasi
- Rajaie Cardiovascular Medical and Research CenterIran University of Medical SciencesTehranIran
| | - Serwa Ghasemi
- Cardiogenetic Research Center, Rajaie Cardiovascular Medical and Research CenterIran University of Medical SciencesTehranIran
| | - Nooshin Ashrafi
- Cardiogenetic Research Center, Rajaie Cardiovascular Medical and Research CenterIran University of Medical SciencesTehranIran
| | - Amir Farjam Fazelifar
- Rajaie Cardiovascular Medical and Research CenterIran University of Medical SciencesTehranIran
| | - Mohammad Mahdavi
- Rajaie Cardiovascular Medical and Research CenterIran University of Medical SciencesTehranIran
| | - Samira Kalayinia
- Cardiogenetic Research Center, Rajaie Cardiovascular Medical and Research CenterIran University of Medical SciencesTehranIran
| |
Collapse
|
41
|
Hara Y, Kuraku S. The impact of local genomic properties on the evolutionary fate of genes. eLife 2023; 12:82290. [PMID: 37223962 DOI: 10.7554/elife.82290] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Accepted: 04/25/2023] [Indexed: 05/25/2023] Open
Abstract
Functionally indispensable genes are likely to be retained and otherwise to be lost during evolution. This evolutionary fate of a gene can also be affected by factors independent of gene dispensability, including the mutability of genomic positions, but such features have not been examined well. To uncover the genomic features associated with gene loss, we investigated the characteristics of genomic regions where genes have been independently lost in multiple lineages. With a comprehensive scan of gene phylogenies of vertebrates with a careful inspection of evolutionary gene losses, we identified 813 human genes whose orthologs were lost in multiple mammalian lineages: designated 'elusive genes.' These elusive genes were located in genomic regions with rapid nucleotide substitution, high GC content, and high gene density. A comparison of the orthologous regions of such elusive genes across vertebrates revealed that these features had been established before the radiation of the extant vertebrates approximately 500 million years ago. The association of human elusive genes with transcriptomic and epigenomic characteristics illuminated that the genomic regions containing such genes were subject to repressive transcriptional regulation. Thus, the heterogeneous genomic features driving gene fates toward loss have been in place and may sometimes have relaxed the functional indispensability of such genes. This study sheds light on the complex interplay between gene function and local genomic properties in shaping gene evolution that has persisted since the vertebrate ancestor.
Collapse
Affiliation(s)
- Yuichiro Hara
- Research Center for Genome & Medical Sciences, Tokyo Metropolitan Institute of Medical Science, Tokyo, Japan
| | - Shigehiro Kuraku
- Molecular Life History Laboratory, Department of Genomics and Evolutionary Biology, National Institute of Genetics, Mishima, Japan
- Department of Genetics, Sokendai (Graduate University for Advanced Studies), Mishima, Japan
- RIKEN Center for Biosystems Dynamics Research, Kobe, Japan
| |
Collapse
|
42
|
Khan SU, Saeed S, Alsuhaibani AM, Fatima S, Ur Rehman K, Zaman U, Ullah M, Refati MS, Lu K. Advances and Challenges for GWAS Analysis in Cardiac Diseases: A Focus on Coronary Artery Disease (CAD). Curr Probl Cardiol 2023:101821. [PMID: 37211304 DOI: 10.1016/j.cpcardiol.2023.101821] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2023] [Accepted: 05/16/2023] [Indexed: 05/23/2023]
Abstract
The achievement of genome-wide association studies (GWAS) has rapidly progressed our understanding of the etiology of coronary artery disease (CAD). It unlocks new strategies to strengthen the stalling of CAD drug development. In this review, we highlighted the recent drawbacks, mainly pointing out those involved in identifying causal genes and interpreting the connections between disease pathology and risk variants. We also benchmark the novel insights into the biological mechanism behind the disease primarily based on outcomes of GWAS. Furthermore, we also shed light on the successful discovery of novel treatment targets by introducing various layers of "omics" data and applying systems genetics strategies. Lastly, we discuss in-depth the significance of precision medicine that is helpful to improve through GWAS analysis in cardiovascular research.
Collapse
Affiliation(s)
- Shahid Ullah Khan
- Integrative Science Center of Germplasm Creation in Western China (CHONGQING) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Beibei, Chongqing 400715, China; Engineering Research Center of South Upland Agriculture, Ministry of Education, Chongqing 400715, China; Women Medical and Dental College, Khyber Medical University, Peshawar, KPK, Pakistan
| | - Sumbul Saeed
- Centre for Cell Factories and Biopolymers (CCFB), Griffith Institute for Drug Discovery, Griffith University, Nathan, QLD, 4111, Australia
| | - Amnah Mohammed Alsuhaibani
- Department of Physical Sport Science, College of Education, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia
| | - Sumaya Fatima
- Research Center, The Fourth Hospital of Hebei Medical University, Shijiazhuang, Hebei 050011, China
| | - Khalil Ur Rehman
- Institute of Chemical Sciences, Gomal University, Dera Ismail Khan 29050, Pakistan
| | - Umber Zaman
- Institute of Chemical Sciences, Gomal University, Dera Ismail Khan 29050, Pakistan
| | - Muneeb Ullah
- Department of Pharmacy, Kohat University of Science and Technology, 26000, KPK, Pakistan
| | - Moamen S Refati
- Department of Chemistry, College of Science, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia
| | - Kun Lu
- Integrative Science Center of Germplasm Creation in Western China (CHONGQING) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Beibei, Chongqing 400715, China; Engineering Research Center of South Upland Agriculture, Ministry of Education, Chongqing 400715, China.
| |
Collapse
|
43
|
Mfarej MG, Hyland CA, Sanchez AC, Falk MM, Iovine MK, Skibbens RV. Cohesin: an emerging master regulator at the heart of cardiac development. Mol Biol Cell 2023; 34:rs2. [PMID: 36947206 PMCID: PMC10162415 DOI: 10.1091/mbc.e22-12-0557] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Revised: 03/10/2023] [Accepted: 03/17/2023] [Indexed: 03/23/2023] Open
Abstract
Cohesins are ATPase complexes that play central roles in cellular processes such as chromosome division, DNA repair, and gene expression. Cohesinopathies arise from mutations in cohesin proteins or cohesin complex regulators and encompass a family of related developmental disorders that present with a range of severe birth defects, affect many different physiological systems, and often lead to embryonic fatality. Treatments for cohesinopathies are limited, in large part due to the lack of understanding of cohesin biology. Thus, characterizing the signaling networks that lie upstream and downstream of cohesin-dependent pathways remains clinically relevant. Here, we highlight alterations in cohesins and cohesin regulators that result in cohesinopathies, with a focus on cardiac defects. In addition, we suggest a novel and more unifying view regarding the mechanisms through which cohesinopathy-based heart defects may arise.
Collapse
Affiliation(s)
- Michael G. Mfarej
- Department of Biological Sciences, Lehigh University, Bethlehem, PA 18015
| | - Caitlin A. Hyland
- Department of Biological Sciences, Lehigh University, Bethlehem, PA 18015
| | - Annie C. Sanchez
- Department of Biological Sciences, Lehigh University, Bethlehem, PA 18015
| | - Matthias M. Falk
- Department of Biological Sciences, Lehigh University, Bethlehem, PA 18015
| | - M. Kathryn Iovine
- Department of Biological Sciences, Lehigh University, Bethlehem, PA 18015
| | - Robert V. Skibbens
- Department of Biological Sciences, Lehigh University, Bethlehem, PA 18015
| |
Collapse
|
44
|
Bartlett J. Random with Respect to Fitness or External Selection? An Important but Often Overlooked Distinction. Acta Biotheor 2023; 71:12. [PMID: 36933070 DOI: 10.1007/s10441-023-09464-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2022] [Accepted: 03/03/2023] [Indexed: 03/19/2023]
Abstract
Mutations are often described as being "random with respect to fitness." Here we show that the experiments used to establish randomness with respect to fitness are only capable of showing that mutations are random with respect to current external selection. Current debates about whether or not mutations are directed may be at least partially resolved by making use of this distinction. Additionally, this distinction has important mathematical, experimental, and inferential implications.
Collapse
|
45
|
Thakur A, Kumar M. Integration of Human and Viral miRNAs in Epstein-Barr Virus-Associated Tumors and Implications for Drug Repurposing. OMICS : A JOURNAL OF INTEGRATIVE BIOLOGY 2023; 27:93-108. [PMID: 36927073 DOI: 10.1089/omi.2023.0005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/18/2023]
Abstract
Epstein-Barr virus (EBV) is associated with several tumors, and has substantial relevance for public health. Therapeutics innovation for EBV-related disorders is much needed. In this context, miRNAs are noncoding RNA molecules that play vital roles in EBV infection. miRNA-Seq and RNA-Seq data for EBV-associated clinical samples and cell lines have been generated, but their detailed integrative analyses, and exploitation for drug repurposing against EBV are lacking. Hence, we identified and analyzed the differentially expressed miRNAs (DEmiRs) in EBV-infected cell lines (28) and infected (28) and uninfected human tissue (20) samples using an in-house pipeline. We found significantly enriched host miRNAs like hsa-mir-3651, hsa-mir-1248, and hsa-mir-29c-3p in EBV-infected samples from EBV-associated nasopharyngeal carcinoma and Hodgkin's lymphoma, among others. Furthermore, we also identified significantly enriched novel miRNAs such as hsa-mir-29c-3p, hsa-mir-3651, and hsa-mir-98-3p, which were not previously reported in EBV-related tumors. Differentially expressed mRNAs (DEMs) were identified in EBV-infected cell lines (21) and uninfected human tissue (14) samples. We predicted and selected 1572 DEMs (upregulated) that are targeted by 547 DEmiRs (downregulated). These were further classified into essential (870) and nonessential (702) genes. Moreover, a miRNA-mRNA network was developed for the hub miRNAs. Importantly, we used the DEMs during EBV latent infection types I, II, and III to identify the candidate drugs for repurposing: Glyburide, Levodopa, Nateglinide, and Stiripentol, among others. To the best of our knowledge, this is the first integrative analyses that identified DEmiRs and DEMs as potential therapeutic targets and predicted drugs as potential candidates for repurposing against EBV-related tumors.
Collapse
Affiliation(s)
- Anamika Thakur
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| | - Manoj Kumar
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| |
Collapse
|
46
|
Cruz Marino T, Leblanc J, Pratte A, Tardif J, Thomas MJ, Fortin CA, Girard L, Bouchard L. Portrait of autosomal recessive diseases in the French-Canadian founder population of Saguenay-Lac-Saint-Jean. Am J Med Genet A 2023; 191:1145-1163. [PMID: 36786328 DOI: 10.1002/ajmg.a.63147] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Revised: 01/16/2023] [Accepted: 01/30/2023] [Indexed: 02/15/2023]
Abstract
The population of the Saguenay-Lac-Saint-Jean (SLSJ) region, located in the province of Quebec, Canada, is recognized as a founder population, where some rare autosomal recessive diseases show a high prevalence. Through the clinical and molecular study of 82 affected individuals from 60 families, this study outlines 12 diseases identified as recurrent in SLSJ. Their carrier frequency was estimated with the contribution of 1059 healthy individuals, increasing the number of autosomal recessive diseases with known carrier frequency in this region from 14 to 25. We review the main clinical and molecular features previously reported for these disorders. Five of the studied diseases have a potential lethal effect and three are associated with intellectual deficiency. Therefore, we believe that the provincial program for carrier screening should be extended to include these eight disorders. The high-carrier frequency, together with the absence of consanguinity in most of these unrelated families, suggest a founder effect and genetic drift for the 12 recurrent variants. We recommend further studies to validate this hypothesis, as well as to extend the present study to other regions in the province of Quebec, since some of these disorders could also be present in other French-Canadian families.
Collapse
Affiliation(s)
- Tania Cruz Marino
- Department of Laboratory Medicine, CIUSSS Saguenay-Lac-St-Jean, Quebec, Canada
| | - Josianne Leblanc
- Department of Laboratory Medicine, CIUSSS Saguenay-Lac-St-Jean, Quebec, Canada
| | - Annabelle Pratte
- Department of Laboratory Medicine, CIUSSS Saguenay-Lac-St-Jean, Quebec, Canada
| | - Jessica Tardif
- Department of Laboratory Medicine, CIUSSS Saguenay-Lac-St-Jean, Quebec, Canada
| | | | - Carol-Ann Fortin
- Department of Biochemistry and Functional Genomics, Faculty of Medicine and Health Sciences (FMHS), Université de Sherbrooke, Quebec, Canada
| | - Lysanne Girard
- Department of Biochemistry and Functional Genomics, Faculty of Medicine and Health Sciences (FMHS), Université de Sherbrooke, Quebec, Canada
| | - Luigi Bouchard
- Department of Laboratory Medicine, CIUSSS Saguenay-Lac-St-Jean, Quebec, Canada.,Department of Biochemistry and Functional Genomics, Faculty of Medicine and Health Sciences (FMHS), Université de Sherbrooke, Quebec, Canada
| |
Collapse
|
47
|
Molecular Landscape of Tourette's Disorder. Int J Mol Sci 2023; 24:ijms24021428. [PMID: 36674940 PMCID: PMC9865021 DOI: 10.3390/ijms24021428] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 12/29/2022] [Accepted: 01/01/2023] [Indexed: 01/12/2023] Open
Abstract
Tourette's disorder (TD) is a highly heritable childhood-onset neurodevelopmental disorder and is caused by a complex interplay of multiple genetic and environmental factors. Yet, the molecular mechanisms underlying the disorder remain largely elusive. In this study, we used the available omics data to compile a list of TD candidate genes, and we subsequently conducted tissue/cell type specificity and functional enrichment analyses of this list. Using genomic data, we also investigated genetic sharing between TD and blood and cerebrospinal fluid (CSF) metabolite levels. Lastly, we built a molecular landscape of TD through integrating the results from these analyses with an extensive literature search to identify the interactions between the TD candidate genes/proteins and metabolites. We found evidence for an enriched expression of the TD candidate genes in four brain regions and the pituitary. The functional enrichment analyses implicated two pathways ('cAMP-mediated signaling' and 'Endocannabinoid Neuronal Synapse Pathway') and multiple biological functions related to brain development and synaptic transmission in TD etiology. Furthermore, we found genetic sharing between TD and the blood and CSF levels of 39 metabolites. The landscape of TD not only provides insights into the (altered) molecular processes that underlie the disease but, through the identification of potential drug targets (such as FLT3, NAALAD2, CX3CL1-CX3CR1, OPRM1, and HRH2), it also yields clues for developing novel TD treatments.
Collapse
|
48
|
Li Y, Zeng M, Zhang F, Wu FX, Li M. DeepCellEss: cell line-specific essential protein prediction with attention-based interpretable deep learning. Bioinformatics 2023; 39:btac779. [PMID: 36458923 PMCID: PMC9825760 DOI: 10.1093/bioinformatics/btac779] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Revised: 11/25/2022] [Accepted: 12/01/2022] [Indexed: 12/05/2022] Open
Abstract
MOTIVATION Protein essentiality is usually accepted to be a conditional trait and strongly affected by cellular environments. However, existing computational methods often do not take such characteristics into account, preferring to incorporate all available data and train a general model for all cell lines. In addition, the lack of model interpretability limits further exploration and analysis of essential protein predictions. RESULTS In this study, we proposed DeepCellEss, a sequence-based interpretable deep learning framework for cell line-specific essential protein predictions. DeepCellEss utilizes a convolutional neural network and bidirectional long short-term memory to learn short- and long-range latent information from protein sequences. Further, a multi-head self-attention mechanism is used to provide residue-level model interpretability. For model construction, we collected extremely large-scale benchmark datasets across 323 cell lines. Extensive computational experiments demonstrate that DeepCellEss yields effective prediction performance for different cell lines and outperforms existing sequence-based methods as well as network-based centrality measures. Finally, we conducted some case studies to illustrate the necessity of considering specific cell lines and the superiority of DeepCellEss. We believe that DeepCellEss can serve as a useful tool for predicting essential proteins across different cell lines. AVAILABILITY AND IMPLEMENTATION The DeepCellEss web server is available at http://csuligroup.com:8000/DeepCellEss. The source code and data underlying this study can be obtained from https://github.com/CSUBioGroup/DeepCellEss. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yiming Li
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China
| | - Min Zeng
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China
| | - Fuhao Zhang
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China
| | - Fang-Xiang Wu
- Division of Biomedical Engineering, Department of Computer Science, Department of Mechanical Engineering University of Saskatchewan, Saskatoon, SK S7N 5A9, Canada
| | - Min Li
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China
| |
Collapse
|
49
|
Manzo M, Giordano M, Maddalena L, Guarracino MR, Granata I. Novel Data Science Methodologies for Essential Genes Identification Based on Network Analysis. STUDIES IN COMPUTATIONAL INTELLIGENCE 2023:117-145. [DOI: 10.1007/978-3-031-24453-7_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]
|
50
|
Zheng C, Zhang J, Jiang F, Li D, Huang C, Guo X, Zhu X, Tan S. Clinical Significance of TUBGCP4 Expression in Hepatocellular Carcinoma. Anal Cell Pathol (Amst) 2022; 2022:9307468. [PMID: 36530949 PMCID: PMC9754849 DOI: 10.1155/2022/9307468] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 11/06/2022] [Accepted: 11/21/2022] [Indexed: 01/04/2025] Open
Abstract
We aim to investigate the expression and clinical significance of the tubulin gamma complex-associated protein 4 (TUBGCP4) in hepatocellular carcinoma (HCC). The mRNA expression of TUBGCP4 in HCC tissues was analyzed using The Cancer Genome Atlas (TCGA) database. Paired HCC and adjacent nontumor tissues were obtained from HCC patients to measure the protein expression of TUBGCP4 by immunohistochemistry (IHC) and to analyze the relationship between TUBGCP4 protein expression and the clinicopathological characteristics and the prognosis of HCC patients. We found that TUBGCP4 mRNA expression was upregulated in HCC tissues from TCGA database. IHC analysis showed that TUBGCP4 was positively expressed in 61.25% (49/80) of HCC tissues and 77.5% (62/80) of adjacent nontumor tissues. The Chi-square analysis indicated that the positive rate of TUBGCP4 expression between HCC tissues and the adjacent nontumor tissues was statistically different (P < 0.05). Furthermore, we found that TUBGCP4 protein expression was correlated with carbohydrate antigen (CA-199) levels of HCC patients (P < 0.05). Further, survival analysis showed that the overall survival time and tumor-free survival time in the TUBGCP4 positive group were significantly higher than those of the negative group (P < 0.05), indicating that the positive expression of TUBGCP4 was related to a better prognosis of HCC patients. COX model showed that TUBGCP4 was an independent prognostic factor for HCC patients. Our study indicates that TUBGCP4 protein expression is downregulated in HCC tissues and has a relationship with the prognosis of HCC patients.
Collapse
Affiliation(s)
- Chuanjun Zheng
- Department of Epidemiology and Health Statistics, School of Public Health, Guilin Medical University, Guilin, 541199 Guangxi, China
| | - Jiaxi Zhang
- Department of Epidemiology and Health Statistics, School of Public Health, Guilin Medical University, Guilin, 541199 Guangxi, China
| | - Fusheng Jiang
- Guilin Center for Disease Control and Prevention, Guilin, 541001 Guangxi, China
| | - Di Li
- Department of Epidemiology and Health Statistics, School of Public Health, Guilin Medical University, Guilin, 541199 Guangxi, China
| | - Caimei Huang
- Department of Epidemiology and Health Statistics, School of Public Health, Guilin Medical University, Guilin, 541199 Guangxi, China
| | - Xuefeng Guo
- Department of Epidemiology and Health Statistics, School of Public Health, Guilin Medical University, Guilin, 541199 Guangxi, China
| | - Xiaonian Zhu
- Department of Epidemiology and Health Statistics, School of Public Health, Guilin Medical University, Guilin, 541199 Guangxi, China
| | - Shengkui Tan
- Department of Epidemiology and Health Statistics, School of Public Health, Guilin Medical University, Guilin, 541199 Guangxi, China
- Department of Epidemiology and Health Statistics, School of Public Health, Central South University, Changsha 410005, China
| |
Collapse
|