1
|
Garcia MF, Retallick-Townsley K, Pruitt A, Davidson E, Dai Y, Fitzpatrick SE, Sen A, Cohen S, Livoti O, Khan S, Dossou G, Cheung J, Deans PJM, Wang Z, Huckins L, Hoffman E, Brennand K. Dynamic convergence of autism disorder risk genes across neurodevelopment. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.23.609190. [PMID: 39229156 PMCID: PMC11370590 DOI: 10.1101/2024.08.23.609190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 09/05/2024]
Abstract
Over a hundred risk genes underlie risk for autism spectrum disorder (ASD) but the extent to which they converge on shared downstream targets to increase ASD risk is unknown. To test the hypothesis that cellular context impacts the nature of convergence, here we apply a pooled CRISPR approach to target 29 ASD loss-of-function genes in human induced pluripotent stem cell (hiPSC)-derived neural progenitor cells, glutamatergic neurons, and GABAergic neurons. Two distinct approaches (gene-level and network-level analyses) demonstrate that convergence is greatest in mature glutamatergic neurons. Convergent effects are dynamic, varying in strength, composition, and biological role between cell types, increasing with functional similarity of the ASD genes examined, and driven by cell-type-specific gene co-expression patterns. Stratification of ASD genes yield targeted drug predictions capable of reversing gene-specific convergent signatures in human cells and ASD-related behaviors in zebrafish. Altogether, convergent networks downstream of ASD risk genes represent novel points of individualized therapeutic intervention.
Collapse
Affiliation(s)
- Meilin Fernandez Garcia
- Departments of Psychiatry and Genetics, Division of Molecular Psychiatry, Department of Genetics, Wu Tsai Institute, Yale University School of Medicine, New Haven, CT 06511
| | - Kayla Retallick-Townsley
- Departments of Psychiatry and Genetics, Division of Molecular Psychiatry, Department of Genetics, Wu Tsai Institute, Yale University School of Medicine, New Haven, CT 06511
- Pamela Sklar Division of Psychiatric Genomics, Department of Genetics and Genomics, Icahn Institute of Genomics and Multiscale Biology, Nash Family Department of Neuroscience, Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029
| | - April Pruitt
- Interdepartmental Neuroscience Program, Yale School of Medicine, New Haven, CT 06511
| | - Elizabeth Davidson
- Child Study Center, Yale University School of Medicine, New Haven, CT 06511
| | - Yi Dai
- Child Study Center, Yale University School of Medicine, New Haven, CT 06511
| | - Sarah E Fitzpatrick
- Interdepartmental Neuroscience Program, Yale School of Medicine, New Haven, CT 06511
| | - Annabel Sen
- Departments of Psychiatry and Genetics, Division of Molecular Psychiatry, Department of Genetics, Wu Tsai Institute, Yale University School of Medicine, New Haven, CT 06511
| | - Sophie Cohen
- Departments of Psychiatry and Genetics, Division of Molecular Psychiatry, Department of Genetics, Wu Tsai Institute, Yale University School of Medicine, New Haven, CT 06511
| | - Olivia Livoti
- Departments of Psychiatry and Genetics, Division of Molecular Psychiatry, Department of Genetics, Wu Tsai Institute, Yale University School of Medicine, New Haven, CT 06511
| | - Suha Khan
- Child Study Center, Yale University School of Medicine, New Haven, CT 06511
| | - Grace Dossou
- Child Study Center, Yale University School of Medicine, New Haven, CT 06511
| | - Jen Cheung
- Departments of Psychiatry and Genetics, Division of Molecular Psychiatry, Department of Genetics, Wu Tsai Institute, Yale University School of Medicine, New Haven, CT 06511
| | - P J Michael Deans
- Departments of Psychiatry and Genetics, Division of Molecular Psychiatry, Department of Genetics, Wu Tsai Institute, Yale University School of Medicine, New Haven, CT 06511
| | - Zuoheng Wang
- Child Study Center, Yale University School of Medicine, New Haven, CT 06511
| | - Laura Huckins
- Departments of Psychiatry and Genetics, Division of Molecular Psychiatry, Department of Genetics, Wu Tsai Institute, Yale University School of Medicine, New Haven, CT 06511
- Interdepartmental Neuroscience Program, Yale School of Medicine, New Haven, CT 06511
- Pamela Sklar Division of Psychiatric Genomics, Department of Genetics and Genomics, Icahn Institute of Genomics and Multiscale Biology, Nash Family Department of Neuroscience, Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029
| | - Ellen Hoffman
- Interdepartmental Neuroscience Program, Yale School of Medicine, New Haven, CT 06511
- Child Study Center, Yale University School of Medicine, New Haven, CT 06511
| | - Kristen Brennand
- Departments of Psychiatry and Genetics, Division of Molecular Psychiatry, Department of Genetics, Wu Tsai Institute, Yale University School of Medicine, New Haven, CT 06511
- Interdepartmental Neuroscience Program, Yale School of Medicine, New Haven, CT 06511
- Pamela Sklar Division of Psychiatric Genomics, Department of Genetics and Genomics, Icahn Institute of Genomics and Multiscale Biology, Nash Family Department of Neuroscience, Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029
| |
Collapse
|
2
|
Yuan H, Mancuso CA, Johnson K, Braasch I, Krishnan A. Computational strategies for cross-species knowledge transfer and translational biomedicine. ARXIV 2024:arXiv:2408.08503v1. [PMID: 39184546 PMCID: PMC11343225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 08/27/2024]
Abstract
Research organisms provide invaluable insights into human biology and diseases, serving as essential tools for functional experiments, disease modeling, and drug testing. However, evolutionary divergence between humans and research organisms hinders effective knowledge transfer across species. Here, we review state-of-the-art methods for computationally transferring knowledge across species, primarily focusing on methods that utilize transcriptome data and/or molecular networks. We introduce the term "agnology" to describe the functional equivalence of molecular components regardless of evolutionary origin, as this concept is becoming pervasive in integrative data-driven models where the role of evolutionary origin can become unclear. Our review addresses four key areas of information and knowledge transfer across species: (1) transferring disease and gene annotation knowledge, (2) identifying agnologous molecular components, (3) inferring equivalent perturbed genes or gene sets, and (4) identifying agnologous cell types. We conclude with an outlook on future directions and several key challenges that remain in cross-species knowledge transfer.
Collapse
Affiliation(s)
- Hao Yuan
- Genetics and Genome Science Program; Ecology, Evolution, and Behavior Program, Michigan State University
| | - Christopher A. Mancuso
- Department of Biostatistics & Informatics, University of Colorado Anschutz Medical Campus
| | - Kayla Johnson
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus
| | - Ingo Braasch
- Department of Integrative Biology; Genetics and Genome Science Program; Ecology, Evolution, and Behavior Program, Michigan State University
| | - Arjun Krishnan
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus
| |
Collapse
|
3
|
Iruzubieta P, Damborenea A, Ioghen M, Bajew S, Fernandez-Torrón R, Töpf A, Herrero-Reiriz Á, Epure D, Vill K, Hernández-Laín A, Manterola M, Azkargorta M, Pikatza-Menoio O, Pérez-Fernandez L, García-Puga M, Gaina G, Bastian A, Streata I, Walter MC, Müller-Felber W, Thiele S, Moragón S, Bastida-Lertxundi N, López-Cortajarena A, Elortza F, Gereñu G, Alonso-Martin S, Straub V, de Sancho D, Teleanu R, López de Munain A, Blázquez L. Biallelic variants in SNUPN cause a limb girdle muscular dystrophy with myofibrillar-like features. Brain 2024; 147:2867-2883. [PMID: 38366623 PMCID: PMC11292911 DOI: 10.1093/brain/awae046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 01/20/2024] [Accepted: 01/26/2024] [Indexed: 02/18/2024] Open
Abstract
Alterations in RNA-splicing are a molecular hallmark of several neurological diseases, including muscular dystrophies, where mutations in genes involved in RNA metabolism or characterized by alterations in RNA splicing have been described. Here, we present five patients from two unrelated families with a limb-girdle muscular dystrophy (LGMD) phenotype carrying a biallelic variant in SNUPN gene. Snurportin-1, the protein encoded by SNUPN, plays an important role in the nuclear transport of small nuclear ribonucleoproteins (snRNPs), essential components of the spliceosome. We combine deep phenotyping, including clinical features, histopathology and muscle MRI, with functional studies in patient-derived cells and muscle biopsies to demonstrate that variants in SNUPN are the cause of a new type of LGMD according to current definition. Moreover, an in vivo model in Drosophila melanogaster further supports the relevance of Snurportin-1 in muscle. SNUPN patients show a similar phenotype characterized by proximal weakness starting in childhood, restrictive respiratory dysfunction and prominent contractures, although inter-individual variability in terms of severity even in individuals from the same family was found. Muscle biopsy showed myofibrillar-like features consisting of myotilin deposits and Z-disc disorganization. MRI showed predominant impairment of paravertebral, vasti, sartorius, gracilis, peroneal and medial gastrocnemius muscles. Conservation and structural analyses of Snurportin-1 p.Ile309Ser variant suggest an effect in nuclear-cytosol snRNP trafficking. In patient-derived fibroblasts and muscle, cytoplasmic accumulation of snRNP components is observed, while total expression of Snurportin-1 and snRNPs remains unchanged, which demonstrates a functional impact of SNUPN variant in snRNP metabolism. Furthermore, RNA-splicing analysis in patients' muscle showed widespread splicing deregulation, in particular in genes relevant for muscle development and splicing factors that participate in the early steps of spliceosome assembly. In conclusion, we report that SNUPN variants are a new cause of limb girdle muscular dystrophy with specific clinical, histopathological and imaging features, supporting SNUPN as a new gene to be included in genetic testing of myopathies. These results further support the relevance of splicing-related proteins in muscle disorders.
Collapse
Affiliation(s)
- Pablo Iruzubieta
- Department of Neurosciences, Biogipuzkoa Health Research Institute, 20014 San Sebastián, Spain
- Department of Neurology, Donostia University Hospital, Osakidetza Basque Health Service, 20014 San Sebastián, Spain
- CIBERNED, ISCIII (CIBER, Carlos III Institute, Spanish Ministry of Sciences and Innovation), 28031, Madrid, Spain
| | - Alberto Damborenea
- Department of Neurosciences, Biogipuzkoa Health Research Institute, 20014 San Sebastián, Spain
| | - Mihaela Ioghen
- Clinical Neurosciences Department, Faculty of Medicine, Carol Davila University of Medicine and Pharmacy, Paediatric Neurology, 020021 Bucharest, Romania
| | - Simon Bajew
- Department of Neurosciences, Biogipuzkoa Health Research Institute, 20014 San Sebastián, Spain
| | - Roberto Fernandez-Torrón
- Department of Neurosciences, Biogipuzkoa Health Research Institute, 20014 San Sebastián, Spain
- Department of Neurology, Donostia University Hospital, Osakidetza Basque Health Service, 20014 San Sebastián, Spain
| | - Ana Töpf
- John Walton Muscular Dystrophy Research Centre, Newcastle University and Newcastle Hospitals NHS Foundation Trust, NE4 5NR Newcastle Upon Tyne, UK
| | - Álvaro Herrero-Reiriz
- Department of Neurosciences, Biogipuzkoa Health Research Institute, 20014 San Sebastián, Spain
| | - Diana Epure
- Department of Paediatric Neurology, Doctor Victor Gomoiu Children’s Hospital, 022102 Bucharest, Romania
| | - Katharina Vill
- Department of Pediatric Neurology and Developmental Medicine and LMU Center for Children with Medical Complexity, Dr. von Hauner Children’s Hospital, LMU University Hospital, Ludwig-Maximilians-University Munich, 80539 Munich, Germany
- Institute of Human Genetics, School of Medicine, Technical University of Munich, 81675 Munich, Germany
| | - Aurelio Hernández-Laín
- Neuropathology Unit, Department of Pathology, 12 de Octubre University Hospital, 28041 Madrid, Spain
- Department of Neuro-oncology, Instituto de Investigación Sanitaria imas12, Hospital Universitario 12 de Octubre, 28041 Madrid, Spain
- Universidad Complutense de Madrid, Facultad de Medicina, 28040 Madrid, Spain
| | - María Manterola
- Department of Neurosciences, Biogipuzkoa Health Research Institute, 20014 San Sebastián, Spain
| | - Mikel Azkargorta
- Proteomics Platform, CIC bioGUNE, Basque Research and Technology Alliance (BRTA), 48160 Derio, Spain
- Centre for the Study of Liver and Gastrointestinal Diseases (CIBERehd), Instituto de Salud Carlos III (ISCIII), 28029 Madrid, Spain
| | - Oihane Pikatza-Menoio
- Department of Neurosciences, Biogipuzkoa Health Research Institute, 20014 San Sebastián, Spain
- CIBERNED, ISCIII (CIBER, Carlos III Institute, Spanish Ministry of Sciences and Innovation), 28031, Madrid, Spain
| | - Laura Pérez-Fernandez
- Department of Neurosciences, Biogipuzkoa Health Research Institute, 20014 San Sebastián, Spain
- Center for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), 20014 San Sebastián, Spain
| | - Mikel García-Puga
- Department of Neurosciences, Biogipuzkoa Health Research Institute, 20014 San Sebastián, Spain
- CIBERNED, ISCIII (CIBER, Carlos III Institute, Spanish Ministry of Sciences and Innovation), 28031, Madrid, Spain
| | - Gisela Gaina
- Department of Cell Biology, Neurosciences and Experimental Myology, Victor Babes National Institute of Pathology, 050096 Bucharest, Romania
| | - Alexandra Bastian
- Department of Pathology, Colentina Clinical Hospital, 020125 Bucharest, Romania
| | - Ioana Streata
- Human Genomics Laboratory, Regional Centre of Medical Genetics, Craiova University of Medicine and Pharmacy, 200349 Dolj, Romania
| | - Maggie C Walter
- Friedrich Baur Institute at the Department of Neurology, LMU University Hospital, Ludwig-Maximilians-University Munich, 80539 Munich, Germany
| | - Wolfgang Müller-Felber
- Institute of Human Genetics, School of Medicine, Technical University of Munich, 81675 Munich, Germany
| | - Simone Thiele
- Friedrich Baur Institute at the Department of Neurology, LMU University Hospital, Ludwig-Maximilians-University Munich, 80539 Munich, Germany
| | - Saioa Moragón
- Department of Neurosciences, Biogipuzkoa Health Research Institute, 20014 San Sebastián, Spain
| | - Nerea Bastida-Lertxundi
- Department of Clinical Genetics, Donostia University Hospital, Osakidetza Basque Health Service, 20014 San Sebastián, Spain
| | - Aitziber López-Cortajarena
- Center for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), 20014 San Sebastián, Spain
- Ikerbasque, Basque Foundation for Science, 48009 Bilbao, Spain
| | - Felix Elortza
- Proteomics Platform, CIC bioGUNE, Basque Research and Technology Alliance (BRTA), 48160 Derio, Spain
- Centre for the Study of Liver and Gastrointestinal Diseases (CIBERehd), Instituto de Salud Carlos III (ISCIII), 28029 Madrid, Spain
| | - Gorka Gereñu
- Department of Neurosciences, Biogipuzkoa Health Research Institute, 20014 San Sebastián, Spain
- CIBERNED, ISCIII (CIBER, Carlos III Institute, Spanish Ministry of Sciences and Innovation), 28031, Madrid, Spain
- Ikerbasque, Basque Foundation for Science, 48009 Bilbao, Spain
| | - Sonia Alonso-Martin
- Department of Neurosciences, Biogipuzkoa Health Research Institute, 20014 San Sebastián, Spain
- CIBERNED, ISCIII (CIBER, Carlos III Institute, Spanish Ministry of Sciences and Innovation), 28031, Madrid, Spain
| | - Volker Straub
- John Walton Muscular Dystrophy Research Centre, Newcastle University and Newcastle Hospitals NHS Foundation Trust, NE4 5NR Newcastle Upon Tyne, UK
| | - David de Sancho
- Donostia International Physics Center, 20018 San Sebastián, Spain
- Faculty of Chemistry, University of the Basque Country, 20018 San Sebastián, Spain
| | - Raluca Teleanu
- Clinical Neurosciences Department, Faculty of Medicine, Carol Davila University of Medicine and Pharmacy, Paediatric Neurology, 020021 Bucharest, Romania
| | - Adolfo López de Munain
- Department of Neurosciences, Biogipuzkoa Health Research Institute, 20014 San Sebastián, Spain
- Department of Neurology, Donostia University Hospital, Osakidetza Basque Health Service, 20014 San Sebastián, Spain
- CIBERNED, ISCIII (CIBER, Carlos III Institute, Spanish Ministry of Sciences and Innovation), 28031, Madrid, Spain
- Faculty of Medicine, University of the Basque Country, 20014 San Sebastián, Spain
- Faculty of Medicine, University of Deusto, 48007 Bilbao, Spain
| | - Lorea Blázquez
- Department of Neurosciences, Biogipuzkoa Health Research Institute, 20014 San Sebastián, Spain
- CIBERNED, ISCIII (CIBER, Carlos III Institute, Spanish Ministry of Sciences and Innovation), 28031, Madrid, Spain
- Ikerbasque, Basque Foundation for Science, 48009 Bilbao, Spain
| |
Collapse
|
4
|
Cote AC, Young HE, Huckins LM. Critical reasoning on the co-expression module QTL in the dorsolateral prefrontal cortex. HGG ADVANCES 2024; 5:100311. [PMID: 38773772 PMCID: PMC11214266 DOI: 10.1016/j.xhgg.2024.100311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Revised: 05/16/2024] [Accepted: 05/16/2024] [Indexed: 05/24/2024] Open
Abstract
Expression quantitative trait locus (eQTL) analysis is a popular method of gaining insight into the function of regulatory variation. While cis-eQTL resources have been instrumental in linking genome-wide association study variants to gene function, complex trait heritability may be additionally mediated by other forms of gene regulation. Toward this end, novel eQTL methods leverage gene co-expression (module-QTL) to investigate joint regulation of gene modules by single genetic variants. Here we broadly define a "module-QTL" as the association of a genetic variant with a summary measure of gene co-expression. This approach aims to reduce the multiple testing burden of a trans-eQTL search through the consolidation of gene-based testing and provide biological context to eQTLs shared between genes. In this article we provide an in-depth examination of the co-expression module eQTL (module-QTL) through literature review, theoretical investigation, and real-data application of the module-QTL to three large prefrontal cortex genotype-RNA sequencing datasets. We find module-QTLs in our study that are disease associated and reproducible are not additionally informative beyond cis- or trans-eQTLs for module genes. Through comparison to prior studies, we highlight promises and limitations of the module-QTL across study designs and provide recommendations for further investigation of the module-QTL framework.
Collapse
Affiliation(s)
- Alanna C Cote
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA.
| | - Hannah E Young
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Laura M Huckins
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT 06511, USA.
| |
Collapse
|
5
|
Holm LL, Doktor TK, Flugt KK, Petersen US, Petersen R, Andresen B. All exons are not created equal-exon vulnerability determines the effect of exonic mutations on splicing. Nucleic Acids Res 2024; 52:4588-4603. [PMID: 38324470 PMCID: PMC11077056 DOI: 10.1093/nar/gkae077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 01/05/2024] [Accepted: 01/26/2024] [Indexed: 02/09/2024] Open
Abstract
It is now widely accepted that aberrant splicing of constitutive exons is often caused by mutations affecting cis-acting splicing regulatory elements (SREs), but there is a misconception that all exons have an equal dependency on SREs and thus a similar vulnerability to aberrant splicing. We demonstrate that some exons are more likely to be affected by exonic splicing mutations (ESMs) due to an inherent vulnerability, which is context dependent and influenced by the strength of exon definition. We have developed VulExMap, a tool which is based on empirical data that can designate whether a constitutive exon is vulnerable. Using VulExMap, we find that only 25% of all exons can be categorized as vulnerable, whereas two-thirds of 359 previously reported ESMs in 75 disease genes are located in vulnerable exons. Because VulExMap analysis is based on empirical data on splicing of exons in their endogenous context, it includes all features important in determining the vulnerability. We believe that VulExMap will be an important tool when assessing the effect of exonic mutations by pinpointing whether they are located in exons vulnerable to ESMs.
Collapse
Affiliation(s)
- Lise L Holm
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense M, Denmark
- Villum Center for Bioanalytical Sciences, University of Southern Denmark, 5230 Odense M, Denmark
| | - Thomas K Doktor
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense M, Denmark
- Villum Center for Bioanalytical Sciences, University of Southern Denmark, 5230 Odense M, Denmark
| | - Katharina K Flugt
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense M, Denmark
- Villum Center for Bioanalytical Sciences, University of Southern Denmark, 5230 Odense M, Denmark
| | - Ulrika S S Petersen
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense M, Denmark
- Villum Center for Bioanalytical Sciences, University of Southern Denmark, 5230 Odense M, Denmark
| | - Rikke Petersen
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense M, Denmark
- Villum Center for Bioanalytical Sciences, University of Southern Denmark, 5230 Odense M, Denmark
| | - Brage S Andresen
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense M, Denmark
- Villum Center for Bioanalytical Sciences, University of Southern Denmark, 5230 Odense M, Denmark
| |
Collapse
|
6
|
Wang X, Lu J, Han M, Wang Z, Zhang H, Liu Y, Zhou P, Fu J, Xie Y. Genome-wide expression quantitative trait locus analysis reveals silk-preferential gene regulatory network in maize. PHYSIOLOGIA PLANTARUM 2024; 176:e14386. [PMID: 38887947 DOI: 10.1111/ppl.14386] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2024] [Revised: 05/27/2024] [Accepted: 05/30/2024] [Indexed: 06/20/2024]
Abstract
Silk of maize (Zea mays L.) contains diverse metabolites with complicated structures and functions, making it a great challenge to explore the mechanisms of metabolic regulation. Genome-wide identification of silk-preferential genes and investigation of their expression regulation provide an opportunity to reveal the regulatory networks of metabolism. Here, we applied the expression quantitative trait locus (eQTL) mapping on a maize natural population to explore the regulation of gene expression in unpollinated silk of maize. We obtained 3,985 silk-preferential genes that were specifically or preferentially expressed in silk using our population. Silk-preferential genes showed more obvious expression variations compared with broadly expressed genes that were ubiquitously expressed in most tissues. We found that trans-eQTL regulation played a more important role for silk-preferential genes compared to the broadly expressed genes. The relationship between 38 transcription factors and 85 target genes, including silk-preferential genes, were detected. Finally, we constructed a transcriptional regulatory network around the silk-preferential gene Bx10, which was proposed to be associated with response to abiotic stress and biotic stress. Taken together, this study deepened our understanding of transcriptome variation in maize silk and the expression regulation of silk-preferential genes, enhancing the investigation of regulatory networks on metabolic pathways.
Collapse
Affiliation(s)
- Xiaoli Wang
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Jiawen Lu
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Mingfang Han
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Zheyuan Wang
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Hongwei Zhang
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yunjun Liu
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Peng Zhou
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Junjie Fu
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yuxin Xie
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| |
Collapse
|
7
|
Zhang P, Zhang W, Sun W, Xu J, Hu H, Wang L, Wong L. Identification of gene biomarkers for brain diseases via multi-network topological semantics extraction and graph convolutional network. BMC Genomics 2024; 25:175. [PMID: 38350848 PMCID: PMC10865627 DOI: 10.1186/s12864-024-09967-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Accepted: 01/03/2024] [Indexed: 02/15/2024] Open
Abstract
BACKGROUND Brain diseases pose a significant threat to human health, and various network-based methods have been proposed for identifying gene biomarkers associated with these diseases. However, the brain is a complex system, and extracting topological semantics from different brain networks is necessary yet challenging to identify pathogenic genes for brain diseases. RESULTS In this study, we present a multi-network representation learning framework called M-GBBD for the identification of gene biomarker in brain diseases. Specifically, we collected multi-omics data to construct eleven networks from different perspectives. M-GBBD extracts the spatial distributions of features from these networks and iteratively optimizes them using Kullback-Leibler divergence to fuse the networks into a common semantic space that represents the gene network for the brain. Subsequently, a graph consisting of both gene and large-scale disease proximity networks learns representations through graph convolution techniques and predicts whether a gene is associated which brain diseases while providing associated scores. Experimental results demonstrate that M-GBBD outperforms several baseline methods. Furthermore, our analysis supported by bioinformatics revealed CAMP as a significantly associated gene with Alzheimer's disease identified by M-GBBD. CONCLUSION Collectively, M-GBBD provides valuable insights into identifying gene biomarkers for brain diseases and serves as a promising framework for brain networks representation learning.
Collapse
Affiliation(s)
- Ping Zhang
- College of Information Science and Engineering, Zaozhuang University, Zaozhuang, 277100, Shandong, China
- College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
| | - Weihan Zhang
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Hubei Hongshan Laboratory, Wuhan, 430074, China
| | - Weicheng Sun
- College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
| | - Jinsheng Xu
- College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
| | - Hua Hu
- College of Information Science and Engineering, Zaozhuang University, Zaozhuang, 277100, Shandong, China.
| | - Lei Wang
- College of Information Science and Engineering, Zaozhuang University, Zaozhuang, 277100, Shandong, China.
- Guangxi Key Lab of Human-Machine Interaction and Intelligent Decision, Guangxi Academy of Sciences, Nanning, 530007, China.
| | - Leon Wong
- College of Big Data and Internet, Shenzhen Technology University, Shenzhen, 518118, China.
| |
Collapse
|
8
|
Wang W, Li Y, Cai C, Zhu Q. Auxin response factors fine-tune lignin biosynthesis in response to mechanical bending in bamboo. THE NEW PHYTOLOGIST 2024; 241:1161-1176. [PMID: 37964659 DOI: 10.1111/nph.19398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Accepted: 10/20/2023] [Indexed: 11/16/2023]
Abstract
Lignin contributes to plant mechanical properties during bending loads. Meanwhile, phytohormone auxin controls various plant biological processes. However, the mechanism of auxin's role in bending-induced lignin biosynthesis was unclear, especially in bamboo, celebrated for its excellent deformation stability. Here, we reported that auxin response factors (ARF) 3 and ARF6 from Moso bamboo (Phyllostachys edulis (Carrière) J. Houz) directly regulate lignin biosynthesis pathway genes, and affect lignin biosynthesis in bamboo. Auxin and lignin exhibited asymmetric distribution patterns, and auxin promoted lignin biosynthesis in response to bending loads in bamboo. Employing transcriptomic and weighted gene co-expression network analysis approach, we discovered that expression patterns of ARF3 and ARF6 strongly correlated with lignin biosynthesis genes. ARF3 and ARF6 directly bind to the promoter regions of 4-coumarate: coenzyme A ligase (4CL3, 4CL7, and 4CL9) or caffeoyl-CoA O-methyltransferase (CCoAOMT2) genes, pivotal to lignin biosynthesis, and activate their expressions. Notably, the efficacy of this binding hinges on auxin levels. Alternation of the expressions of ARF3 and ARF6 substantially altered lignin accumulation in transgenic bamboo. Collectively, our study shed light on bamboo lignification genetics. Auxin signaling could directly modulate lignin biosynthesis genes to impact plant lignin content.
Collapse
Affiliation(s)
- Wenjia Wang
- Basic Forestry and Proteomics Center (BFPC), College of Forestry, Haixia Institute for Science and Technology, Fujian Agriculture and Forestry University, 350002, Fujian, China
| | - Yigang Li
- Basic Forestry and Proteomics Center (BFPC), College of Forestry, Haixia Institute for Science and Technology, Fujian Agriculture and Forestry University, 350002, Fujian, China
| | - Changyang Cai
- Basic Forestry and Proteomics Center (BFPC), College of Forestry, Haixia Institute for Science and Technology, Fujian Agriculture and Forestry University, 350002, Fujian, China
| | - Qiang Zhu
- Basic Forestry and Proteomics Center (BFPC), College of Forestry, Haixia Institute for Science and Technology, Fujian Agriculture and Forestry University, 350002, Fujian, China
| |
Collapse
|
9
|
Mostafavi H, Spence JP, Naqvi S, Pritchard JK. Systematic differences in discovery of genetic effects on gene expression and complex traits. Nat Genet 2023; 55:1866-1875. [PMID: 37857933 DOI: 10.1038/s41588-023-01529-1] [Citation(s) in RCA: 43] [Impact Index Per Article: 43.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2022] [Accepted: 09/14/2023] [Indexed: 10/21/2023]
Abstract
Most signals in genome-wide association studies (GWAS) of complex traits implicate noncoding genetic variants with putative gene regulatory effects. However, currently identified regulatory variants, notably expression quantitative trait loci (eQTLs), explain only a small fraction of GWAS signals. Here, we show that GWAS and cis-eQTL hits are systematically different: eQTLs cluster strongly near transcription start sites, whereas GWAS hits do not. Genes near GWAS hits are enriched in key functional annotations, are under strong selective constraint and have complex regulatory landscapes across different tissue/cell types, whereas genes near eQTLs are depleted of most functional annotations, show relaxed constraint, and have simpler regulatory landscapes. We describe a model to understand these observations, including how natural selection on complex traits hinders discovery of functionally relevant eQTLs. Our results imply that GWAS and eQTL studies are systematically biased toward different types of variant, and support the use of complementary functional approaches alongside the next generation of eQTL studies.
Collapse
Affiliation(s)
| | | | - Sahin Naqvi
- Department of Genetics, Stanford University, Stanford, CA, USA
- Department of Chemical and Systems Biology, Stanford University, Stanford, CA, USA
| | - Jonathan K Pritchard
- Department of Genetics, Stanford University, Stanford, CA, USA.
- Department of Biology, Stanford University, Stanford, CA, USA.
| |
Collapse
|
10
|
Hsieh PH, Lopes-Ramos CM, Zucknick M, Sandve GK, Glass K, Kuijjer ML. Adjustment of spurious correlations in co-expression measurements from RNA-Sequencing data. Bioinformatics 2023; 39:btad610. [PMID: 37802917 PMCID: PMC10598588 DOI: 10.1093/bioinformatics/btad610] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Revised: 08/05/2023] [Accepted: 10/05/2023] [Indexed: 10/08/2023] Open
Abstract
MOTIVATION Gene co-expression measurements are widely used in computational biology to identify coordinated expression patterns across a group of samples. Coordinated expression of genes may indicate that they are controlled by the same transcriptional regulatory program, or involved in common biological processes. Gene co-expression is generally estimated from RNA-Sequencing data, which are commonly normalized to remove technical variability. Here, we demonstrate that certain normalization methods, in particular quantile-based methods, can introduce false-positive associations between genes. These false-positive associations can consequently hamper downstream co-expression network analysis. Quantile-based normalization can, however, be extremely powerful. In particular, when preprocessing large-scale heterogeneous data, quantile-based normalization methods such as smooth quantile normalization can be applied to remove technical variability while maintaining global differences in expression for samples with different biological attributes. RESULTS We developed SNAIL (Smooth-quantile Normalization Adaptation for the Inference of co-expression Links), a normalization method based on smooth quantile normalization specifically designed for modeling of co-expression measurements. We show that SNAIL avoids formation of false-positive associations in co-expression as well as in downstream network analyses. Using SNAIL, one can avoid arbitrary gene filtering and retain associations to genes that only express in small subgroups of samples. This highlights the method's potential future impact on network modeling and other association-based approaches in large-scale heterogeneous data. AVAILABILITY AND IMPLEMENTATION The implementation of the SNAIL algorithm and code to reproduce the analyses described in this work can be found in the GitHub repository https://github.com/kuijjerlab/PySNAIL.
Collapse
Affiliation(s)
- Ping-Han Hsieh
- Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, Oslo 0318, Norway
- Department of Informatics, University of Oslo, Oslo 0316, Norway
| | - Camila Miranda Lopes-Ramos
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, United States
- Department of Medicine, Harvard Medical School, Boston, MA 02115, USA
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA 02115, United States
| | - Manuela Zucknick
- Oslo Centre for Biostatistics and Epidemiology, Institute of Basic Medical Sciences, University of Oslo, Oslo 0317, Norway
| | | | - Kimberly Glass
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, United States
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA 02115, United States
| | - Marieke Lydia Kuijjer
- Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, Oslo 0318, Norway
- Department of Pathology, Leiden University Medical Center, Leiden 2300RC, The Netherlands
- Leiden Center of Computational Oncology, Leiden University Medical Center,Leiden 2300RC, The Netherlands
| |
Collapse
|
11
|
Guthrie J, Ko¨stel Bal S, Lombardo SD, Mu¨ller F, Sin C, Hu¨tter CV, Menche J, Boztug K. AutoCore: A network-based definition of the core module of human autoimmunity and autoinflammation. SCIENCE ADVANCES 2023; 9:eadg6375. [PMID: 37656781 PMCID: PMC10848965 DOI: 10.1126/sciadv.adg6375] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Accepted: 08/01/2023] [Indexed: 09/03/2023]
Abstract
Although research on rare autoimmune and autoinflammatory diseases has enabled definition of nonredundant regulators of homeostasis in human immunity, because of the single gene-single disease nature of many of these diseases, contributing factors were mostly unveiled in sequential and noncoordinated individual studies. We used a network-based approach for integrating a set of 186 inborn errors of immunity with predominant autoimmunity/autoinflammation into a comprehensive map of human immune dysregulation, which we termed "AutoCore." The AutoCore is located centrally within the interactome of all protein-protein interactions, connecting and pinpointing multidisease markers for a range of common, polygenic autoimmune/autoinflammatory diseases. The AutoCore can be subdivided into 19 endotypes that correspond to molecularly and phenotypically cohesive disease subgroups, providing a molecular mechanism-based disease classification and rationale toward systematic targeting for therapeutic purposes. Our study provides a proof of concept for using network-based methods to systematically investigate the molecular relationships between individual rare diseases and address a range of conceptual, diagnostic, and therapeutic challenges.
Collapse
Affiliation(s)
- Julia Guthrie
- Ludwig Boltzmann Institute for Rare and Undiagnosed Diseases, Zimmermannplatz 10, A-1090 Vienna, Austria
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, AKH BT 25.3, A-1090 Vienna, Austria
- Max Perutz Labs, Vienna BioCenter Campus, Dr.-Bohr-Gasse 9, 1030 Vienna, Austria
- Department of Structural and Computational Biology, University of Vienna, Dr.-Bohr-Gasse 9, 1030, Vienna Austria
| | - Sevgi Ko¨stel Bal
- Ludwig Boltzmann Institute for Rare and Undiagnosed Diseases, Zimmermannplatz 10, A-1090 Vienna, Austria
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, AKH BT 25.3, A-1090 Vienna, Austria
- St. Anna Children’s Cancer Research Institute (CCRI), Zimmermannplatz 10, A-1090 Vienna, Austria
| | - Salvo Danilo Lombardo
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, AKH BT 25.3, A-1090 Vienna, Austria
- Max Perutz Labs, Vienna BioCenter Campus, Dr.-Bohr-Gasse 9, 1030 Vienna, Austria
- Department of Structural and Computational Biology, University of Vienna, Dr.-Bohr-Gasse 9, 1030, Vienna Austria
| | - Felix Mu¨ller
- Max Perutz Labs, Vienna BioCenter Campus, Dr.-Bohr-Gasse 9, 1030 Vienna, Austria
- Department of Structural and Computational Biology, University of Vienna, Dr.-Bohr-Gasse 9, 1030, Vienna Austria
| | - Celine Sin
- Max Perutz Labs, Vienna BioCenter Campus, Dr.-Bohr-Gasse 9, 1030 Vienna, Austria
- Department of Structural and Computational Biology, University of Vienna, Dr.-Bohr-Gasse 9, 1030, Vienna Austria
| | - Christiane V. R. Hu¨tter
- Max Perutz Labs, Vienna BioCenter Campus, Dr.-Bohr-Gasse 9, 1030 Vienna, Austria
- Vienna BioCenter PhD Program, Doctoral School of the University of Vienna and Medical University of Vienna, Vienna BioCenter, A-1030 Vienna, Austria
| | - Jo¨rg Menche
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, AKH BT 25.3, A-1090 Vienna, Austria
- Max Perutz Labs, Vienna BioCenter Campus, Dr.-Bohr-Gasse 9, 1030 Vienna, Austria
- Department of Structural and Computational Biology, University of Vienna, Dr.-Bohr-Gasse 9, 1030, Vienna Austria
- Faculty of Mathematics, University of Vienna, Oskar-Morgenstern-Platz 1, A-1090 Vienna, Austria
| | - Kaan Boztug
- Ludwig Boltzmann Institute for Rare and Undiagnosed Diseases, Zimmermannplatz 10, A-1090 Vienna, Austria
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, AKH BT 25.3, A-1090 Vienna, Austria
- St. Anna Children’s Cancer Research Institute (CCRI), Zimmermannplatz 10, A-1090 Vienna, Austria
- St. Anna Children’s Hospital, Kinderspitalgasse 6, A-1090, Vienna, Austria
- Medical University of Vienna, Department of Pediatrics and Adolescent Medicine, Währinger Gürtel 18-20, A-1090 Vienna, Austria
| |
Collapse
|
12
|
Tretter C, de Andrade Krätzig N, Pecoraro M, Lange S, Seifert P, von Frankenberg C, Untch J, Zuleger G, Wilhelm M, Zolg DP, Dreyer FS, Bräunlein E, Engleitner T, Uhrig S, Boxberg M, Steiger K, Slotta-Huspenina J, Ochsenreither S, von Bubnoff N, Bauer S, Boerries M, Jost PJ, Schenck K, Dresing I, Bassermann F, Friess H, Reim D, Grützmann K, Pfütze K, Klink B, Schröck E, Haller B, Kuster B, Mann M, Weichert W, Fröhling S, Rad R, Hiltensperger M, Krackhardt AM. Proteogenomic analysis reveals RNA as a source for tumor-agnostic neoantigen identification. Nat Commun 2023; 14:4632. [PMID: 37532709 PMCID: PMC10397250 DOI: 10.1038/s41467-023-39570-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 06/19/2023] [Indexed: 08/04/2023] Open
Abstract
Systemic pan-tumor analyses may reveal the significance of common features implicated in cancer immunogenicity and patient survival. Here, we provide a comprehensive multi-omics data set for 32 patients across 25 tumor types for proteogenomic-based discovery of neoantigens. By using an optimized computational approach, we discover a large number of tumor-specific and tumor-associated antigens. To create a pipeline for the identification of neoantigens in our cohort, we combine DNA and RNA sequencing with MS-based immunopeptidomics of tumor specimens, followed by the assessment of their immunogenicity and an in-depth validation process. We detect a broad variety of non-canonical HLA-binding peptides in the majority of patients demonstrating partially immunogenicity. Our validation process allows for the selection of 32 potential neoantigen candidates. The majority of neoantigen candidates originates from variants identified in the RNA data set, illustrating the relevance of RNA as a still understudied source of cancer antigens. This study underlines the importance of RNA-centered variant detection for the identification of shared biomarkers and potentially relevant neoantigen candidates.
Collapse
Affiliation(s)
- Celina Tretter
- German Cancer Consortium (DKTK), partner site Munich and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, IIIrd Medical Department, Munich, Germany
| | - Niklas de Andrade Krätzig
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, IInd Medical Department, Munich, Germany
- Technical University of Munich, TUM School of Medicine, Center for Translational Cancer Research (TranslaTUM), Munich, Germany
- Technical University of Munich, TUM School of Medicine, Institute of Molecular Oncology and Functional Genomics, Munich, Germany
| | - Matteo Pecoraro
- Department of Proteomics and Signal Transduction, Max Plank Institute of Biochemistry, Munich, Germany
- Institute for Research in Biomedicine, Università della Svizzera italiana, Bellinzona, Switzerland
| | - Sebastian Lange
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, IInd Medical Department, Munich, Germany
- Technical University of Munich, TUM School of Medicine, Center for Translational Cancer Research (TranslaTUM), Munich, Germany
- Technical University of Munich, TUM School of Medicine, Institute of Molecular Oncology and Functional Genomics, Munich, Germany
| | - Philipp Seifert
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, IIIrd Medical Department, Munich, Germany
| | - Clara von Frankenberg
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, IIIrd Medical Department, Munich, Germany
| | - Johannes Untch
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, IIIrd Medical Department, Munich, Germany
| | - Gabriela Zuleger
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, IIIrd Medical Department, Munich, Germany
| | - Mathias Wilhelm
- Technical University of Munich, TUM School of Life Sciences, Chair of Proteomics and Bioanalytics, Freising, Germany
- Technical University of Munich, TUM School of Life Sciences, Computational Mass Spectrometry, Freising, Germany
| | - Daniel P Zolg
- Technical University of Munich, TUM School of Life Sciences, Chair of Proteomics and Bioanalytics, Freising, Germany
| | - Florian S Dreyer
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, IIIrd Medical Department, Munich, Germany
| | - Eva Bräunlein
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, IIIrd Medical Department, Munich, Germany
| | - Thomas Engleitner
- Technical University of Munich, TUM School of Medicine, Center for Translational Cancer Research (TranslaTUM), Munich, Germany
- Technical University of Munich, TUM School of Medicine, Institute of Molecular Oncology and Functional Genomics, Munich, Germany
| | - Sebastian Uhrig
- German Cancer Consortium (DKTK), partner site Heidelberg and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Molecular Precision Oncology Program, NCT Heidelberg, Heidelberg, Germany
| | - Melanie Boxberg
- German Cancer Consortium (DKTK), partner site Munich and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, Institute of Pathology, Munich, Germany
| | - Katja Steiger
- German Cancer Consortium (DKTK), partner site Munich and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, Institute of Pathology, Munich, Germany
| | - Julia Slotta-Huspenina
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, Institute of Pathology, Munich, Germany
| | - Sebastian Ochsenreither
- German Cancer Consortium (DKTK), partner site Berlin and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Charité Comprehensive Cancer Center, Charité - Universitätsmedizin Berlin, Berlin, Germany
- Department of Hematology, Oncology and Tumor Immunology, Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Nikolas von Bubnoff
- German Cancer Consortium (DKTK), partner site Freiburg and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Institute of Medical Bioinformatics and Systems Medicine (IBSM), Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany
- Department of Hematology and Oncology, Medical Center, University of Schleswig Holstein, Campus Lübeck, Lübeck, Germany
| | - Sebastian Bauer
- German Cancer Consortium (DKTK), partner site Essen and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Department of Medical Oncology and Sarcoma Center, West German Cancer Center, University Hospital Essen, Essen, Germany
| | - Melanie Boerries
- German Cancer Consortium (DKTK), partner site Freiburg and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Institute of Medical Bioinformatics and Systems Medicine (IBSM), Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Philipp J Jost
- German Cancer Consortium (DKTK), partner site Munich and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, IIIrd Medical Department, Munich, Germany
- Clinical Division of Oncology, Department of Internal Medicine, Medical University of Graz, Graz, Austria
- University Comprehensive Cancer Center Graz, Medical University of Graz, Graz, Austria
| | - Kristina Schenck
- German Cancer Consortium (DKTK), partner site Munich and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, IIIrd Medical Department, Munich, Germany
| | - Iska Dresing
- German Cancer Consortium (DKTK), partner site Munich and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, IIIrd Medical Department, Munich, Germany
| | - Florian Bassermann
- German Cancer Consortium (DKTK), partner site Munich and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, IIIrd Medical Department, Munich, Germany
- Technical University of Munich, TUM School of Medicine, Center for Translational Cancer Research (TranslaTUM), Munich, Germany
| | - Helmut Friess
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, Department of Surgery, Munich, Germany
| | - Daniel Reim
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, Department of Surgery, Munich, Germany
| | - Konrad Grützmann
- German Cancer Consortium (DKTK), partner site Dresden and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Core Unit Molecular Tumor Diagnostics (CMTD), NCT Dresden, Dresden, Germany
- Institute for Medical Informatics and Biometry, Faculty of Medicine, TU Dresden, Dresden, Germany
| | - Katrin Pfütze
- German Cancer Consortium (DKTK), partner site Heidelberg and German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Barbara Klink
- German Cancer Consortium (DKTK), partner site Dresden and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Institute for Clinical Genetics, University Hospital Carl Gustav Carus at the Technische Universität Dresden, Dresden, Germany
| | - Evelin Schröck
- German Cancer Consortium (DKTK), partner site Dresden and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Institute for Clinical Genetics, University Hospital Carl Gustav Carus at the Technische Universität Dresden, Dresden, Germany
- ERN GENTURIS, Hereditary Cancer Syndrome Center Dresden, Dresden, Germany
- National Center for Tumor Diseases Dresden (NCT/UCC), Dresden, Germany
- Faculty of Medicine and University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
- Helmholtz-Zentrum Dresden-Rossendorf (HZDR), Dresden, Germany
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
| | - Bernhard Haller
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, Institute of AI and Informatics in Medicine, Munich, Germany
| | - Bernhard Kuster
- German Cancer Consortium (DKTK), partner site Munich and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Technical University of Munich, TUM School of Life Sciences, Chair of Proteomics and Bioanalytics, Freising, Germany
- Technical University of Munich, TUM School of Life Sciences, Bavarian Biomolecular Mass Spectrometry Center (BayBioMS), Freising, Germany
| | - Matthias Mann
- Department of Proteomics and Signal Transduction, Max Plank Institute of Biochemistry, Munich, Germany
| | - Wilko Weichert
- German Cancer Consortium (DKTK), partner site Munich and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, Institute of Pathology, Munich, Germany
| | - Stefan Fröhling
- German Cancer Consortium (DKTK), partner site Heidelberg and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Division of Translational Medical Oncology, National Center for Tumor Diseases (NCT) Heidelberg, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Roland Rad
- German Cancer Consortium (DKTK), partner site Munich and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, IInd Medical Department, Munich, Germany
- Technical University of Munich, TUM School of Medicine, Center for Translational Cancer Research (TranslaTUM), Munich, Germany
- Technical University of Munich, TUM School of Medicine, Institute of Molecular Oncology and Functional Genomics, Munich, Germany
| | - Michael Hiltensperger
- German Cancer Consortium (DKTK), partner site Munich and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, IIIrd Medical Department, Munich, Germany
| | - Angela M Krackhardt
- German Cancer Consortium (DKTK), partner site Munich and German Cancer Research Center (DKFZ), Heidelberg, Germany.
- Technical University of Munich, TUM School of Medicine, Klinikum rechts der Isar, IIIrd Medical Department, Munich, Germany.
- Technical University of Munich, TUM School of Medicine, Center for Translational Cancer Research (TranslaTUM), Munich, Germany.
- Malteser Krankenhaus St. Franziskus-Hospital, Flensburg, Germany.
| |
Collapse
|
13
|
McCabe SD, Nobel AB, Love MI. ACTOR: a latent Dirichlet model to compare expressed isoform proportions to a reference panel. Biostatistics 2023; 24:388-405. [PMID: 33948626 PMCID: PMC10102900 DOI: 10.1093/biostatistics/kxab013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2020] [Revised: 03/19/2021] [Accepted: 03/23/2021] [Indexed: 11/13/2022] Open
Abstract
The relative proportion of RNA isoforms expressed for a given gene has been associated with disease states in cancer, retinal diseases, and neurological disorders. Examination of relative isoform proportions can help determine biological mechanisms, but such analyses often require a per-gene investigation of splicing patterns. Leveraging large public data sets produced by genomic consortia as a reference, one can compare splicing patterns in a data set of interest with those of a reference panel in which samples are divided into distinct groups, such as tissue of origin, or disease status. We propose A latent Dirichlet model to Compare expressed isoform proportions TO a Reference panel (ACTOR), a latent Dirichlet model with Dirichlet Multinomial observations to compare expressed isoform proportions in a data set to an independent reference panel. We use a variational Bayes procedure to estimate posterior distributions for the group membership of one or more samples. Using the Genotype-Tissue Expression project as a reference data set, we evaluate ACTOR on simulated and real RNA-seq data sets to determine tissue-type classifications of genes. ACTOR is publicly available as an R package at https://github.com/mccabes292/actor.
Collapse
Affiliation(s)
- Sean D McCabe
- Department of Biostatistics, University of North Carolina at Chapel Hill, 135 Dauer Drive, Chapel Hill, NC 27599-7400, USA
| | - Andrew B Nobel
- Department of Statistics and Operations Research, University of North Carolina at Chapel Hill, 318 Hanes Hall, Chapel Hill, NC 27599-3260, USA and Department of Biostatistics, University of North Carolina at Chapel Hill, 135 Dauer Drive, Chapel Hill, NC 27599-7400, USA
| | - Michael I Love
- Department of Biostatistics, University of North Carolina at Chapel Hill, 135 Dauer Drive, Chapel Hill, NC 27599-7400, USA and Department of Genetics, University of North Carolina at Chapel Hill, 120 Mason Farm Rd, Chapel Hill, NC 27514, USA
| |
Collapse
|
14
|
Cai H, Des Marais DL. Revisiting regulatory coherence: accounting for temporal bias in plant gene co-expression analyses. THE NEW PHYTOLOGIST 2023; 238:16-24. [PMID: 36617750 DOI: 10.1111/nph.18720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 12/16/2022] [Indexed: 06/17/2023]
Affiliation(s)
- Haoran Cai
- Department of Civil and Environmental Engineering, MIT, 15 Vassar St., Cambridge, MA, 02139, USA
| | - David L Des Marais
- Department of Civil and Environmental Engineering, MIT, 15 Vassar St., Cambridge, MA, 02139, USA
| |
Collapse
|
15
|
Deans PJM, Seah C, Johnson J, Gonzalez JG, Townsley K, Cao E, Schrode N, Stahl E, O’Reilly P, Huckins LM, Brennand KJ. Non-additive effects of schizophrenia risk genes reflect convergent downstream function. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.03.20.23287497. [PMID: 36993466 PMCID: PMC10055596 DOI: 10.1101/2023.03.20.23287497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/31/2023]
Abstract
Genetic studies of schizophrenia (SCZ) reveal a complex polygenic risk architecture comprised of hundreds of risk variants, the majority of which are common in the population at-large and confer only modest increases in disorder risk. Precisely how genetic variants with individually small predicted effects on gene expression combine to yield substantial clinical impacts in aggregate is unclear. Towards this, we previously reported that the combinatorial perturbation of four SCZ risk genes ("eGenes", whose expression is regulated by common variants) resulted in gene expression changes that were not predicted by individual perturbations, being most non-additive among genes associated with synaptic function and SCZ risk. Now, across fifteen SCZ eGenes, we demonstrate that non-additive effects are greatest within groups of functionally similar eGenes. Individual eGene perturbations reveal common downstream transcriptomic effects ("convergence"), while combinatorial eGene perturbations result in changes that are smaller than predicted by summing individual eGene effects ("sub-additive effects"). Unexpectedly, these convergent and sub-additive downstream transcriptomic effects overlap and constitute a large proportion of the genome-wide polygenic risk score, suggesting that functional redundancy of eGenes may be a major mechanism underlying non-additivity. Single eGene perturbations likewise fail to predict the magnitude or directionality of cellular phenotypes resulting from combinatorial perturbations. Overall, our results indicate that polygenic risk cannot be extrapolated from experiments testing one risk gene at a time and must instead be empirically measured. By unravelling the interactions between complex risk variants, it may be possible to improve the clinical utility of polygenic risk scores through more powerful prediction of symptom onset, clinical trajectory, and treatment response, or to identify novel targets for therapeutic intervention.
Collapse
Affiliation(s)
- PJ Michael Deans
- Departments of Psychiatry and Genetics, Division of Molecular Psychiatry, Department of Genetics, Wu Tsai Institute, Yale University School of Medicine, New Haven, CT 06511
| | - Carina Seah
- Pamela Sklar Division of Psychiatric Genomics, Department of Genetics and Genomics, Icahn Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY 10029
- Nash Family Department of Neuroscience, Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029
| | - Jessica Johnson
- Pamela Sklar Division of Psychiatric Genomics, Department of Genetics and Genomics, Icahn Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY 10029
| | - Judit Garcia Gonzalez
- Pamela Sklar Division of Psychiatric Genomics, Department of Genetics and Genomics, Icahn Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY 10029
| | - Kayla Townsley
- Pamela Sklar Division of Psychiatric Genomics, Department of Genetics and Genomics, Icahn Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY 10029
- Nash Family Department of Neuroscience, Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029
| | - Evan Cao
- Departments of Psychiatry and Genetics, Division of Molecular Psychiatry, Department of Genetics, Wu Tsai Institute, Yale University School of Medicine, New Haven, CT 06511
| | - Nadine Schrode
- Pamela Sklar Division of Psychiatric Genomics, Department of Genetics and Genomics, Icahn Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY 10029
- Nash Family Department of Neuroscience, Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029
| | - Eli Stahl
- Pamela Sklar Division of Psychiatric Genomics, Department of Genetics and Genomics, Icahn Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY 10029
| | - Paul O’Reilly
- Pamela Sklar Division of Psychiatric Genomics, Department of Genetics and Genomics, Icahn Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY 10029
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029
| | - Laura M. Huckins
- Pamela Sklar Division of Psychiatric Genomics, Department of Genetics and Genomics, Icahn Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY 10029
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029
| | - Kristen J. Brennand
- Departments of Psychiatry and Genetics, Division of Molecular Psychiatry, Department of Genetics, Wu Tsai Institute, Yale University School of Medicine, New Haven, CT 06511
- Pamela Sklar Division of Psychiatric Genomics, Department of Genetics and Genomics, Icahn Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY 10029
- Nash Family Department of Neuroscience, Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029
| |
Collapse
|
16
|
Chen C, Shen B, Zhang L, Yu T, Wang M, Wu R. A Cartographic Tool to Predict Disease Risk-associated Pseudo-Dynamic Networks from Tissue-specific Gene Expression. Bio Protoc 2023; 13:e4583. [PMID: 36789091 PMCID: PMC9901473 DOI: 10.21769/bioprotoc.4583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2022] [Revised: 09/05/2022] [Accepted: 12/13/2022] [Indexed: 01/06/2023] Open
Abstract
Understanding how genes are differentially expressed across tissues is key to reveal the etiology of human diseases. Genes are never expressed in isolation, but rather co-expressed in a community; thus, they co-act through intricate but well-orchestrated networks. However, existing approaches cannot coalesce the full properties of gene-gene communication and interactions into networks. In particular, the unavailability of dynamic gene expression data might impair the application of existing network models to unleash the complexity of human diseases. To address this limitation, we developed a statistical pipeline named DRDNetPro to visualize and trace how genes dynamically interact with each other across diverse tissues, to ascertain health risk from static expression data. This protocol contains detailed tutorials designed to learn a series of networks, with the illustration example from the Genotype-Tissue Expression (GTEx) project. The proposed toolbox relies on the method developed in our published paper ( Chen et al., 2022 ), coding all genes into bidirectional, signed, weighted, and feedback looped networks, which will provide profound genomic information enabling medical doctors to design precise medicine. Graphical abstract Flowchart illustrating the use of DRDNetPro. The left panel contains the summarized pipeline of DRDNetPro and the right panel contains one pseudo-illustrative example. See the Equipment and Procedure sections for detailed explanations.
Collapse
Affiliation(s)
- Chixiang Chen
- Division of Biostatistics and Bioinformatics, University of Maryland School of Medicine, Baltimore, MD, USA.,Department of Neurosurgery, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Biyi Shen
- Bristol Myers Squibb, Lawrenceville, NJ, USA
| | - Lijun Zhang
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Tonghui Yu
- School of Mathematics, Hefei University of Technology, Anhui, China
| | - Ming Wang
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Rongling Wu
- Division of Biostatistics and Bioinformatics, College of Medicine, Penn State College of Medicine, Hershey, PA, USA
| |
Collapse
|
17
|
Dovrolis N, Filidou E, Tarapatzi G, Kokkotis G, Spathakis M, Kandilogiannakis L, Drygiannakis I, Valatas V, Arvanitidis K, Karakasiliotis I, Vradelis S, Manolopoulos VG, Paspaliaris V, Bamias G, Kolios G. Co-expression of fibrotic genes in inflammatory bowel disease; A localized event? Front Immunol 2022; 13:1058237. [PMID: 36632136 PMCID: PMC9826764 DOI: 10.3389/fimmu.2022.1058237] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Accepted: 12/08/2022] [Indexed: 12/27/2022] Open
Abstract
Introduction Extracellular matrix turnover, a ubiquitous dynamic biological process, can be diverted to fibrosis. The latter can affect the intestine as a serious complication of Inflammatory Bowel Diseases (IBD) and is resistant to current pharmacological interventions. It embosses the need for out-of-the-box approaches to identify and target molecular mechanisms of fibrosis. Methods and results In this study, a novel mRNA sequencing dataset of 22 pairs of intestinal biopsies from the terminal ileum (TI) and the sigmoid of 7 patients with Crohn's disease, 6 with ulcerative colitis and 9 control individuals (CI) served as a validation cohort of a core fibrotic transcriptomic signature (FIBSig), This signature, which was identified in publicly available data (839 samples from patients and healthy individuals) of 5 fibrotic disorders affecting different organs (GI tract, lung, skin, liver, kidney), encompasses 241 genes and the functional pathways which derive from their interactome. These genes were used in further bioinformatics co-expression analyses to elucidate the site-specific molecular background of intestinal fibrosis highlighting their involvement, particularly in the terminal ileum. We also confirmed different transcriptomic profiles of the sigmoid and terminal ileum in our validation cohort. Combining the results of these analyses we highlight 21 core hub genes within a larger single co-expression module, highly enriched in the terminal ileum of CD patients. Further pathway analysis revealed known and novel inflammation-regulated, fibrogenic pathways operating in the TI, such as IL-13 signaling and pyroptosis, respectively. Discussion These findings provide a rationale for the increased incidence of fibrosis at the terminal ileum of CD patients and highlight operating pathways in intestinal fibrosis for future evaluation with mechanistic and translational studies.
Collapse
Affiliation(s)
- Nikolas Dovrolis
- Laboratory of Pharmacology, Department of Medicine, Democritus University of Thrace, Alexandroupolis, Greece,Laboratory of Biology, Department of Medicine, Democritus University of Thrace, Alexandroupolis, Greece,Individualised Medicine & Pharmacological Research Solutions Center (IMPReS), Alexandroupolis, Greece,*Correspondence: George Kolios, ; Nikolas Dovrolis,
| | - Eirini Filidou
- Laboratory of Pharmacology, Department of Medicine, Democritus University of Thrace, Alexandroupolis, Greece,Individualised Medicine & Pharmacological Research Solutions Center (IMPReS), Alexandroupolis, Greece
| | - Gesthimani Tarapatzi
- Laboratory of Pharmacology, Department of Medicine, Democritus University of Thrace, Alexandroupolis, Greece,Individualised Medicine & Pharmacological Research Solutions Center (IMPReS), Alexandroupolis, Greece
| | - Georgios Kokkotis
- Gastrointestinal (GI) Unit, 3 Department of Internal Medicine, Sotiria Hospital, National and Kapodistrian University of Athens, Athens, Greece
| | - Michail Spathakis
- Laboratory of Pharmacology, Department of Medicine, Democritus University of Thrace, Alexandroupolis, Greece,Individualised Medicine & Pharmacological Research Solutions Center (IMPReS), Alexandroupolis, Greece
| | - Leonidas Kandilogiannakis
- Laboratory of Pharmacology, Department of Medicine, Democritus University of Thrace, Alexandroupolis, Greece,Individualised Medicine & Pharmacological Research Solutions Center (IMPReS), Alexandroupolis, Greece
| | - Ioannis Drygiannakis
- Gastroenterology and Hepatology Research Laboratory, Medical School, University of Crete, Heraklion, Greece
| | - Vassilis Valatas
- Laboratory of Pharmacology, Department of Medicine, Democritus University of Thrace, Alexandroupolis, Greece,Gastroenterology and Hepatology Research Laboratory, Medical School, University of Crete, Heraklion, Greece
| | - Konstantinos Arvanitidis
- Laboratory of Pharmacology, Department of Medicine, Democritus University of Thrace, Alexandroupolis, Greece,Individualised Medicine & Pharmacological Research Solutions Center (IMPReS), Alexandroupolis, Greece
| | - Ioannis Karakasiliotis
- Laboratory of Biology, Department of Medicine, Democritus University of Thrace, Alexandroupolis, Greece
| | - Stergios Vradelis
- Second Department of Internal Medicine, University Hospital of Alexandroupolis, Democritus University of Thrace, Alexandroupolis, Greece
| | - Vangelis G. Manolopoulos
- Laboratory of Pharmacology, Department of Medicine, Democritus University of Thrace, Alexandroupolis, Greece,Individualised Medicine & Pharmacological Research Solutions Center (IMPReS), Alexandroupolis, Greece
| | | | - Giorgos Bamias
- Gastrointestinal (GI) Unit, 3 Department of Internal Medicine, Sotiria Hospital, National and Kapodistrian University of Athens, Athens, Greece
| | - George Kolios
- Laboratory of Pharmacology, Department of Medicine, Democritus University of Thrace, Alexandroupolis, Greece,Individualised Medicine & Pharmacological Research Solutions Center (IMPReS), Alexandroupolis, Greece,*Correspondence: George Kolios, ; Nikolas Dovrolis,
| |
Collapse
|
18
|
Castaldi PJ, Abood A, Farber CR, Sheynkman GM. Bridging the splicing gap in human genetics with long-read RNA sequencing: finding the protein isoform drivers of disease. Hum Mol Genet 2022; 31:R123-R136. [PMID: 35960994 PMCID: PMC9585682 DOI: 10.1093/hmg/ddac196] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Revised: 08/08/2022] [Accepted: 08/09/2022] [Indexed: 02/04/2023] Open
Abstract
Aberrant splicing underlies many human diseases, including cancer, cardiovascular diseases and neurological disorders. Genome-wide mapping of splicing quantitative trait loci (sQTLs) has shown that genetic regulation of alternative splicing is widespread. However, identification of the corresponding isoform or protein products associated with disease-associated sQTLs is challenging with short-read RNA-seq, which cannot precisely characterize full-length transcript isoforms. Furthermore, contemporary sQTL interpretation often relies on reference transcript annotations, which are incomplete. Solutions to these issues may be found through integration of newly emerging long-read sequencing technologies. Long-read sequencing offers the capability to sequence full-length mRNA transcripts and, in some cases, to link sQTLs to transcript isoforms containing disease-relevant protein alterations. Here, we provide an overview of sQTL mapping approaches, the use of long-read sequencing to characterize sQTL effects on isoforms, the linkage of RNA isoforms to protein-level functions and comment on future directions in the field. Based on recent progress, long-read RNA sequencing promises to be part of the human disease genetics toolkit to discover and treat protein isoforms causing rare and complex diseases.
Collapse
Affiliation(s)
- Peter J Castaldi
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, MA 02115, USA
- Division of General Medicine and Primary Care, Department of Medicine, Brigham and Women’s Hospital, Boston, MA 02115, USA
| | - Abdullah Abood
- Center for Public Health Genomics, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA
| | - Charles R Farber
- Center for Public Health Genomics, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA
- Department of Public Health Sciences, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA
| | - Gloria M Sheynkman
- Center for Public Health Genomics, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA 22903, USA
- UVA Comprehensive Cancer Center, University of Virginia, Charlottesville, VA 22903, USA
| |
Collapse
|
19
|
Casteels T, Bajew S, Reiniš J, Enders L, Schuster M, Fontaine F, Müller AC, Wagner BK, Bock C, Kubicek S. SMNDC1 links chromatin remodeling and splicing to regulate pancreatic hormone expression. Cell Rep 2022; 40:111288. [PMID: 36044849 DOI: 10.1016/j.celrep.2022.111288] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Revised: 04/06/2022] [Accepted: 08/09/2022] [Indexed: 11/28/2022] Open
Abstract
Insulin expression is primarily restricted to the pancreatic β cells, which are physically or functionally depleted in diabetes. Identifying targetable pathways repressing insulin in non-β cells, particularly in the developmentally related glucagon-secreting α cells, is an important aim of regenerative medicine. Here, we perform an RNA interference screen in a murine α cell line to identify silencers of insulin expression. We discover that knockdown of the splicing factor Smndc1 triggers a global repression of α cell gene-expression programs in favor of increased β cell markers. Mechanistically, Smndc1 knockdown upregulates the β cell transcription factor Pdx1 by modulating the activities of the BAF and Atrx chromatin remodeling complexes. SMNDC1's repressive role is conserved in human pancreatic islets, its loss triggering enhanced insulin secretion and PDX1 expression. Our study identifies Smndc1 as a key factor connecting splicing and chromatin remodeling to the control of insulin expression in human and mouse islet cells.
Collapse
Affiliation(s)
- Tamara Casteels
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, 1090 Vienna, Austria
| | - Simon Bajew
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Carrer del Dr. Aiguader 88, 08003 Barcelona, Spain; Universitat Pompeu Fabra, Carrer del Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Jiří Reiniš
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, 1090 Vienna, Austria
| | - Lennart Enders
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, 1090 Vienna, Austria
| | - Michael Schuster
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, 1090 Vienna, Austria
| | - Frédéric Fontaine
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, 1090 Vienna, Austria
| | - André C Müller
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, 1090 Vienna, Austria
| | | | - Christoph Bock
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, 1090 Vienna, Austria; Medical University of Vienna, Center for Medical Statistics, Informatics, and Intelligent Systems, Institute of Artificial Intelligence, 1090 Vienna, Austria
| | - Stefan Kubicek
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, 1090 Vienna, Austria.
| |
Collapse
|
20
|
Hu Y, Rehawi G, Moyon L, Gerstner N, Ogris C, Knauer-Arloth J, Bittner F, Marsico A, Mueller NS. Network Embedding Across Multiple Tissues and Data Modalities Elucidates the Context of Host Factors Important for COVID-19 Infection. Front Genet 2022; 13:909714. [PMID: 35903362 PMCID: PMC9315940 DOI: 10.3389/fgene.2022.909714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Accepted: 06/06/2022] [Indexed: 11/30/2022] Open
Abstract
COVID-19 is a heterogeneous disease caused by SARS-CoV-2. Aside from infections of the lungs, the disease can spread throughout the body and damage many other tissues, leading to multiorgan failure in severe cases. The highly variable symptom severity is influenced by genetic predispositions and preexisting diseases which have not been investigated in a large-scale multimodal manner. We present a holistic analysis framework, setting previously reported COVID-19 genes in context with prepandemic data, such as gene expression patterns across multiple tissues, polygenetic predispositions, and patient diseases, which are putative comorbidities of COVID-19. First, we generate a multimodal network using the prior-based network inference method KiMONo. We then embed the network to generate a meaningful lower-dimensional representation of the data. The input data are obtained via the Genotype-Tissue Expression project (GTEx), containing expression data from a range of tissues with genomic and phenotypic information of over 900 patients and 50 tissues. The generated network consists of nodes, that is, genes and polygenic risk scores (PRS) for several diseases/phenotypes, as well as for COVID-19 severity and hospitalization, and links between them if they are statistically associated in a regularized linear model by feature selection. Applying network embedding on the generated multimodal network allows us to perform efficient network analysis by identifying nodes close by in a lower-dimensional space that correspond to entities which are statistically linked. By determining the similarity between COVID-19 genes and other nodes through embedding, we identify disease associations to tissues, like the brain and gut. We also find strong associations between COVID-19 genes and various diseases such as ischemic heart disease, cerebrovascular disease, and hypertension. Moreover, we find evidence linking PTPN6 to a range of comorbidities along with the genetic predisposition of COVID-19, suggesting that this kinase is a central player in severe cases of COVID-19. In conclusion, our holistic network inference coupled with network embedding of multimodal data enables the contextualization of COVID-19-associated genes with respect to tissues, disease states, and genetic risk factors. Such contextualization can be exploited to further elucidate the biological importance of known and novel genes for severity of the disease in patients.
Collapse
Affiliation(s)
- Yue Hu
- Computational Health Department, Helmholtz Center Munich, Neuherberg, Germany
- Informatics 12 Chair of Bioinformatics, Technical University Munich, Garching, Germany
| | - Ghalia Rehawi
- Computational Health Department, Helmholtz Center Munich, Neuherberg, Germany
- Translational Research in Psychiatry, MaxPlanck Institute of Psychiatry, Munich, Germany
| | - Lambert Moyon
- Computational Health Department, Helmholtz Center Munich, Neuherberg, Germany
| | - Nathalie Gerstner
- Computational Health Department, Helmholtz Center Munich, Neuherberg, Germany
- Translational Research in Psychiatry, MaxPlanck Institute of Psychiatry, Munich, Germany
| | - Christoph Ogris
- Computational Health Department, Helmholtz Center Munich, Neuherberg, Germany
| | - Janine Knauer-Arloth
- Computational Health Department, Helmholtz Center Munich, Neuherberg, Germany
- Translational Research in Psychiatry, MaxPlanck Institute of Psychiatry, Munich, Germany
| | | | - Annalisa Marsico
- Computational Health Department, Helmholtz Center Munich, Neuherberg, Germany
| | - Nikola S. Mueller
- Computational Health Department, Helmholtz Center Munich, Neuherberg, Germany
- knowing01 GmbH, Munich, Germany
| |
Collapse
|
21
|
Lee K, Yu D, Hyung D, Cho SY, Park C. ASpediaFI: Functional Interaction Analysis of Alternative Splicing Events. GENOMICS, PROTEOMICS & BIOINFORMATICS 2022; 20:466-482. [PMID: 35085775 PMCID: PMC9801047 DOI: 10.1016/j.gpb.2021.10.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/02/2020] [Revised: 10/15/2021] [Accepted: 11/01/2021] [Indexed: 01/26/2023]
Abstract
Alternative splicing (AS) regulates biological processes governing phenotypes and diseases. Differential AS (DAS) gene test methods have been developed to investigate important exonic expression from high-throughput datasets. However, the DAS events extracted using statistical tests are insufficient to delineate relevant biological processes. In this study, we developed a novel application, Alternative Splicing Encyclopedia: Functional Interaction (ASpediaFI), to systemically identify DAS events and co-regulated genes and pathways. ASpediaFI establishes a heterogeneous interaction network of genes and their feature nodes (i.e., AS events and pathways) connected by co-expression or pathway gene set knowledge. Next, ASpediaFI explores the interaction network using the random walk with restart algorithm and interrogates the proximity from a query gene set. Finally, ASpediaFI extracts significant AS events, genes, and pathways. To evaluate the performance of our method, we simulated RNA sequencing (RNA-seq) datasets to consider various conditions of sequencing depth and sample size. The performance was compared with that of other methods. Additionally, we analyzed three public datasets of cancer patients or cell lines to evaluate how well ASpediaFI detects biologically relevant candidates. ASpediaFI exhibits strong performance in both simulated and public datasets. Our integrative approach reveals that DAS events that recognize a global co-expression network and relevant pathways determine the functional importance of spliced genes in the subnetwork. ASpediaFI is publicly available at https://bioconductor.org/packages/ASpediaFI.
Collapse
|
22
|
Shi Q, Liu T, Hu W, Chen Z, He X, Li S. SRTdb: an omnibus for human tissue and cancer-specific RNA transcripts. Biomark Res 2022; 10:27. [PMID: 35473935 PMCID: PMC9044872 DOI: 10.1186/s40364-022-00377-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Accepted: 03/26/2022] [Indexed: 11/10/2022] Open
Abstract
The production of functional mature RNA transcripts from genes undergoes various pre-transcriptional regulation and post-transcriptional modifications. Accumulating studies demonstrated that gene transcription carries out in tissue and cancer type-dependent ways. However, RNA transcript-level specificity analysis in large-scale transcriptomics data across different normal tissue and cancer types is lacking. We applied reference-based de novo transcript assembly and quantification of 27,741 samples across 33 cancer types, 29 tissue types, and 25 cancer cell line types. We totally identified 231,836 specific RNA transcripts (SRTs) across various tissue and cancer types, most of which are found independent of specific genes. Almost half of tumor SRTs are also tissue-specific but in different tissues. Furthermore, we found that 10 ~ 20% of tumor SRTs in most tumor types were testis-specific. The SRT database (SRTdb) was constructed based on these resources. Taking liver cancer as an example, we showed how SRTdb resource is utilized to optimize the identification of RNA transcripts for more precision diagnosis of particular cancers. Our results provide a useful resource for exploring transcript specificity across various cancer and tissue types, and boost the precision medicine for tumor patients.
Collapse
Affiliation(s)
- Qili Shi
- Fudan University Shanghai Cancer Center and Institute of Biomedical Sciences, Shanghai Medical College, Fudan University, Shanghai, 200032, China
| | - Teng Liu
- Precision Research Center for Refractory Diseases, Institute for Clinical Research, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 201620, China.,Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, Taizhou, 318000, China
| | - Wei Hu
- Precision Research Center for Refractory Diseases, Institute for Clinical Research, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 201620, China
| | - Zhiao Chen
- Fudan University Shanghai Cancer Center and Institute of Biomedical Sciences, Shanghai Medical College, Fudan University, Shanghai, 200032, China
| | - Xianghuo He
- Fudan University Shanghai Cancer Center and Institute of Biomedical Sciences, Shanghai Medical College, Fudan University, Shanghai, 200032, China.
| | - Shengli Li
- Precision Research Center for Refractory Diseases, Institute for Clinical Research, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 201620, China.
| |
Collapse
|
23
|
Lié O, Renault S, Augé-Gouillou C. SETMAR, a case of primate co-opted genes: towards new perspectives. Mob DNA 2022; 13:9. [PMID: 35395947 PMCID: PMC8994322 DOI: 10.1186/s13100-022-00267-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2021] [Accepted: 03/28/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND We carry out a review of the history and biological activities of one domesticated gene in higher primates, SETMAR, by discussing current controversies. Our purpose is to open a new outlook that will serve as a framework for future work about SETMAR, possibly in the field of cognition development. MAIN BODY What is newly important about SETMAR can be summarized as follows: (1) the whole protein sequence is under strong purifying pressure; (2) its role is to strengthen existing biological functions rather than to provide new ones; (3) it displays a tissue-specific pattern of expression, at least for the alternative-splicing it undergoes. Studies reported here demonstrate that SETMAR protein(s) may be involved in essential networks regulating replication, transcription and translation. Moreover, during embryogenesis, SETMAR appears to contribute to brain development. SHORT CONCLUSION Our review underlines for the first time that SETMAR directly interacts with genes involved in brain functions related to vocalization and vocal learning. These findings pave the way for future works regarding SETMAR and the development of cognitive abilities in higher primates.
Collapse
Affiliation(s)
- Oriane Lié
- UMR 1253, iBrain, Université de Tours, Inserm, Tours, France.,iBrain, Team Neurogenomics and Neuronal physiopathology, Faculty of Medicine, 10 Bd Tonnellé, Cedex 1, 37032, Tours, France
| | - Sylvaine Renault
- UMR 1253, iBrain, Université de Tours, Inserm, Tours, France.,iBrain, Team Neurogenomics and Neuronal physiopathology, Faculty of Medicine, 10 Bd Tonnellé, Cedex 1, 37032, Tours, France
| | - Corinne Augé-Gouillou
- UMR 1253, iBrain, Université de Tours, Inserm, Tours, France. .,iBrain, Team Neurogenomics and Neuronal physiopathology, Faculty of Medicine, 10 Bd Tonnellé, Cedex 1, 37032, Tours, France.
| |
Collapse
|
24
|
Arzalluz-Luque A, Salguero P, Tarazona S, Conesa A. acorde unravels functionally interpretable networks of isoform co-usage from single cell data. Nat Commun 2022; 13:1828. [PMID: 35383181 PMCID: PMC8983708 DOI: 10.1038/s41467-022-29497-w] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 03/16/2022] [Indexed: 12/13/2022] Open
Abstract
Alternative splicing (AS) is a highly-regulated post-transcriptional mechanism known to modulate isoform expression within genes and contribute to cell-type identity. However, the extent to which alternative isoforms establish co-expression networks that may be relevant in cellular function has not been explored yet. Here, we present acorde, a pipeline that successfully leverages bulk long reads and single-cell data to confidently detect alternative isoform co-expression relationships. To achieve this, we develop and validate percentile correlations, an innovative approach that overcomes data sparsity and yields accurate co-expression estimates from single-cell data. Next, acorde uses correlations to cluster co-expressed isoforms into a network, unraveling cell type-specific alternative isoform usage patterns. By selecting same-gene isoforms between these clusters, we subsequently detect and characterize genes with co-differential isoform usage (coDIU) across cell types. Finally, we predict functional elements from long read-defined isoforms and provide insight into biological processes, motifs, and domains potentially controlled by the coordination of post-transcriptional regulation. The code for acorde is available at https://github.com/ConesaLab/acorde .
Collapse
Affiliation(s)
- Angeles Arzalluz-Luque
- Department of Applied Statistics, Operations Research and Quality, Universitat Politècnica de València, Valencia, Spain
- Institute for Integrative Systems Biology (CSIC-UV), Spanish National Research Council, Paterna, Valencia, Spain
| | - Pedro Salguero
- Department of Applied Statistics, Operations Research and Quality, Universitat Politècnica de València, Valencia, Spain
| | - Sonia Tarazona
- Department of Applied Statistics, Operations Research and Quality, Universitat Politècnica de València, Valencia, Spain.
| | - Ana Conesa
- Institute for Integrative Systems Biology (CSIC-UV), Spanish National Research Council, Paterna, Valencia, Spain.
- Microbiology and Cell Sciences Department, Institute for Food and Agricultural Research, University of Florida, Gainesville, FL, USA.
| |
Collapse
|
25
|
Ziv M, Gruber G, Sharon M, Vinogradov E, Yeger-Lotem E. The TissueNet v.3 database: Protein-protein interactions in adult and embryonic human tissue contexts. J Mol Biol 2022; 434:167532. [DOI: 10.1016/j.jmb.2022.167532] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 03/03/2022] [Accepted: 03/03/2022] [Indexed: 12/28/2022]
|
26
|
Wang Y, Hicks SC, Hansen KD. Addressing the mean-correlation relationship in co-expression analysis. PLoS Comput Biol 2022; 18:e1009954. [PMID: 35353807 PMCID: PMC9009771 DOI: 10.1371/journal.pcbi.1009954] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Revised: 04/14/2022] [Accepted: 02/22/2022] [Indexed: 12/13/2022] Open
Abstract
Estimates of correlation between pairs of genes in co-expression analysis are commonly used to construct networks among genes using gene expression data. As previously noted, the distribution of such correlations depends on the observed expression level of the involved genes, which we refer to this as a mean-correlation relationship in RNA-seq data, both bulk and single-cell. This dependence introduces an unwanted technical bias in co-expression analysis whereby highly expressed genes are more likely to be highly correlated. Such a relationship is not observed in protein-protein interaction data, suggesting that it is not reflecting biology. Ignoring this bias can lead to missing potentially biologically relevant pairs of genes that are lowly expressed, such as transcription factors. To address this problem, we introduce spatial quantile normalization (SpQN), a method for normalizing local distributions in a correlation matrix. We show that spatial quantile normalization removes the mean-correlation relationship and corrects the expression bias in network reconstruction.
Collapse
Affiliation(s)
- Yi Wang
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America
| | - Stephanie C. Hicks
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America
| | - Kasper D. Hansen
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America
- McKusick-Nathans Department of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, Maryland, United States of America
| |
Collapse
|
27
|
Li Y, Huang Y, Ren S, Xiao X, Cao H, He J. A Pan-Cancer Analysis of the Oncogenic Role of Nuclear Transport Factor 2 in Human Cancers. Front Oncol 2022; 12:829389. [PMID: 35155261 PMCID: PMC8831323 DOI: 10.3389/fonc.2022.829389] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2021] [Accepted: 01/04/2022] [Indexed: 12/24/2022] Open
Abstract
Nuclear transport factor 2 (NUTF2) is a GDP-binding protein that participates in the nucleocytoplasmic transport process. The role of NUTF2 in cancer development is largely unknown and lacks systemic assessment across human cancers. In this study, we performed a pan-cancer analysis of NUTF2 in human cancers. Out of 33 types of cancers, 19 types had significantly different expression of NUTF2 between tumor and normal tissues. Meanwhile, survival analysis showed that NUTF2 could be an independent prognostic factor in several tumor types. Further analysis suggested that the expression of NUTF2 expression was correlated with the infiltration of immune cells, such as CD8+ T cells, effector memory CD4+ T cells, and cancer-associated fibroblasts in kidney renal clear cell carcinoma. Moreover, co-expression analysis showed the positive association between NUTF2 and cell proliferation biomarkers (MKI67and PCNA) and epithelial–mesenchymal transition markers (VIM, TWIST1, SNAI1, SNAI2, FN1, and CDH2), suggesting that NUTF2 plays important roles in regulating cancer proliferation and metastasis. This pan-cancer analysis of NUTF2 provides a systemic understanding of its oncogenic role across different types of cancers.
Collapse
Affiliation(s)
- Yu Li
- Department of Laboratory Medicine, Peking University Shenzhen Hospital, Shenzhen, China
| | - Yongsheng Huang
- Cellular & Molecular Diagnostics Center, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou, China.,Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangdong-Hong Kong Joint Laboratory for RNA Medicine, Medical Research Center, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou, China
| | - Shuwei Ren
- Guangdong Provincial Key Laboratory of Colorectal and Pelvic Floor Diseases, Sun Yat-Sen University Sixth Affiliated Hospital, Sun Yat-Sen University, Guangzhou, China
| | - Xing Xiao
- Department of Dermatology, Shenzhen Children's Hospital, Shenzhen, China
| | - Haotian Cao
- Department of Oral and Maxillofacial Surgery, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou, China
| | - Juan He
- Department of Rheumatology and Immunology, Peking University Shenzhen Hospital, Shenzhen, China.,Shenzhen Key Laboratory of Immunity and Inflammatory Diseases, Shenzhen, China
| |
Collapse
|
28
|
Yuan K, Zeng T, Chen L. Interpreting Functional Impact of Genetic Variations by Network QTL for Genotype–Phenotype Association Study. Front Cell Dev Biol 2022; 9:720321. [PMID: 35155440 PMCID: PMC8826544 DOI: 10.3389/fcell.2021.720321] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Accepted: 12/13/2021] [Indexed: 12/18/2022] Open
Abstract
An enormous challenge in the post-genome era is to annotate and resolve the consequences of genetic variation on diverse phenotypes. The genome-wide association study (GWAS) is a well-known method to identify potential genetic loci for complex traits from huge genetic variations, following which it is crucial to identify expression quantitative trait loci (eQTL). However, the conventional eQTL methods usually disregard the systematical role of single-nucleotide polymorphisms (SNPs) or genes, thereby overlooking many network-associated phenotypic determinates. Such a problem motivates us to recognize the network-based quantitative trait loci (QTL), i.e., network QTL (nQTL), which is to detect the cascade association as genotype → network → phenotype rather than conventional genotype → expression → phenotype in eQTL. Specifically, we develop the nQTL framework on the theory and approach of single-sample networks, which can identify not only network traits (e.g., the gene subnetwork associated with genotype) for analyzing complex biological processes but also network signatures (e.g., the interactive gene biomarker candidates screened from network traits) for characterizing targeted phenotype and corresponding subtypes. Our results show that the nQTL framework can efficiently capture associations between SNPs and network traits (i.e., edge traits) in various simulated data scenarios, compared with traditional eQTL methods. Furthermore, we have carried out nQTL analysis on diverse biological and biomedical datasets. Our analysis is effective in detecting network traits for various biological problems and can discover many network signatures for discriminating phenotypes, which can help interpret the influence of nQTL on disease subtyping, disease prognosis, drug response, and pathogen factor association. Particularly, in contrast to the conventional approaches, the nQTL framework could also identify many network traits from human bulk expression data, validated by matched single-cell RNA-seq data in an independent or unsupervised manner. All these results strongly support that nQTL and its detection framework can simultaneously explore the global genotype–network–phenotype associations and the underlying network traits or network signatures with functional impact and importance.
Collapse
Affiliation(s)
- Kai Yuan
- Key Laboratory of Systems Biology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences, Shanghai, China
| | - Tao Zeng
- Key Laboratory of Systems Biology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences, Shanghai, China
- Bio-Med Big Data Center, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
- Guangzhou Laboratory, Guangzhou, China
- *Correspondence: Tao Zeng, ; Luonan Chen,
| | - Luonan Chen
- Key Laboratory of Systems Biology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences, Shanghai, China
- Key Laboratory of Systems Health Science of Zhejiang Province, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Hangzhou, China
- School of Life Science and Technology, ShanghaiTech University, Shanghai, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China
- *Correspondence: Tao Zeng, ; Luonan Chen,
| |
Collapse
|
29
|
Clark KC, Kwitek AE. Multi-Omic Approaches to Identify Genetic Factors in Metabolic Syndrome. Compr Physiol 2021; 12:3045-3084. [PMID: 34964118 PMCID: PMC9373910 DOI: 10.1002/cphy.c210010] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Metabolic syndrome (MetS) is a highly heritable disease and a major public health burden worldwide. MetS diagnosis criteria are met by the simultaneous presence of any three of the following: high triglycerides, low HDL/high LDL cholesterol, insulin resistance, hypertension, and central obesity. These diseases act synergistically in people suffering from MetS and dramatically increase risk of morbidity and mortality due to stroke and cardiovascular disease, as well as certain cancers. Each of these component features is itself a complex disease, as is MetS. As a genetically complex disease, genetic risk factors for MetS are numerous, but not very powerful individually, often requiring specific environmental stressors for the disease to manifest. When taken together, all sequence variants that contribute to MetS disease risk explain only a fraction of the heritable variance, suggesting additional, novel loci have yet to be discovered. In this article, we will give a brief overview on the genetic concepts needed to interpret genome-wide association studies (GWAS) and quantitative trait locus (QTL) data, summarize the state of the field of MetS physiological genomics, and to introduce tools and resources that can be used by the physiologist to integrate genomics into their own research on MetS and any of its component features. There is a wealth of phenotypic and molecular data in animal models and humans that can be leveraged as outlined in this article. Integrating these multi-omic QTL data for complex diseases such as MetS provides a means to unravel the pathways and mechanisms leading to complex disease and promise for novel treatments. © 2022 American Physiological Society. Compr Physiol 12:1-40, 2022.
Collapse
Affiliation(s)
- Karen C Clark
- Department of Physiology, Medical College of Wisconsin, Milwaukee, Wisconsin, USA
| | - Anne E Kwitek
- Department of Physiology, Medical College of Wisconsin, Milwaukee, Wisconsin, USA
| |
Collapse
|
30
|
Verta JP, Jacobs A. The role of alternative splicing in adaptation and evolution. Trends Ecol Evol 2021; 37:299-308. [PMID: 34920907 DOI: 10.1016/j.tree.2021.11.010] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Revised: 10/26/2021] [Accepted: 11/19/2021] [Indexed: 01/02/2023]
Abstract
Regulation of gene expression plays a central role in adaptive divergence and evolution. Although the role of gene regulation in microevolutionary processes is gaining wide acceptance, most studies have only investigated the evolution of transcript levels, ignoring the potentially significant role of transcript structures. We argue that variation in alternative splicing plays an important and widely unexplored role in adaptation (e.g., by increasing transcriptome and/or proteome diversity, or buffering potentially deleterious genetic variation). New studies increasingly highlight the potential for independent evolution in alternative splicing and transcript level, providing alternative paths for selection to act upon. We propose that alternative splicing and transcript levels can provide contrasting, nonredundant mechanisms of equal importance for adaptive diversification of gene function and regulation.
Collapse
Affiliation(s)
- Jukka-Pekka Verta
- Organismal and Evolutionary Biology Research Programme, University of Helsinki, Viikinkaari 9, 00790, Helsinki, Finland.
| | - Arne Jacobs
- Institute of Biodiversity, Animal Health, and Comparative Medicine, University of Glasgow, G12 8QQ, Glasgow, UK.
| |
Collapse
|
31
|
Gurnari C, Pagliuca S, Visconte V. Alternative Splicing in Myeloid Malignancies. Biomedicines 2021; 9:biomedicines9121844. [PMID: 34944660 PMCID: PMC8698609 DOI: 10.3390/biomedicines9121844] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Revised: 11/28/2021] [Accepted: 12/03/2021] [Indexed: 01/02/2023] Open
Abstract
Alternative RNA splicing (AS) is an essential physiologic function that diversifies the human proteome. AS also has a crucial role during cellular development. In fact, perturbations in RNA-splicing have been implicated in the development of several cancers, including myeloid malignancies. Splicing dysfunction can be independent of genetic lesions or appear as a direct consequence of mutations in components of the RNA-splicing machinery, such as in the case of mutations occurring in splicing factor genes (i.e., SF3B1, SRSF2, U2AF1) and their regulators. In addition, cancer cells exhibit marked gene expression alterations, including different usage of AS isoforms, possibly causing tissue-specific effects and perturbations of downstream pathways. This review summarizes several modalities leading to splicing diversity in myeloid malignancies.
Collapse
Affiliation(s)
- Carmelo Gurnari
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH 44195, USA; (C.G.); (S.P.)
- Department of Biomedicine and Prevention, University of Rome Tor Vergata, 00133 Rome, Italy
| | - Simona Pagliuca
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH 44195, USA; (C.G.); (S.P.)
| | - Valeria Visconte
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH 44195, USA; (C.G.); (S.P.)
- Correspondence:
| |
Collapse
|
32
|
Buphamalai P, Kokotovic T, Nagy V, Menche J. Network analysis reveals rare disease signatures across multiple levels of biological organization. Nat Commun 2021; 12:6306. [PMID: 34753928 PMCID: PMC8578255 DOI: 10.1038/s41467-021-26674-1] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Accepted: 10/19/2021] [Indexed: 01/26/2023] Open
Abstract
Rare genetic diseases are typically caused by a single gene defect. Despite this clear causal relationship between genotype and phenotype, identifying the pathobiological mechanisms at various levels of biological organization remains a practical and conceptual challenge. Here, we introduce a network approach for evaluating the impact of rare gene defects across biological scales. We construct a multiplex network consisting of over 20 million gene relationships that are organized into 46 network layers spanning six major biological scales between genotype and phenotype. A comprehensive analysis of 3,771 rare diseases reveals distinct phenotypic modules within individual layers. These modules can be exploited to mechanistically dissect the impact of gene defects and accurately predict rare disease gene candidates. Our results show that the disease module formalism can be applied to rare diseases and generalized beyond physical interaction networks. These findings open up new venues to apply network-based tools for cross-scale data integration.
Collapse
Affiliation(s)
- Pisanu Buphamalai
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, AKH BT 25.3, 1090, Vienna, Austria
- Department of Structural and Computational Biology, Max Perutz Labs, University of Vienna, Campus Vienna BioCenter 5, 1030, Vienna, Austria
| | - Tomislav Kokotovic
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, AKH BT 25.3, 1090, Vienna, Austria
- Ludwig Boltzmann Institute for Rare and Undiagnosed Diseases, Lazarettgasse 14, AKH BT 25.3, 1090, Vienna, Austria
- Department of Neurology, Medical University of Vienna, Währinger Gürtel 18-20, 1090, Vienna, Austria
| | - Vanja Nagy
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, AKH BT 25.3, 1090, Vienna, Austria
- Ludwig Boltzmann Institute for Rare and Undiagnosed Diseases, Lazarettgasse 14, AKH BT 25.3, 1090, Vienna, Austria
- Department of Neurology, Medical University of Vienna, Währinger Gürtel 18-20, 1090, Vienna, Austria
| | - Jörg Menche
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, AKH BT 25.3, 1090, Vienna, Austria.
- Department of Structural and Computational Biology, Max Perutz Labs, University of Vienna, Campus Vienna BioCenter 5, 1030, Vienna, Austria.
- Faculty of Mathematics, University of Vienna, Oskar-Morgenstern-Platz 1, 1090, Vienna, Austria.
| |
Collapse
|
33
|
Filho JAF, Rosolen RR, Almeida DA, de Azevedo PHC, Motta MLL, Aono AH, dos Santos CA, Horta MAC, de Souza AP. Trends in biological data integration for the selection of enzymes and transcription factors related to cellulose and hemicellulose degradation in fungi. 3 Biotech 2021; 11:475. [PMID: 34777932 PMCID: PMC8548487 DOI: 10.1007/s13205-021-03032-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Accepted: 10/15/2021] [Indexed: 12/13/2022] Open
Abstract
Fungi are key players in biotechnological applications. Although several studies focusing on fungal diversity and genetics have been performed, many details of fungal biology remain unknown, including how cellulolytic enzymes are modulated within these organisms to allow changes in main plant cell wall compounds, cellulose and hemicellulose, and subsequent biomass conversion. With the advent and consolidation of DNA/RNA sequencing technology, different types of information can be generated at the genomic, structural and functional levels, including the gene expression profiles and regulatory mechanisms of these organisms, during degradation-induced conditions. This increase in data generation made rapid computational development necessary to deal with the large amounts of data generated. In this context, the origination of bioinformatics, a hybrid science integrating biological data with various techniques for information storage, distribution and analysis, was a fundamental step toward the current state-of-the-art in the postgenomic era. The possibility of integrating biological big data has facilitated exciting discoveries, including identifying novel mechanisms and more efficient enzymes, increasing yields, reducing costs and expanding opportunities in the bioprocess field. In this review, we summarize the current status and trends of the integration of different types of biological data through bioinformatics approaches for biological data analysis and enzyme selection.
Collapse
Affiliation(s)
- Jaire A. Ferreira Filho
- Center for Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, SP Brazil
| | - Rafaela R. Rosolen
- Center for Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, SP Brazil
| | - Deborah A. Almeida
- Center for Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, SP Brazil
| | - Paulo Henrique C. de Azevedo
- Center for Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, SP Brazil
| | - Maria Lorenza L. Motta
- Center for Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, SP Brazil
| | - Alexandre H. Aono
- Center for Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, SP Brazil
| | - Clelton A. dos Santos
- Center for Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, SP Brazil
- Brazilian Biorenewables National Laboratory (LNBR), Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, SP Brazil
| | - Maria Augusta C. Horta
- Center for Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, SP Brazil
- Faculty of Pharmaceutical Sciences of Ribeirão Preto, University of São Paulo, Ribeirão Preto, SP Brazil
| | - Anete P. de Souza
- Center for Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, SP Brazil
- Department of Plant Biology, Institute of Biology, UNICAMP, Universidade Estadual de Campinas, Campinas, SP 13083-875 Brazil
| |
Collapse
|
34
|
Jacobs A, Elmer KR. Alternative splicing and gene expression play contrasting roles in the parallel phenotypic evolution of a salmonid fish. Mol Ecol 2021; 30:4955-4969. [PMID: 33502030 PMCID: PMC8653899 DOI: 10.1111/mec.15817] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Revised: 01/06/2021] [Accepted: 01/18/2021] [Indexed: 12/25/2022]
Abstract
Understanding the contribution of different molecular processes to evolution and development is crucial for identifying the mechanisms of adaptation. Here, we used RNA-sequencing data to test the importance of alternative splicing and differential gene expression in a case of parallel adaptive evolution, the replicated postglacial divergence of the salmonid fish Arctic charr (Salvelinus alpinus) into sympatric benthic and pelagic ecotypes across multiple independent lakes. We found that genes differentially spliced between ecotypes were mostly not differentially expressed (<6% overlap) and were involved in different biological processes. Differentially spliced genes were primarily enriched for muscle development and functioning, while differentially expressed genes were involved in metabolism, immunity and growth. Furthermore, alternative splicing and gene expression were mostly controlled by independent cis-regulatory quantitative trait loci (<3.4% overlap). Cis-regulatory regions were associated with the parallel divergence in splicing (16.5% of intron clusters) and expression (6.7%-10.1% of differentially expressed genes), indicating shared regulatory variation across ecotype pairs. Contrary to theoretical expectation, we found that differentially spliced genes tended to be highly central in regulatory networks ("hub genes") and were annotated to significantly more gene ontology terms compared to nondifferentially spliced genes, consistent with a higher level of pleiotropy. Together, our results suggest that the concerted regulation of alternative splicing and differential gene expression through different regulatory regions leads to the divergence of complementary processes important for local adaptation. This provides novel insights into the importance of contrasting but putatively complementary molecular processes in rapid parallel adaptive evolution.
Collapse
Affiliation(s)
- Arne Jacobs
- Institute of Biodiversity, Animal Health and Comparative Medicine, College of Medical, Veterinary & Life SciencesUniversity of GlasgowGlasgowUK
- Department of Natural ResourcesCornell UniversityIthacaNYUSA
| | - Kathryn R. Elmer
- Institute of Biodiversity, Animal Health and Comparative Medicine, College of Medical, Veterinary & Life SciencesUniversity of GlasgowGlasgowUK
| |
Collapse
|
35
|
Saint-André V. Computational biology approaches for mapping transcriptional regulatory networks. Comput Struct Biotechnol J 2021; 19:4884-4895. [PMID: 34522292 PMCID: PMC8426465 DOI: 10.1016/j.csbj.2021.08.028] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2021] [Revised: 08/16/2021] [Accepted: 08/16/2021] [Indexed: 12/13/2022] Open
Abstract
Transcriptional Regulatory Networks (TRNs) are mainly responsible for the cell-type- or cell-state-specific expression of gene sets from the same DNA sequence. However, so far there are no precise maps of TRNs available for each cell-type or cell-state, and no ideal tool to map those networks clearly and in full from biological samples. In this review, major approaches and tools to map TRNs from high-throughput data are presented, depending on the type of methods or data used to infer them, and their advantages and limitations are discussed. After summarizing the main principles defining the topology and structure–function relationships in TRNs, an overview of the extensive work done to map TRNs from bulk transcriptomic data will be presented by type of methodological approach. Most recent modellings of TRNs using other types of molecular data or integrating different data types, including single-cell RNA-sequencing and chromatin information, will then be discussed, before briefly concluding with improvements expected to come in the field.
Collapse
Affiliation(s)
- Violaine Saint-André
- Hub de Bioinformatique et Biostatistique - Département Biologie Computationnelle, Institut Pasteur, Paris, France
| |
Collapse
|
36
|
Chen S, Ren C, Zhai J, Yu J, Zhao X, Li Z, Zhang T, Ma W, Han Z, Ma C. CAFU: a Galaxy framework for exploring unmapped RNA-Seq data. Brief Bioinform 2021; 21:676-686. [PMID: 30815667 PMCID: PMC7299299 DOI: 10.1093/bib/bbz018] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2018] [Revised: 01/23/2019] [Accepted: 01/27/2019] [Indexed: 12/13/2022] Open
Abstract
A widely used approach in transcriptome analysis is the alignment of short reads to a reference genome. However, owing to the deficiencies of specially designed analytical systems, short reads unmapped to the genome sequence are usually ignored, resulting in the loss of significant biological information and insights. To fill this gap, we present Comprehensive Assembly and Functional annotation of Unmapped RNA-Seq data (CAFU), a Galaxy-based framework that can facilitate the large-scale analysis of unmapped RNA sequencing (RNA-Seq) reads from single- and mixed-species samples. By taking advantage of machine learning techniques, CAFU addresses the issue of accurately identifying the species origin of transcripts assembled using unmapped reads from mixed-species samples. CAFU also represents an innovation in that it provides a comprehensive collection of functions required for transcript confidence evaluation, coding potential calculation, sequence and expression characterization and function annotation. These functions and their dependencies have been integrated into a Galaxy framework that provides access to CAFU via a user-friendly interface, dramatically simplifying complex exploration tasks involving unmapped RNA-Seq reads. CAFU has been validated with RNA-Seq data sets from wheat and Zea mays (maize) samples. CAFU is freely available via GitHub: https://github.com/cma2015/CAFU.
Collapse
Affiliation(s)
- Siyuan Chen
- State Key Laboratory of Crop Stress Biology for Arid Areas, Center of Bioinformatics, College of Life Sciences, Northwest Agriculture and Forestry University
| | - Chengzhi Ren
- State Key Laboratory of Crop Stress Biology for Arid Areas, Center of Bioinformatics, College of Life Sciences, Northwest Agriculture and Forestry University
| | - Jingjing Zhai
- State Key Laboratory of Crop Stress Biology for Arid Areas, Center of Bioinformatics, College of Life Sciences, Northwest Agriculture and Forestry University
| | - Jiantao Yu
- College of Information Engineering, Northwest Agriculture and Forestry University
| | - Xuyang Zhao
- College of Information Engineering, Northwest Agriculture and Forestry University
| | - Zelong Li
- State Key Laboratory of Crop Stress Biology for Arid Areas, Center of Bioinformatics, College of Life Sciences, Northwest Agriculture and Forestry University
| | - Ting Zhang
- State Key Laboratory of Crop Stress Biology for Arid Areas, Center of Bioinformatics, College of Life Sciences, Northwest Agriculture and Forestry University
| | - Wenlong Ma
- State Key Laboratory of Crop Stress Biology for Arid Areas, Center of Bioinformatics, College of Life Sciences, Northwest Agriculture and Forestry University
| | - Zhaoxue Han
- State Key Laboratory of Crop Stress Biology for Arid Areas, Center of Bioinformatics, College of Life Sciences, Northwest Agriculture and Forestry University
| | - Chuang Ma
- State Key Laboratory of Crop Stress Biology for Arid Areas, Center of Bioinformatics, College of Life Sciences, Northwest Agriculture and Forestry University
| |
Collapse
|
37
|
Ovens K, Eames BF, McQuillan I. Comparative Analyses of Gene Co-expression Networks: Implementations and Applications in the Study of Evolution. Front Genet 2021; 12:695399. [PMID: 34484293 PMCID: PMC8414652 DOI: 10.3389/fgene.2021.695399] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Accepted: 07/19/2021] [Indexed: 11/13/2022] Open
Abstract
Similarities and differences in the associations of biological entities among species can provide us with a better understanding of evolutionary relationships. Often the evolution of new phenotypes results from changes to interactions in pre-existing biological networks and comparing networks across species can identify evidence of conservation or adaptation. Gene co-expression networks (GCNs), constructed from high-throughput gene expression data, can be used to understand evolution and the rise of new phenotypes. The increasing abundance of gene expression data makes GCNs a valuable tool for the study of evolution in non-model organisms. In this paper, we cover motivations for why comparing these networks across species can be valuable for the study of evolution. We also review techniques for comparing GCNs in the context of evolution, including local and global methods of graph alignment. While some protein-protein interaction (PPI) bioinformatic methods can be used to compare co-expression networks, they often disregard highly relevant properties, including the existence of continuous and negative values for edge weights. Also, the lack of comparative datasets in non-model organisms has hindered the study of evolution using PPI networks. We also discuss limitations and challenges associated with cross-species comparison using GCNs, and provide suggestions for utilizing co-expression network alignments as an indispensable tool for evolutionary studies going forward.
Collapse
Affiliation(s)
- Katie Ovens
- Augmented Intelligence & Precision Health Laboratory (AIPHL), Research Institute of the McGill University Health Centre, Montreal, QC, Canada
| | - B. Frank Eames
- Department of Anatomy, Physiology, & Pharmacology, University of Saskatchewan, Saskatoon, SK, Canada
| | - Ian McQuillan
- Department of Computer Science, University of Saskatchewan, Saskatoon, SK, Canada
| |
Collapse
|
38
|
Multi-omics in mesial temporal lobe epilepsy with hippocampal sclerosis: Clues into the underlying mechanisms leading to disease. Seizure 2021; 90:34-50. [DOI: 10.1016/j.seizure.2021.03.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2020] [Revised: 02/26/2021] [Accepted: 03/02/2021] [Indexed: 02/07/2023] Open
|
39
|
Skinnider MA, Scott NE, Prudova A, Kerr CH, Stoynov N, Stacey RG, Chan QWT, Rattray D, Gsponer J, Foster LJ. An atlas of protein-protein interactions across mouse tissues. Cell 2021; 184:4073-4089.e17. [PMID: 34214469 DOI: 10.1016/j.cell.2021.06.003] [Citation(s) in RCA: 46] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Revised: 04/05/2021] [Accepted: 06/01/2021] [Indexed: 12/20/2022]
Abstract
Cellular processes arise from the dynamic organization of proteins in networks of physical interactions. Mapping the interactome has therefore been a central objective of high-throughput biology. However, the dynamics of protein interactions across physiological contexts remain poorly understood. Here, we develop a quantitative proteomic approach combining protein correlation profiling with stable isotope labeling of mammals (PCP-SILAM) to map the interactomes of seven mouse tissues. The resulting maps provide a proteome-scale survey of interactome rewiring across mammalian tissues, revealing more than 125,000 unique interactions at a quality comparable to the highest-quality human screens. We identify systematic suppression of cross-talk between the evolutionarily ancient housekeeping interactome and younger, tissue-specific modules. Rewired proteins are tightly regulated by multiple cellular mechanisms and are implicated in disease. Our study opens up new avenues to uncover regulatory mechanisms that shape in vivo interactome responses to physiological and pathophysiological stimuli in mammalian systems.
Collapse
Affiliation(s)
- Michael A Skinnider
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| | - Nichollas E Scott
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada; Peter Doherty Institute, Department of Microbiology and Immunology, The University of Melbourne, Melbourne, VIC 3000, Australia
| | - Anna Prudova
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| | - Craig H Kerr
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada; Department of Biochemistry & Molecular Biology, University of British Columbia, Vancouver, BC V6T 1Z3, Canada
| | - Nikolay Stoynov
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| | - R Greg Stacey
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| | - Queenie W T Chan
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| | - David Rattray
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada; Department of Biochemistry & Molecular Biology, University of British Columbia, Vancouver, BC V6T 1Z3, Canada
| | - Jörg Gsponer
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada; Department of Biochemistry & Molecular Biology, University of British Columbia, Vancouver, BC V6T 1Z3, Canada.
| | - Leonard J Foster
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada; Department of Biochemistry & Molecular Biology, University of British Columbia, Vancouver, BC V6T 1Z3, Canada.
| |
Collapse
|
40
|
Degtyareva AO, Antontseva EV, Merkulova TI. Regulatory SNPs: Altered Transcription Factor Binding Sites Implicated in Complex Traits and Diseases. Int J Mol Sci 2021; 22:6454. [PMID: 34208629 PMCID: PMC8235176 DOI: 10.3390/ijms22126454] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Revised: 06/15/2021] [Accepted: 06/15/2021] [Indexed: 12/19/2022] Open
Abstract
The vast majority of the genetic variants (mainly SNPs) associated with various human traits and diseases map to a noncoding part of the genome and are enriched in its regulatory compartment, suggesting that many causal variants may affect gene expression. The leading mechanism of action of these SNPs consists in the alterations in the transcription factor binding via creation or disruption of transcription factor binding sites (TFBSs) or some change in the affinity of these regulatory proteins to their cognate sites. In this review, we first focus on the history of the discovery of regulatory SNPs (rSNPs) and systematized description of the existing methodical approaches to their study. Then, we brief the recent comprehensive examples of rSNPs studied from the discovery of the changes in the TFBS sequence as a result of a nucleotide substitution to identification of its effect on the target gene expression and, eventually, to phenotype. We also describe state-of-the-art genome-wide approaches to identification of regulatory variants, including both making molecular sense of genome-wide association studies (GWAS) and the alternative approaches the primary goal of which is to determine the functionality of genetic variants. Among these approaches, special attention is paid to expression quantitative trait loci (eQTLs) analysis and the search for allele-specific events in RNA-seq (ASE events) as well as in ChIP-seq, DNase-seq, and ATAC-seq (ASB events) data.
Collapse
Affiliation(s)
- Arina O. Degtyareva
- Department of Molecular Genetic, Institute of Cytology and Genetics, 630090 Novosibirsk, Russia; (A.O.D.); (E.V.A.)
| | - Elena V. Antontseva
- Department of Molecular Genetic, Institute of Cytology and Genetics, 630090 Novosibirsk, Russia; (A.O.D.); (E.V.A.)
| | - Tatiana I. Merkulova
- Department of Molecular Genetic, Institute of Cytology and Genetics, 630090 Novosibirsk, Russia; (A.O.D.); (E.V.A.)
- Department of Natural Sciences, Novosibirsk State University, 630090 Novosibirsk, Russia
| |
Collapse
|
41
|
Xiao L, Zou G, Cheng R, Wang P, Ma K, Cao H, Zhou W, Jin X, Xu Z, Huang Y, Lin X, Nie H, Jiang Q. Alternative splicing associated with cancer stemness in kidney renal clear cell carcinoma. BMC Cancer 2021; 21:703. [PMID: 34130646 PMCID: PMC8204412 DOI: 10.1186/s12885-021-08470-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Accepted: 06/03/2021] [Indexed: 12/20/2022] Open
Abstract
Backgroud Cancer stemness is associated with metastases in kidney renal clear cell carcinoma (KIRC) and negatively correlates with immune infiltrates. Recent stemness evaluation methods based on the absolute expression have been proposed to reveal the relationship between stemness and cancer. However, we found that existing methods do not perform well in assessing the stemness of KIRC patients, and they overlooked the impact of alternative splicing. Alternative splicing not only progresses during the differentiation of stem cells, but also changes during the acquisition of the stemness features of cancer stem cells. There is an urgent need for a new method to predict KIRC-specific stemness more accurately, so as to provide help in selecting treatment options. Methods The corresponding RNA-Seq data were obtained from the The Cancer Genome Atlas (TCGA) data portal. We also downloaded stem cell RNA sequence data from the Progenitor Cell Biology Consortium (PCBC) Synapse Portal. Independent validation sets with large sample size and common clinic pathological characteristics were obtained from the Gene Expression Omnibus (GEO) database. we constructed a KIRC-specific stemness prediction model using an algorithm called one-class logistic regression based on the expression and alternative splicing data to predict stemness indices of KIRC patients, and the model was externally validated. We identify stemness-associated alternative splicing events (SASEs) by analyzing different alternative splicing event between high- and low- stemness groups. Univariate Cox and multivariable logistic regression analysisw as carried out to detect the prognosis-related SASEs respectively. The area under curve (AUC) of receiver operating characteristic (ROC) was performed to evaluate the predictive values of our model. Results Here, we constructed a KIRC-specific stemness prediction model with an AUC of 0.968,and to provide a user-friendly interface of our model for KIRC stemness analysis, we have developed KIRC Stemness Calculator and Visualization (KSCV), hosted on the Shiny server, can most easily be accessed via web browser and the url https://jiang-lab.shinyapps.io/kscv/. When applied to 605 KIRC patients, our stemness indices had a higher correlation with the gender, smoking history and metastasis of the patients than the previous stemness indices, and revealed intratumor heterogeneity at the stemness level. We identified 77 novel SASEs by dividing patients into high- and low- stemness groups with significantly different outcome and they had significant correlations with expression of 17 experimentally validated splicing factors. Both univariate and multivariate survival analysis demonstrated that SASEs closely correlated with the overall survival of patients. Conclusions Basing on the stemness indices, we found that not only immune infiltration but also alternative splicing events showed significant different at the stemness level. More importantly, we highlight the critical role of these differential alternative splicing events in poor prognosis, and we believe in the potential for their further translation into targets for immunotherapy. Supplementary Information The online version contains supplementary material available at 10.1186/s12885-021-08470-8.
Collapse
Affiliation(s)
- Lixing Xiao
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Guoying Zou
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Rui Cheng
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Pingping Wang
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Kexin Ma
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Huimin Cao
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Wenyang Zhou
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Xiyun Jin
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Zhaochun Xu
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Yan Huang
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Xiaoyu Lin
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Huan Nie
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China.
| | - Qinghua Jiang
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China. .,Key Laboratory of Biological Big Data (Harbin Institute of Technology), Ministry of Education, Harbin, China.
| |
Collapse
|
42
|
Mao W, Rahimikollu J, Hausler R, Chikina M. DataRemix: a universal data transformation for optimal inference from gene expression datasets. Bioinformatics 2021; 37:984-991. [PMID: 32821903 DOI: 10.1093/bioinformatics/btaa745] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 08/01/2020] [Accepted: 08/17/2020] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION RNA-seq technology provides unprecedented power in the assessment of the transcription abundance and can be used to perform a variety of downstream tasks such as inference of gene-correlation network and eQTL discovery. However, raw gene expression values have to be normalized for nuisance biological variation and technical covariates, and different normalization strategies can lead to dramatically different results in the downstream study. RESULTS We describe a generalization of singular value decomposition-based reconstruction for which the common techniques of whitening, rank-k approximation and removing the top k principal components are special cases. Our simple three-parameter transformation, DataRemix, can be tuned to reweigh the contribution of hidden factors and reveal otherwise hidden biological signals. In particular, we demonstrate that the method can effectively prioritize biological signals over noise without leveraging external dataset-specific knowledge, and can outperform normalization methods that make explicit use of known technical factors. We also show that DataRemix can be efficiently optimized via Thompson sampling approach, which makes it feasible for computationally expensive objectives such as eQTL analysis. Finally, we apply our method to the Religious Orders Study and Memory and Aging Project dataset, and we report what to our knowledge is the first replicable trans-eQTL effect in human brain. AVAILABILITYAND IMPLEMENTATION DataRemix is an R package which is freely available at GitHub (https://github.com/wgmao/DataRemix). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Weiguang Mao
- Joint Carnegie Mellon-University of Pittsburgh Ph.D. Program in Computational Biology, Pittsburgh, PA 15260, USA.,Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Javad Rahimikollu
- Joint Carnegie Mellon-University of Pittsburgh Ph.D. Program in Computational Biology, Pittsburgh, PA 15260, USA.,Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Ryan Hausler
- Department of Medicine, Division of Hematology/Oncology,, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Maria Chikina
- Joint Carnegie Mellon-University of Pittsburgh Ph.D. Program in Computational Biology, Pittsburgh, PA 15260, USA.,Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| |
Collapse
|
43
|
Jones DC, Ruzzo WL. Polee: RNA-Seq analysis using approximate likelihood. NAR Genom Bioinform 2021; 3:lqab046. [PMID: 34056596 PMCID: PMC8152449 DOI: 10.1093/nargab/lqab046] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Revised: 04/11/2021] [Accepted: 05/11/2021] [Indexed: 12/20/2022] Open
Abstract
The analysis of mRNA transcript abundance with RNA-Seq is a central tool in molecular biology research, but often analyses fail to account for the uncertainty in these estimates, which can be significant, especially when trying to disentangle isoforms or duplicated genes. Preserving uncertainty necessitates a full probabilistic model of the all the sequencing reads, which quickly becomes intractable, as experiments can consist of billions of reads. To overcome these limitations, we propose a new method of approximating the likelihood function of a sparse mixture model, using a technique we call the Pólya tree transformation. We demonstrate that substituting this approximation for the real thing achieves most of the benefits with a fraction of the computational costs, leading to more accurate detection of differential transcript expression and transcript coexpression.
Collapse
Affiliation(s)
- Daniel C Jones
- Paul G. Allen School of Computer Science & Engineering, University of Washington, Box 352350, Seattle, WA 98195-2350, USA
| | - Walter L Ruzzo
- Paul G. Allen School of Computer Science & Engineering, University of Washington, Box 352350, Seattle, WA 98195-2350, USA
- Department of Genome Sciences, University of Washington, Box 355065, Seattle, WA 98195-5065, USA
- Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. N., P.O. Box 19024, Seattle, WA 98109, USA
| |
Collapse
|
44
|
Multilayer modelling of the human transcriptome and biological mechanisms of complex diseases and traits. NPJ Syst Biol Appl 2021; 7:24. [PMID: 34045472 PMCID: PMC8160250 DOI: 10.1038/s41540-021-00186-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Accepted: 04/28/2021] [Indexed: 01/03/2023] Open
Abstract
Here, we performed a comprehensive intra-tissue and inter-tissue multilayer network analysis of the human transcriptome. We generated an atlas of communities in gene co-expression networks in 49 tissues (GTEx v8), evaluated their tissue specificity, and investigated their methodological implications. UMAP embeddings of gene expression from the communities (representing nearly 18% of all genes) robustly identified biologically-meaningful clusters. Notably, new gene expression data can be embedded into our algorithmically derived models to accelerate discoveries in high-dimensional molecular datasets and downstream diagnostic or prognostic applications. We demonstrate the generalisability of our approach through systematic testing in external genomic and transcriptomic datasets. Methodologically, prioritisation of the communities in a transcriptome-wide association study of the biomarker C-reactive protein (CRP) in 361,194 individuals in the UK Biobank identified genetically-determined expression changes associated with CRP and led to considerably improved performance. Furthermore, a deep learning framework applied to the communities in nearly 11,000 tumors profiled by The Cancer Genome Atlas across 33 different cancer types learned biologically-meaningful latent spaces, representing metastasis (p < 2.2 × 10−16) and stemness (p < 2.2 × 10−16). Our study provides a rich genomic resource to catalyse research into inter-tissue regulatory mechanisms, and their downstream consequences on human disease.
Collapse
|
45
|
Eales JM, Jiang X, Xu X, Saluja S, Akbarov A, Cano-Gamez E, McNulty MT, Finan C, Guo H, Wystrychowski W, Szulinska M, Thomas HB, Pramanik S, Chopade S, Prestes PR, Wise I, Evangelou E, Salehi M, Shakanti Y, Ekholm M, Denniff M, Nazgiewicz A, Eichinger F, Godfrey B, Antczak A, Glyda M, Król R, Eyre S, Brown J, Berzuini C, Bowes J, Caulfield M, Zukowska-Szczechowska E, Zywiec J, Bogdanski P, Kretzler M, Woolf AS, Talavera D, Keavney B, Maffia P, Guzik TJ, O'Keefe RT, Trynka G, Samani NJ, Hingorani A, Sampson MG, Morris AP, Charchar FJ, Tomaszewski M. Uncovering genetic mechanisms of hypertension through multi-omic analysis of the kidney. Nat Genet 2021; 53:630-637. [PMID: 33958779 DOI: 10.1038/s41588-021-00835-w] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2020] [Accepted: 03/04/2021] [Indexed: 02/02/2023]
Abstract
The kidney is an organ of key relevance to blood pressure (BP) regulation, hypertension and antihypertensive treatment. However, genetically mediated renal mechanisms underlying susceptibility to hypertension remain poorly understood. We integrated genotype, gene expression, alternative splicing and DNA methylation profiles of up to 430 human kidneys to characterize the effects of BP index variants from genome-wide association studies (GWASs) on renal transcriptome and epigenome. We uncovered kidney targets for 479 (58.3%) BP-GWAS variants and paired 49 BP-GWAS kidney genes with 210 licensed drugs. Our colocalization and Mendelian randomization analyses identified 179 unique kidney genes with evidence of putatively causal effects on BP. Through Mendelian randomization, we also uncovered effects of BP on renal outcomes commonly affecting patients with hypertension. Collectively, our studies identified genetic variants, kidney genes, molecular mechanisms and biological pathways of key relevance to the genetic regulation of BP and inherited susceptibility to hypertension.
Collapse
Affiliation(s)
- James M Eales
- Division of Cardiovascular Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, UK
| | - Xiao Jiang
- Division of Cardiovascular Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, UK
| | - Xiaoguang Xu
- Division of Cardiovascular Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, UK
| | - Sushant Saluja
- Division of Cardiovascular Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, UK
| | - Artur Akbarov
- Division of Cardiovascular Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, UK
| | - Eddie Cano-Gamez
- Department of Human Genetics, Wellcome Sanger Institute, Cambridge, UK
| | - Michelle T McNulty
- Division of Nephrology, Boston Children's Hospital, Boston, MA, USA.,The Broad Institute, Cambridge, MA, USA
| | - Christopher Finan
- Institute of Cardiovascular Science, University College London, London, UK
| | - Hui Guo
- Centre for Biostatistics, School of Health Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, UK
| | - Wojciech Wystrychowski
- Department of General, Vascular and Transplant Surgery, Medical University of Silesia, Katowice, Poland
| | - Monika Szulinska
- Department of Obesity, Metabolic Disorders Treatment and Clinical Dietetics, Karol Marcinkowski University of Medical Sciences, Poznan, Poland
| | - Huw B Thomas
- Division of Evolution and Genomic Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, UK
| | - Sanjeev Pramanik
- Division of Cardiovascular Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, UK.,East Lancashire Hospitals NHS Trust, Blackburn, UK
| | - Sandesh Chopade
- Institute of Cardiovascular Science, University College London, London, UK
| | - Priscilla R Prestes
- Health Innovation and Transformation Centre, School of Science, Psychology and Sport, Federation University Australia, Ballarat, Victoria, Australia
| | - Ingrid Wise
- Australian Institute of Tropical Health & Medicine, James Cook University, Cairns, Queensland, Australia
| | - Evangelos Evangelou
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, UK.,Department of Hygiene and Epidemiology, University of Ioannina Medical School, Ioannina, Greece
| | - Mahan Salehi
- Division of Cardiovascular Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, UK
| | - Yusif Shakanti
- Division of Cardiovascular Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, UK
| | - Mikael Ekholm
- Division of Cardiovascular Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, UK.,Department of Clinical Sciences, Danderyd Hospital, Karolinska Institutet, Stockholm, Sweden
| | - Matthew Denniff
- Department of Cardiovascular Sciences, University of Leicester, Leicester, UK
| | - Alicja Nazgiewicz
- Division of Cardiovascular Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, UK
| | - Felix Eichinger
- Department of Medicine, University of Michigan, Ann Arbor, MI, USA
| | - Bradley Godfrey
- Department of Urology and Uro-oncology, Karol Marcinkowski University of Medical Sciences, Poznan, Poland
| | - Andrzej Antczak
- Department of Urology and Uro-oncology, Karol Marcinkowski University of Medical Sciences, Poznan, Poland
| | - Maciej Glyda
- Department of Transplantology and General Surgery Poznan, Collegium Medicum, Nicolaus Copernicus University, Bydgoszcz, Poland
| | - Robert Król
- Department of General, Vascular and Transplant Surgery, Medical University of Silesia, Katowice, Poland
| | - Stephen Eyre
- Division of Musculoskeletal and Dermatological Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, UK
| | - Jason Brown
- Division of Research and Innovation, Manchester University NHS Foundation Trust, Manchester, UK
| | - Carlo Berzuini
- Centre for Biostatistics, School of Health Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, UK
| | - John Bowes
- Division of Musculoskeletal and Dermatological Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, UK
| | - Mark Caulfield
- William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, UK.,National Institute for Health Research, Barts Cardiovascular Biomedical Research Centre, Queen Mary University of London, London, UK
| | | | - Joanna Zywiec
- Department of Internal Medicine, Diabetology and Nephrology, Zabrze, Medical University of Silesia, Katowice, Poland
| | - Pawel Bogdanski
- Department of Obesity, Metabolic Disorders Treatment and Clinical Dietetics, Karol Marcinkowski University of Medical Sciences, Poznan, Poland
| | | | - Adrian S Woolf
- Division of Cell Matrix Biology and Regenerative Medicine, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK.,Royal Manchester Children's Hospital and Manchester Academic Health Science Centre, Manchester University NHS Foundation Trust, Manchester, UK
| | - David Talavera
- Division of Cardiovascular Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, UK
| | - Bernard Keavney
- Division of Cardiovascular Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, UK.,Division of Cardiology and Manchester Academic Health Science Centre, Manchester University NHS Foundation Trust, Manchester, UK
| | - Pasquale Maffia
- Institute of Infection, Immunity and Inflammation, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, UK.,Institute of Cardiovascular and Medical Sciences, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, UK.,Department of Pharmacy, University of Naples Federico II, Naples, Italy
| | - Tomasz J Guzik
- Institute of Cardiovascular and Medical Sciences, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, UK.,Department of Internal and Agricultural Medicine, Jagiellonian University College of Medicine, Kraków, Poland
| | - Raymond T O'Keefe
- Division of Evolution and Genomic Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, UK
| | - Gosia Trynka
- Department of Human Genetics, Wellcome Sanger Institute, Cambridge, UK.,Open Targets, Wellcome Genome Campus, Cambridge, UK
| | - Nilesh J Samani
- Department of Cardiovascular Sciences, University of Leicester, Leicester, UK.,National Institute for Health Research, Leicester Biomedical Research Centre, Leicester, UK
| | - Aroon Hingorani
- Institute of Cardiovascular Science, University College London, London, UK
| | - Matthew G Sampson
- Division of Nephrology, Boston Children's Hospital, Boston, MA, USA.,The Broad Institute, Cambridge, MA, USA.,Harvard Medical School, Boston, MA, USA
| | - Andrew P Morris
- Division of Musculoskeletal and Dermatological Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, UK.,Department of Biostatistics, Institute of Translational Medicine, University of Liverpool, Liverpool, UK
| | - Fadi J Charchar
- Health Innovation and Transformation Centre, School of Science, Psychology and Sport, Federation University Australia, Ballarat, Victoria, Australia.,Department of Cardiovascular Sciences, University of Leicester, Leicester, UK.,Department of Physiology, University of Melbourne, Parkville, Victoria, Australia
| | - Maciej Tomaszewski
- Division of Cardiovascular Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, UK. .,Manchester Heart Centre and Manchester Academic Health Science Centre, Manchester University NHS Foundation Trust, Manchester, UK.
| |
Collapse
|
46
|
Functional annotation of lncRNA in high-throughput screening. Essays Biochem 2021; 65:761-773. [PMID: 33835127 PMCID: PMC8564734 DOI: 10.1042/ebc20200061] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Revised: 02/25/2021] [Accepted: 03/15/2021] [Indexed: 12/17/2022]
Abstract
Recent efforts on the characterization of long non-coding RNAs (lncRNAs) revealed their functional roles in modulating diverse cellular processes. These include pluripotency maintenance, lineage commitment, carcinogenesis, and pathogenesis of various diseases. By interacting with DNA, RNA and protein, lncRNAs mediate multifaceted mechanisms to regulate transcription, RNA processing, RNA interference and translation. Of more than 173000 discovered lncRNAs, the majority remain functionally unknown. The cell type-specific expression and localization of the lncRNA also suggest potential distinct functions of lncRNAs across different cell types. This highlights the niche of identifying functional lncRNAs in different biological processes and diseases through high-throughput (HTP) screening. This review summarizes the current work performed and perspectives on HTP screening of functional lncRNAs where different technologies, platforms, cellular responses and the downstream analyses are discussed. We hope to provide a better picture in applying different technologies to facilitate functional annotation of lncRNA efficiently.
Collapse
|
47
|
Sebastian-Leon P, Devesa-Peiro A, Aleman A, Parraga-Leo A, Arnau V, Pellicer A, Diaz-Gimeno P. Transcriptional changes through menstrual cycle reveal a global transcriptional derepression underlying the molecular mechanism involved in the window of implantation. Mol Hum Reprod 2021; 27:6217366. [PMID: 33830236 DOI: 10.1093/molehr/gaab027] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Revised: 03/27/2021] [Indexed: 12/20/2022] Open
Abstract
The human endometrium is a dynamic tissue that only is receptive to host the embryo during a brief time in the middle secretory phase, called the window of implantation (WOI). Despite its importance, regulation of the menstrual cycle remains incompletely understood. The aim of this study was to characterize the gene cooperation and regulation of menstrual cycle progression, to dissect the molecular complexity underlying acquisition of endometrial receptivity for a successful pregnancy, and to provide the scientific community with detailed gene co-expression information throughout the menstrual cycle on a user-friendly web-tool database. A retrospective gene co-expression analysis was performed based on the endometrial receptivity array (ERarray) gene signature from 523 human endometrial samples collected across the menstrual cycle, including during the WOI. Gene co-expression analysis revealed the WOI as having the significantly smallest proportion of negative correlations for transcriptional profiles associated with successful pregnancies compared to other cycle stages, pointing to a global transcriptional derepression being involved in acquisition of endometrial receptivity. Regulation was greatest during the transition between proliferative and secretory endometrial phases. Further, we prioritized nuclear hormone receptors as major regulators of this derepression and proved that some genes and transcription factors involved in this process were dysregulated in patients with recurrent implantation failure. We also compiled the wealth of gene co-expression data to stimulate hypothesis-driven single-molecule endometrial studies in a user-friendly database: Menstrual Cycle Gene Co-expression Network (www.menstrualcyclegcn.com). This study revealed a global transcriptional repression across the menstrual cycle, which relaxes when the WOI opens for transcriptional profiles associated with successful pregnancies. These findings suggest that a global transcriptional derepression is needed for embryo implantation and early development.
Collapse
Affiliation(s)
- P Sebastian-Leon
- Department of Genomic & Systems Reproductive Medicine, IVI-RMA IVI Foundation-Instituto de Investigación Sanitaria La Fe, Valencia, Spain
| | - A Devesa-Peiro
- Department of Genomic & Systems Reproductive Medicine, IVI-RMA IVI Foundation-Instituto de Investigación Sanitaria La Fe, Valencia, Spain.,Department of Pediatrics, Obstetrics and Gynaecology, University of Valencia, Valencia, Spain
| | - A Aleman
- Department of Genomic & Systems Reproductive Medicine, IVI-RMA IVI Foundation-Instituto de Investigación Sanitaria La Fe, Valencia, Spain
| | - A Parraga-Leo
- Department of Genomic & Systems Reproductive Medicine, IVI-RMA IVI Foundation-Instituto de Investigación Sanitaria La Fe, Valencia, Spain.,Department of Pediatrics, Obstetrics and Gynaecology, University of Valencia, Valencia, Spain
| | - V Arnau
- Bioinformatics, Escuela Técnica Superior de Ingeniería, Universidad de Valencia, Burjassot, Spain.,Institute for Integrative Systems Biology (I2SysBio), Universidad de Valencia-Consejo Superior de Investigaciones Científicas (CSIC), C/Catedrático Agustín Escardino Benlloch, Paterna, Spain
| | - A Pellicer
- Department of Genomic & Systems Reproductive Medicine, IVI-RMA IVI Foundation-Instituto de Investigación Sanitaria La Fe, Valencia, Spain.,Department of Pediatrics, Obstetrics and Gynaecology, University of Valencia, Valencia, Spain.,Reproductive Medicine, IVI-RMA IVI Rome, Rome, Italy
| | - P Diaz-Gimeno
- Department of Genomic & Systems Reproductive Medicine, IVI-RMA IVI Foundation-Instituto de Investigación Sanitaria La Fe, Valencia, Spain
| |
Collapse
|
48
|
Zhang X, Cui Y, Wang J, Huang Y, Qi Y. Conserved co-functional network between maize and Arabidopsis aid in the identification of seed defective genes in maize. Genes Genomics 2021; 43:433-446. [PMID: 33651300 DOI: 10.1007/s13258-021-01067-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2020] [Accepted: 02/17/2021] [Indexed: 01/12/2023]
Abstract
BACKGROUND The biological pathways related to Arabidopsis seed development have been well studied and functional genes involved in it have been discovered. However, functional studies about maize seed development were more limited compared to Arabidopsis. OBJECTIVE Therefore, transferring knowledge from Arabidopsis into maize would facilitate functional studies about maize seed development. METHOD In this study, public transcriptome data of the two species related to seed development were obtained. Co-expression network in each species was compared by integrating orthology information. RESULTS This conserved co-functional network contained 4510 maize and 4808 Arabidopsis genes, respectively. Most of these genes were expressed in throughout embryo, early or later endosperm/seed. These conserved co-functional genes were significantly enriched for members of PPR protein family, which was consistent with that PPR proteins play an important role in maize seed development. Spatial-temporally co-functional genes were discovered in the seed coat and embryo. Furthermore, 66 well-studied genes involved in Arabidopsis seed development were co-functional with 319 maize genes and one maize gene (GRMZM2G036050) was further confirmed using an EMS-induced seed defective mutant by bulked segregating RNA sequencing (BSR) analysis. CONCLUSIONS Altogether, these results showed the potential of this approach to support functional studies in maize seed development by transferring knowledge from Arabidopsis.
Collapse
Affiliation(s)
- Xiangbo Zhang
- Guangdong Sugarcane Genetic Improvement Engineering Center, Institute of Bioengineering, Guangdong Academy of Sciences, Guangzhou, 510316, China
| | - Yang Cui
- Sciences Rice and Sorghum Institude, Sichuan Academy of Agricultural, Deyang, 618000, China
| | - Juxuan Wang
- Yunnan Yingmao Sugar Industry (Group) Co. LTD, Kunming, 650228, China
| | - Yonghong Huang
- Guangdong Sugarcane Genetic Improvement Engineering Center, Institute of Bioengineering, Guangdong Academy of Sciences, Guangzhou, 510316, China
| | - Yongwen Qi
- Guangdong Sugarcane Genetic Improvement Engineering Center, Institute of Bioengineering, Guangdong Academy of Sciences, Guangzhou, 510316, China.
| |
Collapse
|
49
|
Zhang P, Southey BR, Sweedler JV, Pradhan A, Rodriguez-Zas SL. Enhanced Understanding of Molecular Interactions and Function Underlying Pain Processes Through Networks of Transcript Isoforms, Genes, and Gene Families. Adv Appl Bioinform Chem 2021; 14:49-69. [PMID: 33633454 PMCID: PMC7901473 DOI: 10.2147/aabc.s284986] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Accepted: 01/05/2021] [Indexed: 11/23/2022] Open
Abstract
Introduction Molecular networks based on the abundance of mRNA at the gene level and pathway networks that relate families or groups of paralog genes have supported the understanding of interactions between molecules. However, multiple molecular mechanisms underlying health and behavior, such as pain signal processing, are modulated by the abundances of the transcript isoforms that originate from alternative splicing, in addition to gene abundances. Alternative splice variants of growth factors, ion channels, and G-protein-coupled receptors can code for proteoforms that can have different effects on pain and nociception. Therefore, networks inferred using abundance from more agglomerative molecular units (eg, gene family, or gene) have limitations in capturing interactions at a more granular level (eg, gene, or transcript isoform, respectively) do not account for changes in the abundance at the transcript isoform level. Objective The objective of this study was to evaluate the relative benefits of network inference using abundance patterns at various aggregate levels. Methods Sparse networks were inferred using Gaussian Markov random fields and a novel aggregation criterion was used to aggregate network edges. The relative advantages of network aggregation were evaluated on two molecular systems that have different dimensions and connectivity, circadian rhythm and Toll-like receptor pathways, using RNA-sequencing data from mice representing two pain level groups, opioid-induced hyperalgesia and control, and two central nervous system regions, the nucleus accumbens and the trigeminal ganglia. Results The inferred networks were benchmarked against the Kyoto Encyclopedia of Genes and Genomes reference pathways using multiple criteria. Networks inferred using more granular information performed better than networks inferred using more aggregate information. The advantage of granular inference varied with the pathway and data set used. Discussion The differences in inferred network structure between data sets highlight the differences in OIH effect between central nervous system regions. Our findings suggest that inference of networks using alternative splicing variants can offer complementary insights into the relationship between genes and gene paralog groups.
Collapse
Affiliation(s)
- Pan Zhang
- Illinois Informatics Institute, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Bruce R Southey
- Department of Animal Sciences, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Jonathan V Sweedler
- Department of Chemistry and the Beckman Institute, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Amynah Pradhan
- Department of Psychiatry, University of Illinois at Chicago, Chicago, IL, USA
| | - Sandra L Rodriguez-Zas
- Illinois Informatics Institute, University of Illinois at Urbana-Champaign, Urbana, IL, USA.,Department of Animal Sciences, University of Illinois at Urbana-Champaign, Urbana, IL, USA.,Department of Statistics, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| |
Collapse
|
50
|
Emad A, Sinha S. Inference of phenotype-relevant transcriptional regulatory networks elucidates cancer type-specific regulatory mechanisms in a pan-cancer study. NPJ Syst Biol Appl 2021; 7:9. [PMID: 33558504 PMCID: PMC7870953 DOI: 10.1038/s41540-021-00169-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2019] [Accepted: 01/05/2021] [Indexed: 01/30/2023] Open
Abstract
Reconstruction of transcriptional regulatory networks (TRNs) is a powerful approach to unravel the gene expression programs involved in healthy and disease states of a cell. However, these networks are usually reconstructed independent of the phenotypic (or clinical) properties of the samples. Therefore, they may confound regulatory mechanisms that are specifically related to a phenotypic property with more general mechanisms underlying the full complement of the analyzed samples. In this study, we develop a method called InPheRNo to identify "phenotype-relevant" TRNs. This method is based on a probabilistic graphical model that models the simultaneous effects of multiple transcription factors (TFs) on their target genes and the statistical relationship between the target genes' expression and the phenotype. Extensive comparison of InPheRNo with related approaches using primary tumor samples of 18 cancer types from The Cancer Genome Atlas reveals that InPheRNo can accurately reconstruct cancer type-relevant TRNs and identify cancer driver TFs. In addition, survival analysis reveals that the activity level of TFs with many target genes could distinguish patients with poor prognosis from those with better prognosis.
Collapse
Affiliation(s)
- Amin Emad
- Department of Electrical and Computer Engineering, McGill University, Montreal, QC, Canada.
| | - Saurabh Sinha
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
- Cancer Center at Illinois, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
| |
Collapse
|