1
|
Ono K, Fong D, Gao C, Churas C, Pillich R, Lenkiewicz J, Pratt D, Pico AR, Hanspers K, Xin Y, Morris J, Kucera M, Franz M, Lopes C, Bader G, Ideker T, Chen J. Cytoscape Web: bringing network biology to the browser. Nucleic Acids Res 2025:gkaf365. [PMID: 40308211 DOI: 10.1093/nar/gkaf365] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2025] [Revised: 04/08/2025] [Accepted: 04/22/2025] [Indexed: 05/02/2025] Open
Abstract
Since its introduction in 2003, Cytoscape has been a de facto standard for visualizing and analyzing biological networks. We now introduce Cytoscape Web (https://web.cytoscape.org), an online implementation that captures the interface and key visualization functionality of the desktop while providing integration with web tools and databases. Cytoscape Web enhances accessibility, simplifying collaboration through online data sharing. It integrates with Cytoscape desktop via the CX2 network exchange format and with the Network Data Exchange for storing and sharing networks. The platform supports extensibility through an App framework for UI components and Service Apps for algorithm integration, fostering community-driven development of new analysis tools. Overall, Cytoscape Web enhances network biology by providing a versatile, accessible, and collaborative online platform that adapts to evolving computational challenges, laying a foundation for future incorporation of advanced network analysis capabilities by the community.
Collapse
Affiliation(s)
- Keiichiro Ono
- Department of Medicine, University of California, San Diego, CA 92093, United States
| | - Dylan Fong
- Department of Medicine, University of California, San Diego, CA 92093, United States
| | - Chengzhan Gao
- Department of Medicine, University of California, San Diego, CA 92093, United States
| | - Christopher Churas
- Department of Medicine, University of California, San Diego, CA 92093, United States
| | - Rudolf Pillich
- Department of Medicine, University of California, San Diego, CA 92093, United States
| | - Joanna Lenkiewicz
- Department of Medicine, University of California, San Diego, CA 92093, United States
| | - Dexter Pratt
- Department of Medicine, University of California, San Diego, CA 92093, United States
| | - Alexander R Pico
- Data Science and Biotechnology, Gladstone Institutes, San Francisco, CA 94158, United States
| | - Kristina Hanspers
- Data Science and Biotechnology, Gladstone Institutes, San Francisco, CA 94158, United States
| | - Yihang Xin
- Data Science and Biotechnology, Gladstone Institutes, San Francisco, CA 94158, United States
| | - John Morris
- Resource for Biocomputing, Visualization and Informatics, University of California, San Francisco, CA 94118, United States
| | - Mike Kucera
- The Donnelly Centre, University of Toronto, Ontario M5S 3E1, Canada
| | - Max Franz
- The Donnelly Centre, University of Toronto, Ontario M5S 3E1, Canada
| | - Christian Lopes
- The Donnelly Centre, University of Toronto, Ontario M5S 3E1, Canada
| | - Gary Bader
- The Donnelly Centre, University of Toronto, Ontario M5S 3E1, Canada
| | - Trey Ideker
- Department of Medicine, University of California, San Diego, CA 92093, United States
| | - Jing Chen
- Department of Medicine, University of California, San Diego, CA 92093, United States
| |
Collapse
|
2
|
Liang X, Zhang J, Kim Y, Ho J, Liu K, Keenum I, Gupta S, Davis B, Hepp SL, Zhang L, Xia K, Knowlton KF, Liao J, Vikesland PJ, Pruden A, Heath LS. ARGem: a new metagenomics pipeline for antibiotic resistance genes: metadata, analysis, and visualization. Front Genet 2023; 14:1219297. [PMID: 37811141 PMCID: PMC10558085 DOI: 10.3389/fgene.2023.1219297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Accepted: 09/01/2023] [Indexed: 10/10/2023] Open
Abstract
Antibiotic resistance is of crucial interest to both human and animal medicine. It has been recognized that increased environmental monitoring of antibiotic resistance is needed. Metagenomic DNA sequencing is becoming an attractive method to profile antibiotic resistance genes (ARGs), including a special focus on pathogens. A number of computational pipelines are available and under development to support environmental ARG monitoring; the pipeline we present here is promising for general adoption for the purpose of harmonized global monitoring. Specifically, ARGem is a user-friendly pipeline that provides full-service analysis, from the initial DNA short reads to the final visualization of results. The capture of extensive metadata is also facilitated to support comparability across projects and broader monitoring goals. The ARGem pipeline offers efficient analysis of a modest number of samples along with affordable computational components, though the throughput could be increased through cloud resources, based on the user's configuration. The pipeline components were carefully assessed and selected to satisfy tradeoffs, balancing efficiency and flexibility. It was essential to provide a step to perform short read assembly in a reasonable time frame to ensure accurate annotation of identified ARGs. Comprehensive ARG and mobile genetic element databases are included in ARGem for annotation support. ARGem further includes an expandable set of analysis tools that include statistical and network analysis and supports various useful visualization techniques, including Cytoscape visualization of co-occurrence and correlation networks. The performance and flexibility of the ARGem pipeline is demonstrated with analysis of aquatic metagenomes. The pipeline is freely available at https://github.com/xlxlxlx/ARGem.
Collapse
Affiliation(s)
- Xiao Liang
- Department of Computer Science, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Jingyi Zhang
- Department of Computer Science, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Yoonjin Kim
- Department of Computer Science, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Josh Ho
- Department of Computer Science, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Kevin Liu
- Department of Computer Science, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Ishi Keenum
- Department of Civil and Environmental Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Suraj Gupta
- Interdisciplinary PhD Program in Genetics, Bioinformatics, and Computational Biology, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Benjamin Davis
- Department of Civil and Environmental Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Shannon L. Hepp
- Department of Civil and Environmental Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Liqing Zhang
- Department of Computer Science, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Kang Xia
- School of Plant and Environmental Science, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Katharine F. Knowlton
- Department of Dairy Science, Virginia Polytechnic Institute and State University, Blacksburg, VaA, United States
| | - Jingqiu Liao
- Department of Civil and Environmental Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Peter J. Vikesland
- Department of Civil and Environmental Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Amy Pruden
- Department of Civil and Environmental Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Lenwood S. Heath
- Department of Computer Science, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| |
Collapse
|
3
|
Köse TB, Li J, Ritz A. Growing Directed Acyclic Graphs: Optimization Functions for Pathway Reconstruction Algorithms. J Comput Biol 2023. [PMID: 36862510 DOI: 10.1089/cmb.2022.0376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/03/2023] Open
Abstract
A major challenge in molecular systems biology is to understand how proteins work to transmit external signals to changes in gene expression. Computationally reconstructing these signaling pathways from protein interaction networks can help understand what is missing from existing pathway databases. We formulate a new pathway reconstruction problem, one that iteratively grows directed acyclic graphs (DAGs) from a set of starting proteins in a protein interaction network. We present an algorithm that provably returns the optimal DAGs for two different cost functions and evaluate the pathway reconstructions when applied to six diverse signaling pathways from the NetPath database. The optimal DAGs outperform an existing k-shortest paths method for pathway reconstruction, and the new reconstructions are enriched for different biological processes. Growing DAGs is a promising step toward reconstructing pathways that provably optimize a specific cost function.
Collapse
Affiliation(s)
- Tunç Başar Köse
- Department of Computer Science and Reed College, Portland, Oregon, USA
| | - Jiarong Li
- Department of Computer Science and Reed College, Portland, Oregon, USA
| | - Anna Ritz
- Department of Biology, Reed College, Portland, Oregon, USA
| |
Collapse
|
4
|
Fusar-Poli P, Manchia M, Koutsouleris N, Leslie D, Woopen C, Calkins ME, Dunn M, Tourneau CL, Mannikko M, Mollema T, Oliver D, Rietschel M, Reininghaus EZ, Squassina A, Valmaggia L, Kessing LV, Vieta E, Correll CU, Arango C, Andreassen OA. Ethical considerations for precision psychiatry: A roadmap for research and clinical practice. Eur Neuropsychopharmacol 2022; 63:17-34. [PMID: 36041245 DOI: 10.1016/j.euroneuro.2022.08.001] [Citation(s) in RCA: 45] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Revised: 07/04/2022] [Accepted: 08/05/2022] [Indexed: 12/14/2022]
Abstract
Precision psychiatry is an emerging field with transformative opportunities for mental health. However, the use of clinical prediction models carries unprecedented ethical challenges, which must be addressed before accessing the potential benefits of precision psychiatry. This critical review covers multidisciplinary areas, including psychiatry, ethics, statistics and machine-learning, healthcare and academia, as well as input from people with lived experience of mental disorders, their family, and carers. We aimed to identify core ethical considerations for precision psychiatry and mitigate concerns by designing a roadmap for research and clinical practice. We identified priorities: learning from somatic medicine; identifying precision psychiatry use cases; enhancing transparency and generalizability; fostering implementation; promoting mental health literacy; communicating risk estimates; data protection and privacy; and fostering the equitable distribution of mental health care. We hope this blueprint will advance research and practice and enable people with mental health problems to benefit from precision psychiatry.
Collapse
Affiliation(s)
- Paolo Fusar-Poli
- Early Psychosis: Interventions and Clinical-Detection (EPIC) Lab, Department of Psychosis Studies, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK; South London and Maudsley NHS Foundation Trust, London, UK; Department of Brain and Behavioral Sciences, University of Pavia, Pavia, Italy.
| | - Mirko Manchia
- Section of Psychiatry, Department of Medical Sciences and Public Health, University of Cagliari, Cagliari, Italy; Unit of Clinical Psychiatry, University Hospital Agency of Cagliari, Cagliari, Italy; Department of Pharmacology, Dalhousie University, Halifax, Nova Scotia, Canada
| | | | | | | | - Monica E Calkins
- Neurodevelopment and Psychosis Section and Lifespan Brain Institute of Penn/CHOP, Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, USA
| | - Michael Dunn
- Centre for Biomedical Ethics, Yong Loo Lin School of Medicine, National University of Singapore
| | - Christophe Le Tourneau
- Institut Curie, Department of Drug Development and Innovation (D3i), INSERM U900 Research unit, Paris-Saclay University, France
| | - Miia Mannikko
- European Federation of Associations of Families of People with Mental Illness (EUFAMI), Leuven, Belgium
| | - Tineke Mollema
- Global Alliance of Mental Illness Advocacy Networks-Europe (GAMIAN), Brussels, Belgium
| | - Dominic Oliver
- Early Psychosis: Interventions and Clinical-Detection (EPIC) Lab, Department of Psychosis Studies, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK
| | - Marcella Rietschel
- Department of Genetic Epidemiology in Psychiatry, Central Institute of Mental Health, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany
| | - Eva Z Reininghaus
- Department of Psychiatry and Psychotherapeutic Medicine, Medical University of Graz, Graz, Austria
| | - Alessio Squassina
- Section of Neuroscience and Clinical Pharmacology, Department of Biomedical Sciences, University of Cagliari, Italy
| | - Lucia Valmaggia
- South London and Maudsley NHS Foundation Trust, London, UK; Department of Psychology, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK; Department of Psychiatry, KU Leuven, Belgium
| | - Lars Vedel Kessing
- Copenhagen Affective disorder Research Center (CADIC), Psychiatric Center Copenhagen, Denmark; Department of clinical Medicine, University of Copenhagen, Denmark
| | - Eduard Vieta
- Hospital Clinic, Institute of Neuroscience, University of Barcelona, IDIBAPS, CIBERSAM, Barcelona, Catalonia, Spain
| | - Christoph U Correll
- The Zucker Hillside Hospital, Department of Psychiatry, Northwell Health, Glen Oaks, NY, USA; Department of Psychiatry and Molecular Medicine, Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Hempstead, NY, USA; Center for Psychiatric Neuroscience; The Feinstein Institutes for Medical Research, Manhasset, NY, USA; Department of Child and Adolescent Psychiatry, Charité Universitätsmedizin, Berlin, Germany
| | - Celso Arango
- Department of Child and Adolescent Psychiatry, Institute of Psychiatry and Mental Health, Hospital General Universitario Gregorio Marañón, Gregorio Marañón; Health Research Institute (IiGSM), School of Medicine, Universidad Complutense de Madrid; Biomedical Research Center for Mental Health (CIBERSAM), Madrid, Spain
| | - Ole A Andreassen
- NORMENT, Institute of Clinical Medicine, University of Oslo and Division of Mental Health and Addiction, Oslo University Hospital, Oslo, Norway
| | | |
Collapse
|
5
|
Zeng H, Zhang J, Preising GA, Rubel T, Singh P, Ritz A. Graphery: interactive tutorials for biological network algorithms. Nucleic Acids Res 2021; 49:W257-W262. [PMID: 34037782 PMCID: PMC8262715 DOI: 10.1093/nar/gkab420] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Revised: 04/19/2021] [Accepted: 05/03/2021] [Indexed: 11/14/2022] Open
Abstract
Networks have been an excellent framework for modeling complex biological information, but the methodological details of network-based tools are often described for a technical audience. We have developed Graphery, an interactive tutorial webserver that illustrates foundational graph concepts frequently used in network-based methods. Each tutorial describes a graph concept along with executable Python code that can be interactively run on a graph. Users navigate each tutorial using their choice of real-world biological networks that highlight the diverse applications of network algorithms. Graphery also allows users to modify the code within each tutorial or write new programs, which all can be executed without requiring an account. Graphery accepts ideas for new tutorials and datasets that will be shaped by both computational and biological researchers, growing into a community-contributed learning platform. Graphery is available at https://graphery.reedcompbio.org/.
Collapse
Affiliation(s)
- Heyuan Zeng
- Computer Science Department, Reed College, 3203 SE Woodstock Blvd, Portland, OR 97202, USA.,Biology Department, Reed College, 3203 SE Woodstock Blvd, Portland, OR 97202, USA
| | - Jinbiao Zhang
- Information and Communication Technology Department, Xiamen University Malaysia, Jalan Sunsuria, Bandar Sunsuria, 43900 Sepang, Selangor Darul Ehsan, Malaysia
| | - Gabriel A Preising
- Biology Department, Reed College, 3203 SE Woodstock Blvd, Portland, OR 97202, USA
| | - Tobias Rubel
- Biology Department, Reed College, 3203 SE Woodstock Blvd, Portland, OR 97202, USA
| | - Pramesh Singh
- Biology Department, Reed College, 3203 SE Woodstock Blvd, Portland, OR 97202, USA
| | - Anna Ritz
- Biology Department, Reed College, 3203 SE Woodstock Blvd, Portland, OR 97202, USA
| |
Collapse
|
6
|
Xin S, Zhang W. Construction and analysis of the protein-protein interaction network for the olfactory system of the silkworm Bombyx mori. ARCHIVES OF INSECT BIOCHEMISTRY AND PHYSIOLOGY 2020; 105:e21737. [PMID: 32926465 DOI: 10.1002/arch.21737] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 08/21/2020] [Accepted: 08/24/2020] [Indexed: 06/11/2023]
Abstract
Olfaction plays an essential role in feeding and information exchange in insects. Previous studies on the olfaction of silkworms have provided a wealth of information about genes and proteins, yet, most studies have only focused on a single gene or protein related to the insect's olfaction. The aim of the current study is to determine key proteins in the olfactory system of the silkworm, and further understand protein-protein interactions (PPIs) in the olfactory system of Lepidoptera. To achieve this goal, we integrated Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and network analyses. Furthermore, we selected 585 olfactory-related proteins and constructed a (PPI) network for the olfactory system of the silkworm. Network analysis led to the identification of several key proteins, including GSTz1, LOC733095, BGIBMGA002169-TA, BGIBMGA010939-TA, GSTs2, GSTd2, Or-2, and BGIBMGA013255-TA. A comprehensive evaluation of the proteins showed that glutathione S-transferases (GSTs) had the highest ranking. GSTs also had the highest enrichment levels in GO and KEGG. In conclusion, our analysis showed that key nodes in the biological network had a significant impact on the network, and the key proteins identified via network analysis could serve as new research targets to determine their functions in olfaction.
Collapse
Affiliation(s)
- Shanghong Xin
- School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Wenjun Zhang
- School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| |
Collapse
|
7
|
Nagpal S, Baksi KD, Kuntal BK, Mande SS. NetConfer: a web application for comparative analysis of multiple biological networks. BMC Biol 2020; 18:53. [PMID: 32430035 PMCID: PMC7236966 DOI: 10.1186/s12915-020-00781-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2020] [Accepted: 04/14/2020] [Indexed: 12/29/2022] Open
Abstract
Background Most biological experiments are inherently designed to compare changes or transitions of state between conditions of interest. The advancements in data intensive research have in particular elevated the need for resources and tools enabling comparative analysis of biological data. The complexity of biological systems and the interactions of their various components, such as genes, proteins, taxa, and metabolites, have been inferred, represented, and visualized via graph theory-based networks. Comparisons of multiple networks can help in identifying variations across different biological systems, thereby providing additional insights. However, while a number of online and stand-alone tools exist for generating, analyzing, and visualizing individual biological networks, the utility to batch process and comprehensively compare multiple networks is limited. Results Here, we present a graphical user interface (GUI)-based web application which implements multiple network comparison methodologies and presents them in the form of organized analysis workflows. Dedicated comparative visualization modules are provided to the end-users for obtaining easy to comprehend, insightful, and meaningful comparisons of various biological networks. We demonstrate the utility and power of our tool using publicly available microbial and gene expression data. Conclusion NetConfer tool is developed keeping in mind the requirements of researchers working in the field of biological data analysis with limited programming expertise. It is also expected to be useful for advanced users from biological as well as other domains (working with association networks), benefiting from provided ready-made workflows, as they allow to focus directly on the results without worrying about the implementation. While the web version allows using this application without installation and dependency requirements, a stand-alone version has also been supplemented to accommodate the offline requirement of processing large networks.
Collapse
Affiliation(s)
- Sunil Nagpal
- Bio-Sciences R&D Division, TCS Research, Tata Consultancy Services Ltd., 54-B Hadapsar Industrial Estate, Pune, 411 013, India
| | - Krishanu Das Baksi
- Bio-Sciences R&D Division, TCS Research, Tata Consultancy Services Ltd., 54-B Hadapsar Industrial Estate, Pune, 411 013, India
| | - Bhusan K Kuntal
- Bio-Sciences R&D Division, TCS Research, Tata Consultancy Services Ltd., 54-B Hadapsar Industrial Estate, Pune, 411 013, India. .,Chemical Engineering and Process Development Division, CSIR-National Chemical Laboratory, Dr. Homi Bhabha Road, Pune, 411 008, India. .,Academy of Scientific and Innovative Research (AcSIR), CSIR-National Chemical Laboratory Campus, Pune, 411 008, India.
| | - Sharmila S Mande
- Bio-Sciences R&D Division, TCS Research, Tata Consultancy Services Ltd., 54-B Hadapsar Industrial Estate, Pune, 411 013, India.
| |
Collapse
|
8
|
Youssef I, Law J, Ritz A. Integrating protein localization with automated signaling pathway reconstruction. BMC Bioinformatics 2019; 20:505. [PMID: 31787091 PMCID: PMC6886211 DOI: 10.1186/s12859-019-3077-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
Background Understanding cellular responses via signal transduction is a core focus in systems biology. Tools to automatically reconstruct signaling pathways from protein-protein interactions (PPIs) can help biologists generate testable hypotheses about signaling. However, automatic reconstruction of signaling pathways suffers from many interactions with the same confidence score leading to many equally good candidates. Further, some reconstructions are biologically misleading due to ignoring protein localization information. Results We propose LocPL, a method to improve the automatic reconstruction of signaling pathways from PPIs by incorporating information about protein localization in the reconstructions. The method relies on a dynamic program to ensure that the proteins in a reconstruction are localized in cellular compartments that are consistent with signal transduction from the membrane to the nucleus. LocPL and existing reconstruction algorithms are applied to two PPI networks and assessed using both global and local definitions of accuracy. LocPL produces more accurate and biologically meaningful reconstructions on a versatile set of signaling pathways. Conclusion LocPL is a powerful tool to automatically reconstruct signaling pathways from PPIs that leverages cellular localization information about proteins. The underlying dynamic program and signaling model are flexible enough to study cellular signaling under different settings of signaling flow across the cellular compartments.
Collapse
Affiliation(s)
- Ibrahim Youssef
- Biomedical Engineering Department, Cairo University, Giza, 12613, Egypt.,Biology Department, Reed College, Portland, OR 97202, USA
| | - Jeffrey Law
- Genetics, Bioinformatics, and Computational Biology, Virginia Tech, Blacksburg, VA 24061, USA
| | - Anna Ritz
- Biology Department, Reed College, Portland, OR 97202, USA.
| |
Collapse
|
9
|
Franzese N, Groce A, Murali TM, Ritz A. Hypergraph-based connectivity measures for signaling pathway topologies. PLoS Comput Biol 2019; 15:e1007384. [PMID: 31652258 PMCID: PMC6834280 DOI: 10.1371/journal.pcbi.1007384] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2019] [Revised: 11/06/2019] [Accepted: 09/09/2019] [Indexed: 12/12/2022] Open
Abstract
Characterizing cellular responses to different extrinsic signals is an active area of research, and curated pathway databases describe these complex signaling reactions. Here, we revisit a fundamental question in signaling pathway analysis: are two molecules “connected” in a network? This question is the first step towards understanding the potential influence of molecules in a pathway, and the answer depends on the choice of modeling framework. We examined the connectivity of Reactome signaling pathways using four different pathway representations. We find that Reactome is very well connected as a graph, moderately well connected as a compound graph or bipartite graph, and poorly connected as a hypergraph (which captures many-to-many relationships in reaction networks). We present a novel relaxation of hypergraph connectivity that iteratively increases connectivity from a node while preserving the hypergraph topology. This measure, B-relaxation distance, provides a parameterized transition between hypergraph connectivity and graph connectivity. B-relaxation distance is sensitive to the presence of small molecules that participate in many functionally unrelated reactions in the network. We also define a score that quantifies one pathway’s downstream influence on another, which can be calculated as B-relaxation distance gradually relaxes the connectivity constraint in hypergraphs. Computing this score across all pairs of 34 Reactome pathways reveals pairs of pathways with statistically significant influence. We present two such case studies, and we describe the specific reactions that contribute to the large influence score. Finally, we investigate the ability for connectivity measures to capture functional relationships among proteins, and use the evidence channels in the STRING database as a benchmark dataset. STRING interactions whose proteins are B-connected in Reactome have statistically significantly higher scores than interactions connected in the bipartite graph representation. Our method lays the groundwork for other generalizations of graph-theoretic concepts to hypergraphs in order to facilitate signaling pathway analysis. Signaling pathways describe how cells respond to external signals through molecular interactions. As we gain a deeper understanding of these signaling reactions, it is important to understand how molecules may influence downstream responses and how pathways may affect each other. As the amount of information in signaling pathway databases continues to grow, we have the opportunity to analyze properties about pathway structure. We pose an intuitive question about signaling pathways: when are two molecules “connected” in a pathway? This answer varies dramatically based on the assumptions we make about how reactions link molecules. Here, examine four approaches for modeling the structural topology of signaling pathways, and present methods to quantify whether two molecules are “connected” in a pathway database. We find that existing approaches are either too permissive (molecules are connected to many others) or restrictive (molecules are connected to a handful of others), and we present a new measure that offers a continuum between these two extremes. We then expand our question to ask when an entire signaling pathway is “downstream” of another pathway, and show two case studies from the Reactome pathway database that uncovers pathway influence. Finally, we show that the strict notion of connectivity can capture functional relationships among proteins using an independent benchmark dataset. Our approach to quantify connectivity in pathways considers a biologically-motivated definition of connectivity, laying the foundation for more sophisticated analyses that leverage the detailed information in pathway databases.
Collapse
Affiliation(s)
- Nicholas Franzese
- Biology Department, Reed College, Portland, Oregon, United States of America
- Computer Science Department, Reed College, Portland, Oregon, United States of America
- Department of Computer Science, Virginia Tech, Blacksburg, Virginia, United States of America
| | - Adam Groce
- Computer Science Department, Reed College, Portland, Oregon, United States of America
| | - T. M. Murali
- Department of Computer Science, Virginia Tech, Blacksburg, Virginia, United States of America
- ICTAS Center for Systems Biology of Engineered Tissues, Virginia Tech, Blacksburg, Virginia, United States of America
| | - Anna Ritz
- Biology Department, Reed College, Portland, Oregon, United States of America
- * E-mail:
| |
Collapse
|
10
|
Bern M, King A, Applewhite DA, Ritz A. Network-based prediction of polygenic disease genes involved in cell motility. BMC Bioinformatics 2019; 20:313. [PMID: 31216978 PMCID: PMC6584515 DOI: 10.1186/s12859-019-2834-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
Background Schizophrenia and autism are examples of polygenic diseases caused by a multitude of genetic variants, many of which are still poorly understood. Recently, both diseases have been associated with disrupted neuron motility and migration patterns, suggesting that aberrant cell motility is a phenotype for these neurological diseases. Results We formulate the Polygenic Disease Phenotype Problem which seeks to identify candidate disease genes that may be associated with a phenotype such as cell motility. We present a machine learning approach to solve this problem for schizophrenia and autism genes within a brain-specific functional interaction network. Our method outperforms peer semi-supervised learning approaches, achieving better cross-validation accuracy across different sets of gold-standard positives. We identify top candidates for both schizophrenia and autism, and select six genes labeled as schizophrenia positives that are predicted to be associated with cell motility for follow-up experiments. Conclusions Candidate genes predicted by our method suggest testable hypotheses about these genes’ role in cell motility regulation, offering a framework for generating predictions for experimental validation. Electronic supplementary material The online version of this article (10.1186/s12859-019-2834-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Miriam Bern
- Biology Department, Reed College, Portland, OR, USA
| | | | | | - Anna Ritz
- Biology Department, Reed College, Portland, OR, USA.
| |
Collapse
|
11
|
Hoyt CT, Domingo-Fernández D, Hofmann-Apitius M. BEL Commons: an environment for exploration and analysis of networks encoded in Biological Expression Language. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2018; 2018:5255171. [PMID: 30576488 PMCID: PMC6301338 DOI: 10.1093/database/bay126] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/03/2018] [Accepted: 11/05/2018] [Indexed: 12/19/2022]
Abstract
The rapid accumulation of knowledge in the field of systems and networks biology during recent years requires complex, but user-friendly and accessible web applications that allow from visualization to complex algorithmic analysis. While several web applications exist with various focuses on creation, revision, curation, storage, integration, collaboration, exploration, visualization and analysis, many of these services remain disjoint and have yet to be packaged into a cohesive environment. Here, we present BEL Commons: an integrative knowledge discovery environment for networks encoded in the Biological Expression Language (BEL). Users can upload files in BEL to be parsed, validated, compiled and stored with fine granular permissions. After, users can summarize, explore and optionally shared their networks with the scientific community. We have implemented a query builder wizard to help users find the relevant portions of increasingly large and complex networks and a visualization interface that allows them to explore their resulting networks. Finally, we have included a dedicated analytical service for performing data-driven analysis of knowledge networks to support hypothesis generation.
Collapse
Affiliation(s)
- Charles Tapley Hoyt
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing, Sankt Augustin, Germany.,Bonn-Aachen International Center for IT, Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany
| | - Daniel Domingo-Fernández
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing, Sankt Augustin, Germany.,Bonn-Aachen International Center for IT, Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany
| | - Martin Hofmann-Apitius
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing, Sankt Augustin, Germany.,Bonn-Aachen International Center for IT, Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany
| |
Collapse
|