1
|
Yin S, Mi X, Shukla D. Leveraging machine learning models for peptide-protein interaction prediction. RSC Chem Biol 2024; 5:401-417. [PMID: 38725911 PMCID: PMC11078210 DOI: 10.1039/d3cb00208j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Accepted: 02/07/2024] [Indexed: 05/12/2024] Open
Abstract
Peptides play a pivotal role in a wide range of biological activities through participating in up to 40% protein-protein interactions in cellular processes. They also demonstrate remarkable specificity and efficacy, making them promising candidates for drug development. However, predicting peptide-protein complexes by traditional computational approaches, such as docking and molecular dynamics simulations, still remains a challenge due to high computational cost, flexible nature of peptides, and limited structural information of peptide-protein complexes. In recent years, the surge of available biological data has given rise to the development of an increasing number of machine learning models for predicting peptide-protein interactions. These models offer efficient solutions to address the challenges associated with traditional computational approaches. Furthermore, they offer enhanced accuracy, robustness, and interpretability in their predictive outcomes. This review presents a comprehensive overview of machine learning and deep learning models that have emerged in recent years for the prediction of peptide-protein interactions.
Collapse
Affiliation(s)
- Song Yin
- Department of Chemical and Biomolecular Engineering, University of Illinois Urbana-Champaign Urbana 61801 Illinois USA
| | - Xuenan Mi
- Center for Biophysics and Quantitative Biology, University of Illinois Urbana-Champaign Urbana IL 61801 USA
| | - Diwakar Shukla
- Department of Chemical and Biomolecular Engineering, University of Illinois Urbana-Champaign Urbana 61801 Illinois USA
- Center for Biophysics and Quantitative Biology, University of Illinois Urbana-Champaign Urbana IL 61801 USA
- Department of Bioengineering, University of Illinois Urbana-Champaign Urbana IL 61801 USA
| |
Collapse
|
2
|
Yin S, Mi X, Shukla D. Leveraging Machine Learning Models for Peptide-Protein Interaction Prediction. ARXIV 2024:arXiv:2310.18249v2. [PMID: 37961736 PMCID: PMC10635286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Peptides play a pivotal role in a wide range of biological activities through participating in up to 40% protein-protein interactions in cellular processes. They also demonstrate remarkable specificity and efficacy, making them promising candidates for drug development. However, predicting peptide-protein complexes by traditional computational approaches, such as Docking and Molecular Dynamics simulations, still remains a challenge due to high computational cost, flexible nature of peptides, and limited structural information of peptide-protein complexes. In recent years, the surge of available biological data has given rise to the development of an increasing number of machine learning models for predicting peptide-protein interactions. These models offer efficient solutions to address the challenges associated with traditional computational approaches. Furthermore, they offer enhanced accuracy, robustness, and interpretability in their predictive outcomes. This review presents a comprehensive overview of machine learning and deep learning models that have emerged in recent years for the prediction of peptide-protein interactions.
Collapse
Affiliation(s)
- Song Yin
- Department of Chemical and Biomolecular Engineering, University of Illinois Urbana-Champaign, Urbana, IL 61801, United States
- These authors contributed to the work equally
| | - Xuenan Mi
- Center for Biophysics and Quantitative Biology, University of Illinois Urbana-Champaign, Urbana, IL 61801, United States
- These authors contributed to the work equally
| | - Diwakar Shukla
- Department of Chemical and Biomolecular Engineering, University of Illinois Urbana-Champaign, Urbana, IL 61801, United States
- Center for Biophysics and Quantitative Biology, University of Illinois Urbana-Champaign, Urbana, IL 61801, United States
- Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, IL 61801, United States
| |
Collapse
|
3
|
Jeyaraman M, Ratna HVK, Jeyaraman N, Venkatesan A, Ramasubramanian S, Yadav S. Leveraging Artificial Intelligence and Machine Learning in Regenerative Orthopedics: A Paradigm Shift in Patient Care. Cureus 2023; 15:e49756. [PMID: 38161806 PMCID: PMC10757680 DOI: 10.7759/cureus.49756] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/28/2023] [Indexed: 01/03/2024] Open
Abstract
The integration of artificial intelligence (AI) and machine learning (ML) into regenerative orthopedics heralds a paradigm shift in clinical methodologies and patient management. This review article scrutinizes AI's role in augmenting diagnostic accuracy, refining predictive models, and customizing patient care in orthopedic medicine. Focusing on innovations such as KeyGene and CellNet, we illustrate AI's adeptness in navigating complex genomic datasets, cellular differentiation, and scaffold biodegradation, which are critical components of tissue engineering. Despite its transformative potential, AI's clinical adoption remains in its infancy, contending with challenges in validation, ethical oversight, and model training for clinical relevance. This review posits AI as a vital complement to human intelligence (HI), advocating for an interdisciplinary approach that merges AI's computational prowess with medical expertise to fulfill precision medicine's promise. By analyzing historical and contemporary developments in AI, from the foundational theories of McCullough and Pitts to sophisticated neural networks, the paper emphasizes the need for a synergistic alliance between AI and HI. This collaboration is imperative for improving surgical outcomes, streamlining therapeutic modalities, and enhancing the quality of patient care. Our article calls for robust interdisciplinary strategies to overcome current obstacles and harness AI's full potential in revolutionizing patient outcomes, thereby significantly contributing to the advancement of regenerative orthopedics and the broader field of scientific research.
Collapse
Affiliation(s)
- Madhan Jeyaraman
- Orthopaedics, ACS Medical College and Hospital, Dr. MGR Educational and Research Institute, Chennai, IND
| | | | - Naveen Jeyaraman
- Orthopaedics, ACS Medical College and Hospital, Dr. MGR Educational and Research Institute, Chennai, IND
| | | | | | - Sankalp Yadav
- Medicine, Shri Madan Lal Khurana Chest Clinic, New Delhi, IND
| |
Collapse
|
4
|
Rogers JR, Nikolényi G, AlQuraishi M. Growing ecosystem of deep learning methods for modeling protein-protein interactions. Protein Eng Des Sel 2023; 36:gzad023. [PMID: 38102755 DOI: 10.1093/protein/gzad023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 12/06/2023] [Accepted: 12/07/2023] [Indexed: 12/17/2023] Open
Abstract
Numerous cellular functions rely on protein-protein interactions. Efforts to comprehensively characterize them remain challenged however by the diversity of molecular recognition mechanisms employed within the proteome. Deep learning has emerged as a promising approach for tackling this problem by exploiting both experimental data and basic biophysical knowledge about protein interactions. Here, we review the growing ecosystem of deep learning methods for modeling protein interactions, highlighting the diversity of these biophysically informed models and their respective trade-offs. We discuss recent successes in using representation learning to capture complex features pertinent to predicting protein interactions and interaction sites, geometric deep learning to reason over protein structures and predict complex structures, and generative modeling to design de novo protein assemblies. We also outline some of the outstanding challenges and promising new directions. Opportunities abound to discover novel interactions, elucidate their physical mechanisms, and engineer binders to modulate their functions using deep learning and, ultimately, unravel how protein interactions orchestrate complex cellular behaviors.
Collapse
Affiliation(s)
- Julia R Rogers
- Department of Systems Biology, Columbia University, New York, NY 10032, USA
| | - Gergő Nikolényi
- Department of Systems Biology, Columbia University, New York, NY 10032, USA
| | | |
Collapse
|
5
|
Valejo ADB, Fabbri R, de Andrade Lopes A, Zhao L, de Oliveira MCF. Multilevel Coarsening for Interactive Visualization of Large Bipartite Networks. Front Res Metr Anal 2022; 7:855165. [PMID: 35782366 PMCID: PMC9244804 DOI: 10.3389/frma.2022.855165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 05/16/2022] [Indexed: 12/03/2022] Open
Abstract
Bipartite networks are pervasive in modeling real-world phenomena and play a fundamental role in graph theory. Interactive exploratory visualization of such networks is an important problem, and particularly challenging when handling large networks. In this paper we present results from an investigation on using a general multilevel method for this purpose. Multilevel methods on networks have been introduced as a general approach to increase scalability of community detection and other complex optimization algorithms. They employ graph coarsening algorithms to create a hierarchy of increasingly coarser (reduced) approximations of an original network. Multilevel coarsening has been applied, e.g., to the problem of drawing simple (“unipartite”) networks. We build on previous work that extended multilevel coarsening to bipartite graphs to propose a visualization interface that uses multilevel coarsening to compute a multi-resolution hierarchical representation of an input bipartite network. From this hierarchy, interactive node-link drawings are displayed following a genuine route of the “overview first, zoom and filter, details on demand” visual information seeking mantra. Analysts may depart from the coarsest representation and select nodes or sub-graphs to be expanded and shown at greater detail. Besides intuitive navigation of large-scale networks, this solution affords great flexibility, as users are free to select different coarsening strategies in different scenarios. We illustrate its potential with case studies involving real networks on distinct domains. The experimental analysis shows our strategy is effective to reveal topological structures, such as communities and holes, that may remain hidden in a conventional node-link layout. It is also useful to highlight connectivity patterns across the bipartite layers, as illustrated in an example that emphasizes the correlation between diseases and genes in genetic disorders, and in a study of a scientific collaboration network of authors and papers.
Collapse
Affiliation(s)
- Alan Demétrius Baria Valejo
- Department of Computing, Federal University of São Carlos, São Carlos, Brazil
- *Correspondence: Alan Demétrius Baria Valejo
| | - Renato Fabbri
- Department of Computer Science, Institute of Mathematics and Computer Science, University of São Paulo, São Carlos, Brazil
| | - Alneu de Andrade Lopes
- Department of Computer Science, Institute of Mathematics and Computer Science, University of São Paulo, São Carlos, Brazil
| | - Liang Zhao
- Department of Computing and Mathematics, Faculdade de Filosofia Ciências e Letras de Ribeirão Preto (FFCLRP), University of São Paulo, Ribeirão Preto, Brazil
| | | |
Collapse
|
6
|
Ronan T, Garnett R, Naegle KM. New analysis pipeline for high-throughput domain-peptide affinity experiments improves SH2 interaction data. J Biol Chem 2020; 295:11346-11363. [PMID: 32540967 DOI: 10.1074/jbc.ra120.012503] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2020] [Revised: 06/11/2020] [Indexed: 11/06/2022] Open
Abstract
Protein domain interactions with short linear peptides, such as those of the Src homology 2 (SH2) domain with phosphotyrosine-containing peptide motifs (pTyr), are ubiquitous and important to many biochemical processes of the cell. The desire to map and quantify these interactions has resulted in the development of high-throughput (HTP) quantitative measurement techniques, such as microarray or fluorescence polarization assays. For example, in the last 15 years, experiments have progressed from measuring single interactions to covering 500,000 of the 5.5 million possible SH2-pTyr interactions in the human proteome. However, high variability in affinity measurements and disagreements about positive interactions between published data sets led us here to reevaluate the analysis methods and raw data of published SH2-pTyr HTP experiments. We identified several opportunities for improving the identification of positive and negative interactions and the accuracy of affinity measurements. We implemented model-fitting techniques that are more statistically appropriate for the nonlinear SH2-pTyr interaction data. We also developed a method to account for protein concentration errors due to impurities and degradation or protein inactivity and aggregation. Our revised analysis increases the reported affinity accuracy, reduces the false-negative rate, and increases the amount of useful data by adding reliable true-negative results. We demonstrate improvement in classification of binding versus nonbinding when using machine-learning techniques, suggesting improved coherence in the reanalyzed data sets. We present revised SH2-pTyr affinity results and propose a new analysis pipeline for future HTP measurements of domain-peptide interactions.
Collapse
Affiliation(s)
- Tom Ronan
- Department of Biomedical Engineering, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Roman Garnett
- Department of Computer Science and Engineering, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Kristen M Naegle
- Department of Biomedical Engineering, Washington University in St. Louis, St. Louis, Missouri, USA
| |
Collapse
|
7
|
Interplay between HGAL and Grb2 proteins regulates B-cell receptor signaling. Blood Adv 2020; 3:2286-2297. [PMID: 31362927 DOI: 10.1182/bloodadvances.2018016162] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2018] [Accepted: 05/30/2019] [Indexed: 01/30/2023] Open
Abstract
Human germinal center (GC)-associated lymphoma (HGAL) is an adaptor protein expressed in GC B cells. HGAL regulates cell motility and B-cell receptor (BCR) signaling, processes that are central for the successful completion of the GC reaction. Herein, we demonstrate phosphorylation of HGAL by Syk and Lyn kinases at tyrosines Y80, Y86, Y106Y107, Y128, and Y148. The HGAL YEN motif (amino acids 107-109) is similar to the phosphopeptide motif pYXN used as a binding site to the growth factor receptor-bound protein 2 (Grb2). We demonstrate by biochemical and molecular methodologies that HGAL directly interacts with Grb2. Concordantly, microscopy studies demonstrate HGAL-Grb2 colocalization in the membrane central supramolecular activation clusters (cSMAC) following BCR activation. Mutation of the HGAL putative binding site to Grb2 abrogates the interaction between these proteins. Further, this HGAL mutant localizes exclusively in the peripheral SMAC and decreases the rate and intensity of BCR accumulation in the cSMAC. Furthermore, we demonstrate that Grb2, HGAL, and Syk interact in the same complex, but Grb2 does not modulate the effects of HGAL on Syk kinase activity. Overall, the interplay between the HGAL and Grb2 regulates the magnitude of BCR signaling and synapse formation.
Collapse
|
8
|
Biophysical prediction of protein-peptide interactions and signaling networks using machine learning. Nat Methods 2020; 17:175-183. [PMID: 31907444 PMCID: PMC7004877 DOI: 10.1038/s41592-019-0687-1] [Citation(s) in RCA: 59] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2019] [Accepted: 11/15/2019] [Indexed: 12/17/2022]
Abstract
In mammalian cells, much of signal transduction is mediated by weak protein-protein interactions between globular peptide-binding domains (PBDs) and unstructured peptidic motifs in partner proteins. The number and diversity of these PBDs (over 1,800 are known), low binding affinities, and sensitivity of binding properties to minor sequence variation represent a substantial challenge to experimental and computational analysis of PBD specificity and the networks PBDs create. Here we introduce a bespoke machine learning approach, hierarchical statistical mechanical modelling (HSM), capable of accurately predicting the affinities of PBD-peptide interactions across multiple protein families. By synthesizing biophysical priors within a modern machine learning framework, HSM outperforms existing computational methods and high-throughput experimental assays. HSM models are interpretable in familiar biophysical terms at three spatial scales: the energetics of protein-peptide binding, the multi-dentate organization of protein-protein interactions, and the global architecture of signaling networks.
Collapse
|
9
|
Abstract
Motivation Network propagation has been widely used to aggregate and amplify the effects of tumor mutations using knowledge of molecular interaction networks. However, propagating mutations through interactions irrelevant to cancer leads to erosion of pathway signals and complicates the identification of cancer subtypes. Results To address this problem we introduce a propagation algorithm, Network-Based Supervised Stratification (NBS2), which learns the mutated subnetworks underlying tumor subtypes using a supervised approach. Given an annotated molecular network and reference tumor mutation profiles for which subtypes have been predefined, NBS2 is trained by adjusting the weights on interaction features such that network propagation best recovers the provided subtypes. After training, weights are fixed such that mutation profiles of new tumors can be accurately classified. We evaluate NBS2 on breast and glioblastoma tumors, demonstrating that it outperforms the best network-based approaches in classifying tumors to known subtypes for these diseases. By interpreting the interaction weights, we highlight characteristic molecular pathways driving selected subtypes. Availability and implementation The NBS2 package is freely available at: https://github.com/wzhang1984/NBSS. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Wei Zhang
- Department of Medicine, University of California, San Diego, La Jolla, CA, USA
| | - Jianzhu Ma
- Department of Medicine, University of California, San Diego, La Jolla, CA, USA
| | - Trey Ideker
- Department of Medicine, University of California, San Diego, La Jolla, CA, USA.,Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| |
Collapse
|
10
|
Dzobo K, Adotey S, Thomford NE, Dzobo W. Integrating Artificial and Human Intelligence: A Partnership for Responsible Innovation in Biomedical Engineering and Medicine. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2019; 24:247-263. [PMID: 31313972 DOI: 10.1089/omi.2019.0038] [Citation(s) in RCA: 48] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Historically, the term "artificial intelligence" dates to 1956 when it was first used in a conference at Dartmouth College in the US. Since then, the development of artificial intelligence has in part been shaped by the field of neuroscience. By understanding the human brain, scientists have attempted to build new intelligent machines capable of performing complex tasks akin to humans. Indeed, future research into artificial intelligence will continue to benefit from the study of the human brain. While the development of artificial intelligence algorithms has been fast paced, the actual use of most artificial intelligence (AI) algorithms in biomedical engineering and clinical practice is still markedly below its conceivably broader potentials. This is partly because for any algorithm to be incorporated into existing workflows it has to stand the test of scientific validation, clinical and personal utility, application context, and is equitable as well. In this context, there is much to be gained by combining AI and human intelligence (HI). Harnessing Big Data, computing power and storage capacities, and addressing societal issues emergent from algorithm applications, demand deploying HI in tandem with AI. Very few countries, even economically developed states, lack adequate and critical governance frames to best understand and steer the AI innovation trajectories in health care. Drug discovery and translational pharmaceutical research stand to gain from AI technology provided they are also informed by HI. In this expert review, we analyze the ways in which AI applications are likely to traverse the continuum of life from birth to death, and encompassing not only humans but also all animal, plant, and other living organisms that are increasingly touched by AI. Examples of AI applications include digital health, diagnosis of diseases in newborns, remote monitoring of health by smart devices, real-time Big Data analytics for prompt diagnosis of heart attacks, and facial analysis software with consequences on civil liberties. While we underscore the need for integration of AI and HI, we note that AI technology does not have to replace medical specialists or scientists and rather, is in need of such expert HI. Altogether, AI and HI offer synergy for responsible innovation and veritable prospects for improving health care from prevention to diagnosis to therapeutics while unintended consequences of automation emergent from AI and algorithms should be borne in mind on scientific cultures, work force, and society at large.
Collapse
Affiliation(s)
- Kevin Dzobo
- International Centre for Genetic Engineering and Biotechnology (ICGEB), Cape Town Component, Wernher and Beit Building (South), UCT Medical Campus, Anzio Road, Observatory 7925, Cape Town, South Africa.,Division of Medical Biochemistry and Institute of Infectious Disease and Molecular Medicine, Department of Integrative Biomedical Sciences, Faculty of Health Sciences, University of Cape Town, Cape Town, South Africa
| | - Sampson Adotey
- International Development Innovation Network, D-Lab, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Nicholas E Thomford
- Pharmacogenetics Research Group, Division of Human Genetics, Department of Pathology and Institute of Infectious Diseases and Molecular Medicine, Faculty of Health Sciences, University of Cape Town, Observatory 7925, Cape Town, South Africa
| | - Witness Dzobo
- Pathology and Immunology Department, University Hospital Southampton, Mail Point B, Tremona Road, Southampton, UK.,University of Portsmouth, Faculty of Science, St Michael's Building, White Swan Road, Portsmouth, UK
| |
Collapse
|
11
|
Degoot AM, Adabor ES, Chirove F, Ndifon W. Predicting Antigenicity of Influenza A Viruses Using biophysical ideas. Sci Rep 2019; 9:10218. [PMID: 31308446 PMCID: PMC6629677 DOI: 10.1038/s41598-019-46740-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2018] [Accepted: 07/01/2019] [Indexed: 11/18/2022] Open
Abstract
Antigenic variations of influenza A viruses are induced by genomic mutation in their trans-membrane protein HA1, eliciting viral escape from neutralization by antibodies generated in prior infections or vaccinations. Prediction of antigenic relationships among influenza viruses is useful for designing (or updating the existing) influenza vaccines, provides important insights into the evolutionary mechanisms underpinning viral antigenic variations, and helps to understand viral epidemiology. In this study, we present a simple and physically interpretable model that can predict antigenic relationships among influenza A viruses, based on biophysical ideas, using both genomic amino acid sequences and experimental antigenic data. We demonstrate the applicability of the model using a benchmark dataset of four subtypes of influenza A (H1N1, H3N2, H5N1, and H9N2) viruses and report on its performance profiles. Additionally, analysis of the model’s parameters confirms several observations that are consistent with the findings of other previous studies, for which we provide plausible explanations.
Collapse
Affiliation(s)
- Abdoelnaser M Degoot
- Research Department, African Institute for Mathematical Sciences, Next Einstein Initiative, Kigali, Rwanda. .,University of KwaZulu-Natal, School of Mathematics, Statistics and Computer Science, Pietermaritzburg, 3209, South Africa. .,DST-NRF Centre of Excellence in Mathematical and Statistical Sciences (CoE-MaSS), Gauteng, Wits, 2050, South Africa.
| | - Emmanuel S Adabor
- Research Centre, African Institute for Mathematical Sciences, Cape Town, 7945, South Africa
| | - Faraimunashe Chirove
- University of KwaZulu-Natal, School of Mathematics, Statistics and Computer Science, Pietermaritzburg, 3209, South Africa
| | - Wilfred Ndifon
- Research Department, African Institute for Mathematical Sciences, Next Einstein Initiative, Kigali, Rwanda.
| |
Collapse
|
12
|
Abstract
Classically, phenotype is what is observed, and genotype is the genetic makeup. Statistical studies aim to project phenotypic likelihoods of genotypic patterns. The traditional genotype-to-phenotype theory embraces the view that the encoded protein shape together with gene expression level largely determines the resulting phenotypic trait. Here, we point out that the molecular biology revolution at the turn of the century explained that the gene encodes not one but ensembles of conformations, which in turn spell all possible gene-associated phenotypes. The significance of a dynamic ensemble view is in understanding the linkage between genetic change and the gained observable physical or biochemical characteristics. Thus, despite the transformative shift in our understanding of the basis of protein structure and function, the literature still commonly relates to the classical genotype-phenotype paradigm. This is important because an ensemble view clarifies how even seemingly small genetic alterations can lead to pleiotropic traits in adaptive evolution and in disease, why cellular pathways can be modified in monogenic and polygenic traits, and how the environment may tweak protein function.
Collapse
Affiliation(s)
- Ruth Nussinov
- Cancer and Inflammation Program, Leidos Biomedical Research, Inc., Frederick National Laboratory for Cancer Research, National Cancer Institute at Frederick, Frederick, Maryland, United States of America
- Sackler Institute of Molecular Medicine, Department of Human Genetics and Molecular Medicine, Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Chung-Jung Tsai
- Cancer and Inflammation Program, Leidos Biomedical Research, Inc., Frederick National Laboratory for Cancer Research, National Cancer Institute at Frederick, Frederick, Maryland, United States of America
| | - Hyunbum Jang
- Cancer and Inflammation Program, Leidos Biomedical Research, Inc., Frederick National Laboratory for Cancer Research, National Cancer Institute at Frederick, Frederick, Maryland, United States of America
| |
Collapse
|
13
|
Capobianco E. Next Generation Networks: Featuring the Potential Role of Emerging Applications in Translational Oncology. J Clin Med 2019; 8:jcm8050664. [PMID: 31083565 PMCID: PMC6572295 DOI: 10.3390/jcm8050664] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2019] [Revised: 05/06/2019] [Accepted: 05/08/2019] [Indexed: 01/24/2023] Open
Abstract
Nowadays, networks are pervasively used as examples of models suitable to mathematically represent and visualize the complexity of systems associated with many diseases, including cancer. In the cancer context, the concept of network entropy has guided many studies focused on comparing equilibrium to disequilibrium (i.e., perturbed) conditions. Since these conditions reflect both structural and dynamic properties of network interaction maps, the derived topological characterizations offer precious support to conduct cancer inference. Recent innovative directions have emerged in network medicine addressing especially experimental omics approaches integrated with a variety of other data, from molecular to clinical and also electronic records, bioimaging etc. This work considers a few theoretically relevant concepts likely to impact the future of applications in personalized/precision/translational oncology. The focus goes to specific properties of networks that are still not commonly utilized or studied in the oncological domain, and they are: controllability, synchronization and symmetry. The examples here provided take inspiration from the consideration of metastatic processes, especially their progression through stages and their hallmark characteristics. Casting these processes into computational frameworks and identifying network states with specific modular configurations may be extremely useful to interpret or even understand dysregulation patterns underlying cancer, and associated events (onset, progression) and disease phenotypes.
Collapse
Affiliation(s)
- Enrico Capobianco
- Center for Computational Science, University of Miami, Miami, FL 33146, USA.
| |
Collapse
|
14
|
Salvatore JE, Han S, Farris SP, Mignogna KM, Miles MF, Agrawal A. Beyond genome-wide significance: integrative approaches to the interpretation and extension of GWAS findings for alcohol use disorder. Addict Biol 2019; 24:275-289. [PMID: 29316088 PMCID: PMC6037617 DOI: 10.1111/adb.12591] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2017] [Revised: 11/20/2017] [Accepted: 11/26/2017] [Indexed: 12/16/2022]
Abstract
Alcohol use disorder (AUD) is a heritable complex behavior. Due to the highly polygenic nature of AUD, identifying genetic variants that comprise this heritable variation has proved to be challenging. With the exception of functional variants in alcohol metabolizing genes (e.g. ADH1B and ALDH2), few other candidate loci have been confidently linked to AUD. Genome-wide association studies (GWAS) of AUD and other alcohol-related phenotypes have either produced few hits with genome-wide significance or have failed to replicate on further study. These issues reinforce the complex nature of the genetic underpinnings for AUD and suggest that both GWAS studies with larger samples and additional analysis approaches that better harness the nominally significant loci in existing GWAS are needed. Here, we review approaches of interest in the post-GWAS era, including in silico functional analyses; functional partitioning of single nucleotide polymorphism heritability; aggregation of signal into genes and gene networks; and validation of identified loci, genes and gene networks in postmortem brain tissue and across species. These integrative approaches hold promise to illuminate our understanding of the biological basis of AUD; however, we recognize that the main challenge continues to be the extremely polygenic nature of AUD, which necessitates large samples to identify multiple loci associated with AUD liability.
Collapse
Affiliation(s)
- Jessica E. Salvatore
- Department of Psychology; Virginia Commonwealth University; Richmond VA USA
- Virginia Institute for Psychiatric and Behavioral Genetics; Virginia Commonwealth University; Richmond VA USA
| | - Shizhong Han
- Department of Psychiatry; University of Iowa; Iowa City IA USA
- Department of Psychiatry and Behavioral Sciences; Johns Hopkins School of Medicine; Baltimore MD USA
| | - Sean P. Farris
- Waggoner Center for Alcohol and Addiction Research; The University of Texas at Austin; Austin TX USA
| | - Kristin M. Mignogna
- Virginia Institute for Psychiatric and Behavioral Genetics; Virginia Commonwealth University; Richmond VA USA
| | - Michael F. Miles
- Department of Pharmacology and Toxicology; Virginia Commonwealth University; Richmond VA USA
| | - Arpana Agrawal
- Department of Psychiatry; Washington University School of Medicine; Saint Louis MO USA
| |
Collapse
|
15
|
Rai A, Shinde P, Jalan S. Network spectra for drug-target identification in complex diseases: new guns against old foes. APPLIED NETWORK SCIENCE 2018; 3:51. [PMID: 30596144 PMCID: PMC6297166 DOI: 10.1007/s41109-018-0107-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/29/2018] [Accepted: 10/30/2018] [Indexed: 05/07/2023]
Abstract
The fundamental understanding of altered complex molecular interactions in a diseased condition is the key to its cure. The overall functioning of these molecules is kind of jugglers play in the cell orchestra and to anticipate these relationships among the molecules is one of the greatest challenges in modern biology and medicine. Network science turned out to be providing a successful and simple platform to understand complex interactions among healthy and diseased tissues. Furthermore, much information about the structure and dynamics of a network is concealed in the eigenvalues of its adjacency matrix. In this review, we illustrate rapid advancements in the field of network science in combination with spectral graph theory that enables us to uncover the complexities of various diseases. Interpretations laid by network science approach have solicited insights into molecular relationships and have reported novel drug targets and biomarkers in various complex diseases.
Collapse
Affiliation(s)
- Aparna Rai
- Aushadhi Open Innovation Programme, Indian Institute of Technology Guwahati, Guwahati, 781039 India
| | - Pramod Shinde
- Discipline of Biosciences and Biomedical Engineering, Indian Institute of Technology Indore, Khandwa Road, Simrol, Indore, 453552 India
| | - Sarika Jalan
- Discipline of Biosciences and Biomedical Engineering, Indian Institute of Technology Indore, Khandwa Road, Simrol, Indore, 453552 India
- Complex Systems Lab, Discipline of Physics, Indian Institute of Technology Indore, Khandwa Road, Indore, 453552 India
- Lobachevsky University, Gagarin avenue 23, Nizhny Novgorod, 603950 Russia
| |
Collapse
|
16
|
Typing tumors using pathways selected by somatic evolution. Nat Commun 2018; 9:4159. [PMID: 30297789 PMCID: PMC6175900 DOI: 10.1038/s41467-018-06464-y] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2018] [Accepted: 09/03/2018] [Indexed: 01/01/2023] Open
Abstract
Many recent efforts to analyze cancer genomes involve aggregation of mutations within reference maps of molecular pathways and protein networks. Here, we find these pathway studies are impeded by molecular interactions that are functionally irrelevant to cancer or the patient’s tumor type, as these interactions diminish the contrast of driver pathways relative to individual frequently mutated genes. This problem can be addressed by creating stringent tumor-specific networks of biophysical protein interactions, identified by signatures of epistatic selection during tumor evolution. Using such an evolutionarily selected pathway (ESP) map, we analyze the major cancer genome atlases to derive a hierarchical classification of tumor subtypes linked to characteristic mutated pathways. These pathways are clinically prognostic and predictive, including the TP53-AXIN-ARHGEF17 combination in liver and CYLC2-STK11-STK11IP in lung cancer, which we validate in independent cohorts. This ESP framework substantially improves the definition of cancer pathways and subtypes from tumor genome data. Informative pathways driving cancer pathogenesis and subtypes can be difficult to identify in the presence of many gene interactions irrelevant to cancer. Here, the authors describe an approach for cancer gene pathway analysis based on key molecular interactions that drive cancer in relevant tissue types, and they assemble a focused map of Evolutionarily Selected Pathways (ESP) with interactions supported by both protein–protein binding and genetic epistasis during somatic tumor evolution.
Collapse
|
17
|
Degoot AM, Chirove F, Ndifon W. Trans-Allelic Model for Prediction of Peptide:MHC-II Interactions. Front Immunol 2018; 9:1410. [PMID: 29988560 PMCID: PMC6026802 DOI: 10.3389/fimmu.2018.01410] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2018] [Accepted: 06/06/2018] [Indexed: 12/30/2022] Open
Abstract
Major histocompatibility complex class two (MHC-II) molecules are trans-membrane proteins and key components of the cellular immune system. Upon recognition of foreign peptides expressed on the MHC-II binding groove, CD4+ T cells mount an immune response against invading pathogens. Therefore, mechanistic identification and knowledge of physicochemical features that govern interactions between peptides and MHC-II molecules is useful for the design of effective epitope-based vaccines, as well as for understanding of immune responses. In this article, we present a comprehensive trans-allelic prediction model, a generalized version of our previous biophysical model, that can predict peptide interactions for all three human MHC-II loci (HLA-DR, HLA-DP, and HLA-DQ), using both peptide sequence data and structural information of MHC-II molecules. The advantage of this approach over other machine learning models is that it offers a simple and plausible physical explanation for peptide–MHC-II interactions. We train the model using a benchmark experimental dataset and measure its predictive performance using novel data. Despite its relative simplicity, we find that the model has comparable performance to the state-of-the-art method, the NetMHCIIpan method. Focusing on the physical basis of peptide–MHC binding, we find support for previous theoretical predictions about the contributions of certain binding pockets to the binding energy. In addition, we find that binding pocket P5 of HLA-DP, which was not previously considered as a primary anchor, does make strong contribution to the binding energy. Together, the results indicate that our model can serve as a useful complement to alternative approaches to predicting peptide–MHC interactions.
Collapse
Affiliation(s)
- Abdoelnaser M Degoot
- African Institute of Mathematical Sciences (AIMS), Muizenberg, South Africa.,School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, South Africa.,DST-NRF Centre of Excellence in Mathematical and Statistical Sciences (CoE-MaSS), Gauteng, South Africa
| | - Faraimunashe Chirove
- School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, South Africa
| | - Wilfred Ndifon
- African Institute of Mathematical Sciences (AIMS), Muizenberg, South Africa
| |
Collapse
|
18
|
Affiliation(s)
- Mohammed AlQuraishi
- Mohammed AlQuraishi is a Fellow in Therapeutic Science at Harvard Medical School, Boston, MA 02115, USA.Peter K. Sorger is Professor of Systems Biology, Head of the Harvard Program in Therapeutic Sciences, and Director of the Laboratory of Systems Biology at Harvard Medical School, Boston, MA 02115, USA.
| | - Peter K Sorger
- Mohammed AlQuraishi is a Fellow in Therapeutic Science at Harvard Medical School, Boston, MA 02115, USA.Peter K. Sorger is Professor of Systems Biology, Head of the Harvard Program in Therapeutic Sciences, and Director of the Laboratory of Systems Biology at Harvard Medical School, Boston, MA 02115, USA.
| |
Collapse
|
19
|
Rosell M, Fernández-Recio J. Hot-spot analysis for drug discovery targeting protein-protein interactions. Expert Opin Drug Discov 2018; 13:327-338. [PMID: 29376444 DOI: 10.1080/17460441.2018.1430763] [Citation(s) in RCA: 61] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
INTRODUCTION Protein-protein interactions are important for biological processes and pathological situations, and are attractive targets for drug discovery. However, rational drug design targeting protein-protein interactions is still highly challenging. Hot-spot residues are seen as the best option to target such interactions, but their identification requires detailed structural and energetic characterization, which is only available for a tiny fraction of protein interactions. Areas covered: In this review, the authors cover a variety of computational methods that have been reported for the energetic analysis of protein-protein interfaces in search of hot-spots, and the structural modeling of protein-protein complexes by docking. This can help to rationalize the discovery of small-molecule inhibitors of protein-protein interfaces of therapeutic interest. Computational analysis and docking can help to locate the interface, molecular dynamics can be used to find suitable cavities, and hot-spot predictions can focus the search for inhibitors of protein-protein interactions. Expert opinion: A major difficulty for applying rational drug design methods to protein-protein interactions is that in the majority of cases the complex structure is not available. Fortunately, computational docking can complement experimental data. An interesting aspect to explore in the future is the integration of these strategies for targeting PPIs with large-scale mutational analysis.
Collapse
Affiliation(s)
- Mireia Rosell
- a Department of Life Sciences , Barcelona Supercomputing Center (BSC) , Barcelona , Spain
| | - Juan Fernández-Recio
- a Department of Life Sciences , Barcelona Supercomputing Center (BSC) , Barcelona , Spain.,b Structural Biology Unit , Institut de Biologia Molecular de Barcelona (IBMB), CSIC , Barcelona , Spain
| |
Collapse
|
20
|
Zuck M, Austin LS, Danziger SA, Aitchison JD, Kaushansky A. The Promise of Systems Biology Approaches for Revealing Host Pathogen Interactions in Malaria. Front Microbiol 2017; 8:2183. [PMID: 29201016 PMCID: PMC5696578 DOI: 10.3389/fmicb.2017.02183] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2017] [Accepted: 10/24/2017] [Indexed: 12/18/2022] Open
Abstract
Despite global eradication efforts over the past century, malaria remains a devastating public health burden, causing almost half a million deaths annually (WHO, 2016). A detailed understanding of the mechanisms that control malaria infection has been hindered by technical challenges of studying a complex parasite life cycle in multiple hosts. While many interventions targeting the parasite have been implemented, the complex biology of Plasmodium poses a major challenge, and must be addressed to enable eradication. New approaches for elucidating key host-parasite interactions, and predicting how the parasite will respond in a variety of biological settings, could dramatically enhance the efficacy and longevity of intervention strategies. The field of systems biology has developed methodologies and principles that are well poised to meet these challenges. In this review, we focus our attention on the Liver Stage of the Plasmodium lifecycle and issue a “call to arms” for using systems biology approaches to forge a new era in malaria research. These approaches will reveal insights into the complex interplay between host and pathogen, and could ultimately lead to novel intervention strategies that contribute to malaria eradication.
Collapse
Affiliation(s)
- Meghan Zuck
- Center for Infectious Disease Research, formerly Seattle Biomedical Research Institute, Seattle, WA, United States
| | - Laura S Austin
- Center for Infectious Disease Research, formerly Seattle Biomedical Research Institute, Seattle, WA, United States
| | - Samuel A Danziger
- Center for Infectious Disease Research, formerly Seattle Biomedical Research Institute, Seattle, WA, United States.,Institute for Systems Biology, Seattle, WA, United States
| | - John D Aitchison
- Center for Infectious Disease Research, formerly Seattle Biomedical Research Institute, Seattle, WA, United States.,Institute for Systems Biology, Seattle, WA, United States
| | - Alexis Kaushansky
- Center for Infectious Disease Research, formerly Seattle Biomedical Research Institute, Seattle, WA, United States.,Department of Global Health, University of Washington, Seattle, WA, United States
| |
Collapse
|
21
|
Wilson SJ, Wilkins AD, Lin CH, Lua RC, Lichtarge O. DISCOVERY OF FUNCTIONAL AND DISEASE PATHWAYS BY COMMUNITY DETECTION IN PROTEIN-PROTEIN INTERACTION NETWORKS. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2017; 22:336-347. [PMID: 27896987 DOI: 10.1142/9789813207813_0032] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Advances in cellular, molecular, and disease biology depend on the comprehensive characterization of gene interactions and pathways. Traditionally, these pathways are curated manually, limiting their efficient annotation and, potentially, reinforcing field-specific bias. Here, in order to test objective and automated identification of functionally cooperative genes, we compared a novel algorithm with three established methods to search for communities within gene interaction networks. Communities identified by the novel approach and by one of the established method overlapped significantly (q < 0.1) with control pathways. With respect to disease, these communities were biased to genes with pathogenic variants in ClinVar (p ≪ 0.01), and often genes from the same community were co-expressed, including in breast cancers. The interesting subset of novel communities, defined by poor overlap to control pathways also contained co-expressed genes, consistent with a possible functional role. This work shows that community detection based on topological features of networks suggests new, biologically meaningful groupings of genes that, in turn, point to health and disease relevant hypotheses.
Collapse
Affiliation(s)
- Stephen J Wilson
- Department of Biochemistry and Molecular Biology, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA,
| | | | | | | | | |
Collapse
|
22
|
Garland J. Unravelling the complexity of signalling networks in cancer: A review of the increasing role for computational modelling. Crit Rev Oncol Hematol 2017; 117:73-113. [PMID: 28807238 DOI: 10.1016/j.critrevonc.2017.06.004] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2016] [Revised: 06/01/2017] [Accepted: 06/08/2017] [Indexed: 02/06/2023] Open
Abstract
Cancer induction is a highly complex process involving hundreds of different inducers but whose eventual outcome is the same. Clearly, it is essential to understand how signalling pathways and networks generated by these inducers interact to regulate cell behaviour and create the cancer phenotype. While enormous strides have been made in identifying key networking profiles, the amount of data generated far exceeds our ability to understand how it all "fits together". The number of potential interactions is astronomically large and requires novel approaches and extreme computation methods to dissect them out. However, such methodologies have high intrinsic mathematical and conceptual content which is difficult to follow. This review explains how computation modelling is progressively finding solutions and also revealing unexpected and unpredictable nano-scale molecular behaviours extremely relevant to how signalling and networking are coherently integrated. It is divided into linked sections illustrated by numerous figures from the literature describing different approaches and offering visual portrayals of networking and major conceptual advances in the field. First, the problem of signalling complexity and data collection is illustrated for only a small selection of known oncogenes. Next, new concepts from biophysics, molecular behaviours, kinetics, organisation at the nano level and predictive models are presented. These areas include: visual representations of networking, Energy Landscapes and energy transfer/dissemination (entropy); diffusion, percolation; molecular crowding; protein allostery; quinary structure and fractal distributions; energy management, metabolism and re-examination of the Warburg effect. The importance of unravelling complex network interactions is then illustrated for some widely-used drugs in cancer therapy whose interactions are very extensive. Finally, use of computational modelling to develop micro- and nano- functional models ("bottom-up" research) is highlighted. The review concludes that computational modelling is an essential part of cancer research and is vital to understanding network formation and molecular behaviours that are associated with it. Its role is increasingly essential because it is unravelling the huge complexity of cancer induction otherwise unattainable by any other approach.
Collapse
Affiliation(s)
- John Garland
- Manchester Interdisciplinary Biocentre, Manchester University, Manchester, UK.
| |
Collapse
|
23
|
Mathematical and Computational Modeling in Complex Biological Systems. BIOMED RESEARCH INTERNATIONAL 2017; 2017:5958321. [PMID: 28386558 PMCID: PMC5366773 DOI: 10.1155/2017/5958321] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/16/2016] [Revised: 12/20/2016] [Accepted: 01/16/2017] [Indexed: 12/22/2022]
Abstract
The biological process and molecular functions involved in the cancer progression remain difficult to understand for biologists and clinical doctors. Recent developments in high-throughput technologies urge the systems biology to achieve more precise models for complex diseases. Computational and mathematical models are gradually being used to help us understand the omics data produced by high-throughput experimental techniques. The use of computational models in systems biology allows us to explore the pathogenesis of complex diseases, improve our understanding of the latent molecular mechanisms, and promote treatment strategy optimization and new drug discovery. Currently, it is urgent to bridge the gap between the developments of high-throughput technologies and systemic modeling of the biological process in cancer research. In this review, we firstly studied several typical mathematical modeling approaches of biological systems in different scales and deeply analyzed their characteristics, advantages, applications, and limitations. Next, three potential research directions in systems modeling were summarized. To conclude, this review provides an update of important solutions using computational modeling approaches in systems biology.
Collapse
|
24
|
Wei L, Xing P, Zeng J, Chen J, Su R, Guo F. Improved prediction of protein-protein interactions using novel negative samples, features, and an ensemble classifier. Artif Intell Med 2017; 83:67-74. [PMID: 28320624 DOI: 10.1016/j.artmed.2017.03.001] [Citation(s) in RCA: 159] [Impact Index Per Article: 19.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2016] [Revised: 02/17/2017] [Accepted: 03/01/2017] [Indexed: 10/20/2022]
Abstract
Computational methods are employed in bioinformatics to predict protein-protein interactions (PPIs). PPIs and protein-protein non-interactions (PPNIs) display different levels of development, and the number of PPIs is considerably greater than that of PPNIs. This significant difference in the number of PPIs and PPNIs increases the cost of constructing a balanced dataset. PPIs can be classified as either physical or genetic. However, ready-made PPNI databases were proven only to have no physical interactions and were not proven to have no genetic interactions. Hence, ready-made PPNI databases contain false negative non-interactions. In this study, two PPNI datasets were artificially generated from a PPI database. In contrast to various traditional PPI feature extraction methods based on sequential information, two types of novel feature extraction methods were proposed. One is based on secondary structure information, and the other is based on the physicochemical properties of proteins. The experimental results of the RandomPairs dataset validate the efficiency and effectiveness of the proposed prediction model. These results reveal the potential of constructing a PPI negative dataset to reduce false negatives. Related datasets, tools, and source codes are accessible at http://lab.malab.cn/soft/PPIPre/PPIPre.html.
Collapse
Affiliation(s)
- Leyi Wei
- School of Computer Science and Technology, Tianjin University, Tianjin, China
| | - Pengwei Xing
- School of Computer Science and Technology, Tianjin University, Tianjin, China
| | - Jiancang Zeng
- School of Information Science and Technology, Xiamen University, Xiamen, China
| | - JinXiu Chen
- School of Information Science and Technology, Xiamen University, Xiamen, China
| | - Ran Su
- School of Software, Tianjin University, Tianjin, China
| | - Fei Guo
- School of Computer Science and Technology, Tianjin University, Tianjin, China.
| |
Collapse
|
25
|
Rai A, Pradhan P, Nagraj J, Lohitesh K, Chowdhury R, Jalan S. Understanding cancer complexome using networks, spectral graph theory and multilayer framework. Sci Rep 2017; 7:41676. [PMID: 28155908 PMCID: PMC5290734 DOI: 10.1038/srep41676] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2016] [Accepted: 12/15/2016] [Indexed: 02/06/2023] Open
Abstract
Cancer complexome comprises a heterogeneous and multifactorial milieu that varies in cytology, physiology, signaling mechanisms and response to therapy. The combined framework of network theory and spectral graph theory along with the multilayer analysis provides a comprehensive approach to analyze the proteomic data of seven different cancers, namely, breast, oral, ovarian, cervical, lung, colon and prostate. Our analysis demonstrates that the protein-protein interaction networks of the normal and the cancerous tissues associated with the seven cancers have overall similar structural and spectral properties. However, few of these properties implicate unsystematic changes from the normal to the disease networks depicting difference in the interactions and highlighting changes in the complexity of different cancers. Importantly, analysis of common proteins of all the cancer networks reveals few proteins namely the sensors, which not only occupy significant position in all the layers but also have direct involvement in causing cancer. The prediction and analysis of miRNAs targeting these sensor proteins hint towards the possible role of these proteins in tumorigenesis. This novel approach helps in understanding cancer at the fundamental level and provides a clue to develop promising and nascent concept of single drug therapy for multiple diseases as well as personalized medicine.
Collapse
Affiliation(s)
- Aparna Rai
- Centre for Biosciences and Biomedical Engineering, Indian Institute of Technology Indore, Simrol, Indore, Madhya Pradesh 453552, India
| | - Priodyuti Pradhan
- Complex Systems Lab, Discipline of Physics, Indian Institute of Technology Indore, Simrol, Indore, Madhya Pradesh 453552, India
| | - Jyothi Nagraj
- Department of Biological Sciences, Birla Institute of Technology and Science, Vidya Vihar, Pilani, Rajasthan 333031, India
| | - K. Lohitesh
- Department of Biological Sciences, Birla Institute of Technology and Science, Vidya Vihar, Pilani, Rajasthan 333031, India
| | - Rajdeep Chowdhury
- Department of Biological Sciences, Birla Institute of Technology and Science, Vidya Vihar, Pilani, Rajasthan 333031, India
| | - Sarika Jalan
- Centre for Biosciences and Biomedical Engineering, Indian Institute of Technology Indore, Simrol, Indore, Madhya Pradesh 453552, India
- Complex Systems Lab, Discipline of Physics, Indian Institute of Technology Indore, Simrol, Indore, Madhya Pradesh 453552, India
| |
Collapse
|
26
|
Abstract
Recent studies across multiple tumour types are starting to reveal a recurrent regulatory architecture in which genomic alterations cluster upstream of functional master regulator (MR) proteins, the aberrant activity of which is both necessary and sufficient to maintain tumour cell state. These proteins form small, hyperconnected and autoregulated modules (termed tumour checkpoints) that are increasingly emerging as optimal biomarkers and therapeutic targets. Crucially, as their activity is mostly dysregulated in a post-translational manner, rather than by mutations in their corresponding genes or by differential expression, the identification of MR proteins by conventional methods is challenging. In this Opinion article, we discuss novel methods for the systematic analysis of MR proteins and of the modular regulatory architecture they implement, including their use as a valuable reductionist framework to study the genetic heterogeneity of human disease and to drive key translational applications.
Collapse
Affiliation(s)
- Andrea Califano
- Department of Systems Biology, Columbia University, and the Departments of Biomedical Informatics, Biochemistry and Molecular Biophysics, JP Sulzberger Columbia Genome Center, Herbert Irving Comprehensive Cancer Center, Columbia University, New York, New York 10032, USA
| | - Mariano J Alvarez
- DarwinHealth, Inc., 3960 Broadway, Suite 540, New York, New York 10032, USA
| |
Collapse
|
27
|
Adlung L, Kar S, Wagner MC, She B, Chakraborty S, Bao J, Lattermann S, Boerries M, Busch H, Wuchter P, Ho AD, Timmer J, Schilling M, Höfer T, Klingmüller U. Protein abundance of AKT and ERK pathway components governs cell type-specific regulation of proliferation. Mol Syst Biol 2017; 13:904. [PMID: 28123004 PMCID: PMC5293153 DOI: 10.15252/msb.20167258] [Citation(s) in RCA: 58] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Signaling through the AKT and ERK pathways controls cell proliferation. However, the integrated regulation of this multistep process, involving signal processing, cell growth and cell cycle progression, is poorly understood. Here, we study different hematopoietic cell types, in which AKT and ERK signaling is triggered by erythropoietin (Epo). Although these cell types share the molecular network topology for pro‐proliferative Epo signaling, they exhibit distinct proliferative responses. Iterating quantitative experiments and mathematical modeling, we identify two molecular sources for cell type‐specific proliferation. First, cell type‐specific protein abundance patterns cause differential signal flow along the AKT and ERK pathways. Second, downstream regulators of both pathways have differential effects on proliferation, suggesting that protein synthesis is rate‐limiting for faster cycling cells while slower cell cycles are controlled at the G1‐S progression. The integrated mathematical model of Epo‐driven proliferation explains cell type‐specific effects of targeted AKT and ERK inhibitors and faithfully predicts, based on the protein abundance, anti‐proliferative effects of inhibitors in primary human erythroid progenitor cells. Our findings suggest that the effectiveness of targeted cancer therapy might become predictable from protein abundance.
Collapse
Affiliation(s)
- Lorenz Adlung
- Division of Systems Biology of Signal Transduction, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Sandip Kar
- Division of Theoretical Systems Biology, German Cancer Research Center (DKFZ), Heidelberg, Germany.,BioQuant Center, University of Heidelberg, Heidelberg, Germany.,Department of Chemistry, Indian Institute of Technology, Mumbai, India
| | - Marie-Christine Wagner
- Division of Systems Biology of Signal Transduction, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Bin She
- Division of Systems Biology of Signal Transduction, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Sajib Chakraborty
- Division of Systems Biology of Signal Transduction, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Jie Bao
- Systems Biology of the Cellular Microenvironment Group, IMMZ, ALU, Freiburg, Germany
| | - Susen Lattermann
- Division of Systems Biology of Signal Transduction, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Melanie Boerries
- Systems Biology of the Cellular Microenvironment Group, IMMZ, ALU, Freiburg, Germany.,German Cancer Consortium (DKTK), Freiburg, Germany.,German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Hauke Busch
- Systems Biology of the Cellular Microenvironment Group, IMMZ, ALU, Freiburg, Germany.,German Cancer Consortium (DKTK), Freiburg, Germany.,German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Patrick Wuchter
- Department of Medicine V, University of Heidelberg, Heidelberg, Germany.,Institute for Transfusion Medicine and Immunology, University of Heidelberg, Mannheim, Germany
| | - Anthony D Ho
- Department of Medicine V, University of Heidelberg, Heidelberg, Germany
| | - Jens Timmer
- Center for Biological Signaling Studies (BIOSS), Institute of Physics, University of Freiburg, Freiburg, Germany
| | - Marcel Schilling
- Division of Systems Biology of Signal Transduction, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Thomas Höfer
- Division of Theoretical Systems Biology, German Cancer Research Center (DKFZ), Heidelberg, Germany .,BioQuant Center, University of Heidelberg, Heidelberg, Germany
| | - Ursula Klingmüller
- Division of Systems Biology of Signal Transduction, German Cancer Research Center (DKFZ), Heidelberg, Germany .,Translational Lung Research Center (TLRC), Member of the German Center for Lung Research (DZL), Heidelberg, Germany
| |
Collapse
|
28
|
Abstract
The Src Homology 2 (SH2) domain is the prototypical protein interaction module that lies at the heart of phosphotyrosine signaling. Since its serendipitous discovery, there has been a tremendous advancement in technologies and an array of techniques available for studying SH2 domains and phosphotyrosine signaling. In this chapter, we provide a glimpse of the history of SH2 domains and describe many of the tools and techniques that have been developed along the way and discuss future directions for SH2 domain studies. We highlight the gist of each chapter in this volume in the context of: the structural biology and phosphotyrosine binding; characterizing SH2 specificity and generating prediction models; systems biology and proteomics; SH2 domains in signal transduction; and SH2 domains in disease, diagnostics, and therapeutics. Many of the individual chapters provide an in-depth approach that will allow scientists to interrogate the function and role of SH2 domains.
Collapse
Affiliation(s)
- Bernard A Liu
- Broad Institute of Harvard and MIT, 415 Main St., 5175 JJ, Cambridge, MA, 02142, USA.
| | - Kazuya Machida
- Raymond and Beverly Sackler Laboratory of Genetics and Molecular Medicine, Department of Genetics and Genome Sciences, University of Connecticut School of Medicine, 400 Farmington Ave., Farmington, CT, 06030, USA.
| |
Collapse
|
29
|
Rudolph JD, de Graauw M, van de Water B, Geiger T, Sharan R. Elucidation of Signaling Pathways from Large-Scale Phosphoproteomic Data Using Protein Interaction Networks. Cell Syst 2016; 3:585-593.e3. [DOI: 10.1016/j.cels.2016.11.005] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2016] [Revised: 08/24/2016] [Accepted: 11/09/2016] [Indexed: 01/01/2023]
|
30
|
Sain N, Mohanty D. modPDZpep: a web resource for structure based analysis of human PDZ-mediated interaction networks. Biol Direct 2016; 11:48. [PMID: 27655048 PMCID: PMC5031328 DOI: 10.1186/s13062-016-0151-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2016] [Accepted: 09/14/2016] [Indexed: 11/27/2022] Open
Abstract
Background PDZ domains recognize short sequence stretches usually present in C-terminal of their interaction partners. Because of the involvement of PDZ domains in many important biological processes, several attempts have been made for developing bioinformatics tools for genome-wide identification of PDZ interaction networks. Currently available tools for prediction of interaction partners of PDZ domains utilize machine learning approach. Since, they have been trained using experimental substrate specificity data for specific PDZ families, their applicability is limited to PDZ families closely related to the training set. These tools also do not allow analysis of PDZ-peptide interaction interfaces. Results We have used a structure based approach to develop modPDZpep, a program to predict the interaction partners of human PDZ domains and analyze structural details of PDZ interaction interfaces. modPDZpep predicts interaction partners by using structural models of PDZ-peptide complexes and evaluating binding energy scores using residue based statistical pair potentials. Since, it does not require training using experimental data on peptide binding affinity, it can predict substrates for diverse PDZ families. Because of the use of simple scoring function for binding energy, it is also fast enough for genome scale structure based analysis of PDZ interaction networks. Benchmarking using artificial as well as real negative datasets indicates good predictive power with ROC-AUC values in the range of 0.7 to 0.9 for a large number of human PDZ domains. Another novel feature of modPDZpep is its ability to map novel PDZ mediated interactions in human protein-protein interaction networks, either by utilizing available experimental phage display data or by structure based predictions. Conclusions In summary, we have developed modPDZpep, a web-server for structure based analysis of human PDZ domains. It is freely available at http://www.nii.ac.in/modPDZpep.html or http://202.54.226.235/modPDZpep.html. Reviewers This article was reviewed by Michael Gromiha and Zoltán Gáspári. Electronic supplementary material The online version of this article (doi:10.1186/s13062-016-0151-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Neetu Sain
- Bioinformatics Center, National Institute of Immunology, Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Debasisa Mohanty
- Bioinformatics Center, National Institute of Immunology, Aruna Asaf Ali Marg, New Delhi, 110067, India.
| |
Collapse
|
31
|
Krishnan A, Taroni JN, Greene CS. Integrative Networks Illuminate Biological Factors Underlying Gene–Disease Associations. CURRENT GENETIC MEDICINE REPORTS 2016. [DOI: 10.1007/s40142-016-0102-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
|
32
|
Stern AM, Schurdak ME, Bahar I, Berg JM, Taylor DL. A Perspective on Implementing a Quantitative Systems Pharmacology Platform for Drug Discovery and the Advancement of Personalized Medicine. JOURNAL OF BIOMOLECULAR SCREENING 2016; 21:521-34. [PMID: 26962875 PMCID: PMC4917453 DOI: 10.1177/1087057116635818] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Drug candidates exhibiting well-defined pharmacokinetic and pharmacodynamic profiles that are otherwise safe often fail to demonstrate proof-of-concept in phase II and III trials. Innovation in drug discovery and development has been identified as a critical need for improving the efficiency of drug discovery, especially through collaborations between academia, government agencies, and industry. To address the innovation challenge, we describe a comprehensive, unbiased, integrated, and iterative quantitative systems pharmacology (QSP)-driven drug discovery and development strategy and platform that we have implemented at the University of Pittsburgh Drug Discovery Institute. Intrinsic to QSP is its integrated use of multiscale experimental and computational methods to identify mechanisms of disease progression and to test predicted therapeutic strategies likely to achieve clinical validation for appropriate subpopulations of patients. The QSP platform can address biological heterogeneity and anticipate the evolution of resistance mechanisms, which are major challenges for drug development. The implementation of this platform is dedicated to gaining an understanding of mechanism(s) of disease progression to enable the identification of novel therapeutic strategies as well as repurposing drugs. The QSP platform will help promote the paradigm shift from reactive population-based medicine to proactive personalized medicine by focusing on the patient as the starting and the end point.
Collapse
Affiliation(s)
- Andrew M. Stern
- Department of Computational and Systems Biology, Pittsburgh, PA, USA
- University of Pittsburgh Drug Discovery Institute, Pittsburgh, PA, USA
| | - Mark E. Schurdak
- Department of Computational and Systems Biology, Pittsburgh, PA, USA
- University of Pittsburgh Drug Discovery Institute, Pittsburgh, PA, USA
- The University of Pittsburgh Cancer Institute, Pittsburgh, PA, USA
| | - Ivet Bahar
- Department of Computational and Systems Biology, Pittsburgh, PA, USA
- University of Pittsburgh Drug Discovery Institute, Pittsburgh, PA, USA
- The University of Pittsburgh Cancer Institute, Pittsburgh, PA, USA
| | - Jeremy M. Berg
- Department of Computational and Systems Biology, Pittsburgh, PA, USA
- University of Pittsburgh Drug Discovery Institute, Pittsburgh, PA, USA
- University of Pittsburgh Institute for Personalized Medicine, Pittsburgh, PA, USA
| | - D. Lansing Taylor
- Department of Computational and Systems Biology, Pittsburgh, PA, USA
- University of Pittsburgh Drug Discovery Institute, Pittsburgh, PA, USA
- The University of Pittsburgh Cancer Institute, Pittsburgh, PA, USA
| |
Collapse
|
33
|
Zhao B, Pritchard JR. Inherited Disease Genetics Improves the Identification of Cancer-Associated Genes. PLoS Genet 2016; 12:e1006081. [PMID: 27304678 PMCID: PMC4909226 DOI: 10.1371/journal.pgen.1006081] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Accepted: 05/04/2016] [Indexed: 01/21/2023] Open
Abstract
The identification of biologically significant variants in cancer genomes is critical to therapeutic discovery, but it is limited by the statistical power needed to discern driver from passenger. Independent biological data can be used to filter cancer exomes and increase statistical power. Large genetic databases for inherited diseases are uniquely suited to this task because they contain specific amino acid alterations with known pathogenicity and molecular mechanisms. However, no rigorous method to overlay this information onto the cancer exome exists. Here, we present a computational methodology that overlays any variant database onto the somatic mutations in all cancer exomes. We validate the computation experimentally and identify novel associations in a re-analysis of 7362 cancer exomes. This analysis identified activating SOS1 mutations associated with Noonan syndrome as significantly altered in melanoma and the first kinase-activating mutations in ACVR1 associated with adult tumors. Beyond a filter, significant variants found in both rare cancers and rare inherited diseases increase the unmet medical need for therapeutics that target these variants and may bootstrap drug discovery efforts in orphan indications.
Collapse
Affiliation(s)
- Boyang Zhao
- Computational and Systems Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Discovery/Translational Biology, ARIAD Pharmaceuticals, Cambridge, Massachusetts, United States of America
- * E-mail: (BZ); (JP)
| | - Justin R. Pritchard
- Discovery/Translational Biology, ARIAD Pharmaceuticals, Cambridge, Massachusetts, United States of America
- * E-mail: (BZ); (JP)
| |
Collapse
|
34
|
Kaiser JL, Bland CL, Klinke DJ. Identifying causal networks linking cancer processes and anti-tumor immunity using Bayesian network inference and metagene constructs. Biotechnol Prog 2016; 32:470-9. [PMID: 26785356 DOI: 10.1002/btpr.2230] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2016] [Revised: 01/13/2016] [Indexed: 12/17/2022]
Abstract
Cancer arises from a deregulation of both intracellular and intercellular networks that maintain system homeostasis. Identifying the architecture of these networks and how they are changed in cancer is a pre-requisite for designing drugs to restore homeostasis. Since intercellular networks only appear in intact systems, it is difficult to identify how these networks become altered in human cancer using many of the common experimental models. To overcome this, we used the diversity in normal and malignant human tissue samples from the Cancer Genome Atlas (TCGA) database of human breast cancer to identify the topology associated with intercellular networks in vivo. To improve the underlying biological signals, we constructed Bayesian networks using metagene constructs, which represented groups of genes that are concomitantly associated with different immune and cancer states. We also used bootstrap resampling to establish the significance associated with the inferred networks. In short, we found opposing relationships between cell proliferation and epithelial-to-mesenchymal transformation (EMT) with regards to macrophage polarization. These results were consistent across multiple carcinomas in that proliferation was associated with a type 1 cell-mediated anti-tumor immune response and EMT was associated with a pro-tumor anti-inflammatory response. To address the identifiability of these networks from other datasets, we could identify the relationship between EMT and macrophage polarization with fewer samples when the Bayesian network was generated from malignant samples alone. However, the relationship between proliferation and macrophage polarization was identified with fewer samples when the samples were taken from a combination of the normal and malignant samples. © 2016 American Institute of Chemical Engineers Biotechnol. Prog., 32:470-479, 2016.
Collapse
Affiliation(s)
- Jacob L Kaiser
- Dept. of Microbiology, Immunology, and Cell Biology, West Virginia University, Morgantown, WV, 25606
| | - Cassidy L Bland
- Dept. of Chemical Engineering, West Virginia University, Morgantown, WV, 25606
| | - David J Klinke
- Dept. of Microbiology, Immunology, and Cell Biology, West Virginia University, Morgantown, WV, 25606.,Dept. of Chemical Engineering and Mary Babb Randolph Cancer Center, West Virginia University Morgantown, WV, 25606
| |
Collapse
|
35
|
Hart T, Dider S, Han W, Xu H, Zhao Z, Xie L. Toward Repurposing Metformin as a Precision Anti-Cancer Therapy Using Structural Systems Pharmacology. Sci Rep 2016; 6:20441. [PMID: 26841718 PMCID: PMC4740793 DOI: 10.1038/srep20441] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2015] [Accepted: 01/04/2016] [Indexed: 01/12/2023] Open
Abstract
Metformin, a drug prescribed to treat type-2 diabetes, exhibits anti-cancer effects in a portion of patients, but the direct molecular and genetic interactions leading to this pleiotropic effect have not yet been fully explored. To repurpose metformin as a precision anti-cancer therapy, we have developed a novel structural systems pharmacology approach to elucidate metformin's molecular basis and genetic biomarkers of action. We integrated structural proteome-scale drug target identification with network biology analysis by combining structural genomic, functional genomic, and interactomic data. Through searching the human structural proteome, we identified twenty putative metformin binding targets and their interaction models. We experimentally verified the interactions between metformin and our top-ranked kinase targets. Notably, kinases, particularly SGK1 and EGFR were identified as key molecular targets of metformin. Subsequently, we linked these putative binding targets to genes that do not directly bind to metformin but whose expressions are altered by metformin through protein-protein interactions, and identified network biomarkers of phenotypic response of metformin. The molecular targets and the key nodes in genetic networks are largely consistent with the existing experimental evidence. Their interactions can be affected by the observed cancer mutations. This study will shed new light into repurposing metformin for safe, effective, personalized therapies.
Collapse
Affiliation(s)
- Thomas Hart
- The Rockefeller University, New York, New York, United States of America
- Department of Biological Sciences, Hunter College, The City University of New York, New York, New York, United States of America
| | - Shihab Dider
- Department of Biological Sciences, Hunter College, The City University of New York, New York, New York, United States of America
| | - Weiwei Han
- The Key Laboratory for Molecular Enzymology and Engineering, Ministry of Education Jilin University, Changchun, P. R. China
| | - Hua Xu
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Texas, United States of America
| | - Zhongming Zhao
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Department of Psychiatry, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Department of Cancer Biology, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Texas, United States of America
| | - Lei Xie
- Ph.D. Program in Computer Science, Biology, and Biochemistry, The Graduate Center, The City University of New York, New York, New York, United States of America
- Department of Computer Science, Hunter College, The City University of New York, New York, New York, United States of America
| |
Collapse
|
36
|
Zhao M, Kim P, Mitra R, Zhao J, Zhao Z. TSGene 2.0: an updated literature-based knowledgebase for tumor suppressor genes. Nucleic Acids Res 2015; 44:D1023-31. [PMID: 26590405 PMCID: PMC4702895 DOI: 10.1093/nar/gkv1268] [Citation(s) in RCA: 282] [Impact Index Per Article: 28.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2015] [Accepted: 11/03/2015] [Indexed: 11/14/2022] Open
Abstract
Tumor suppressor genes (TSGs) are a major type of gatekeeper genes in the cell growth. A knowledgebase with the systematic collection and curation of TSGs in multiple cancer types is critically important for further studying their biological functions as well as for developing therapeutic strategies. Since its development in 2012, the Tumor Suppressor Gene database (TSGene), has become a popular resource in the cancer research community. Here, we reported the TSGene version 2.0, which has substantial updates of contents (e.g. up-to-date literature and pan-cancer genomic data collection and curation), data types (noncoding RNAs and protein-coding genes) and content accessibility. Specifically, the current TSGene 2.0 contains 1217 human TSGs (1018 protein-coding and 199 non-coding genes) curated from over 9000 articles. Additionally, TSGene 2.0 provides thousands of expression and mutation patterns derived from pan-cancer data of The Cancer Genome Atlas. A new web interface is available at http://bioinfo.mc.vanderbilt.edu/TSGene/. Systematic analyses of 199 non-coding TSGs provide numerous cancer-specific non-coding mutational events for further screening and clinical use. Intriguingly, we identified 49 protein-coding TSGs that were consistently down-regulated in 11 cancer types. In summary, TSGene 2.0, which is the only available database for TSGs, provides the most updated TSGs and their features in pan-cancer.
Collapse
Affiliation(s)
- Min Zhao
- School of Engineering, Faculty of Science, Health, Education and Engineering, University of the Sunshine Coast, Maroochydore DC, Queensland 4558, Australia
| | - Pora Kim
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN 37232, USA
| | - Ramkrishna Mitra
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN 37232, USA
| | - Junfei Zhao
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN 37232, USA
| | - Zhongming Zhao
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN 37232, USA Department of Cancer Biology, Vanderbilt University School of Medicine, Nashville, TN 37232, USA Department of Psychiatry, Vanderbilt University School of Medicine, Nashville, TN 37212, USA School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| |
Collapse
|
37
|
Porta-Pardo E, Garcia-Alonso L, Hrabe T, Dopazo J, Godzik A. A Pan-Cancer Catalogue of Cancer Driver Protein Interaction Interfaces. PLoS Comput Biol 2015; 11:e1004518. [PMID: 26485003 PMCID: PMC4616621 DOI: 10.1371/journal.pcbi.1004518] [Citation(s) in RCA: 84] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2015] [Accepted: 08/21/2015] [Indexed: 12/19/2022] Open
Abstract
Despite their importance in maintaining the integrity of all cellular pathways, the role of mutations on protein-protein interaction (PPI) interfaces as cancer drivers has not been systematically studied. Here we analyzed the mutation patterns of the PPI interfaces from 10,028 proteins in a pan-cancer cohort of 5,989 tumors from 23 projects of The Cancer Genome Atlas (TCGA) to find interfaces enriched in somatic missense mutations. To that end we use e-Driver, an algorithm to analyze the mutation distribution of specific protein functional regions. We identified 103 PPI interfaces enriched in somatic cancer mutations. 32 of these interfaces are found in proteins coded by known cancer driver genes. The remaining 71 interfaces are found in proteins that have not been previously identified as cancer drivers even that, in most cases, there is an extensive literature suggesting they play an important role in cancer. Finally, we integrate these findings with clinical information to show how tumors apparently driven by the same gene have different behaviors, including patient outcomes, depending on which specific interfaces are mutated. Until now, most efforts in cancer genomics have focused on identifying genes and pathways driving tumor development. Although this has been unquestionably a success, as evidenced by the fact that we now have an extensive catalogue of cancer driver genes and pathways, there is still a poor understanding of why patients with the same affected driver genes may have different disease outcomes or drug responses. This is precisely the aim of this work-to show how by considering proteins as multifunctional factories instead of monolithic black boxes, it is possible to identify novel cancer driver genes and propose molecular hypotheses to explain such heterogeneity. To that end we have mapped the mutation profiles of 5,989 cancer patients from TCGA to more than 10,000 protein structures, leading us to identify 103 protein interaction interfaces enriched in somatic mutations. Finally, we have integrated clinical annotations as well as proteomics data to show how tumors apparently driven by the same gene can display different behaviors, including patient outcomes, depending on which specific interfaces are mutated.
Collapse
Affiliation(s)
- Eduard Porta-Pardo
- Bioinformatics and Systems Biology Program, Sanford-Burnham Medical Research Institute, La Jolla, California, United States of America
| | - Luz Garcia-Alonso
- European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Cambridge, United Kingdom
| | - Thomas Hrabe
- Bioinformatics and Systems Biology Program, Sanford-Burnham Medical Research Institute, La Jolla, California, United States of America
| | - Joaquin Dopazo
- Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain
- Functional Genomics Node, (INB) at CIPF, Valencia, Spain
- Bioinformatics of Rare Diseases (BIER), CIBER de Enfermedades Raras (CIBERER), Valencia, Spain
- * E-mail: (JD); (AG)
| | - Adam Godzik
- Bioinformatics and Systems Biology Program, Sanford-Burnham Medical Research Institute, La Jolla, California, United States of America
- * E-mail: (JD); (AG)
| |
Collapse
|
38
|
Abstract
The acquisition of mutations that activate oncogenes or inactivate tumor suppressors is a primary feature of most cancers. Mutations that directly alter protein sequence and structure drive the development of tumors through aberrant expression and modification of proteins, in many cases directly impacting components of signal transduction pathways and cellular architecture. Cancer-associated mutations may have direct or indirect effects on proteins and their interactions and while the effects of mutations on signaling pathways have been widely studied, how mutations alter underlying protein-protein interaction networks is much less well understood. Systematic mapping of oncoprotein protein interactions using proteomics techniques as well as computational network analyses is revealing how oncoprotein mutations perturb protein-protein interaction networks and drive the cancer phenotype.
Collapse
Affiliation(s)
- Emily Bowler
- Centre for Biological Sciences, University of Southampton, Southampton SO17 1BJ, UK
| | - Zhenghe Wang
- Department of Genetics and Genome Science, Case Western Reserve University, Cleveland, Ohio 44106, USA
| | - Rob M. Ewing
- Centre for Biological Sciences, University of Southampton, Southampton SO17 1BJ, UK
| |
Collapse
|
39
|
Niroula A, Vihinen M. Harmful somatic amino acid substitutions affect key pathways in cancers. BMC Med Genomics 2015; 8:53. [PMID: 26282678 PMCID: PMC4539680 DOI: 10.1186/s12920-015-0125-x] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Accepted: 07/30/2015] [Indexed: 12/14/2022] Open
Abstract
Background Cancer is characterized by the accumulation of large numbers of genetic variations and alterations of multiple biological phenomena. Cancer genomics has largely focused on the identification of such genetic alterations and the genes containing them, known as ‘cancer genes’. However, the non-functional somatic variations out-number functional variations and remain as a major challenge. Recurrent somatic variations are thought to be cancer drivers but they are present in only a small fraction of patients. Methods We performed an extensive analysis of amino acid substitutions (AASs) from 6,861 cancer samples (whole genome or exome sequences) classified into 30 cancer types and performed pathway enrichment analysis. We also studied the overlap between the cancers based on proteins containing harmful AASs and pathways affected by them. Results We found that only a fraction of AASs (39.88 %) are harmful even in known cancer genes. In addition, we found that proteins containing harmful AASs in cancers are often centrally located in protein interaction networks. Based on the proteins containing harmful AASs, we identified significantly affected pathways in 28 cancer types and indicate that proteins containing harmful AASs can affect pathways despite the frequency of AASs in them. Our cross-cancer overlap analysis showed that it would be more beneficial to identify affected pathways in cancers rather than individual genes and variations. Conclusion Pathways affected by harmful AASs reveal key processes involved in cancer development. Our approach filters out the putative benign AASs thus reducing the list of cancer variations allowing reliable identification of affected pathways. The pathways identified in individual cancer and overlap between cancer types open avenues for further experimental research and for developing targeted therapies and interventions. Electronic supplementary material The online version of this article (doi:10.1186/s12920-015-0125-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Abhishek Niroula
- Department of Experimental Medical Science, Lund University, BMC B13, SE-22184, Lund, Sweden.
| | - Mauno Vihinen
- Department of Experimental Medical Science, Lund University, BMC B13, SE-22184, Lund, Sweden.
| |
Collapse
|
40
|
Berndt JD, Wong W. 2014: signaling breakthroughs of the year. Sci Signal 2015; 8:eg1. [PMID: 25564676 DOI: 10.1126/scisignal.aaa4696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
The 2014 breakthroughs fell into four main areas: innate immunity, host-microbe interactions, cell death signaling, and methodological advances in the study of cell signaling. Nominations included new discoveries about signaling in innate immune cells, innate immune functions for lymphoid and nonhematological cells, and the importance of host-microbe interactions for the regulation of host physiology. Also this year, we received nominations highlighting molecular mechanisms by which p53 contributes to the pathology of chronic inflammation and how signaling pathways mediate programmed necrotic cell death. Finally, 2014 saw the use of new techniques to study cell signaling and identify drug targets, such as the in vivo use of RNA interference to study signaling in T cells and new computational methods to study large datasets of different data types.
Collapse
Affiliation(s)
- Jason D Berndt
- Associate Editor of Science Signaling, American Association for the Advancement of Science, 1200 New York Avenue, N.W., Washington, DC 20005, USA.
| | - Wei Wong
- Senior Editor of Science Signaling, American Association for the Advancement of Science, 1200 New York Avenue, N.W., Washington, DC 20005, USA.
| |
Collapse
|
41
|
Agarwal M, Adhil M, Talukder AK. Multi-omics Multi-scale Big Data Analytics for Cancer Genomics. BIG DATA ANALYTICS 2015. [DOI: 10.1007/978-3-319-27057-9_16] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
|
42
|
|