1
|
Santra T, Herrero A, Rodriguez J, von Kriegsheim A, Iglesias-Martinez LF, Schwarzl T, Higgins D, Aye TT, Heck AJR, Calvo F, Agudo-Ibáñez L, Crespo P, Matallanas D, Kolch W. An Integrated Global Analysis of Compartmentalized HRAS Signaling. Cell Rep 2020; 26:3100-3115.e7. [PMID: 30865897 DOI: 10.1016/j.celrep.2019.02.038] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2018] [Revised: 12/16/2018] [Accepted: 02/11/2019] [Indexed: 12/27/2022] Open
Abstract
Modern omics technologies allow us to obtain global information on different types of biological networks. However, integrating these different types of analyses into a coherent framework for a comprehensive biological interpretation remains challenging. Here, we present a conceptual framework that integrates protein interaction, phosphoproteomics, and transcriptomics data. Applying this method to analyze HRAS signaling from different subcellular compartments shows that spatially defined networks contribute specific functions to HRAS signaling. Changes in HRAS protein interactions at different sites lead to different kinase activation patterns that differentially regulate gene transcription. HRAS-mediated signaling is the strongest from the cell membrane, but it regulates the largest number of genes from the endoplasmic reticulum. The integrated networks provide a topologically and functionally resolved view of HRAS signaling. They reveal distinct HRAS functions including the control of cell migration from the endoplasmic reticulum and TP53-dependent cell survival when signaling from the Golgi apparatus.
Collapse
Affiliation(s)
- Tapesh Santra
- Systems Biology Ireland, University College Dublin, Belfield, Dublin, Ireland
| | - Ana Herrero
- Systems Biology Ireland, University College Dublin, Belfield, Dublin, Ireland
| | - Javier Rodriguez
- Systems Biology Ireland, University College Dublin, Belfield, Dublin, Ireland
| | - Alex von Kriegsheim
- Systems Biology Ireland, University College Dublin, Belfield, Dublin, Ireland
| | | | - Thomas Schwarzl
- Systems Biology Ireland, University College Dublin, Belfield, Dublin, Ireland
| | - Des Higgins
- Systems Biology Ireland, University College Dublin, Belfield, Dublin, Ireland; Conway Institute of Biomolecular & Biomedical Research, University College Dublin, Belfield, Ireland; School of Medicine and Medical Science, University College Dublin, Belfield, Ireland
| | - Thin-Thin Aye
- Biomolecular Mass Spectrometry and Proteomics, Bijvoet Centre for Biomolecular Research and Utrecht Institute for Pharmaceutical Science, Utrecht University, Padualaan 8, 3584 Utrecht, the Netherlands
| | - Albert J R Heck
- Biomolecular Mass Spectrometry and Proteomics, Bijvoet Centre for Biomolecular Research and Utrecht Institute for Pharmaceutical Science, Utrecht University, Padualaan 8, 3584 Utrecht, the Netherlands
| | - Fernando Calvo
- Instituto de Biomedicina y Biotecnología de Cantabria (IBBTEC), Consejo Superior de Investigaciones Científicas (CSIC) - Universidad de Cantabria, Santander 39011, Spain
| | - Lorena Agudo-Ibáñez
- Instituto de Biomedicina y Biotecnología de Cantabria (IBBTEC), Consejo Superior de Investigaciones Científicas (CSIC) - Universidad de Cantabria, Santander 39011, Spain
| | - Piero Crespo
- Instituto de Biomedicina y Biotecnología de Cantabria (IBBTEC), Consejo Superior de Investigaciones Científicas (CSIC) - Universidad de Cantabria, Santander 39011, Spain; Centro de Investigación Biomédica en Red de Cáncer (CIBERONC), Instituto de Salud Carlos III, Madrid, Spain
| | - David Matallanas
- Systems Biology Ireland, University College Dublin, Belfield, Dublin, Ireland.
| | - Walter Kolch
- Systems Biology Ireland, University College Dublin, Belfield, Dublin, Ireland; Conway Institute of Biomolecular & Biomedical Research, University College Dublin, Belfield, Ireland; School of Medicine and Medical Science, University College Dublin, Belfield, Ireland.
| |
Collapse
|
2
|
Kruppa J, Kramer F, Beißbarth T, Jung K. A simulation framework for correlated count data of features subsets in high-throughput sequencing or proteomics experiments. Stat Appl Genet Mol Biol 2016; 15:401-414. [PMID: 27655448 DOI: 10.1515/sagmb-2015-0082] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
As part of the data processing of high-throughput-sequencing experiments count data are produced representing the amount of reads that map to specific genomic regions. Count data also arise in mass spectrometric experiments for the detection of protein-protein interactions. For evaluating new computational methods for the analysis of sequencing count data or spectral count data from proteomics experiments artificial count data is thus required. Although, some methods for the generation of artificial sequencing count data have been proposed, all of them simulate single sequencing runs, omitting thus the correlation structure between the individual genomic features, or they are limited to specific structures. We propose to draw correlated data from the multivariate normal distribution and round these continuous data in order to obtain discrete counts. In our approach, the required distribution parameters can either be constructed in different ways or estimated from real count data. Because rounding affects the correlation structure we evaluate the use of shrinkage estimators that have already been used in the context of artificial expression data from DNA microarrays. Our approach turned out to be useful for the simulation of counts for defined subsets of features such as individual pathways or GO categories.
Collapse
|
3
|
A Bayesian algorithm for detecting differentially expressed proteins and its application in breast cancer research. Sci Rep 2016; 6:30159. [PMID: 27444576 PMCID: PMC4957118 DOI: 10.1038/srep30159] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2015] [Accepted: 06/28/2016] [Indexed: 02/07/2023] Open
Abstract
Presence of considerable noise and missing data points make analysis of mass-spectrometry (MS) based proteomic data a challenging task. The missing values in MS data are caused by the inability of MS machines to reliably detect proteins whose abundances fall below the detection limit. We developed a Bayesian algorithm that exploits this knowledge and uses missing data points as a complementary source of information to the observed protein intensities in order to find differentially expressed proteins by analysing MS based proteomic data. We compared its accuracy with many other methods using several simulated datasets. It consistently outperformed other methods. We then used it to analyse proteomic screens of a breast cancer (BC) patient cohort. It revealed large differences between the proteomic landscapes of triple negative and Luminal A, which are the most and least aggressive types of BC. Unexpectedly, majority of these differences could be attributed to the direct transcriptional activity of only seven transcription factors some of which are known to be inactive in triple negative BC. We also identified two new proteins which significantly correlated with the survival of BC patients, and therefore may have potential diagnostic/prognostic values.
Collapse
|
4
|
Jung K. Statistical Aspects in Proteomic Biomarker Discovery. Methods Mol Biol 2016; 1362:293-310. [PMID: 26519185 DOI: 10.1007/978-1-4939-3106-4_19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
In the pursuit of a personalized medicine, i.e., the individual treatment of a patient, many medical decision problems are desired to be supported by biomarkers that can help to make a diagnosis, prediction, or prognosis. Proteomic biomarkers are of special interest since they can not only be detected in tissue samples but can also often be easily detected in diverse body fluids. Statistical methods play an important role in the discovery and validation of proteomic biomarkers. They are necessary in the planning of experiments, in the processing of raw signals, and in the final data analysis. This review provides an overview on the most frequent experimental settings including sample size considerations, and focuses on exploratory data analysis and classifier development.
Collapse
Affiliation(s)
- Klaus Jung
- Department of Medical Statistics, Georg-August-University Göttingen, Humboldtallee 32, 37073, Göttingen, Germany.
| |
Collapse
|
5
|
Kruppa J, Jung K. Set-Based Test Procedures for the Functional Analysis of Protein Lists from Differential Analysis. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2015; 1362:143-56. [PMID: 26519175 DOI: 10.1007/978-1-4939-3106-4_9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
The analysis of most high-throughput proteomics experiments involves the selection of differentially expressed proteins or peptides between two different sets of samples, e.g., from two experimental groups. As a result, a large list of selected features is reported, typically sorted by a measure for the expression fold change and a p-value from a statistical test. The biological interpretation of such a list is usually difficult since the features can typically be assigned to a large variety of biological classes. To facilitate the biological interpretation, set-based procedures focus on the analysis of feature subsets that all belong to the same biological class (e.g., same cellular component, biological process, molecular function, or pathway). Set-based procedures can roughly be divided into "enrichment methods" and "global test procedures," where the first involve all features of an experiment and the second only those features of a particular set. In this chapter we detail the working principle of these kind of statistical methods and describe how features can be classified into molecular subsets. We illustrate the use of the methods on a data example from a proteomics Parkinson study.
Collapse
Affiliation(s)
- Jochen Kruppa
- Department of Medical Statistics, Georg-August-University Göttingen, Humboldtallee 32, 37073, Göttingen, Germany
| | - Klaus Jung
- Department of Medical Statistics, Georg-August-University Göttingen, Humboldtallee 32, 37073, Göttingen, Germany.
| |
Collapse
|
6
|
Kammers K, Cole RN, Tiengwe C, Ruczinski I. Detecting Significant Changes in Protein Abundance. EUPA OPEN PROTEOMICS 2015; 7:11-19. [PMID: 25821719 DOI: 10.1016/j.euprot.2015.02.002] [Citation(s) in RCA: 201] [Impact Index Per Article: 20.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
We review and demonstrate how an empirical Bayes method, shrinking a protein's sample variance towards a pooled estimate, leads to far more powerful and stable inference to detect significant changes in protein abundance compared to ordinary t-tests. Using examples from isobaric mass labeled proteomic experiments we show how to analyze data from multiple experiments simultaneously, and discuss the effects of missing data on the inference. We also present easy to use open source software for normalization of mass spectrometry data and inference based on moderated test statistics.
Collapse
Affiliation(s)
- Kai Kammers
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Robert N Cole
- Mass Spectrometry and Proteomics Core Facility, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Calvin Tiengwe
- Department of Cell Biology, Johns Hopkins University School of Medicine, Baltimore, MD, USA. ; Department of Microbiology and Immunology, School of Medicine and Biomedical Sciences, University at Bu alo, Bu alo, NY, USA
| | - Ingo Ruczinski
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| |
Collapse
|
7
|
Huan T, Li L. Counting Missing Values in a Metabolite-Intensity Data Set for Measuring the Analytical Performance of a Metabolomics Platform. Anal Chem 2014; 87:1306-13. [DOI: 10.1021/ac5039994] [Citation(s) in RCA: 67] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Affiliation(s)
- Tao Huan
- Department of Chemistry, University of Alberta, Edmonton, Alberta T6G2G2, Canada
| | - Liang Li
- Department of Chemistry, University of Alberta, Edmonton, Alberta T6G2G2, Canada
| |
Collapse
|