1
|
Sing JC, Charkow J, AlHigaylan M, Horecka I, Xu L, Röst HL. MassDash: A Web-Based Dashboard for Data-Independent Acquisition Mass Spectrometry Visualization. J Proteome Res 2024. [PMID: 38684072 DOI: 10.1021/acs.jproteome.4c00026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2024]
Abstract
With the increased usage and diversity of methods and instruments being applied to analyze Data-Independent Acquisition (DIA) data, visualization is becoming increasingly important to validate automated software results. Here we present MassDash, a cross-platform DIA mass spectrometry visualization and validation software for comparing features and results across popular tools. MassDash provides a web-based interface and Python package for interactive feature visualizations and summary report plots across multiple automated DIA feature detection tools, including OpenSwath, DIA-NN, and dreamDIA. Furthermore, MassDash processes peptides on the fly, enabling interactive visualization of peptides across dozens of runs simultaneously on a personal computer. MassDash supports various multidimensional visualizations across retention time, ion mobility, m/z, and intensity, providing additional insights into the data. The modular framework is easily extendable, enabling rapid algorithm development of novel peak-picker techniques, such as deep-learning-based approaches and refinement of existing tools. MassDash is open-source under a BSD 3-Clause license and freely available at https://github.com/Roestlab/massdash, and a demo version can be accessed at https://massdash.streamlit.app.
Collapse
Affiliation(s)
- Justin C Sing
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5G 1A8, Canada
| | - Joshua Charkow
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5G 1A8, Canada
| | - Mohammed AlHigaylan
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5G 1A8, Canada
| | - Ira Horecka
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5G 1A8, Canada
| | - Leon Xu
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5G 1A8, Canada
| | - Hannes L Röst
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5G 1A8, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario M5G 1A8, Canada
| |
Collapse
|
2
|
MSSort-DIAXMBD: A deep learning classification tool of the peptide precursors quantified by OpenSWATH. J Proteomics 2022; 259:104542. [DOI: 10.1016/j.jprot.2022.104542] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 02/17/2022] [Accepted: 02/18/2022] [Indexed: 11/21/2022]
|
3
|
Zhang F, Yu S, Wu L, Zang Z, Yi X, Zhu J, Lu C, Sun P, Sun Y, Selvarajan S, Chen L, Teng X, Zhao Y, Wang G, Xiao J, Huang S, Kon OL, Iyer NG, Li SZ, Luan Z, Guo T. Phenotype Classification using Proteome Data in a Data-Independent Acquisition Tensor Format. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2020; 31:2296-2304. [PMID: 33104352 DOI: 10.1021/jasms.0c00254] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
A novel approach for phenotype prediction is developed for data-independent acquisition (DIA) mass spectrometric (MS) data without the need for peptide precursor identification using existing DIA software tools. The first step converts the DIA-MS data file into a new file format called DIA tensor (DIAT), which can be used for the convenient visualization of all the ions from peptide precursors and fragments. DIAT files can be fed directly into a deep neural network to predict phenotypes such as appearances of cats, dogs, and microscopic images. As a proof of principle, we applied this approach to 102 hepatocellular carcinoma samples and achieved an accuracy of 96.8% in distinguishing malignant from benign samples. We further applied a refined model to classify thyroid nodules. Deep learning based on 492 training samples achieved an accuracy of 91.7% in an independent cohort of 216 test samples. This approach surpassed the deep-learning model based on peptide and protein matrices generated by OpenSWATH. In summary, we present a new strategy for DIA data analysis based on a novel data format called DIAT, which enables facile two-dimensional visualization of DIA proteomics data. DIAT files can be directly used for deep learning for biological and clinical phenotype classification. Future research will interpret the deep-learning models emerged from DIAT analysis.
Collapse
Affiliation(s)
- Fangfei Zhang
- Zhejiang Provincial Laboratory of Life Sciences and Biomedicine, Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, 18 Shilongshan Road, Hangzhou 310024, Zhejiang Province, China
- Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, Hangzhou 310024, Zhejiang Province, China
| | - Shaoyang Yu
- Zhejiang Provincial Laboratory of Life Sciences and Biomedicine, Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, 18 Shilongshan Road, Hangzhou 310024, Zhejiang Province, China
- Sino-German Joint Software Institute (JSI), Beihang University, Beijing 100191, China
| | - Lirong Wu
- Center for AI Research and Innovation (CAIRI), School of Engineering, Westlake University, Hangzhou 310024, China
| | - Zelin Zang
- Center for AI Research and Innovation (CAIRI), School of Engineering, Westlake University, Hangzhou 310024, China
| | - Xiao Yi
- Zhejiang Provincial Laboratory of Life Sciences and Biomedicine, Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, 18 Shilongshan Road, Hangzhou 310024, Zhejiang Province, China
- Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, Hangzhou 310024, Zhejiang Province, China
| | - Jiang Zhu
- Center for Stem Cell Research and Application, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, Hubei, China
| | - Cong Lu
- Center for Stem Cell Research and Application, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, Hubei, China
| | - Ping Sun
- Department of Hepatobiliary Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, Hubei, China
| | - Yaoting Sun
- Zhejiang Provincial Laboratory of Life Sciences and Biomedicine, Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, 18 Shilongshan Road, Hangzhou 310024, Zhejiang Province, China
- Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, Hangzhou 310024, Zhejiang Province, China
| | | | - Lirong Chen
- Department of Pathology, The Second Affiliated Hospital of College of Medicine, Zhejiang University, Hangzhou 310009, China
| | - Xiaodong Teng
- Department of Pathology, the First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou 310003, China
| | - Yongfu Zhao
- Department of General Surgery, The Second Hospital of Dalian Medical University, Dalian 116027, China
| | - Guangzhi Wang
- Department of General Surgery, The Second Hospital of Dalian Medical University, Dalian 116027, China
| | - Junhong Xiao
- Division of Surgical Oncology, National Cancer Centre Singapore, Singapore 169610, Republic of Singapore
| | - Shiang Huang
- Center for Stem Cell Research and Application, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, Hubei, China
| | - Oi Lian Kon
- Division of Medical Sciences, National Cancer Centre Singapore, Singapore 169610, Republic of Singapore
| | - N Gopalakrishna Iyer
- Division of Surgical Oncology, National Cancer Centre Singapore, Singapore 169610, Republic of Singapore
- Division of Medical Sciences, National Cancer Centre Singapore, Singapore 169610, Republic of Singapore
| | - Stan Z Li
- Center for AI Research and Innovation (CAIRI), School of Engineering, Westlake University, Hangzhou 310024, China
| | - Zhongzhi Luan
- Sino-German Joint Software Institute (JSI), Beihang University, Beijing 100191, China
| | - Tiannan Guo
- Zhejiang Provincial Laboratory of Life Sciences and Biomedicine, Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, 18 Shilongshan Road, Hangzhou 310024, Zhejiang Province, China
- Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, Hangzhou 310024, Zhejiang Province, China
| |
Collapse
|
4
|
Zhang F, Ge W, Ruan G, Cai X, Guo T. Data‐Independent Acquisition Mass Spectrometry‐Based Proteomics and Software Tools: A Glimpse in 2020. Proteomics 2020; 20:e1900276. [DOI: 10.1002/pmic.201900276] [Citation(s) in RCA: 220] [Impact Index Per Article: 44.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2019] [Revised: 03/27/2020] [Indexed: 01/02/2023]
Affiliation(s)
- Fangfei Zhang
- Key Laboratory of Structural Biology of Zhejiang ProvinceSchool of Life SciencesWestlake University 18 Shilongshan Road Hangzhou Zhejiang Province 310024 China
- Institute of Basic Medical SciencesWestlake Institute for Advanced Study 18 Shilongshan Road Hangzhou Zhejiang Province 310024 China
| | - Weigang Ge
- Key Laboratory of Structural Biology of Zhejiang ProvinceSchool of Life SciencesWestlake University 18 Shilongshan Road Hangzhou Zhejiang Province 310024 China
- Institute of Basic Medical SciencesWestlake Institute for Advanced Study 18 Shilongshan Road Hangzhou Zhejiang Province 310024 China
| | - Guan Ruan
- Key Laboratory of Structural Biology of Zhejiang ProvinceSchool of Life SciencesWestlake University 18 Shilongshan Road Hangzhou Zhejiang Province 310024 China
- Institute of Basic Medical SciencesWestlake Institute for Advanced Study 18 Shilongshan Road Hangzhou Zhejiang Province 310024 China
| | - Xue Cai
- Key Laboratory of Structural Biology of Zhejiang ProvinceSchool of Life SciencesWestlake University 18 Shilongshan Road Hangzhou Zhejiang Province 310024 China
- Institute of Basic Medical SciencesWestlake Institute for Advanced Study 18 Shilongshan Road Hangzhou Zhejiang Province 310024 China
| | - Tiannan Guo
- Key Laboratory of Structural Biology of Zhejiang ProvinceSchool of Life SciencesWestlake University 18 Shilongshan Road Hangzhou Zhejiang Province 310024 China
- Institute of Basic Medical SciencesWestlake Institute for Advanced Study 18 Shilongshan Road Hangzhou Zhejiang Province 310024 China
| |
Collapse
|
5
|
Gupta S, Sing J, Mahmoodi A, Röst H. DrawAlignR: An Interactive Tool for Across Run Chromatogram Alignment Visualization. Proteomics 2020; 20:e1900353. [PMID: 32418354 DOI: 10.1002/pmic.201900353] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2020] [Revised: 04/16/2020] [Indexed: 12/23/2022]
Abstract
Multi-run alignment is widely used in proteomics to establish analyte correspondence across runs. Generally alignment algorithms return a cumulative score, which may not be easily interpretable for each peptide. Here a novel and interactive tool for cross-run chromatogram alignment visualization (DrawAlignR) of data-independent acquisition (DIA) data is presented. Furthermore, a novel C++ based implementation of raw chromatogram alignment which is 35 times faster than the previously published algorithm is developed. This not only enables users to plot alignment interactively by DrawAlignR, but also allows other software platforms to use the algorithm. DrawAlignR is an open-source web application using R Shiny that can be hosted using the source-code available at https://github.com/Roestlab/DrawAlignR.
Collapse
Affiliation(s)
- Shubham Gupta
- Department of Molecular Genetics, University of Toronto, Toronto, ON, M5G 1A8, Canada.,Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, M5S 3E1, Canada
| | - Justin Sing
- Department of Molecular Genetics, University of Toronto, Toronto, ON, M5G 1A8, Canada.,Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, M5S 3E1, Canada
| | - Arshia Mahmoodi
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, M5S 3E1, Canada.,Department of Cell and Systems Biology, University of Toronto, Toronto, ON, M5G 1A8, Canada
| | - Hannes Röst
- Department of Molecular Genetics, University of Toronto, Toronto, ON, M5G 1A8, Canada.,Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, M5S 3E1, Canada
| |
Collapse
|
6
|
Anjo SI, Santa C, Manadas B. SWATH-MS as a tool for biomarker discovery: From basic research to clinical applications. Proteomics 2017; 17. [DOI: 10.1002/pmic.201600278] [Citation(s) in RCA: 102] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Revised: 01/05/2017] [Accepted: 01/23/2017] [Indexed: 12/16/2022]
Affiliation(s)
- Sandra Isabel Anjo
- CNC - Center for Neuroscience and Cell Biology; University of Coimbra; Coimbra Portugal
- Faculty of Sciences and Technology; University of Coimbra; Coimbra Portugal
| | - Cátia Santa
- CNC - Center for Neuroscience and Cell Biology; University of Coimbra; Coimbra Portugal
- Institute for Interdisciplinary Research (III); University of Coimbra; Coimbra Portugal
| | - Bruno Manadas
- CNC - Center for Neuroscience and Cell Biology; University of Coimbra; Coimbra Portugal
| |
Collapse
|
7
|
Röst HL, Aebersold R, Schubert OT. Automated SWATH Data Analysis Using Targeted Extraction of Ion Chromatograms. Methods Mol Biol 2017; 1550:289-307. [PMID: 28188537 DOI: 10.1007/978-1-4939-6747-6_20] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Targeted mass spectrometry comprises a set of methods able to quantify protein analytes in complex mixtures with high accuracy and sensitivity. These methods, e.g., Selected Reaction Monitoring (SRM) and SWATH MS, use specific mass spectrometric coordinates (assays) for reproducible detection and quantification of proteins. In this protocol, we describe how to analyze, in a targeted manner, data from a SWATH MS experiment aimed at monitoring thousands of proteins reproducibly over many samples. We present a standard SWATH MS analysis workflow, including manual data analysis for quality control (based on Skyline) as well as automated data analysis with appropriate control of error rates (based on the OpenSWATH workflow). We also discuss considerations to ensure maximal coverage, reproducibility, and quantitative accuracy.
Collapse
Affiliation(s)
- Hannes L Röst
- Institute of Molecular Systems Biology, ETH Zurich, CH-8093, Zurich, Switzerland.
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA.
| | - Ruedi Aebersold
- Institute of Molecular Systems Biology, ETH Zurich, CH-8093, Zurich, Switzerland.
- Faculty of Science, University of Zurich, CH-8057, Zurich, Switzerland.
| | - Olga T Schubert
- Institute of Molecular Systems Biology, ETH Zurich, CH-8093, Zurich, Switzerland.
- Department of Human Genetics, University of California Los Angeles, Los Angeles, CA, 90095, USA.
| |
Collapse
|
8
|
Röst HL, Malmström L, Aebersold R. Reproducible quantitative proteotype data matrices for systems biology. Mol Biol Cell 2016; 26:3926-31. [PMID: 26543201 PMCID: PMC4710225 DOI: 10.1091/mbc.e15-07-0507] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Historically, many mass spectrometry–based proteomic studies have aimed at compiling an inventory of protein compounds present in a biological sample, with the long-term objective of creating a proteome map of a species. However, to answer fundamental questions about the behavior of biological systems at the protein level, accurate and unbiased quantitative data are required in addition to a list of all protein components. Fueled by advances in mass spectrometry, the proteomics field has thus recently shifted focus toward the reproducible quantification of proteins across a large number of biological samples. This provides the foundation to move away from pure enumeration of identified proteins toward quantitative matrices of many proteins measured across multiple samples. It is argued here that data matrices consisting of highly reproducible, quantitative, and unbiased proteomic measurements across a high number of conditions, referred to here as quantitative proteotype maps, will become the fundamental currency in the field and provide the starting point for downstream biological analysis. Such proteotype data matrices, for example, are generated by the measurement of large patient cohorts, time series, or multiple experimental perturbations. They are expected to have a large effect on systems biology and personalized medicine approaches that investigate the dynamic behavior of biological systems across multiple perturbations, time points, and individuals.
Collapse
Affiliation(s)
- Hannes L Röst
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, CH-8093 Zurich, Switzerland Department of Genetics, Stanford University, Stanford, CA 94305
| | - Lars Malmström
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, CH-8093 Zurich, Switzerland S3IT, University of Zurich, CH-8057 Zurich, Switzerland
| | - Ruedi Aebersold
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, CH-8093 Zurich, Switzerland Faculty of Science, University of Zurich, CH-8057 Zurich, Switzerland
| |
Collapse
|
9
|
Röst HL, Sachsenberg T, Aiche S, Bielow C, Weisser H, Aicheler F, Andreotti S, Ehrlich HC, Gutenbrunner P, Kenar E, Liang X, Nahnsen S, Nilse L, Pfeuffer J, Rosenberger G, Rurik M, Schmitt U, Veit J, Walzer M, Wojnar D, Wolski WE, Schilling O, Choudhary JS, Malmström L, Aebersold R, Reinert K, Kohlbacher O. OpenMS: a flexible open-source software platform for mass spectrometry data analysis. Nat Methods 2016; 13:741-8. [DOI: 10.1038/nmeth.3959] [Citation(s) in RCA: 461] [Impact Index Per Article: 51.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2016] [Accepted: 06/27/2016] [Indexed: 12/28/2022]
|
10
|
Misra BB, van der Hooft JJJ. Updates in metabolomics tools and resources: 2014-2015. Electrophoresis 2015; 37:86-110. [DOI: 10.1002/elps.201500417] [Citation(s) in RCA: 84] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2015] [Revised: 10/04/2015] [Accepted: 10/05/2015] [Indexed: 12/12/2022]
Affiliation(s)
- Biswapriya B. Misra
- Department of Biology, Genetics Institute; University of Florida; Gainesville FL USA
| | | |
Collapse
|