1
|
van Bentum M, Selbach M. An Introduction to Advanced Targeted Acquisition Methods. Mol Cell Proteomics 2021; 20:100165. [PMID: 34673283 PMCID: PMC8600983 DOI: 10.1016/j.mcpro.2021.100165] [Citation(s) in RCA: 70] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2021] [Revised: 10/11/2021] [Accepted: 10/13/2021] [Indexed: 01/13/2023] Open
Abstract
Targeted proteomics via selected reaction monitoring (SRM) or parallel reaction monitoring (PRM) enables fast and sensitive detection of a preselected set of target peptides. However, the number of peptides that can be monitored in conventional targeting methods is usually rather small. Recently, a series of methods has been described that employ intelligent acquisition strategies to increase the efficiency of mass spectrometers to detect target peptides. These methods are based on one of two strategies. First, retention time adjustment-based methods enable intelligent scheduling of target peptide retention times. These include Picky, iRT, as well as spike-in free real-time adjustment methods such as MaxQuant.Live. Second, in spike-in triggered acquisition methods such as SureQuant, Pseudo-PRM, TOMAHAQ, and Scout-MRM, targeted scans are initiated by abundant labeled synthetic peptides added to samples before the run. Both strategies enable the mass spectrometer to better focus data acquisition time on target peptides. This either enables more sensitive detection or a higher number of targets per run. Here, we provide an overview of available advanced targeting methods and highlight their intrinsic strengths and weaknesses and compatibility with specific experimental setups. Our goal is to provide a basic introduction to advanced targeting methods for people starting to work in this field.
Collapse
Affiliation(s)
- Mirjam van Bentum
- Proteome Dynamics, Max Delbrück Center for Molecular Medicine, Berlin, Germany; Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Matthias Selbach
- Proteome Dynamics, Max Delbrück Center for Molecular Medicine, Berlin, Germany; Charité-Universitätsmedizin Berlin, Berlin, Germany.
| |
Collapse
|
2
|
Wen B, Zeng W, Liao Y, Shi Z, Savage SR, Jiang W, Zhang B. Deep Learning in Proteomics. Proteomics 2020; 20:e1900335. [PMID: 32939979 PMCID: PMC7757195 DOI: 10.1002/pmic.201900335] [Citation(s) in RCA: 78] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Revised: 09/14/2020] [Indexed: 12/17/2022]
Abstract
Proteomics, the study of all the proteins in biological systems, is becoming a data-rich science. Protein sequences and structures are comprehensively catalogued in online databases. With recent advancements in tandem mass spectrometry (MS) technology, protein expression and post-translational modifications (PTMs) can be studied in a variety of biological systems at the global scale. Sophisticated computational algorithms are needed to translate the vast amount of data into novel biological insights. Deep learning automatically extracts data representations at high levels of abstraction from data, and it thrives in data-rich scientific research domains. Here, a comprehensive overview of deep learning applications in proteomics, including retention time prediction, MS/MS spectrum prediction, de novo peptide sequencing, PTM prediction, major histocompatibility complex-peptide binding prediction, and protein structure prediction, is provided. Limitations and the future directions of deep learning in proteomics are also discussed. This review will provide readers an overview of deep learning and how it can be used to analyze proteomics data.
Collapse
Affiliation(s)
- Bo Wen
- Lester and Sue Smith Breast CenterBaylor College of MedicineHoustonTX77030USA
- Department of Molecular and Human GeneticsBaylor College of MedicineHoustonTX77030USA
| | - Wen‐Feng Zeng
- Key Lab of Intelligent Information Processing of Chinese Academy of Sciences (CAS)Chinese Academy of SciencesInstitute of Computing TechnologyBeijing100190China
| | - Yuxing Liao
- Lester and Sue Smith Breast CenterBaylor College of MedicineHoustonTX77030USA
- Department of Molecular and Human GeneticsBaylor College of MedicineHoustonTX77030USA
| | - Zhiao Shi
- Lester and Sue Smith Breast CenterBaylor College of MedicineHoustonTX77030USA
- Department of Molecular and Human GeneticsBaylor College of MedicineHoustonTX77030USA
| | - Sara R. Savage
- Lester and Sue Smith Breast CenterBaylor College of MedicineHoustonTX77030USA
- Department of Molecular and Human GeneticsBaylor College of MedicineHoustonTX77030USA
| | - Wen Jiang
- Lester and Sue Smith Breast CenterBaylor College of MedicineHoustonTX77030USA
- Department of Molecular and Human GeneticsBaylor College of MedicineHoustonTX77030USA
| | - Bing Zhang
- Lester and Sue Smith Breast CenterBaylor College of MedicineHoustonTX77030USA
- Department of Molecular and Human GeneticsBaylor College of MedicineHoustonTX77030USA
| |
Collapse
|
3
|
Wen B, Li K, Zhang Y, Zhang B. Cancer neoantigen prioritization through sensitive and reliable proteogenomics analysis. Nat Commun 2020; 11:1759. [PMID: 32273506 PMCID: PMC7145864 DOI: 10.1038/s41467-020-15456-w] [Citation(s) in RCA: 89] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2019] [Accepted: 03/10/2020] [Indexed: 01/01/2023] Open
Abstract
Genomics-based neoantigen discovery can be enhanced by proteomic evidence, but there remains a lack of consensus on the performance of different quality control methods for variant peptide identification in proteogenomics. We propose to use the difference between accurately predicted and observed retention times for each peptide as a metric to evaluate different quality control methods. To this end, we develop AutoRT, a deep learning algorithm with high accuracy in retention time prediction. Analysis of three cancer data sets with a total of 287 tumor samples using different quality control strategies results in substantially different numbers of identified variant peptides and putative neoantigens. Our systematic evaluation, using the proposed retention time metric, provides insights and practical guidance on the selection of quality control strategies. We implement the recommended strategy in a computational workflow named NeoFlow to support proteogenomics-based neoantigen prioritization, enabling more sensitive discovery of putative neoantigens. Identifying mutation-derived neoantigens by proteogenomics requires robust strategies for quality control. Here, the authors propose peptide retention time as an evaluation metric for proteogenomics quality control methods, and develop a deep learning algorithm for accurate retention time prediction.
Collapse
Affiliation(s)
- Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Kai Li
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Yun Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA. .,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA.
| |
Collapse
|
4
|
Noor Z, Ranganathan S. Bioinformatics approaches for improving seminal plasma proteome analysis. Theriogenology 2019; 137:43-49. [PMID: 31186128 DOI: 10.1016/j.theriogenology.2019.05.036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Reproduction efficiency of male animals is one of the key factors influencing the sustainability of livestock. Mass spectrometry (MS) based proteomics has become an important tool for studying seminal plasma proteomes. In this review, we summarize bioinformatics analysis strategies for current proteomics approaches, for identifying novel biomarkers of reproductive robustness.
Collapse
Affiliation(s)
- Zainab Noor
- Department of Molecular Sciences, Macquarie University, Sydney, Australia
| | - Shoba Ranganathan
- Department of Molecular Sciences, Macquarie University, Sydney, Australia.
| |
Collapse
|
5
|
Chen AT, Franks A, Slavov N. DART-ID increases single-cell proteome coverage. PLoS Comput Biol 2019; 15:e1007082. [PMID: 31260443 PMCID: PMC6625733 DOI: 10.1371/journal.pcbi.1007082] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2018] [Revised: 07/12/2019] [Accepted: 05/06/2019] [Indexed: 01/09/2023] Open
Abstract
Analysis by liquid chromatography and tandem mass spectrometry (LC-MS/MS) can identify and quantify thousands of proteins in microgram-level samples, such as those comprised of thousands of cells. This process, however, remains challenging for smaller samples, such as the proteomes of single mammalian cells, because reduced protein levels reduce the number of confidently sequenced peptides. To alleviate this reduction, we developed Data-driven Alignment of Retention Times for IDentification (DART-ID). DART-ID implements principled Bayesian frameworks for global retention time (RT) alignment and for incorporating RT estimates towards improved confidence estimates of peptide-spectrum-matches. When applied to bulk or to single-cell samples, DART-ID increased the number of data points by 30-50% at 1% FDR, and thus decreased missing data. Benchmarks indicate excellent quantification of peptides upgraded by DART-ID and support their utility for quantitative analysis, such as identifying cell types and cell-type specific proteins. The additional datapoints provided by DART-ID boost the statistical power and double the number of proteins identified as differentially abundant in monocytes and T-cells. DART-ID can be applied to diverse experimental designs and is freely available at http://dart-id.slavovlab.net.
Collapse
Affiliation(s)
- Albert Tian Chen
- Department of Bioengineering, Northeastern University, Boston, Massachusetts, United States of America
- Barnett Institute, Northeastern University, Boston, Massachusetts, United States of America
| | - Alexander Franks
- Department of Statistics and Applied Probability, University of California Santa Barbara, California, United States of America
| | - Nikolai Slavov
- Department of Bioengineering, Northeastern University, Boston, Massachusetts, United States of America
- Barnett Institute, Northeastern University, Boston, Massachusetts, United States of America
- Department of Biology, Northeastern University, Boston, Massachusetts, United States of America
| |
Collapse
|
6
|
Bouwmeester R, Martens L, Degroeve S. Comprehensive and Empirical Evaluation of Machine Learning Algorithms for Small Molecule LC Retention Time Prediction. Anal Chem 2019; 91:3694-3703. [DOI: 10.1021/acs.analchem.8b05820] [Citation(s) in RCA: 69] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Affiliation(s)
- Robbin Bouwmeester
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Sven Degroeve
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| |
Collapse
|
7
|
Ma C, Ren Y, Yang J, Ren Z, Yang H, Liu S. Improved Peptide Retention Time Prediction in Liquid Chromatography through Deep Learning. Anal Chem 2018; 90:10881-10888. [PMID: 30114359 DOI: 10.1021/acs.analchem.8b02386] [Citation(s) in RCA: 88] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
The accuracy of peptide retention time (RT) prediction model in liquid chromatography (LC) is still not sufficient for wider implementation in proteomics practice. Herein, we propose deep learning as an ideal tool to considerably improve this prediction. A new peptide RT prediction tool, DeepRT, was designed using a capsule network model, and the public data sets containing peptides separated by reverse-phase liquid chromatography were used to evaluate the DeepRT performance. Compared with other prevailing RT predictors, DeepRT attained overall improvement in the prediction of peptide RTs with an R2 of ∼0.994. Moreover, DeepRT was able to accommodate to the peptides that were separated by different types of LC, such as strong cation exchange (SCX) and hydrophilic interaction liquid chromatography (HILIC) and to reach the RT prediction with R2 values of ∼0.996 for SCX and ∼0.993 for HILIC, respectively. If a large peptide data set is available for one type of LC, DeepRT can be promoted to DeepRT(+) using transfer learning. Based on a large peptide data set gained from SWATH, DeepRT(+) further elevated the accuracy of RT prediction for peptides in a small data set and enabled a satisfactory prediction upon limited peptides approximating hundreds. Further, DeepRT automatically learns retention-related properties of amino acids under different separation mechanisms, which are well consistent with retention coefficients (Rc) of the amino acids. DeepRT was thus proven to be an improved RT predictor with high flexibility and efficiency. DeepRT is available at https://github.com/horsepurve/DeepRTplus .
Collapse
Affiliation(s)
- Chunwei Ma
- BGI-Shenzhen , Beishan Industrial Zone 11th Building, Yantian District, Shenzhen , Guangdong 518083 , China.,China National GeneBank , BGI-Shenzhen , Shenzhen 518120 , China
| | - Yan Ren
- BGI-Shenzhen , Beishan Industrial Zone 11th Building, Yantian District, Shenzhen , Guangdong 518083 , China.,China National GeneBank , BGI-Shenzhen , Shenzhen 518120 , China
| | - Jiarui Yang
- BGI-Shenzhen , Beishan Industrial Zone 11th Building, Yantian District, Shenzhen , Guangdong 518083 , China.,China National GeneBank , BGI-Shenzhen , Shenzhen 518120 , China
| | - Zhe Ren
- BGI-Shenzhen , Beishan Industrial Zone 11th Building, Yantian District, Shenzhen , Guangdong 518083 , China.,China National GeneBank , BGI-Shenzhen , Shenzhen 518120 , China
| | - Huanming Yang
- BGI-Shenzhen , Beishan Industrial Zone 11th Building, Yantian District, Shenzhen , Guangdong 518083 , China.,James D. Watson Institute of Genome Sciences , Hangzhou 310008 , China
| | - Siqi Liu
- BGI-Shenzhen , Beishan Industrial Zone 11th Building, Yantian District, Shenzhen , Guangdong 518083 , China.,China National GeneBank , BGI-Shenzhen , Shenzhen 518120 , China
| |
Collapse
|
8
|
A novel strategy for retention prediction of nucleic acids with their sequence information in ion-pair reversed phase liquid chromatography. Talanta 2018; 185:592-601. [DOI: 10.1016/j.talanta.2018.04.030] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2017] [Revised: 04/02/2018] [Accepted: 04/07/2018] [Indexed: 12/18/2022]
|
9
|
Yang J, Xie Q, Zhou H, Chang L, Wei W, Wang Y, Li H, Deng Z, Xiao Y, Wu J, Xu P, Hong X. Proteomic Analysis and NIR-II Imaging of MCM2 Protein in Hepatocellular Carcinoma. J Proteome Res 2018; 17:2428-2439. [DOI: 10.1021/acs.jproteome.8b00181] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Jing Yang
- State Key Laboratory of Virology, Key Laboratory of Combinatorial Biosynthesis and Drug Discovery (MOE) and Zhongnan Hospital of Wuhan University, Wuhan University School of Pharmaceutical Sciences, Wuhan 430071, China
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Proteome Research Center, Beijing Institute of Lifeomics, Beijing 102206, China
| | - Qi Xie
- State Key Laboratory of Virology, Key Laboratory of Combinatorial Biosynthesis and Drug Discovery (MOE) and Zhongnan Hospital of Wuhan University, Wuhan University School of Pharmaceutical Sciences, Wuhan 430071, China
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Proteome Research Center, Beijing Institute of Lifeomics, Beijing 102206, China
- Center for Experimental Basic Medical Education, School of Basic Medical Sciences, Hubei Province Engineering and Technology Research Center for Fluorinated Pharmaceuticals and Hubei Provincial Key Laboratory of Developmentally Originated Disease, Wuhan University, Wuhan 430071, China
| | - Hui Zhou
- State Key Laboratory of Virology, Key Laboratory of Combinatorial Biosynthesis and Drug Discovery (MOE) and Zhongnan Hospital of Wuhan University, Wuhan University School of Pharmaceutical Sciences, Wuhan 430071, China
| | - Lei Chang
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Proteome Research Center, Beijing Institute of Lifeomics, Beijing 102206, China
| | - Wei Wei
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Proteome Research Center, Beijing Institute of Lifeomics, Beijing 102206, China
| | - Yin Wang
- State Key Laboratory of Virology, Key Laboratory of Combinatorial Biosynthesis and Drug Discovery (MOE) and Zhongnan Hospital of Wuhan University, Wuhan University School of Pharmaceutical Sciences, Wuhan 430071, China
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Proteome Research Center, Beijing Institute of Lifeomics, Beijing 102206, China
| | - Hong Li
- Pathology Department, Binzhou Medical University Hospital, Binzhou 256600, China
| | - Zixin Deng
- State Key Laboratory of Virology, Key Laboratory of Combinatorial Biosynthesis and Drug Discovery (MOE) and Zhongnan Hospital of Wuhan University, Wuhan University School of Pharmaceutical Sciences, Wuhan 430071, China
| | - Yuling Xiao
- State Key Laboratory of Virology, Key Laboratory of Combinatorial Biosynthesis and Drug Discovery (MOE) and Zhongnan Hospital of Wuhan University, Wuhan University School of Pharmaceutical Sciences, Wuhan 430071, China
| | - Junzhu Wu
- Center for Experimental Basic Medical Education, School of Basic Medical Sciences, Hubei Province Engineering and Technology Research Center for Fluorinated Pharmaceuticals and Hubei Provincial Key Laboratory of Developmentally Originated Disease, Wuhan University, Wuhan 430071, China
| | - Ping Xu
- State Key Laboratory of Virology, Key Laboratory of Combinatorial Biosynthesis and Drug Discovery (MOE) and Zhongnan Hospital of Wuhan University, Wuhan University School of Pharmaceutical Sciences, Wuhan 430071, China
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Proteome Research Center, Beijing Institute of Lifeomics, Beijing 102206, China
- Anhui Medical University, Hefei 230032, China
| | - Xuechuan Hong
- State Key Laboratory of Virology, Key Laboratory of Combinatorial Biosynthesis and Drug Discovery (MOE) and Zhongnan Hospital of Wuhan University, Wuhan University School of Pharmaceutical Sciences, Wuhan 430071, China
- Medical College, Tibet University, Lasa 850000, China
| |
Collapse
|