1
|
Wei Q, Li J, He QY, Chen Y, Zhang G. Identifying PE2 and PE5 Proteins from Existing Mass Spectrometry Data Using pFind. J Proteome Res 2024; 23:2323-2331. [PMID: 38865581 DOI: 10.1021/acs.jproteome.3c00674] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2024]
Abstract
The Chromosome-Centric Human Proteome Project (C-HPP) aims to identify all proteins encoded by the human genome. Currently, the human proteome still contains approximately 2000 PE2-PE5 proteins, referring to annotated coding genes that lack sufficient protein-level evidence. During the past 10 years, it has been increasingly difficult to identify PE2-PE5 proteins in C-HPP approaches due to the limited occurrence. Therefore, we proposed that reanalyzing massive MS data sets in repository with newly developed algorithms may increase the occurrence of the peptides of these proteins. In this study, we downloaded 1000 MS data sets via the ProteomeXchange database. Using pFind software, we identified peptides referring to 1788 PE2-PE5 proteins. Among them, 11 PE2 and 16 PE5 proteins were identified with at least 2 peptides, and 12 of them were identified using 2 peptides in a single data set, following the criteria of the HPP guidelines. We found translation evidence for 16 of the 11 PE2 and 16 PE5 proteins in our RNC-seq data, supporting their existence. The properties of the PE2 and PE5 proteins were similar to those of the PE1 proteins. Our approach demonstrated that mining PE2 and PE5 proteins in massive data repository is still worthy, and multidata set peptide identifications may support the presence of PE2 and PE5 proteins or at least prompt additional studies for validation. Extremely high throughput could be a solution to finding more PE2 and PE5 proteins.
Collapse
Affiliation(s)
- Qianzhou Wei
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes and MOE Key Laboratory of Tumor Molecular Biology, Institute of Life and Health Engineering, Jinan University, Guangzhou 510632, China
| | - Jiamin Li
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes and MOE Key Laboratory of Tumor Molecular Biology, Institute of Life and Health Engineering, Jinan University, Guangzhou 510632, China
| | - Qing-Yu He
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes and MOE Key Laboratory of Tumor Molecular Biology, Institute of Life and Health Engineering, Jinan University, Guangzhou 510632, China
| | - Yang Chen
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes and MOE Key Laboratory of Tumor Molecular Biology, Institute of Life and Health Engineering, Jinan University, Guangzhou 510632, China
| | - Gong Zhang
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes and MOE Key Laboratory of Tumor Molecular Biology, Institute of Life and Health Engineering, Jinan University, Guangzhou 510632, China
| |
Collapse
|
2
|
Parkes R, Garcia TX. Bringing proteomics to bear on male fertility: key lessons. Expert Rev Proteomics 2024; 21:181-203. [PMID: 38536015 PMCID: PMC11426281 DOI: 10.1080/14789450.2024.2327553] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 02/07/2024] [Indexed: 04/11/2024]
Abstract
INTRODUCTION Male infertility is a major public health concern globally. Proteomics has revolutionized our comprehension of male fertility by identifying potential infertility biomarkers and reproductive defects. Studies comparing sperm proteome with other male reproductive tissues have the potential to refine fertility diagnostics and guide infertility treatment development. AREAS COVERED This review encapsulates literature using proteomic approaches to progress male reproductive biology. Our search methodology included systematic searches of databases such as PubMed, Scopus, and Web of Science for articles up to 2023. Keywords used included 'male fertility proteomics,' 'spermatozoa proteome,' 'testis proteomics,' 'epididymal proteomics,' and 'non-hormonal male contraception.' Inclusion criteria were robust experimental design, significant contributions to male fertility, and novel use of proteomic technologies. EXPERT OPINION Expert analysis shows a shift from traditional research to an integrative approach that clarifies male reproductive health's molecular intricacies. A gap exists between proteomic discoveries and clinical application. The expert opinions consolidated here not only navigate the current findings but also chart the future proteomic applications for scientific and clinical breakthroughs. We underscore the need for continued investment in proteomic research - both in the technological and collaborative arenas - to further unravel the secrets of male fertility, which will be central to resolving fertility issues in the coming era.
Collapse
Affiliation(s)
- Rachel Parkes
- Center for Drug Discovery, Baylor College of Medicine
- Department of Pathology & Immunology, Baylor College of Medicine
| | - Thomas X. Garcia
- Center for Drug Discovery, Baylor College of Medicine
- Department of Pathology & Immunology, Baylor College of Medicine
- Scott Department of Urology, Baylor College of Medicine
| |
Collapse
|
3
|
Kwiatkowski M, Hotze M, Schumacher J, Asif AR, Pittol JMR, Brenig B, Ramljak S, Zischler H, Herlyn H. Protein speciation is likely to increase the chance of proteins to be determined in 2‐DE/MS. Electrophoresis 2022; 43:1203-1214. [DOI: 10.1002/elps.202000393] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Revised: 11/30/2021] [Accepted: 02/02/2022] [Indexed: 11/08/2022]
Affiliation(s)
- Marcel Kwiatkowski
- Department of Biochemistry and Center for Molecular Biosciences Innsbruck University of Innsbruck Innsbruck Austria
| | - Madlen Hotze
- Department of Biochemistry and Center for Molecular Biosciences Innsbruck University of Innsbruck Innsbruck Austria
| | | | - Abdul R. Asif
- Department of Clinical Chemistry/UMG‐Laboratories University Medical Center Göttingen Germany
| | - Jose Miguel Ramos Pittol
- Department of Biochemistry and Center for Molecular Biosciences Innsbruck University of Innsbruck Innsbruck Austria
| | - Bertram Brenig
- Department of Molecular Biology of Livestock Institute of Veterinary Medicine University of Göttingen Göttingen Germany
| | | | - Hans Zischler
- Institute of Organismic and Molecular Evolution, Anthropology University of Mainz Mainz Germany
| | - Holger Herlyn
- Institute of Organismic and Molecular Evolution, Anthropology University of Mainz Mainz Germany
| |
Collapse
|
4
|
Lautenbacher L, Samaras P, Muller J, Grafberger A, Shraideh M, Rank J, Fuchs ST, Schmidt TK, The M, Dallago C, Wittges H, Rost B, Krcmar H, Kuster B, Wilhelm M. ProteomicsDB: toward a FAIR open-source resource for life-science research. Nucleic Acids Res 2022; 50:D1541-D1552. [PMID: 34791421 PMCID: PMC8728203 DOI: 10.1093/nar/gkab1026] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2021] [Revised: 10/12/2021] [Accepted: 10/15/2021] [Indexed: 12/28/2022] Open
Abstract
ProteomicsDB (https://www.ProteomicsDB.org) is a multi-omics and multi-organism resource for life science research. In this update, we present our efforts to continuously develop and expand ProteomicsDB. The major focus over the last two years was improving the findability, accessibility, interoperability and reusability (FAIR) of the data as well as its implementation. For this purpose, we release a new application programming interface (API) that provides systematic access to essentially all data in ProteomicsDB. Second, we release a new open-source user interface (UI) and show the advantages the scientific community gains from such software. With the new interface, two new visualizations of protein primary, secondary and tertiary structure as well an updated spectrum viewer were added. Furthermore, we integrated ProteomicsDB with our deep-neural-network Prosit that can predict the fragmentation characteristics and retention time of peptides. The result is an automatic processing pipeline that can be used to reevaluate database search engine results stored in ProteomicsDB. In addition, we extended the data content with experiments investigating different human biology as well as a newly supported organism.
Collapse
Affiliation(s)
- Ludwig Lautenbacher
- Technical University of Munich, Computational Mass Spectrometry, 85354 Freising, Bavaria, Germany
| | - Patroklos Samaras
- Technical University of Munich, Chair of Proteomics and Bioanalytics, 85354 Freising, Bavaria, Germany
| | - Julian Muller
- Technical University of Munich, Chair of Proteomics and Bioanalytics, 85354 Freising, Bavaria, Germany
| | - Andreas Grafberger
- Technical University of Munich, Chair of Proteomics and Bioanalytics, 85354 Freising, Bavaria, Germany
| | - Marwin Shraideh
- Technical University of Munich, Chair for Information Systems, 85748 Garching, Bavaria, Germany
- Technical University of Munich, SAP University Competence Center, 85748 Garching, Bavaria, Germany
| | - Johannes Rank
- Technical University of Munich, Chair for Information Systems, 85748 Garching, Bavaria, Germany
- Technical University of Munich, SAP University Competence Center, 85748 Garching, Bavaria, Germany
| | - Simon T Fuchs
- Technical University of Munich, Chair for Information Systems, 85748 Garching, Bavaria, Germany
- Technical University of Munich, SAP University Competence Center, 85748 Garching, Bavaria, Germany
| | - Tobias K Schmidt
- Technical University of Munich, Chair of Proteomics and Bioanalytics, 85354 Freising, Bavaria, Germany
| | - Matthew The
- Technical University of Munich, Chair of Proteomics and Bioanalytics, 85354 Freising, Bavaria, Germany
| | - Christian Dallago
- Technical University of Munich, Department for Bioinformatics and Computational Biology, 85748 Garching, Bavaria, Germany
- Technical University of Munich, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), 85748 Garching, Bavaria, Germany
| | - Holger Wittges
- Technical University of Munich, Chair for Information Systems, 85748 Garching, Bavaria, Germany
- Technical University of Munich, SAP University Competence Center, 85748 Garching, Bavaria, Germany
| | - Burkhard Rost
- Technical University of Munich, Department for Bioinformatics and Computational Biology, 85748 Garching, Bavaria, Germany
- Technical University of Munich, Institute for Advanced Study (TUM-IAS), 85748 Freising, Bavaria, Germany
| | - Helmut Krcmar
- Technical University of Munich, Chair for Information Systems, 85748 Garching, Bavaria, Germany
- Technical University of Munich, SAP University Competence Center, 85748 Garching, Bavaria, Germany
| | - Bernhard Kuster
- Technical University of Munich, Chair of Proteomics and Bioanalytics, 85354 Freising, Bavaria, Germany
- Technical University of Munich, Bavarian Biomolecular Mass Spectrometry Center (BayBioMS), 85354 Freising, Bavaria, Germany
| | - Mathias Wilhelm
- Technical University of Munich, Computational Mass Spectrometry, 85354 Freising, Bavaria, Germany
| |
Collapse
|
5
|
Zheng W, Yang P, Sun C, Zhang Y. Comprehensive comparison of sample preparation workflows for proteomics. Mol Omics 2022; 18:555-567. [DOI: 10.1039/d2mo00076h] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Mass spectrometry-based proteomics experiments can be subject to a large variability, which forms an obstacle to obtaining deep and accurate protein identification. Here, to obtain an optimal sample preparation workflow...
Collapse
|
6
|
Bu F, Cheng Q, Zhang Y, Zhang X, Yan K, Liu F, Li Z, Lu X, Ren Y, Liu S. Discovery of Missing Proteins from an Aneuploidy Cell Line Using a Proteogenomic Approach. J Proteome Res 2021; 20:5329-5339. [PMID: 34748338 DOI: 10.1021/acs.jproteome.1c00772] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
With the steadfast development of proteomic technology, the number of missing proteins (MPs) has been continuously shrinking, with approximately 1470 MPs that have not been explored yet. Due to this phenomenon, the discovery of MPs has been increasingly more difficult and elusive. In order to face this challenge, we have hypothesized that a stable aneuploid cell line with increased chromosomes serves as a useful material for assisting MP exploration. Ker-CT cell line with trisomy at chromosome 5 and 20 was selected for this purpose. With a combination strategy of RNA-Seq and LC-MS/MS, a total of 22 178 transcripts and 8846 proteins were identified in Ker-CT. Although the transcripts corresponding to 15 and 15 MP genes located at chromosome 5 and 20 were detected, none of the MPs were found in Ker-CT. Surprisingly, 3 MPs containing at least two unique non-nest peptides of length ≥9 amino acids were identified in Ker-CT, whose genes are located on chromosome 3 and 10, respectively. Furthermore, the 3 MPs were verified using the method of parallel reaction monitoring (PRM). These results suggest that the abnormal status of chromosomes may not only impact the expression of the corresponding genes in trisomy chromosomes, but also influence that of other chromosomes, which benefits MP discovery. The data obtained in this study are available via ProteomeXchange (PXD028647) and PeptideAtlas (PASS01700), respectively.
Collapse
Affiliation(s)
- Fanyu Bu
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China.,Department of BGI Education, School of Life Sciences, University of Chinese Academy of Sciences, Shenzhen, Guangdong 518083, China
| | - Qingqiu Cheng
- Clinical Laboratory Center of Dongguan Eighth People's Hospital, Dongguan 523325, China
| | - Yuxing Zhang
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China.,Department of BGI Education, School of Life Sciences, University of Chinese Academy of Sciences, Shenzhen, Guangdong 518083, China
| | - Xia Zhang
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China.,Department of BGI Education, School of Life Sciences, University of Chinese Academy of Sciences, Shenzhen, Guangdong 518083, China
| | - Keqiang Yan
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China.,Department of BGI Education, School of Life Sciences, University of Chinese Academy of Sciences, Shenzhen, Guangdong 518083, China
| | - Frank Liu
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China
| | - Zelong Li
- Biological Resource Center of Plants, Animals and Microorganisms, China National Gene Bank, BGI-Shenzhen, Guangdong 518120, China
| | - Xiaomei Lu
- Clinical Laboratory Center of Dongguan Eighth People's Hospital, Dongguan 523325, China
| | - Yan Ren
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China
| | - Siqi Liu
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China.,Department of BGI Education, School of Life Sciences, University of Chinese Academy of Sciences, Shenzhen, Guangdong 518083, China
| |
Collapse
|
7
|
Digre A, Lindskog C. The Human Protein Atlas-Spatial localization of the human proteome in health and disease. Protein Sci 2021; 30:218-233. [PMID: 33146890 PMCID: PMC7737765 DOI: 10.1002/pro.3987] [Citation(s) in RCA: 123] [Impact Index Per Article: 30.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2020] [Revised: 10/29/2020] [Accepted: 10/30/2020] [Indexed: 12/11/2022]
Abstract
For a complete understanding of a system's processes and each protein's role in health and disease, it is essential to study protein expression with a spatial resolution, as the exact location of proteins at tissue, cellular, or subcellular levels is tightly linked to protein function. The Human Protein Atlas (HPA) project is a large-scale initiative aiming at mapping the entire human proteome using antibody-based proteomics and integration of various other omics technologies. The publicly available knowledge resource www.proteinatlas.org is one of the world's most visited biological databases and has been extensively updated during the last few years. The current version is divided into six main sections, each focusing on particular aspects of the human proteome: (a) the Tissue Atlas showing the distribution of proteins across all major tissues and organs in the human body; (b) the Cell Atlas showing the subcellular localization of proteins in single cells; (c) the Pathology Atlas showing the impact of protein levels on survival of patients with cancer; (d) the Blood Atlas showing the expression profiles of blood cells and actively secreted proteins; (e) the Brain Atlas showing the distribution of proteins in human, mouse, and pig brain; and (f) the Metabolic Atlas showing the involvement of proteins in human metabolism. The HPA constitutes an important resource for further understanding of human biology, and the publicly available datasets hold much promise for integration with other emerging efforts focusing on single cell analyses, both at transcriptomic and proteomic level.
Collapse
Affiliation(s)
- Andreas Digre
- Department of Immunology, Genetics and PathologyRudbeck Laboratory, Uppsala UniversityUppsalaSweden
| | - Cecilia Lindskog
- Department of Immunology, Genetics and PathologyRudbeck Laboratory, Uppsala UniversityUppsalaSweden
| |
Collapse
|
8
|
Zhang Y, Zhang K, Bu F, Hao P, Yang H, Liu S, Ren Y. D283 Med, a Cell Line Derived from Peritoneal Metastatic Medulloblastoma: A Good Choice for Missing Protein Discovery. J Proteome Res 2020; 19:4857-4866. [DOI: 10.1021/acs.jproteome.0c00743] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Affiliation(s)
- Yuanliang Zhang
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China
- BGI-Genomics, BGI-Shenzhen, Shenzhen, Guangdong 518083, China
| | - Keren Zhang
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China
- BGI-Genomics, BGI-Shenzhen, Shenzhen, Guangdong 518083, China
| | - Fanyu Bu
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, Guangdong 518083, China
| | - Piliang Hao
- School of Life Science and Technology, ShanghaiTech University, 393 Middle Huaxia Road, Shanghai 201210, China
| | - Huanming Yang
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China
- James D. Watson Institute of Genome Sciences, Hangzhou 310058, China
| | - Siqi Liu
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China
- BGI-Genomics, BGI-Shenzhen, Shenzhen, Guangdong 518083, China
| | - Yan Ren
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China
- BGI-Genomics, BGI-Shenzhen, Shenzhen, Guangdong 518083, China
| |
Collapse
|
9
|
Wu S, Sun J, Wang X, Xu F, Chi H, Li Y, Zhong B, Xie Y, Yan Z, Chang L, Wang D, He F, Wu J, Zhang Y, Xu P. Open-pFind Verified Four Missing Proteins from Multi-Tissues. J Proteome Res 2020; 19:4808-4814. [PMID: 33172275 DOI: 10.1021/acs.jproteome.0c00370] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
The Chromosome-Centric Human Proteome Project (C-HPP) was launched in 2012 to perfect the annotation of human protein existence by identifying stronger evidence of the expression of missing proteins (MPs) at the protein level. After an 8 year effort all over the world, the number of MPs in the neXtProt database significantly decreased from 5511 (2012-02-24) to 1899 (2020-01-17). It is now more difficult to provide confident evidence of the remaining MPs because of their specific characteristics, including low abundance, low molecular weight, unexpected modifications, transmembrane structure, tissue-expression specificity, and so on. A higher resolution mass spectrometry (MS) interpretation engine might provide an opportunity to identify these buried MPs in complex samples by the combination with multi-tissue large-scale proteomics. In this study, open-pFind was used to dig MPs from 20 pairs of healthy human tissues by Wang et al. ( Mol. Syst. Biol. 2019, 15 (2), e8503) combined with our large-scale testis data set digested by three enzymes (Glu-C, Lys-C, and trypsin) with specificity for different amino acid residues ( J. Proteme Res. 2019, 18 (12), 4189-4196). A total of 1 535 536 peptides with 17 283 477 peptide-spectrum matches (PSMs) were mapped to 14 279 protein entries at a false discovery rate of <1% at the PSM, peptide, and protein levels. A total of 103 MP candidates were identified, among which 86 candidates had more unique peptide numbers compared with our single testis tissue. After rigorous screening, manual checks, peptide synthesis, and matching with documented peptides from PeptideAtlas, we validated four MPs, P0C7T8 (duodenum and small intestine), Q8WWZ4 (stomach and rectum), Q8IV35 (fallopian tube), and O14921 (tonsil), at the protein level. All MS raw files have been deposited to the ProteomeXchange with identifier PXD021391.
Collapse
Affiliation(s)
- Shujia Wu
- School of Basic Medical Science, Key Laboratory of Combinational Biosynthesis and Drug Discovery of Ministry of Education, School of Basic Medical Sciences, Wuhan University, Wuhan 430072, China.,State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Jinshuai Sun
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China.,Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China
| | - Xi Wang
- Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, Beijing 100080, China
| | - Feng Xu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Hao Chi
- Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, Beijing 100080, China
| | - Yanchang Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Bowen Zhong
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Yuping Xie
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Zhonghua Yan
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Lei Chang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Dongxue Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Fuchu He
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Junzhu Wu
- School of Basic Medical Science, Key Laboratory of Combinational Biosynthesis and Drug Discovery of Ministry of Education, School of Basic Medical Sciences, Wuhan University, Wuhan 430072, China
| | - Yao Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Ping Xu
- School of Basic Medical Science, Key Laboratory of Combinational Biosynthesis and Drug Discovery of Ministry of Education, School of Basic Medical Sciences, Wuhan University, Wuhan 430072, China.,State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China.,Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China
| |
Collapse
|
10
|
Sivertsson Å, Lindström E, Oksvold P, Katona B, Hikmet F, Vuu J, Gustavsson J, Sjöstedt E, von Feilitzen K, Kampf C, Schwenk JM, Uhlén M, Lindskog C. Enhanced Validation of Antibodies Enables the Discovery of Missing Proteins. J Proteome Res 2020; 19:4766-4781. [PMID: 33170010 PMCID: PMC7723238 DOI: 10.1021/acs.jproteome.0c00486] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
![]()
The localization of proteins at a
tissue- or cell-type-specific
level is tightly linked to the protein function. To better understand
each
protein’s role in cellular systems, spatial information constitutes
an important complement to quantitative data. The standard methods
for determining the spatial distribution of proteins in single cells
of complex tissue samples make use of antibodies. For a stringent
analysis of the human proteome, we used orthogonal methods and independent
antibodies to validate 5981 antibodies that show the expression of
3775 human proteins across all major human tissues. This enhanced
validation uncovered 56 proteins corresponding to the group of “missing
proteins” and 171 proteins of unknown function. The presented
strategy will facilitate further discussions around criteria for evidence
of protein existence based on immunohistochemistry and serves as a
useful guide to identify candidate proteins for integrative studies
with quantitative proteomics methods.
Collapse
Affiliation(s)
- Åsa Sivertsson
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH - Royal Institute of Technology, 17121 Stockholm, Sweden
| | - Emil Lindström
- Department of Immunology, Genetics and Pathology, Rudbeck Laboratory, Uppsala University, 75185 Uppsala, Sweden
| | - Per Oksvold
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH - Royal Institute of Technology, 17121 Stockholm, Sweden
| | - Borbala Katona
- Department of Immunology, Genetics and Pathology, Rudbeck Laboratory, Uppsala University, 75185 Uppsala, Sweden
| | - Feria Hikmet
- Department of Immunology, Genetics and Pathology, Rudbeck Laboratory, Uppsala University, 75185 Uppsala, Sweden
| | - Jimmy Vuu
- Department of Immunology, Genetics and Pathology, Rudbeck Laboratory, Uppsala University, 75185 Uppsala, Sweden
| | - Jonas Gustavsson
- Department of Immunology, Genetics and Pathology, Rudbeck Laboratory, Uppsala University, 75185 Uppsala, Sweden
| | - Evelina Sjöstedt
- Department of Neuroscience, Karolinska Institutet, 17177 Stockholm, Sweden
| | - Kalle von Feilitzen
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH - Royal Institute of Technology, 17121 Stockholm, Sweden
| | - Caroline Kampf
- Department of Immunology, Genetics and Pathology, Rudbeck Laboratory, Uppsala University, 75185 Uppsala, Sweden.,Atlas Antibodies AB, 16869 Bromma, Sweden
| | - Jochen M Schwenk
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH - Royal Institute of Technology, 17121 Stockholm, Sweden
| | - Mathias Uhlén
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH - Royal Institute of Technology, 17121 Stockholm, Sweden.,Department of Neuroscience, Karolinska Institutet, 17177 Stockholm, Sweden
| | - Cecilia Lindskog
- Department of Immunology, Genetics and Pathology, Rudbeck Laboratory, Uppsala University, 75185 Uppsala, Sweden
| |
Collapse
|
11
|
Vandenbrouck Y, Pineau C, Lane L. The Functionally Unannotated Proteome of Human Male Tissues: A Shared Resource to Uncover New Protein Functions Associated with Reproductive Biology. J Proteome Res 2020; 19:4782-4794. [PMID: 33064489 DOI: 10.1021/acs.jproteome.0c00516] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
In the context of the Human Proteome Project, we built an inventory of 412 functionally unannotated human proteins for which experimental evidence at the protein level exists (uPE1) and which are highly expressed in tissues involved in human male reproduction. We implemented a strategy combining literature mining, bioinformatics tools to collate annotation and experimental information from specific molecular public resources, and efficient visualization tools to put these unknown proteins into their biological context (protein complexes, tissue and subcellular location, expression pattern). The gathered knowledge allowed pinpointing five uPE1 for which a function has recently been proposed and which should be updated in protein knowledge bases. Furthermore, this bioinformatics strategy allowed to build new functional hypotheses for five other uPE1s in link with phenotypic traits that are specific to male reproductive function such as ciliogenesis/flagellum formation in germ cells (CCDC112 and TEX9), chromatin remodeling (C3orf62) and spermatozoon maturation (CCDC183). We also discussed the enigmatic case of MAGEB proteins, a poorly documented cancer/testis antigen subtype. Tools used and computational outputs produced during this study are freely accessible via ProteoRE (http://www.proteore.org), a Galaxy-based instance, for reuse purposes. We propose these five uPE1s should be investigated in priority by expert laboratories and hope that this inventory and shared resources will stimulate the interest of the community of reproductive biology.
Collapse
Affiliation(s)
- Yves Vandenbrouck
- Univ. Grenoble Alpes, INSERM, CEA, IRIG-BGE, U1038, F-38000 Grenoble, France
| | - Charles Pineau
- Univ. Rennes, Inserm, EHESP, Irset (Institut de Recherche en Santé, Environnement et Travail) - UMR_S 1085, F-35042 Rennes cedex, France
| | - Lydie Lane
- SIB Swiss Institute of Bioinformatics and Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, CMU, Michel Servet 1, 1211 Geneva 4, Switzerland
| |
Collapse
|
12
|
Lin Z, Zhang Y, Pan H, Hao P, Li S, He Y, Yang H, Liu S, Ren Y. Alternative Strategy To Explore Missing Proteins with Low Molecular Weight. J Proteome Res 2019; 18:4180-4188. [PMID: 31592669 DOI: 10.1021/acs.jproteome.9b00353] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Identifying more missing proteins (MPs) is an important mission of C-HPP. With the number of identified MPs being attenuated year by year (2,949 to 2,129 MPs from 2016 to 2019), we have realized that the difficulty of exploring the remaining MPs is a challenge in technique. Herein, we propose a comprehensive strategy to effectively enrich, separate, and identify proteins with low molecular weights, aiming at the discovery of MPs. Basically, a protein extract from human placenta was passed through a C18 SPE column, and the bound proteins that were eluted were further separated with an SDS-PAGE gel or a 50 kDa cutoff filter. The separated proteins were subjected to trypsin digestion, and the MS/MS signals were searched against data sets with two different digestion modes (full-trypsin and semitrypsin). The strategy was adopted, resulting in the identification of 4 MPs with 8 unique peptides (≥2 non-nested unique peptides with ≥9 amino acids). Importantly, the identification of 6 out of 8 of the unique peptides derived from the MPs was further supported by parallel reaction monitoring, which confirmed the identification of 3 MPs from human placenta tissues (Q6NT89: TMF-regulated nuclear protein 1; A0A183: late cornified envelope protein 6A; and Q6UWQ7: insulin growth factor-like family member 2, mapped to chromosomes 1, 1, and 19, respectively). The three proteins ranged in length from 80 aa to 227 aa. The study not only establishes a feasible strategy for analyzing proteins with low molecular weights but also fills a small part of a large gap in the list of MPs. The data obtained in this study are available via ProteomeXchange (PXD014083) and PeptideAtlas (PASS01389).
Collapse
Affiliation(s)
- Zhilong Lin
- BGI-Shenzhen , Beishan Industrial Zone 11th building , Yantian District, Shenzhen , Guangdong 518083 , China.,Clinical laboratory of BGI Health , BGI-Shenzhen , Shenzhen 518083 , China
| | - Yuanliang Zhang
- BGI-Shenzhen , Beishan Industrial Zone 11th building , Yantian District, Shenzhen , Guangdong 518083 , China.,Clinical laboratory of BGI Health , BGI-Shenzhen , Shenzhen 518083 , China
| | - Huozhen Pan
- BGI-Shenzhen , Beishan Industrial Zone 11th building , Yantian District, Shenzhen , Guangdong 518083 , China.,Clinical laboratory of BGI Health , BGI-Shenzhen , Shenzhen 518083 , China
| | - Piliang Hao
- School of Life Science and Technology , ShanghaiTech University , 393 Middle Huaxia Road , Shanghai 201210 , China
| | - Siqi Li
- BGI-Shenzhen , Beishan Industrial Zone 11th building , Yantian District, Shenzhen , Guangdong 518083 , China.,Clinical laboratory of BGI Health , BGI-Shenzhen , Shenzhen 518083 , China
| | - Yanbin He
- BGI-Shenzhen , Beishan Industrial Zone 11th building , Yantian District, Shenzhen , Guangdong 518083 , China.,Clinical laboratory of BGI Health , BGI-Shenzhen , Shenzhen 518083 , China
| | - Huanming Yang
- BGI-Shenzhen , Beishan Industrial Zone 11th building , Yantian District, Shenzhen , Guangdong 518083 , China.,James D. Watson Institute of Genome Sciences , Hangzhou 310058 , China
| | - Siqi Liu
- BGI-Shenzhen , Beishan Industrial Zone 11th building , Yantian District, Shenzhen , Guangdong 518083 , China.,Clinical laboratory of BGI Health , BGI-Shenzhen , Shenzhen 518083 , China
| | - Yan Ren
- BGI-Shenzhen , Beishan Industrial Zone 11th building , Yantian District, Shenzhen , Guangdong 518083 , China.,Clinical laboratory of BGI Health , BGI-Shenzhen , Shenzhen 518083 , China
| |
Collapse
|
13
|
Sun J, Shi J, Wang Y, Wu S, Zhao L, Li Y, Wang H, Chang L, Lyu Z, Wu J, Liu F, Li W, He F, Zhang Y, Xu P. Open-pFind Enhances the Identification of Missing Proteins from Human Testis Tissue. J Proteome Res 2019; 18:4189-4196. [DOI: 10.1021/acs.jproteome.9b00376] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Affiliation(s)
- Jinshuai Sun
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China
| | - Jiahui Shi
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China
| | - Yihao Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Shujia Wu
- Key Laboratory of Combinational Biosynthesis and Drug Discovery (Wuhan University), Ministry of Education, School of Pharmaceutical Science, Wuhan University, Wuhan 430072, China
| | - Liping Zhao
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
- Guizhou University School of Medicine, Guiyang 550025, China
| | - Yanchang Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Hong Wang
- School of Public Health, North China University Science and Technology, Tangshan 063210, China
| | - Lei Chang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Zhitang Lyu
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China
| | - Junzhu Wu
- Key Laboratory of Combinational Biosynthesis and Drug Discovery (Wuhan University), Ministry of Education, School of Pharmaceutical Science, Wuhan University, Wuhan 430072, China
| | - Fengsong Liu
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China
| | - Wenjun Li
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, China
| | - Fuchu He
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Yao Zhang
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, China
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Ping Xu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China
- Key Laboratory of Combinational Biosynthesis and Drug Discovery (Wuhan University), Ministry of Education, School of Pharmaceutical Science, Wuhan University, Wuhan 430072, China
- Guizhou University School of Medicine, Guiyang 550025, China
| |
Collapse
|
14
|
Pineau C, Hikmet F, Zhang C, Oksvold P, Chen S, Fagerberg L, Uhlén M, Lindskog C. Cell Type-Specific Expression of Testis Elevated Genes Based on Transcriptomics and Antibody-Based Proteomics. J Proteome Res 2019; 18:4215-4230. [PMID: 31429579 DOI: 10.1021/acs.jproteome.9b00351] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
One of the most complex organs in the human body is the testis, where spermatogenesis takes place. This physiological process involves thousands of genes and proteins that are activated and repressed, making testis the organ with the highest number of tissue-specific genes. However, the function of a large proportion of the corresponding proteins remains unknown and testis harbors many missing proteins (MPs), defined as products of protein-coding genes that lack experimental mass spectrometry evidence. Here, an integrated omics approach was used for exploring the cell type-specific protein expression of genes with an elevated expression in testis. By combining genome-wide transcriptomics analysis with immunohistochemistry, more than 500 proteins with distinct testicular protein expression patterns were identified, and these were selected for in-depth characterization of their in situ expression in eight different testicular cell types. The cell type-specific protein expression patterns allowed us to identify six distinct clusters of expression at different stages of spermatogenesis. The analysis highlighted numerous poorly characterized proteins in each of these clusters whose expression overlapped with that of known proteins involved in spermatogenesis, including 85 proteins with an unknown function and 60 proteins that previously have been classified as MPs. Furthermore, we were able to characterize the in situ distribution of several proteins that previously lacked spatial information and cell type-specific expression within the testis. The testis elevated expression levels both at the RNA and protein levels suggest that these proteins are related to testis-specific functions. In summary, the study demonstrates the power of combining genome-wide transcriptomics analysis with antibody-based protein profiling to explore the cell type-specific expression of both well-known proteins and MPs. The analyzed proteins constitute important targets for further testis-specific research in male reproductive disorders.
Collapse
Affiliation(s)
- Charles Pineau
- Univ Rennes , Inserm, EHESP, Irset (Institut de Recherche en Santé, Environnement et Travail), UMR_S 1085 , 35042 Rennes Cedex, France.,Protim , Univ Rennes , 35042 Rennes Cedex, France
| | - Feria Hikmet
- Uppsala University , Department of Immunology, Genetics and Pathology, Rudbeck Laboratory , 75185 Uppsala , Sweden
| | - Cheng Zhang
- Science for Life Laboratory , School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH-Royal Institute of Technology , 17121 Stockholm , Sweden
| | - Per Oksvold
- Science for Life Laboratory , School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH-Royal Institute of Technology , 17121 Stockholm , Sweden
| | - Shuqi Chen
- Science for Life Laboratory , School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH-Royal Institute of Technology , 17121 Stockholm , Sweden
| | - Linn Fagerberg
- Science for Life Laboratory , School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH-Royal Institute of Technology , 17121 Stockholm , Sweden
| | - Mathias Uhlén
- Science for Life Laboratory , School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH-Royal Institute of Technology , 17121 Stockholm , Sweden
| | - Cecilia Lindskog
- Uppsala University , Department of Immunology, Genetics and Pathology, Rudbeck Laboratory , 75185 Uppsala , Sweden
| |
Collapse
|
15
|
Deng Y, Ren Z, Pan Q, Qi D, Wen B, Ren Y, Yang H, Wu L, Chen F, Liu S. pClean: An Algorithm To Preprocess High-Resolution Tandem Mass Spectra for Database Searching. J Proteome Res 2019; 18:3235-3244. [PMID: 31364357 DOI: 10.1021/acs.jproteome.9b00141] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Database searches of MS/MS spectra are the main approach to peptide/protein identification in proteomics. Since most database search engines only utilize a small portion of the original MS/MS signals for peptide detection, how to improve the quality of MS/MS signals is a primary concern for enhancement of the peptide/protein identification rate. A fundamental issue is that some noise MS signals, informative or uninformative, have to be filtered out prior to database searching. Herein, an integrative preprocessing algorithm was designed, termed pClean, which incorporates three modules to preprocess MS/MS spectra, such as the removal of isobaric-labeling related ions, the reduction in isotopic peaks, the deconvolution of ions with higher charges, and the clearance of uninformative MS/MS signals. In contrast to the currently available approaches to MS/MS data preprocessing, pClean enables treatment of MS/MS spectra with high mass accuracy and favors filtering for the labeling or nonlabeling of peptides. Data sets at various scales gained from mass spectrometers with high resolution were used to assess the quality of peptides identified after pClean treatment and to compare the pClean improvement with those of other software programs. On the basis of the analysis of peptides identified and the Mascot ion score, pClean was proven to be effective in the removal of mass spectral noise and the reduction of random matching. Compared with other software programs, pClean appeared to be beneficial in terms of preprocessing performances for the enhancement of confidence scores and the increase in peptides identified. pClean is available at https://github.com/AimeeD90/pClean_release .
Collapse
Affiliation(s)
- Yamei Deng
- CAS Key Laboratory of Genome Sciences and Information , Beijing Institute of Genomics, Chinese Academy of Sciences , Beijing 100101 , China.,University of the Chinese Academy of Sciences , Beijing 100049 , China.,BGI-Shenzhen , Shenzhen 518083 , China
| | - Zhe Ren
- BGI-Shenzhen , Shenzhen 518083 , China.,China National GeneBank, BGI-Shenzhen , Shenzhen 518120 , China
| | - Qingfei Pan
- CAS Key Laboratory of Genome Sciences and Information , Beijing Institute of Genomics, Chinese Academy of Sciences , Beijing 100101 , China.,University of the Chinese Academy of Sciences , Beijing 100049 , China.,BGI-Shenzhen , Shenzhen 518083 , China
| | - Da Qi
- BGI-Shenzhen , Shenzhen 518083 , China.,China National GeneBank, BGI-Shenzhen , Shenzhen 518120 , China
| | | | - Yan Ren
- BGI-Shenzhen , Shenzhen 518083 , China.,China National GeneBank, BGI-Shenzhen , Shenzhen 518120 , China
| | - Huanming Yang
- BGI-Shenzhen , Shenzhen 518083 , China.,China National GeneBank, BGI-Shenzhen , Shenzhen 518120 , China.,James D. Watson Institute of Genome Sciences , Hangzhou 310058 , China
| | - Lin Wu
- CAS Key Laboratory of Genome Sciences and Information , Beijing Institute of Genomics, Chinese Academy of Sciences , Beijing 100101 , China
| | - Fei Chen
- CAS Key Laboratory of Genome Sciences and Information , Beijing Institute of Genomics, Chinese Academy of Sciences , Beijing 100101 , China
| | - Siqi Liu
- CAS Key Laboratory of Genome Sciences and Information , Beijing Institute of Genomics, Chinese Academy of Sciences , Beijing 100101 , China.,BGI-Shenzhen , Shenzhen 518083 , China.,China National GeneBank, BGI-Shenzhen , Shenzhen 518120 , China
| |
Collapse
|
16
|
Alfano M, Pederzoli F, Locatelli I, Ippolito S, Longhi E, Zerbi P, Ferrari M, Brendolan A, Montorsi F, Drago D, Andolfo A, Nebuloni M, Salonia A. Impaired testicular signaling of vitamin A and vitamin K contributes to the aberrant composition of the extracellular matrix in idiopathic germ cell aplasia. Fertil Steril 2019; 111:687-698. [DOI: 10.1016/j.fertnstert.2018.12.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2018] [Revised: 12/04/2018] [Accepted: 12/04/2018] [Indexed: 12/12/2022]
|
17
|
Panner Selvam MK, Baskaran S, Agarwal A. Proteomics of reproduction: Prospects and perspectives. Adv Clin Chem 2019; 92:217-243. [PMID: 31472755 DOI: 10.1016/bs.acc.2019.04.005] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
In recent years, proteomics has been used widely in reproductive research in order to understand the molecular mechanisms related to gametes at the cellular level and the role of proteins involved in fertilization. Network and pathway analysis using bioinformatic tools have paved way to obtain a wider picture on the possible pathways associated with the key differentially expressed proteins (DEPs) and its implication in various infertility scenarios. A brief overview of advanced techniques and bioinformatic tools used for reproductive proteomics is presented. Key findings of proteomic-based studies on male and female reproduction are also presented. Furthermore, the chapter sheds light on the cellular pathways and potential biomarkers associated with male and female infertility. Proteomics coupled with bioinformatic analysis provides an ideal platform for non-invasive management of infertility in couples.
Collapse
|
18
|
Omenn GS, Lane L, Overall CM, Corrales FJ, Schwenk JM, Paik YK, Van Eyk JE, Liu S, Snyder M, Baker MS, Deutsch EW. Progress on Identifying and Characterizing the Human Proteome: 2018 Metrics from the HUPO Human Proteome Project. J Proteome Res 2018; 17:4031-4041. [PMID: 30099871 PMCID: PMC6387656 DOI: 10.1021/acs.jproteome.8b00441] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
The Human Proteome Project (HPP) annually reports on progress throughout the field in credibly identifying and characterizing the human protein parts list and making proteomics an integral part of multiomics studies in medicine and the life sciences. NeXtProt release 2018-01-17, the baseline for this sixth annual HPP special issue of the Journal of Proteome Research, contains 17 470 PE1 proteins, 89% of all neXtProt predicted PE1-4 proteins, up from 17 008 in release 2017-01-23 and 13 975 in release 2012-02-24. Conversely, the number of neXtProt PE2,3,4 missing proteins has been reduced from 2949 to 2579 to 2186 over the past two years. Of the PE1 proteins, 16 092 are based on mass spectrometry results, and 1378 on other kinds of protein studies, notably protein-protein interaction findings. PeptideAtlas has 15 798 canonical proteins, up 625 over the past year, including 269 from SUMOylation studies. The largest reason for missing proteins is low abundance. Meanwhile, the Human Protein Atlas has released its Cell Atlas, Pathology Atlas, and updated Tissue Atlas, and is applying recommendations from the International Working Group on Antibody Validation. Finally, there is progress using the quantitative multiplex organ-specific popular proteins targeted proteomics approach in various disease categories.
Collapse
Affiliation(s)
- Gilbert S. Omenn
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, Michigan 48109-2218, United States
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, Washington 98109-5263, United States
| | - Lydie Lane
- CALIPHO Group, SIB Swiss Institute of Bioinformatics and Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, CMU, Michel-Servet 1, 1211 Geneva 4, Switzerland
| | - Christopher M. Overall
- Life Sciences Institute, Faculty of Dentistry, University of British Columbia, 2350 Health Sciences Mall, Room 4.401, Vancouver, BC Canada V6T 1Z3
| | | | - Jochen M. Schwenk
- Science for Life Laboratory, KTH Royal Institute of Technology, Tomtebodavägen 23A, 17165 Solna, Sweden
| | - Young-Ki Paik
- Yonsei Proteome Research Center, Room 425, Building #114, Yonsei University,50 Yonsei-ro, Seodaemoon-ku, Seoul 120-749, Korea
| | - Jennifer E. Van Eyk
- Advanced Clinical BioSystems Research Institute, Cedars Sinai Precision Biomarker Laboratories, Barbra Streisand Women’s Heart Center, Cedars-Sinai Medical Center, Los Angeles, CA 90048, United States
| | - Siqi Liu
- Department of Molecular Biology, University of Texas Southwestern Medical Center, Dallas, TX 75390-9148, United States
| | - Michael Snyder
- Department of Genetics, Stanford University, Alway Building, 300 Pasteur Drive, 3165 Porter Drive, Palo Alto, 94304, United States
| | - Mark S. Baker
- Department of Biomedical Sciences, Macquarie University, NSW 2109, Australia
| | - Eric W. Deutsch
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, Washington 98109-5263, United States
| |
Collapse
|
19
|
Paik YK. Toward Completion of the Human Proteome Parts List: Progress Uncovering Proteins That Are Missing or Have Unknown Function and Developing Analytical Methods. J Proteome Res 2018; 17:4023-4030. [PMID: 30985145 PMCID: PMC6288998 DOI: 10.1021/acs.jproteome.8b00885] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Young-Ki Paik
- Yonsei
Proteome Research Center, College of Life Science and
Technology, Yonsei University
| |
Collapse
|
20
|
Ronci M, Pieroni L, Greco V, Scotti L, Marini F, Carregari VC, Cunsolo V, Foti S, Aceto A, Urbani A. Sequential Fractionation Strategy Identifies Three Missing Proteins in the Mitochondrial Proteome of Commonly Used Cell Lines. J Proteome Res 2018; 17:4307-4314. [PMID: 30284448 DOI: 10.1021/acs.jproteome.8b00422] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Mitochondria are undeniably the cell powerhouse, directly affecting cell survival and fate. Growing evidence suggest that mitochondrial protein repertoire affects metabolic activity and plays an important role in determining cell proliferation/differentiation or quiescence shift. Consequently, the bioenergetic status of a cell is associated with the quality and abundance of the mitochondrial populations and proteomes. Mitochondrial morphology changes in the development of different cellular functions associated with metabolic switches. It is therefore reasonable to speculate that different cell lines do contain different mitochondrial-associated proteins, and the investigation of these pools may well represent a source for mining missing proteins (MPs). A very effective approach to increase the number of IDs through mass spectrometry consists of reducing the complexity of the biological samples by fractionation. The present study aims at investigating the mitochondrial proteome of five phenotypically different cell lines, possibly expressing some of the MPs, through an enrichment-fractionation approach at the organelle and protein level. We demonstrate a substantial increase in the proteome coverage, which, in turn, increases the likelihood of detecting low abundant proteins, often falling in the category of MPs, and resulting, for the present study, in the identification of METTL12, FAM163A, and RGS13. All MS data have been deposited to the MassIVE data repository ( https://massive.ucsd.edu ) with the data set identifier MSV000082409 and PXD010446.
Collapse
Affiliation(s)
- Maurizio Ronci
- Department of Medical, Oral and Biotechnological Sciences , University G. D'Annunzio Chieti , Chieti-Pescara, Via dei Vestini 31 , 66100 Chieti , Italy
- Proteomics and Metabonomics Unit , IRCCS Fondazione Santa Lucia , Via del Fosso di Fiorano 64 , 00143 Rome , Italy
| | - Luisa Pieroni
- Proteomics and Metabonomics Unit , IRCCS Fondazione Santa Lucia , Via del Fosso di Fiorano 64 , 00143 Rome , Italy
| | - Viviana Greco
- Institute of Biochemistry and Clinical Biochemistry , Università Cattolica del Sacro Cuore , L.go F. Vito 1 , 00168 Rome , Italy
- Department of Laboratory Diagnostic and Infectious Diseases , Fondazione Policlinico Universitario Agostino Gemelli-IRCCS , L.go A. Gemelli 8 , 00168 Rome , Italy
| | - Luca Scotti
- Department of Medical, Oral and Biotechnological Sciences , University G. D'Annunzio Chieti , Chieti-Pescara, Via dei Vestini 31 , 66100 Chieti , Italy
| | - Federica Marini
- Institute of Biochemistry and Clinical Biochemistry , Università Cattolica del Sacro Cuore , L.go F. Vito 1 , 00168 Rome , Italy
- Department of Laboratory Diagnostic and Infectious Diseases , Fondazione Policlinico Universitario Agostino Gemelli-IRCCS , L.go A. Gemelli 8 , 00168 Rome , Italy
| | - Victor C Carregari
- Institute of Biochemistry and Clinical Biochemistry , Università Cattolica del Sacro Cuore , L.go F. Vito 1 , 00168 Rome , Italy
- Department of Laboratory Diagnostic and Infectious Diseases , Fondazione Policlinico Universitario Agostino Gemelli-IRCCS , L.go A. Gemelli 8 , 00168 Rome , Italy
| | - Vincenzo Cunsolo
- Department of Chemical Sciences , University of Catania , V.le A. Doria 6 , 95125 Catania , Italy
| | - Salvatore Foti
- Department of Chemical Sciences , University of Catania , V.le A. Doria 6 , 95125 Catania , Italy
| | - Antonio Aceto
- Department of Medical, Oral and Biotechnological Sciences , University G. D'Annunzio Chieti , Chieti-Pescara, Via dei Vestini 31 , 66100 Chieti , Italy
| | - Andrea Urbani
- Institute of Biochemistry and Clinical Biochemistry , Università Cattolica del Sacro Cuore , L.go F. Vito 1 , 00168 Rome , Italy
- Department of Laboratory Diagnostic and Infectious Diseases , Fondazione Policlinico Universitario Agostino Gemelli-IRCCS , L.go A. Gemelli 8 , 00168 Rome , Italy
| |
Collapse
|
21
|
Sun J, Shi J, Wang Y, Chen Y, Li Y, Kong D, Chang L, Liu F, Lv Z, Zhou Y, He F, Zhang Y, Xu P. Multiproteases Combined with High-pH Reverse-Phase Separation Strategy Verified Fourteen Missing Proteins in Human Testis Tissue. J Proteome Res 2018; 17:4171-4177. [PMID: 30280576 DOI: 10.1021/acs.jproteome.8b00397] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Subsequent to conducting the Chromosome-Centric Human Proteome Project, we have focused on human testis-enriched missing proteins (MPs) since 2015. For protein coverage to be enhanced, a multiprotease strategy was used for separation of samples by 10% SDS-PAGE. For the separating efficiency to be improved, a high-pH reverse phase (RP) separation strategy was applied to fractionate complex samples in this study. A total of 11,558 proteins was identified, which is the largest proteome data set for single human tissue sample so far. On the basis of this large-scale data set, we verified 14 MPs (PE2) in neXtProt (2018-01) after spectrum quality analysis, isobaric post-translational modification, and single amino acid variant filtering, and synthesized peptide matching. Tissue expression analysis showed that 3 of 14 MPs were testis-specific proteins. Functional analysis showed that 10 of 14 MPs were closely related to liver tumor, liver carcinoma, and hepatocellular carcinoma. Another 100 MPs were listed as candidates but required additional verification information. All MS data sets have been deposited into the ProteomeXchange with the identifier PXD009737.
Collapse
Affiliation(s)
- Jinshuai Sun
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences , Hebei University , Baoding , Hebei 071002 , China.,State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing) , Beijing Institute of Lifeomics , Beijing 102206 , China
| | - Jiahui Shi
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences , Hebei University , Baoding , Hebei 071002 , China.,State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing) , Beijing Institute of Lifeomics , Beijing 102206 , China
| | - Yihao Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing) , Beijing Institute of Lifeomics , Beijing 102206 , China
| | - Yang Chen
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing) , Beijing Institute of Lifeomics , Beijing 102206 , China
| | - Yanchang Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing) , Beijing Institute of Lifeomics , Beijing 102206 , China
| | - Degang Kong
- Department of Hepatopancreatobiliary Surgery , The Second Affiliated Hospital of Tianjin Medical University , Tianjin 300211 , China
| | - Lei Chang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing) , Beijing Institute of Lifeomics , Beijing 102206 , China
| | - Fengsong Liu
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences , Hebei University , Baoding , Hebei 071002 , China
| | - Zhitang Lv
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences , Hebei University , Baoding , Hebei 071002 , China
| | - Yue Zhou
- Demo Laboratory of Thermofisher Scientific China , Shanghai 200120 , China
| | - Fuchu He
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing) , Beijing Institute of Lifeomics , Beijing 102206 , China
| | - Yao Zhang
- State Key Laboratory of Biocontrol, Guangdong Key Laboratory of Plant Resources, School of Life Sciences , Sun Yat-Sen University , Guangzhou 510275 , China
| | - Ping Xu
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences , Hebei University , Baoding , Hebei 071002 , China.,State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing) , Beijing Institute of Lifeomics , Beijing 102206 , China.,Key Laboratory of Combinational Biosynthesis and Drug Discovery (Wuhan University), Ministry of Education, School of Pharmaceutical Science , Wuhan University , Wuhan 430072 , China
| |
Collapse
|
22
|
Miller SE, Rizzo AI, Waldbauer JR. Postnovo: Postprocessing Enables Accurate and FDR-Controlled de Novo Peptide Sequencing. J Proteome Res 2018; 17:3671-3680. [PMID: 30277077 DOI: 10.1021/acs.jproteome.8b00278] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
De novo sequencing offers an alternative to database search methods for peptide identification from mass spectra. Since it does not rely on a predetermined database of expected or potential sequences in the sample, de novo sequencing is particularly appropriate for samples lacking a well-defined or comprehensive reference database. However, the low accuracy of many de novo sequence predictions has prevented the widespread use of the variety of sequencing tools currently available. Here, we present a new open-source tool, Postnovo, that postprocesses de novo sequence predictions to find high-accuracy results. Postnovo uses a predictive model to rescore and rerank candidate sequences in a manner akin to database search postprocessing tools such as Percolator. Postnovo leverages the output from multiple de novo sequencing tools in its own analyses, producing many times the length of amino acid sequence information (including both full- and partial-length peptide sequences) at an equivalent false discovery rate (FDR) compared to any individual tool. We present a methodology to reliably screen the sequence predictions to a desired FDR given the Postnovo sequence score. We validate Postnovo with multiple data sets and demonstrate its ability to identify proteins that are missed by database search even in samples with paired reference databases.
Collapse
Affiliation(s)
- Samuel E Miller
- Department of the Geophysical Sciences , University of Chicago , 5734 South Ellis Avenue , Chicago , Illinois 60637 , United States
| | - Adriana I Rizzo
- Department of the Geophysical Sciences , University of Chicago , 5734 South Ellis Avenue , Chicago , Illinois 60637 , United States
| | - Jacob R Waldbauer
- Department of the Geophysical Sciences , University of Chicago , 5734 South Ellis Avenue , Chicago , Illinois 60637 , United States
| |
Collapse
|
23
|
He C, Sun J, Shi J, Wang Y, Zhao J, Wu S, Chang L, Gao H, Liu F, Lv Z, He F, Zhang Y, Xu P. Digging for Missing Proteins Using Low-Molecular-Weight Protein Enrichment and a “Mirror Protease” Strategy. J Proteome Res 2018; 17:4178-4185. [DOI: 10.1021/acs.jproteome.8b00398] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Affiliation(s)
- Cuitong He
- State Key Laboratory of Biocontrol, Guangdong Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou 510275, China
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing Proteome Research Center, Beijing 102206, China
| | - Jinshuai Sun
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing Proteome Research Center, Beijing 102206, China
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China
| | - Jiahui Shi
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing Proteome Research Center, Beijing 102206, China
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China
| | - Yihao Wang
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing Proteome Research Center, Beijing 102206, China
| | - Jialing Zhao
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing Proteome Research Center, Beijing 102206, China
- Key Laboratory of Combinational Biosynthesis and Drug Discovery (Wuhan University), Ministry of Education, School of Pharmaceutical Science, Wuhan University, Wuhan 430072, China
| | - Shujia Wu
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing Proteome Research Center, Beijing 102206, China
- Key Laboratory of Combinational Biosynthesis and Drug Discovery (Wuhan University), Ministry of Education, School of Pharmaceutical Science, Wuhan University, Wuhan 430072, China
| | - Lei Chang
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing Proteome Research Center, Beijing 102206, China
| | - Huiying Gao
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing Proteome Research Center, Beijing 102206, China
| | - Fengsong Liu
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China
| | - Zhitang Lv
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China
| | - Fuchu He
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing Proteome Research Center, Beijing 102206, China
| | - Yao Zhang
- State Key Laboratory of Biocontrol, Guangdong Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou 510275, China
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing Proteome Research Center, Beijing 102206, China
| | - Ping Xu
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing Proteome Research Center, Beijing 102206, China
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China
- Key Laboratory of Combinational Biosynthesis and Drug Discovery (Wuhan University), Ministry of Education, School of Pharmaceutical Science, Wuhan University, Wuhan 430072, China
| |
Collapse
|
24
|
Sjöstedt E, Sivertsson Å, Hikmet Noraddin F, Katona B, Näsström Å, Vuu J, Kesti D, Oksvold P, Edqvist PH, Olsson I, Uhlén M, Lindskog C. Integration of Transcriptomics and Antibody-Based Proteomics for Exploration of Proteins Expressed in Specialized Tissues. J Proteome Res 2018; 17:4127-4137. [DOI: 10.1021/acs.jproteome.8b00406] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Affiliation(s)
- Evelina Sjöstedt
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH - Royal Institute of Technology, Stockholm SE 171 21, Sweden
- Rudbeck Laboratory, Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala SE 752 37, Sweden
| | - Åsa Sivertsson
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH - Royal Institute of Technology, Stockholm SE 171 21, Sweden
| | - Feria Hikmet Noraddin
- Rudbeck Laboratory, Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala SE 752 37, Sweden
| | - Borbala Katona
- Rudbeck Laboratory, Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala SE 752 37, Sweden
| | - Åsa Näsström
- Rudbeck Laboratory, Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala SE 752 37, Sweden
| | - Jimmy Vuu
- Rudbeck Laboratory, Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala SE 752 37, Sweden
| | - Dennis Kesti
- Rudbeck Laboratory, Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala SE 752 37, Sweden
| | - Per Oksvold
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH - Royal Institute of Technology, Stockholm SE 171 21, Sweden
| | - Per-Henrik Edqvist
- Rudbeck Laboratory, Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala SE 752 37, Sweden
| | - Ingmarie Olsson
- Rudbeck Laboratory, Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala SE 752 37, Sweden
| | - Mathias Uhlén
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH - Royal Institute of Technology, Stockholm SE 171 21, Sweden
| | - Cecilia Lindskog
- Rudbeck Laboratory, Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala SE 752 37, Sweden
| |
Collapse
|
25
|
Paik YK, Overall CM, Deutsch EW, Hancock WS, Omenn GS. Progress in the Chromosome-Centric Human Proteome Project as Highlighted in the Annual Special Issue IV. J Proteome Res 2018; 15:3945-3950. [PMID: 27809547 DOI: 10.1021/acs.jproteome.6b00803] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- Young-Ki Paik
- Yonsei Proteome Research Center and Department of Biochemistry, Yonsei University
| | - Christopher M Overall
- Centre for Blood Research, Departments of Oral Biological & Medical Sciences, and Biochemistry & Molecular Biology, Faculty of Dentistry, University of British Columbia
| | | | | | - Gilbert S Omenn
- Departments of Computational Medicine & Bioinformatics, Internal Medicine, and Human Genetics and School of Public Health, University of Michigan
| |
Collapse
|
26
|
Li S, He Y, Lin Z, Xu S, Zhou R, Liang F, Wang J, Yang H, Liu S, Ren Y. Digging More Missing Proteins Using an Enrichment Approach with ProteoMiner. J Proteome Res 2017; 16:4330-4339. [PMID: 28960076 DOI: 10.1021/acs.jproteome.7b00353] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Human Proteome Project (HPP) aims at mapping entire human proteins with a systematic effort upon all the emerging techniques, which would enhance understanding of human biology and lay a foundation for development of medical applications. Until now, 2563 missing proteins (MPs, PE2-4) are still undetected even using the most sensitive approach of protein detection. Herein, we propose that enrichment of low-abundance proteins benefits MPs finding. ProteoMiner is an equalizing technique by reducing high-abundance proteins and enriching low-abundance proteins in biological liquids. With triton X-100/TBS buffer extraction, ProteoMiner enrichment, and peptide fractionation, 20 MPs (at least two non-nested unique peptides with more than eight a.a. length) with 60 unique peptides were identified from four human tissues including eight membrane/secreted proteins and five nucleus proteins. Then 15 of them were confirmed with two non-nested unique peptides (≥9 a.a.) identified by matching well with their chemically synthetic peptides in PRM assay. Hence, these results demonstrated ProteoMiner as a powerful means in discovery of MPs.
Collapse
Affiliation(s)
- Siqi Li
- BGI-Shenzhen , Beishan Industrial Zone 11th building, Yantian District, Shenzhen, Guangdong 518083, China
| | - Yanbin He
- BGI-Shenzhen , Beishan Industrial Zone 11th building, Yantian District, Shenzhen, Guangdong 518083, China
| | - Zhilong Lin
- BGI-Shenzhen , Beishan Industrial Zone 11th building, Yantian District, Shenzhen, Guangdong 518083, China
| | - Shaohang Xu
- BGI-Shenzhen , Beishan Industrial Zone 11th building, Yantian District, Shenzhen, Guangdong 518083, China
| | - Ruo Zhou
- BGI-Shenzhen , Beishan Industrial Zone 11th building, Yantian District, Shenzhen, Guangdong 518083, China
| | - Feng Liang
- BGI-Shenzhen , Beishan Industrial Zone 11th building, Yantian District, Shenzhen, Guangdong 518083, China
| | - Jian Wang
- BGI-Shenzhen , Beishan Industrial Zone 11th building, Yantian District, Shenzhen, Guangdong 518083, China
| | - Huanming Yang
- BGI-Shenzhen , Beishan Industrial Zone 11th building, Yantian District, Shenzhen, Guangdong 518083, China.,James D. Watson Institute of Genome Sciences, Hangzhou 310008, China
| | - Siqi Liu
- BGI-Shenzhen , Beishan Industrial Zone 11th building, Yantian District, Shenzhen, Guangdong 518083, China
| | - Yan Ren
- BGI-Shenzhen , Beishan Industrial Zone 11th building, Yantian District, Shenzhen, Guangdong 518083, China
| |
Collapse
|
27
|
Wang Y, Chen Y, Zhang Y, Wei W, Li Y, Zhang T, He F, Gao Y, Xu P. Multi-Protease Strategy Identifies Three PE2 Missing Proteins in Human Testis Tissue. J Proteome Res 2017; 16:4352-4363. [PMID: 28959888 DOI: 10.1021/acs.jproteome.7b00340] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Although 5 years of the missing proteins (MPs) study have been completed, searching for MPs remains one of the core missions of the Chromosome-Centric Human Proteome Project (C-HPP). Following the next-50-MPs challenge of the C-HPP, we have focused on the testis-enriched MPs by various strategies since 2015. On the basis of the theoretical analysis of MPs (2017-01, neXtProt) using multiprotease digestion, we found that nonconventional proteases (e.g. LysargiNase, GluC) could improve the peptide diversity and sequence coverage compared with Trypsin. Therefore, a multiprotease strategy was used for searching more MPs in the same human testis tissues separated by 10% SDS-PAGE, followed by high resolution LC-MS/MS system (Q Exactive HF). A total of 7838 proteins were identified. Among them, three PE2 MPs in neXtProt 2017-01 have been identified: beta-defensin 123 ( Q8N688 , chr 20q), cancer/testis antigen family 45 member A10 ( P0DMU9 , chr Xq), and Histone H2A-Bbd type 2/3 ( P0C5Z0 , chr Xq). However, because only one unique peptide of ≥9 AA was identified in beta-defensin 123 and Histone H2A-Bbd type 2/3, respectively, further analysis indicates that each falls under the exceptions clause of the HPP Guidelines v2.1. After a spectrum quality check, isobaric PTM and single amino acid variant (SAAV) filtering, and verification with a synthesized peptide, and based on overlapping peptides from different proteases, these three MPs should be considered as exemplary examples of MPs found by exceptional criteria. Other MPs were considered as candidates but need further validation. All MS data sets have been deposited to the ProteomeXchange with identifier PXD006465.
Collapse
Affiliation(s)
- Yihao Wang
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine , Beijing 102206, China.,Department of Pharmacology and Toxicology, Beijing Institute of Radiation Medicine , Beijing 100850, China
| | - Yang Chen
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Yao Zhang
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, College of Ecology and Evolution, Sun Yat-Sen University , Guangzhou 510275, China
| | - Wei Wei
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Yanchang Li
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Tao Zhang
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Fuchu He
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Yue Gao
- Department of Pharmacology and Toxicology, Beijing Institute of Radiation Medicine , Beijing 100850, China
| | - Ping Xu
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine , Beijing 102206, China.,Key Laboratory of Combinatorial Biosynthesis and Drug Discovery of Ministry of Education, School of Pharmaceutical Sciences, Wuhan University , Wuhan 430072, China.,Graduate School, Anhui Medical University , Hefei 230032, China.,Tianjin Baodi Hospital , Tianjin 301800, China
| |
Collapse
|
28
|
Hwang H, Park GW, Park JY, Lee HK, Lee JY, Jeong JE, Park SKR, Yates JR, Kwon KH, Park YM, Lee HJ, Paik YK, Kim JY, Yoo JS. Next Generation Proteomic Pipeline for Chromosome-Based Proteomic Research Using NeXtProt and GENCODE Databases. J Proteome Res 2017; 16:4425-4434. [PMID: 28965411 DOI: 10.1021/acs.jproteome.7b00223] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Human Proteome Project aims to map all human proteins including missing proteins as well as proteoforms with post translational modifications, alternative splicing variants (ASVs), and single amino acid variants (SAAVs). neXtProt and Ensemble databases are usually used to provide curated information on human coding genes. However, to find these proteoforms, we (Chr #11 team) first introduce a streamlined pipeline using customized and concatenated neXtProt and GENCODE originated from Ensemble, with controlled false discovery rate (FDR). Because of large sized databases used in this pipeline, we found more stringent FDR filtering (0.1% at the peptide level and 1% at the protein level) to claim novel findings, such as GENCODE ASVs and missing proteins, from human hippocampus data set (MSV000081385) and ProteomeXchange (PXD007166). Using our next generation proteomic pipeline (nextPP) with neXtProt and GENCODE databases, two missing proteins such as activity-regulated cytoskeleton-associated protein (ARC, Chr 8) and glutamate receptor ionotropic, kainite 5 (GRIK5, Chr 19) were additionally identified with two or more unique peptides from human brain tissues. Additionally, by applying the pipeline to human brain related data sets such as cortex (PXD000067 and PXD000561), spinal cord, and fetal brain (PXD000561), seven GENCODE ASVs such as ACTN4-012 (Chr.19), DPYSL2-005 (Chr.8), MPRIP-003 (Chr.17), NCAM1-013 (Chr.11), EPB41L1-017 (Chr.20), AGAP1-004 (Chr.2), and CPNE5-005 (Chr.6) were identified from two or more data sets. The identified peptides of GENCODE ASVs were mapped onto novel exon insertions, alternative translations at 5'-untranslated region, or novel protein coding sequence. Applying the pipeline to male reproductive organ related data sets, 52 GENCODE ASVs were identified from two testis (PXD000561 and PXD002179) and a spermatozoa (PXD003947) data sets. Four out of 52 GENCODE ASVs such as RAB11FIP5-008 (Chr. 2), RP13-347D8.7-001 (Chr. X), PRDX4-002 (Chr. X), and RP11-666A8.13-001 (Chr. 17) were identified in all of the three samples.
Collapse
Affiliation(s)
- Heeyoun Hwang
- Biomedical Omics Group, Korea Basic Science Institute , Cheongju 28119, Republic of Korea
| | - Gun Wook Park
- Biomedical Omics Group, Korea Basic Science Institute , Cheongju 28119, Republic of Korea
| | - Ji Yeong Park
- Biomedical Omics Group, Korea Basic Science Institute , Cheongju 28119, Republic of Korea.,Graduate School of Analytical Science and Technology, Chungnam National University , Daejeon, Republic of Korea
| | - Hyun Kyoung Lee
- Biomedical Omics Group, Korea Basic Science Institute , Cheongju 28119, Republic of Korea.,Graduate School of Analytical Science and Technology, Chungnam National University , Daejeon, Republic of Korea
| | - Ju Yeon Lee
- Biomedical Omics Group, Korea Basic Science Institute , Cheongju 28119, Republic of Korea
| | - Ji Eun Jeong
- Biomedical Omics Group, Korea Basic Science Institute , Cheongju 28119, Republic of Korea.,Graduate School of Analytical Science and Technology, Chungnam National University , Daejeon, Republic of Korea
| | - Sung-Kyu Robin Park
- Department of Chemical Physiology, The Scripps Research Institute , La Jolla, California 92037, United States
| | - John R Yates
- Department of Chemical Physiology, The Scripps Research Institute , La Jolla, California 92037, United States
| | - Kyung-Hoon Kwon
- Biomedical Omics Group, Korea Basic Science Institute , Cheongju 28119, Republic of Korea
| | - Young Mok Park
- Center for Cognition and Sociality, Institute for Basic Science , Daejeon, Republic of Korea
| | - Hyoung-Joo Lee
- Yonsei Proteome Research Center and Department of Integrated OMICS for Biomedical Science, and Department of Biochemistry, College of Life Science and Biotechnology, Yonsei University , Seoul, Republic of Korea
| | - Young-Ki Paik
- Yonsei Proteome Research Center and Department of Integrated OMICS for Biomedical Science, and Department of Biochemistry, College of Life Science and Biotechnology, Yonsei University , Seoul, Republic of Korea
| | - Jin Young Kim
- Biomedical Omics Group, Korea Basic Science Institute , Cheongju 28119, Republic of Korea
| | - Jong Shin Yoo
- Biomedical Omics Group, Korea Basic Science Institute , Cheongju 28119, Republic of Korea.,Graduate School of Analytical Science and Technology, Chungnam National University , Daejeon, Republic of Korea
| |
Collapse
|
29
|
Carapito C, Duek P, Macron C, Seffals M, Rondel K, Delalande F, Lindskog C, Fréour T, Vandenbrouck Y, Lane L, Pineau C. Validating Missing Proteins in Human Sperm Cells by Targeted Mass-Spectrometry- and Antibody-based Methods. J Proteome Res 2017; 16:4340-4351. [DOI: 10.1021/acs.jproteome.7b00374] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Affiliation(s)
- Christine Carapito
- Laboratoire
de Spectrométrie de Masse BioOrganique (LSMBO), IPHC, Université de Strasbourg, CNRS UMR7178, 25 Rue Becquerel, Strasbourg F-67087, France
| | - Paula Duek
- CALIPHO
Group, SIB-Swiss Institute of Bioinformatics, CMU, rue Michel-Servet
1, CH-1211 Geneva
4, Switzerland
| | - Charlotte Macron
- Laboratoire
de Spectrométrie de Masse BioOrganique (LSMBO), IPHC, Université de Strasbourg, CNRS UMR7178, 25 Rue Becquerel, Strasbourg F-67087, France
| | - Marine Seffals
- H2P2
Core facility, UMS BioSit, University of Rennes 1, Rennes F-35040, France
| | - Karine Rondel
- Protim,
Inserm U1085, Irset, Campus de Beaulieu, Rennes F-35042, France
| | - François Delalande
- Laboratoire
de Spectrométrie de Masse BioOrganique (LSMBO), IPHC, Université de Strasbourg, CNRS UMR7178, 25 Rue Becquerel, Strasbourg F-67087, France
| | - Cecilia Lindskog
- Department
of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Thomas Fréour
- Service de
Médecine de la Reproduction, CHU de Nantes, 38 boulevard
Jean Monnet, Nantes F-44093, France
- Inserm UMR1064, Nantes F-44093, France
| | - Yves Vandenbrouck
- CEA, DRF, BIG,
Laboratoire de Biologie à Grande Echelle, 17, rue des Martyrs, Grenoble F-38054, France
- Inserm U1038, Grenoble F-38054, France
- Grenoble-Alpes University, Grenoble F-38054, France
| | - Lydie Lane
- CALIPHO
Group, SIB-Swiss Institute of Bioinformatics, CMU, rue Michel-Servet
1, CH-1211 Geneva
4, Switzerland
- Department
of Human Protein Sciences, Faculty of Medicine, University of Geneva, 1, rue Michel-Servet, 1211 Geneva 4, Switzerland
| | - Charles Pineau
- Protim,
Inserm U1085, Irset, Campus de Beaulieu, Rennes F-35042, France
| |
Collapse
|
30
|
Abstract
De novo peptide sequencing from tandem MS data is the key technology in proteomics for the characterization of proteins, especially for new sequences, such as mAbs. In this study, we propose a deep neural network model, DeepNovo, for de novo peptide sequencing. DeepNovo architecture combines recent advances in convolutional neural networks and recurrent neural networks to learn features of tandem mass spectra, fragment ions, and sequence patterns of peptides. The networks are further integrated with local dynamic programming to solve the complex optimization task of de novo sequencing. We evaluated the method on a wide variety of species and found that DeepNovo considerably outperformed state of the art methods, achieving 7.7-22.9% higher accuracy at the amino acid level and 38.1-64.0% higher accuracy at the peptide level. We further used DeepNovo to automatically reconstruct the complete sequences of antibody light and heavy chains of mouse, achieving 97.5-100% coverage and 97.2-99.5% accuracy, without assisting databases. Moreover, DeepNovo is retrainable to adapt to any sources of data and provides a complete end-to-end training and prediction solution to the de novo sequencing problem. Not only does our study extend the deep learning revolution to a new field, but it also shows an innovative approach in solving optimization problems by using deep learning and dynamic programming.
Collapse
|
31
|
Alikhani M, Mirzaei M, Sabbaghian M, Parsamatin P, Karamzadeh R, Adib S, Sodeifi N, Gilani MAS, Zabet-Moghaddam M, Parker L, Wu Y, Gupta V, Haynes PA, Gourabi H, Baharvand H, Salekdeh GH. Quantitative proteomic analysis of human testis reveals system-wide molecular and cellular pathways associated with non-obstructive azoospermia. J Proteomics 2017; 162:141-154. [DOI: 10.1016/j.jprot.2017.02.007] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2016] [Revised: 01/22/2017] [Accepted: 02/13/2017] [Indexed: 12/17/2022]
|
32
|
Segura V, Garin-Muga A, Guruceaga E, Corrales FJ. Progress and pitfalls in finding the 'missing proteins' from the human proteome map. Expert Rev Proteomics 2017; 14:9-14. [PMID: 27885863 DOI: 10.1080/14789450.2017.1265450] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2016] [Accepted: 11/23/2016] [Indexed: 01/09/2023]
Abstract
The Human Proteome Project was launched with two main goals: the comprehensive and systematic definition of the human proteome map and the development of ready to use analytical tools to measure relevant proteins in their biological context in health and disease. Despite the great progress in this endeavour, there is still a group of reluctant proteins with no, or scarce, experimental evidence supporting their existence. These are called the 'missing proteins' and represent one of the biggest challenges to complete the human proteome map. Areas covered: This review focuses on the description of the missing proteome based on the HUPO standards, the analysis of the reasons explaining the difficulty of detecting missing proteins and the strategies currently used in the search for missing proteins. The present and future of the quest for the missing proteins is critically revised hereafter. Expert commentary: An overarching multidisciplinary effort is currently being done under the HUPO umbrella to allow completion of the human proteome map. It is expected that the detection of missing proteins will grow in the coming years since the methods and the best tissue/cell type sample for their search are already on the table.
Collapse
Affiliation(s)
- Victor Segura
- a Proteomics and Bioinformatics Laboratory, CIMA , University of Navarra , Pamplona , Spain
| | - Alba Garin-Muga
- a Proteomics and Bioinformatics Laboratory, CIMA , University of Navarra , Pamplona , Spain
| | - Elizabeth Guruceaga
- a Proteomics and Bioinformatics Laboratory, CIMA , University of Navarra , Pamplona , Spain
| | - Fernando J Corrales
- a Proteomics and Bioinformatics Laboratory, CIMA , University of Navarra , Pamplona , Spain
| |
Collapse
|
33
|
Weisser H, Wright JC, Mudge JM, Gutenbrunner P, Choudhary JS. Flexible Data Analysis Pipeline for High-Confidence Proteogenomics. J Proteome Res 2016; 15:4686-4695. [PMID: 27786492 PMCID: PMC5703597 DOI: 10.1021/acs.jproteome.6b00765] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
![]()
Proteogenomics leverages information
derived from proteomic data
to improve genome annotations. Of particular interest are “novel”
peptides that provide direct evidence of protein expression for genomic
regions not previously annotated as protein-coding. We present a modular,
automated data analysis pipeline aimed at detecting such “novel”
peptides in proteomic data sets. This pipeline implements criteria
developed by proteomics and genome annotation experts for high-stringency
peptide identification and filtering. Our pipeline is based on the
OpenMS computational framework; it incorporates multiple database
search engines for peptide identification and applies a machine-learning
approach (Percolator) to post-process search results. We describe
several new and improved software tools that we developed to facilitate
proteogenomic analyses that enhance the wealth of tools provided by
OpenMS. We demonstrate the application of our pipeline to a human
testis tissue data set previously acquired for the Chromosome-Centric
Human Proteome Project, which led to the addition of five new gene
annotations on the human reference genome.
Collapse
Affiliation(s)
| | | | | | - Petra Gutenbrunner
- School of Informatics, Communications, and Media, University of Applied Sciences Upper Austria , Hagenberg 4232, Austria
| | | |
Collapse
|
34
|
Omenn GS, Lane L, Lundberg EK, Beavis RC, Overall CM, Deutsch EW. Metrics for the Human Proteome Project 2016: Progress on Identifying and Characterizing the Human Proteome, Including Post-Translational Modifications. J Proteome Res 2016; 15:3951-3960. [PMID: 27487407 PMCID: PMC5129622 DOI: 10.1021/acs.jproteome.6b00511] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
The HUPO Human Proteome Project (HPP) has two overall goals: (1) stepwise completion of the protein parts list-the draft human proteome including confidently identifying and characterizing at least one protein product from each protein-coding gene, with increasing emphasis on sequence variants, post-translational modifications (PTMs), and splice isoforms of those proteins; and (2) making proteomics an integrated counterpart to genomics throughout the biomedical and life sciences community. PeptideAtlas and GPMDB reanalyze all major human mass spectrometry data sets available through ProteomeXchange with standardized protocols and stringent quality filters; neXtProt curates and integrates mass spectrometry and other findings to present the most up to date authorative compendium of the human proteome. The HPP Guidelines for Mass Spectrometry Data Interpretation version 2.1 were applied to manuscripts submitted for this 2016 C-HPP-led special issue [ www.thehpp.org/guidelines ]. The Human Proteome presented as neXtProt version 2016-02 has 16,518 confident protein identifications (Protein Existence [PE] Level 1), up from 13,664 at 2012-12, 15,646 at 2013-09, and 16,491 at 2014-10. There are 485 proteins that would have been PE1 under the Guidelines v1.0 from 2012 but now have insufficient evidence due to the agreed-upon more stringent Guidelines v2.0 to reduce false positives. neXtProt and PeptideAtlas now both require two non-nested, uniquely mapping (proteotypic) peptides of at least 9 aa in length. There are 2,949 missing proteins (PE2+3+4) as the baseline for submissions for this fourth annual C-HPP special issue of Journal of Proteome Research. PeptideAtlas has 14,629 canonical (plus 1187 uncertain and 1755 redundant) entries. GPMDB has 16,190 EC4 entries, and the Human Protein Atlas has 10,475 entries with supportive evidence. neXtProt, PeptideAtlas, and GPMDB are rich resources of information about post-translational modifications (PTMs), single amino acid variants (SAAVSs), and splice isoforms. Meanwhile, the Biology- and Disease-driven (B/D)-HPP has created comprehensive SRM resources, generated popular protein lists to guide targeted proteomics assays for specific diseases, and launched an Early Career Researchers initiative.
Collapse
Affiliation(s)
- Gilbert S. Omenn
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, Michigan 48109-2218, United States
| | - Lydie Lane
- CALIPHO Group, SIB Swiss Institute of Bioinformatics and Department of Human Protein Science, University of Geneva, CMU, Michel-Servet 1, 1211 Geneva 4, Switzerland
| | - Emma K. Lundberg
- SciLifeLab Stockholm and School of Biotechnology, KTH, Karolinska Institutet Science Park, Tomtebodavägen 23, SE-171 65 Solna, Sweden
| | - Ronald C. Beavis
- Biochemistry & Medical Genetics, University of Manitoba, Winnipeg, MB R3T 2N2, Canada
| | - Christopher M. Overall
- Biochemistry and Molecular Biology, and Oral Biological and Medical Sciences University of British Columbia, 2350 Health Sciences Mall, Room 4.401, Vancouver, BC V6T 1Z3, Canada
| | - Eric W. Deutsch
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, Washington 98109-5263, United States
| |
Collapse
|
35
|
Poverennaya EV, Kopylov AT, Ponomarenko EA, Ilgisonis EV, Zgoda VG, Tikhonova OV, Novikova SE, Farafonova TE, Kiseleva YY, Radko SP, Vakhrushev IV, Yarygin KN, Moshkovskii SA, Kiseleva OI, Lisitsa AV, Sokolov AS, Mazur AM, Prokhortchouk EB, Skryabin KG, Kostrjukova ES, Tyakht AV, Gorbachev AY, Ilina EN, Govorun VM, Archakov AI. State of the Art of Chromosome 18-Centric HPP in 2016: Transcriptome and Proteome Profiling of Liver Tissue and HepG2 Cells. J Proteome Res 2016; 15:4030-4038. [PMID: 27527821 DOI: 10.1021/acs.jproteome.6b00380] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
A gene-centric approach was applied for a large-scale study of expression products of a single chromosome. Transcriptome profiling of liver tissue and HepG2 cell line was independently performed using two RNA-Seq platforms (SOLiD and Illumina) and also by Droplet Digital PCR (ddPCR) and quantitative RT-PCR. Proteome profiling was performed using shotgun LC-MS/MS as well as selected reaction monitoring with stable isotope-labeled standards (SRM/SIS) for liver tissue and HepG2 cells. On the basis of SRM/SIS measurements, protein copy numbers were estimated for the Chromosome 18 (Chr 18) encoded proteins in the selected types of biological material. These values were compared with expression levels of corresponding mRNA. As a result, we obtained information about 158 and 142 transcripts for HepG2 cell line and liver tissue, respectively. SRM/SIS measurements and shotgun LC-MS/MS allowed us to detect 91 Chr 18-encoded proteins in total, while an intersection between the HepG2 cell line and liver tissue proteomes was ∼66%. In total, there were 16 proteins specifically observed in HepG2 cell line, while 15 proteins were found solely in the liver tissue. Comparison between proteome and transcriptome revealed a poor correlation (R2 ≈ 0.1) between corresponding mRNA and protein expression levels. The SRM and shotgun data sets (obtained during 2015-2016) are available in PASSEL (PASS00697) and ProteomeExchange/PRIDE (PXD004407). All measurements were also uploaded into the in-house Chr 18 Knowledgebase at http://kb18.ru/protein/matrix/416126 .
Collapse
Affiliation(s)
| | - Arthur T Kopylov
- Institute of Biomedical Chemistry , Pogodinskaya Street, 10, Moscow 119121, Russia
| | - Elena A Ponomarenko
- Institute of Biomedical Chemistry , Pogodinskaya Street, 10, Moscow 119121, Russia
| | | | - Victor G Zgoda
- Institute of Biomedical Chemistry , Pogodinskaya Street, 10, Moscow 119121, Russia
| | - Olga V Tikhonova
- Institute of Biomedical Chemistry , Pogodinskaya Street, 10, Moscow 119121, Russia
| | - Svetlana E Novikova
- Institute of Biomedical Chemistry , Pogodinskaya Street, 10, Moscow 119121, Russia
| | - Tatyana E Farafonova
- Institute of Biomedical Chemistry , Pogodinskaya Street, 10, Moscow 119121, Russia
| | - Yana Yu Kiseleva
- Institute of Biomedical Chemistry , Pogodinskaya Street, 10, Moscow 119121, Russia
| | - Sergey P Radko
- Institute of Biomedical Chemistry , Pogodinskaya Street, 10, Moscow 119121, Russia
| | - Igor V Vakhrushev
- Institute of Biomedical Chemistry , Pogodinskaya Street, 10, Moscow 119121, Russia
| | - Konstantin N Yarygin
- Institute of Biomedical Chemistry , Pogodinskaya Street, 10, Moscow 119121, Russia
| | - Sergei A Moshkovskii
- Institute of Biomedical Chemistry , Pogodinskaya Street, 10, Moscow 119121, Russia.,Pirogov Russian National Research Medical University , Ostrovitianov Str. 1, Moscow 117997, Russia
| | - Olga I Kiseleva
- Institute of Biomedical Chemistry , Pogodinskaya Street, 10, Moscow 119121, Russia
| | - Andrey V Lisitsa
- Institute of Biomedical Chemistry , Pogodinskaya Street, 10, Moscow 119121, Russia
| | - Alexey S Sokolov
- Center "Bioengineering" Russian Academy of Sciences , Prospect 60-let Oktyabrya, 7, Build.1, Moscow 119071, Russia
| | - Alexander M Mazur
- Center "Bioengineering" Russian Academy of Sciences , Prospect 60-let Oktyabrya, 7, Build.1, Moscow 119071, Russia
| | - Egor B Prokhortchouk
- Center "Bioengineering" Russian Academy of Sciences , Prospect 60-let Oktyabrya, 7, Build.1, Moscow 119071, Russia
| | - Konstantin G Skryabin
- Center "Bioengineering" Russian Academy of Sciences , Prospect 60-let Oktyabrya, 7, Build.1, Moscow 119071, Russia
| | - Elena S Kostrjukova
- Scientific Research Institute of Physical-Chemical Medicine , Malaya Pirogovskaya, 1a, Moscow 119435, Russia
| | - Alexander V Tyakht
- Scientific Research Institute of Physical-Chemical Medicine , Malaya Pirogovskaya, 1a, Moscow 119435, Russia
| | - Alexey Yu Gorbachev
- Scientific Research Institute of Physical-Chemical Medicine , Malaya Pirogovskaya, 1a, Moscow 119435, Russia
| | - Elena N Ilina
- Scientific Research Institute of Physical-Chemical Medicine , Malaya Pirogovskaya, 1a, Moscow 119435, Russia
| | - Vadim M Govorun
- Scientific Research Institute of Physical-Chemical Medicine , Malaya Pirogovskaya, 1a, Moscow 119435, Russia
| | - Alexander I Archakov
- Institute of Biomedical Chemistry , Pogodinskaya Street, 10, Moscow 119121, Russia
| |
Collapse
|
36
|
Wei W, Luo W, Wu F, Peng X, Zhang Y, Zhang M, Zhao Y, Su N, Qi Y, Chen L, Zhang Y, Wen B, He F, Xu P. Deep Coverage Proteomics Identifies More Low-Abundance Missing Proteins in Human Testis Tissue with Q-Exactive HF Mass Spectrometer. J Proteome Res 2016; 15:3988-3997. [PMID: 27535590 DOI: 10.1021/acs.jproteome.6b00390] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Since 2012, missing proteins (MPs) investigation has been one of the critical missions of Chromosome-Centric Human Proteome Project (C-HPP) through various biochemical strategies. On the basis of our previous testis MPs study, faster scanning and higher resolution mass-spectrometry-based proteomics might be conducive to MPs exploration, especially for low-abundance proteins. In this study, Q-Exactive HF (HF) was used to survey proteins from the same testis tissues separated by two separating methods (tricine- and glycine-SDS-PAGE), as previously described. A total of 8526 proteins were identified, of which more low-abundance proteins were uniquely detected in HF data but not in our previous LTQ Orbitrap Velos (Velos) reanalysis data. Further transcriptomics analysis showed that these uniquely identified proteins by HF also had lower expression at the mRNA level. Of the 81 total identified MPs, 74 and 39 proteins were listed as MPs in HF and Velos data sets, respectively. Among the above MPs, 47 proteins (43 neXtProt PE2 and 4 PE3) were ranked as confirmed MPs after verifying with the stringent spectra match and isobaric and single amino acid variants filtering. Functional investigation of these 47 MPs revealed that 11 MPs were testis-specific proteins and 7 MPs were involved in spermatogenesis process. Therefore, we concluded that higher scanning speed and resolution of HF might be factors for improving the low-abundance MP identification in future C-HPP studies. All mass-spectrometry data from this study have been deposited in the ProteomeXchange with identifier PXD004092.
Collapse
Affiliation(s)
- Wei Wei
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Weijia Luo
- Graduate School, Anhui Medical University , Hefei 230032, China
| | - Feilin Wu
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China.,Life Science College, Southwest Forestry University , Kunming, 650224, China
| | - Xuehui Peng
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China.,Key Laboratory of Combinatorial Biosynthesis and Drug Discovery of the Ministry of Education, School of Pharmaceutical Sciences, Wuhan University , Wuhan 430072, China
| | - Yao Zhang
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China.,Institute of Microbiology , Chinese Academy of Science, Beijing 100101, China
| | - Manli Zhang
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Yan Zhao
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Na Su
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - YingZi Qi
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Lingsheng Chen
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Yangjun Zhang
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Bo Wen
- BGI-Shenzhen , Shenzhen 518083, China
| | - Fuchu He
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Ping Xu
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China.,Graduate School, Anhui Medical University , Hefei 230032, China.,Key Laboratory of Combinatorial Biosynthesis and Drug Discovery of the Ministry of Education, School of Pharmaceutical Sciences, Wuhan University , Wuhan 430072, China
| |
Collapse
|
37
|
Duek P, Bairoch A, Gateau A, Vandenbrouck Y, Lane L. Missing Protein Landscape of Human Chromosomes 2 and 14: Progress and Current Status. J Proteome Res 2016; 15:3971-3978. [PMID: 27487287 DOI: 10.1021/acs.jproteome.6b00443] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Within the C-HPP, the Swiss and French teams are responsible for the annotation of proteins from chromosomes 2 and 14, respectively. neXtProt currently reports 1231 entries on chromosome 2 and 624 entries on chromosome 14; of these, 134 and 93 entries are still not experimentally validated and are thus considered as "missing proteins" (PE2-4), respectively. Among these entries, some may never be validated by conventional MS/MS approaches because of incompatible biochemical features. Others have already been validated but are still awaiting annotation. On the basis of information retrieved from the literature and from three of the main C-HPP resources (Human Protein Atlas, PeptideAtlas, and neXtProt), a subset of 40 theoretically detectable missing proteins (25 on chromosome 2 and 15 on chromosome 14) was defined for upcoming targeted studies in sperm samples. This list is proposed as a roadmap for the French and Swiss teams in the near future.
Collapse
Affiliation(s)
- Paula Duek
- CALIPHO Group, SIB-Swiss Institute of Bioinformatics, CMU, rue Michel-Servet 1, CH-1211 Geneva 4, Switzerland
| | - Amos Bairoch
- CALIPHO Group, SIB-Swiss Institute of Bioinformatics, CMU, rue Michel-Servet 1, CH-1211 Geneva 4, Switzerland.,Department of Human Protein Sciences, Faculty of Medicine, University of Geneva , CMU, rue Michel-Servet 1, CH-1211 Geneva 4, Switzerland
| | - Alain Gateau
- CALIPHO Group, SIB-Swiss Institute of Bioinformatics, CMU, rue Michel-Servet 1, CH-1211 Geneva 4, Switzerland
| | - Yves Vandenbrouck
- CEA, DRF, BIG, Laboratoire de Biologie à Grande Echelle, 17, rue des Martyrs, Grenoble F-38054, France.,Inserm U1038 , 17, rue des Martyrs, Grenoble F-38054, France.,Université de Grenoble , Grenoble F-38054, France
| | - Lydie Lane
- CALIPHO Group, SIB-Swiss Institute of Bioinformatics, CMU, rue Michel-Servet 1, CH-1211 Geneva 4, Switzerland.,Department of Human Protein Sciences, Faculty of Medicine, University of Geneva , CMU, rue Michel-Servet 1, CH-1211 Geneva 4, Switzerland
| |
Collapse
|
38
|
Zhao M, Wei W, Cheng L, Zhang Y, Wu F, He F, Xu P. Searching Missing Proteins Based on the Optimization of Membrane Protein Enrichment and Digestion Process. J Proteome Res 2016; 15:4020-4029. [PMID: 27485413 DOI: 10.1021/acs.jproteome.6b00389] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
A membrane protein enrichment method composed of ultracentrifugation and detergent-based extraction was first developed based on MCF7 cell line. Then, in-solution digestion with detergents and eFASP (enhanced filter-aided sample preparation) with detergents were compared with the time-consuming in-gel digestion method. Among the in-solution digestion strategies, the eFASP combined with RapiGest identified 1125 membrane proteins. Similarly, the eFASP combined with sodium deoxycholate identified 1069 membrane proteins; however, the in-gel digestion characterized 1091 membrane proteins. Totally, with the five digestion methods, 1390 membrane proteins were identified with ≥1 unique peptides, among which 1345 membrane proteins contain unique peptides ≥2. This is the biggest membrane protein data set for MCF7 cell line and even breast cancer tissue samples. Interestingly, we identified 13 unique peptides belonging to 8 missing proteins (MPs). Finally, eight unique peptides were validated by synthesized peptides. Two proteins were confirmed as MPs, and another two proteins were candidate detections.
Collapse
Affiliation(s)
- Mingzhi Zhao
- State Key Laboratory of Proteomics, National Engineering Research Center for Protein Drugs, Beijing Proteome Research Center, National Center for Protein Sciences Beijing, Beijing Institute of Radiation Medicine , Beijing 102206, P. R. China
| | - Wei Wei
- State Key Laboratory of Proteomics, National Engineering Research Center for Protein Drugs, Beijing Proteome Research Center, National Center for Protein Sciences Beijing, Beijing Institute of Radiation Medicine , Beijing 102206, P. R. China
| | - Long Cheng
- Department of Medical Molecular Biology, Beijing Institute of Biotechnology , 27 Tai-Ping Lu Road, Beijing 100850, China
| | - Yao Zhang
- State Key Laboratory of Proteomics, National Engineering Research Center for Protein Drugs, Beijing Proteome Research Center, National Center for Protein Sciences Beijing, Beijing Institute of Radiation Medicine , Beijing 102206, P. R. China.,Institute of Microbiology, Chinese Academy of Science , Beijing 100101, China
| | - Feilin Wu
- State Key Laboratory of Proteomics, National Engineering Research Center for Protein Drugs, Beijing Proteome Research Center, National Center for Protein Sciences Beijing, Beijing Institute of Radiation Medicine , Beijing 102206, P. R. China.,Life Science College, Southwest Forestry University , Kunming 650224, P. R. China
| | - Fuchu He
- State Key Laboratory of Proteomics, National Engineering Research Center for Protein Drugs, Beijing Proteome Research Center, National Center for Protein Sciences Beijing, Beijing Institute of Radiation Medicine , Beijing 102206, P. R. China
| | - Ping Xu
- State Key Laboratory of Proteomics, National Engineering Research Center for Protein Drugs, Beijing Proteome Research Center, National Center for Protein Sciences Beijing, Beijing Institute of Radiation Medicine , Beijing 102206, P. R. China.,Key Laboratory of Combinatorial Biosynthesis and Drug Discovery (Wuhan University), Ministry of Education, and Wuhan University School of Pharmaceutical Sciences , Wuhan 430071, P. R. China.,Anhui Medical University , Hefei 230032, Anhui, P. R. China
| |
Collapse
|
39
|
Guo J, Cui Y, Yan Z, Luo Y, Zhang W, Deng S, Tang S, Zhang G, He QY, Wang T. Phosphoproteome Characterization of Human Colorectal Cancer SW620 Cell-Derived Exosomes and New Phosphosite Discovery for C-HPP. J Proteome Res 2016; 15:4060-4072. [PMID: 27470641 DOI: 10.1021/acs.jproteome.6b00391] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Affiliation(s)
- Jiahui Guo
- Key Laboratory of Functional
Protein Research of Guangdong Higher Education Institutes, Institute
of Life and Health Engineering, College of Life Science and Technology, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China
| | - Yizhi Cui
- Key Laboratory of Functional
Protein Research of Guangdong Higher Education Institutes, Institute
of Life and Health Engineering, College of Life Science and Technology, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China
| | - Ziqi Yan
- Key Laboratory of Functional
Protein Research of Guangdong Higher Education Institutes, Institute
of Life and Health Engineering, College of Life Science and Technology, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China
| | - Yanzhang Luo
- Key Laboratory of Functional
Protein Research of Guangdong Higher Education Institutes, Institute
of Life and Health Engineering, College of Life Science and Technology, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China
| | - Wanling Zhang
- Key Laboratory of Functional
Protein Research of Guangdong Higher Education Institutes, Institute
of Life and Health Engineering, College of Life Science and Technology, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China
| | - Suyuan Deng
- Key Laboratory of Functional
Protein Research of Guangdong Higher Education Institutes, Institute
of Life and Health Engineering, College of Life Science and Technology, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China
| | - Shengquan Tang
- Key Laboratory of Functional
Protein Research of Guangdong Higher Education Institutes, Institute
of Life and Health Engineering, College of Life Science and Technology, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China
| | - Gong Zhang
- Key Laboratory of Functional
Protein Research of Guangdong Higher Education Institutes, Institute
of Life and Health Engineering, College of Life Science and Technology, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China
| | - Qing-Yu He
- Key Laboratory of Functional
Protein Research of Guangdong Higher Education Institutes, Institute
of Life and Health Engineering, College of Life Science and Technology, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China
| | - Tong Wang
- Key Laboratory of Functional
Protein Research of Guangdong Higher Education Institutes, Institute
of Life and Health Engineering, College of Life Science and Technology, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China
| |
Collapse
|
40
|
Paik YK, Omenn GS, Overall CM, Deutsch EW, Hancock WS. Recent Advances in the Chromosome-Centric Human Proteome Project: Missing Proteins in the Spot Light. J Proteome Res 2016; 14:3409-14. [PMID: 26337862 DOI: 10.1021/acs.jproteome.5b00785] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Young-Ki Paik
- Yonsei Proteome Research Center, Yonsei University , Seoul 120-749, Korea
| | - Gilbert S Omenn
- Center for Computational Medicine and Bioinformatics, University of Michigan , Ann Arbor, Michigan 48109, United States.,Yonsei Proteome Research Center, Yonsei University , Seoul 120-749, Korea
| | - Christopher M Overall
- Department of Biochemistry and Molecular Biology, University of British Columbia , Vancouver, British Columbia V6T 1Z3, Canada.,Yonsei Proteome Research Center, Yonsei University , Seoul 120-749, Korea
| | - Eric W Deutsch
- Institute for Systems Biology , Seattle, Washington 98109, United States.,Yonsei Proteome Research Center, Yonsei University , Seoul 120-749, Korea
| | - William S Hancock
- Department of Chemical Biology, Northeastern University , Boston, Massachusetts 02115, United States.,Yonsei Proteome Research Center, Yonsei University , Seoul 120-749, Korea
| |
Collapse
|
41
|
Improving GENCODE reference gene annotation using a high-stringency proteogenomics workflow. Nat Commun 2016; 7:11778. [PMID: 27250503 PMCID: PMC4895710 DOI: 10.1038/ncomms11778] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2015] [Accepted: 04/28/2016] [Indexed: 12/16/2022] Open
Abstract
Complete annotation of the human genome is indispensable for medical research. The GENCODE consortium strives to provide this, augmenting computational and experimental evidence with manual annotation. The rapidly developing field of proteogenomics provides evidence for the translation of genes into proteins and can be used to discover and refine gene models. However, for both the proteomics and annotation groups, there is a lack of guidelines for integrating this data. Here we report a stringent workflow for the interpretation of proteogenomic data that could be used by the annotation community to interpret novel proteogenomic evidence. Based on reprocessing of three large-scale publicly available human data sets, we show that a conservative approach, using stringent filtering is required to generate valid identifications. Evidence has been found supporting 16 novel protein-coding genes being added to GENCODE. Despite this many peptide identifications in pseudogenes cannot be annotated due to the absence of orthogonal supporting evidence. Identifying and annotating functional elements in the human genome remains a challenging but important task. Here the authors propose a priority annotation score to rank identifications and suggest how proteogenomics evidence can be interpreted and what additional information substantiates protein-coding potential for annotation.
Collapse
|
42
|
Zhao M, Cai M, Wu F, Zhang Y, Xiong Z, Xu P. Recombinant expression, refolding, purification and characterization of Pseudomonas aeruginosa protease IV in Escherichia coli. Protein Expr Purif 2016; 126:69-76. [PMID: 27260967 DOI: 10.1016/j.pep.2016.05.019] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2016] [Revised: 05/29/2016] [Accepted: 05/30/2016] [Indexed: 11/30/2022]
Abstract
Several protease IV enzymes are widely used in proteomic research. Specifically, protease IV from Pseudomonas aeruginosa has lysyl endopeptidase activity. Here, we report the recombinant expression, refolding, activation, and purification of this protease in Escherichia coli. Proteolytic instability of the activated intermediate, a major obstacle for efficient production, is controlled through ammonium sulfate precipitation. The purified protease IV exhibits superior lysyl endopeptidase activity compared to a commercial product.
Collapse
Affiliation(s)
- Mingzhi Zhao
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences Beijing, Beijing Institute of Radiation Medicine, Beijing, 102206, PR China
| | - Man Cai
- Prosit Sole Biotechnology, Co., Ltd., Beijing, 100085, PR China
| | - Feilin Wu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences Beijing, Beijing Institute of Radiation Medicine, Beijing, 102206, PR China; Life Science College, Southwest Forestry University, Kunming, 650224, PR China
| | - Yao Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences Beijing, Beijing Institute of Radiation Medicine, Beijing, 102206, PR China; Institute of Microbiology Chinese Academy of Science, Beijing, 100101, PR China
| | - Zhi Xiong
- Life Science College, Southwest Forestry University, Kunming, 650224, PR China
| | - Ping Xu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences Beijing, Beijing Institute of Radiation Medicine, Beijing, 102206, PR China; Key Laboratory of Combinatorial Biosynthesis and Drug Discovery (Wuhan University), Ministry of Education and Wuhan University School of Pharmaceutical Sciences, Wuhan, 430071, PR China; Anhui Medical University, Hefei, 230032, Anhui, PR China.
| |
Collapse
|
43
|
Xu S, Zhou R, Ren Z, Zhou B, Lin Z, Hou G, Deng Y, Zi J, Lin L, Wang Q, Liu X, Xu X, Wen B, Liu S. Appraisal of the Missing Proteins Based on the mRNAs Bound to Ribosomes. J Proteome Res 2015; 14:4976-84. [PMID: 26500078 DOI: 10.1021/acs.jproteome.5b00476] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Considering the technical limitations of mass spectrometry in protein identification, the mRNAs bound to ribosomes (RNC-mRNA) are assumed to reflect the mRNAs participating in the translational process. The RNC-mRNA data are reasoned to be useful for appraising the missing proteins. A set of the multiomics data including free-mRNAs, RNC-mRNAs, and proteomes was acquired from three liver cancer cell lines. On the basis of the missing proteins in neXtProt (release 2014-09-19), the bioinformatics analysis was carried out in three phases: (1) finding how many neXtProt missing proteins have or do not have RNA-seq and/or MS/MS evidence, (2) analyzing specific physicochemical and biological properties of the missing proteins that lack both RNA-seq and MS/MS evidence, and (3) analyzing the combined properties of these missing proteins. Total of 1501 missing proteins were found by neither RNC-mRNA nor MS/MS in the three liver cancer cell lines. For these missing proteins, some are expected higher hydrophobicity, unsuitable detection, or sensory functions as properties at the protein level, while some are predicted to have nonexpressing chromatin structures on the corresponding gene level. With further integrated analysis, we could attribute 93% of them (1391/1501) to these causal factors, which result in the expression products scarcely detected by RNA-seq or MS/MS.
Collapse
Affiliation(s)
- Shaohang Xu
- BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
| | - Ruo Zhou
- BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
| | - Zhe Ren
- BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
| | - Baojin Zhou
- BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
| | - Zhilong Lin
- BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
| | - Guixue Hou
- BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences , BeiChen West Road, Beijing 100101, China
| | - Yamei Deng
- BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences , BeiChen West Road, Beijing 100101, China
| | - Jin Zi
- BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
| | - Liang Lin
- BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
| | - Quanhui Wang
- BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences , BeiChen West Road, Beijing 100101, China
| | - Xin Liu
- BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
| | - Xun Xu
- BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
| | - Bo Wen
- BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
| | - Siqi Liu
- BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences , BeiChen West Road, Beijing 100101, China
| |
Collapse
|