1
|
Lewis JM, Jebeli L, Coulon PML, Lay CE, Scott NE. Glycoproteomic and proteomic analysis of Burkholderia cenocepacia reveals glycosylation events within FliF and MotB are dispensable for motility. Microbiol Spectr 2024; 12:e0034624. [PMID: 38709084 DOI: 10.1128/spectrum.00346-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2024] [Accepted: 04/16/2024] [Indexed: 05/07/2024] Open
Abstract
Across the Burkholderia genus O-linked protein glycosylation is highly conserved. While the inhibition of glycosylation has been shown to be detrimental for virulence in Burkholderia cepacia complex species, such as Burkholderia cenocepacia, little is known about how specific glycosylation sites impact protein functionality. Within this study, we sought to improve our understanding of the breadth, dynamics, and requirement for glycosylation across the B. cenocepacia O-glycoproteome. Assessing the B. cenocepacia glycoproteome across different culture media using complementary glycoproteomic approaches, we increase the known glycoproteome to 141 glycoproteins. Leveraging this repertoire of glycoproteins, we quantitively assessed the glycoproteome of B. cenocepacia using Data-Independent Acquisition (DIA) revealing the B. cenocepacia glycoproteome is largely stable across conditions with most glycoproteins constitutively expressed. Examination of how the absence of glycosylation impacts the glycoproteome reveals that the protein abundance of only five glycoproteins (BCAL1086, BCAL2974, BCAL0525, BCAM0505, and BCAL0127) are altered by the loss of glycosylation. Assessing ΔfliF (ΔBCAL0525), ΔmotB (ΔBCAL0127), and ΔBCAM0505 strains, we demonstrate the loss of FliF, and to a lesser extent MotB, mirror the proteomic effects observed in the absence of glycosylation in ΔpglL. While both MotB and FliF are essential for motility, we find loss of glycosylation sites in MotB or FliF does not impact motility supporting these sites are dispensable for function. Combined this work broadens our understanding of the B. cenocepacia glycoproteome supporting that the loss of glycoproteins in the absence of glycosylation is not an indicator of the requirement for glycosylation for protein function. IMPORTANCE Burkholderia cenocepacia is an opportunistic pathogen of concern within the Cystic Fibrosis community. Despite a greater appreciation of the unique physiology of B. cenocepacia gained over the last 20 years a complete understanding of the proteome and especially the O-glycoproteome, is lacking. In this study, we utilize systems biology approaches to expand the known B. cenocepacia glycoproteome as well as track the dynamics of glycoproteins across growth phases, culturing media and in response to the loss of glycosylation. We show that the glycoproteome of B. cenocepacia is largely stable across conditions and that the loss of glycosylation only impacts five glycoproteins including the motility associated proteins FliF and MotB. Examination of MotB and FliF shows, while these proteins are essential for motility, glycosylation is dispensable. Combined this work supports that B. cenocepacia glycosylation can be dispensable for protein function and may influence protein properties beyond stability.
Collapse
Affiliation(s)
- Jessica M Lewis
- Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| | - Leila Jebeli
- Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| | - Pauline M L Coulon
- Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| | - Catrina E Lay
- Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| | - Nichollas E Scott
- Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| |
Collapse
|
2
|
Miravet-Verde S, Mazzolini R, Segura-Morales C, Broto A, Lluch-Senar M, Serrano L. ProTInSeq: transposon insertion tracking by ultra-deep DNA sequencing to identify translated large and small ORFs. Nat Commun 2024; 15:2091. [PMID: 38453908 PMCID: PMC10920889 DOI: 10.1038/s41467-024-46112-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Accepted: 02/14/2024] [Indexed: 03/09/2024] Open
Abstract
Identifying open reading frames (ORFs) being translated is not a trivial task. ProTInSeq is a technique designed to characterize proteomes by sequencing transposon insertions engineered to express a selection marker when they occur in-frame within a protein-coding gene. In the bacterium Mycoplasma pneumoniae, ProTInSeq identifies 83% of its annotated proteins, along with 5 proteins and 153 small ORF-encoded proteins (SEPs; ≤100 aa) that were not previously annotated. Moreover, ProTInSeq can be utilized for detecting translational noise, as well as for relative quantification and transmembrane topology estimation of fitness and non-essential proteins. By integrating various identification approaches, the number of initially annotated SEPs in this bacterium increases from 27 to 329, with a quarter of them predicted to possess antimicrobial potential. Herein, we describe a methodology complementary to Ribo-Seq and mass spectroscopy that can identify SEPs while providing other insights in a proteome with a flexible and cost-effective DNA ultra-deep sequencing approach.
Collapse
Affiliation(s)
- Samuel Miravet-Verde
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, 08003, Barcelona, Spain.
- Department of Biology, Institute of Microbiology and Swiss Institute of Bioinformatics, ETH Zurich, Zurich, Switzerland.
| | | | - Carolina Segura-Morales
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, 08003, Barcelona, Spain
| | - Alicia Broto
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, 08003, Barcelona, Spain
| | - Maria Lluch-Senar
- Pulmobiotics, Dr Aiguader 88, 08003, Barcelona, Spain.
- Institute of Biotechnology and Biomedicine "Vicent Villar Palasi" (IBB), Universitat Autònoma de Barcelona, Barcelona, Spain.
| | - Luis Serrano
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, 08003, Barcelona, Spain.
- Universitat Pompeu Fabra (UPF), Barcelona, Spain.
- ICREA, Pg. Lluis Companys 23, 08010, Barcelona, Spain.
| |
Collapse
|
3
|
Mao Y, Qian SB. Making sense of mRNA translational "noise". Semin Cell Dev Biol 2024; 154:114-122. [PMID: 36925447 PMCID: PMC10500040 DOI: 10.1016/j.semcdb.2023.03.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Revised: 03/09/2023] [Accepted: 03/09/2023] [Indexed: 03/15/2023]
Abstract
The importance of translation fidelity has been apparent since the discovery of genetic code. It is commonly believed that translation deviating from the main coding region is to be avoided at all times inside cells. However, ribosome profiling and mass spectrometry have revealed pervasive noncanonical translation. Both the scope and origin of translational "noise" are just beginning to be appreciated. Although largely overlooked, those translational "noises" are associated with a wide range of cellular functions, such as producing unannotated protein products. Furthermore, the dynamic nature of translational "noise" is responsive to stress conditions, highlighting the beneficial effect of translational "noise" in stress adaptation. Mechanistic investigation of translational "noise" will provide better insight into the mechanisms of translational regulation. Ultimately, they are not "noise" at all but represent a signature of cellular activities under pathophysiological conditions. Deciphering translational "noise" holds the therapeutic and diagnostic potential in a wide spectrum of human diseases.
Collapse
Affiliation(s)
- Yuanhui Mao
- Division of Nutritional Sciences, Cornell University, Ithaca, NY 14853, USA
| | - Shu-Bing Qian
- Division of Nutritional Sciences, Cornell University, Ithaca, NY 14853, USA.
| |
Collapse
|
4
|
Lu Y, Ran Y, Li H, Wen J, Cui X, Zhang X, Guan X, Cheng M. Micropeptides: origins, identification, and potential role in metabolism-related diseases. J Zhejiang Univ Sci B 2023; 24:1106-1122. [PMID: 38057268 PMCID: PMC10710913 DOI: 10.1631/jzus.b2300128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Accepted: 06/06/2023] [Indexed: 12/08/2023]
Abstract
With the development of modern sequencing techniques and bioinformatics, genomes that were once thought to be noncoding have been found to encode abundant functional micropeptides (miPs), a kind of small polypeptides. Although miPs are difficult to analyze and identify, a number of studies have begun to focus on them. More and more miPs have been revealed as essential for energy metabolism homeostasis, immune regulation, and tumor growth and development. Many reports have shown that miPs are especially essential for regulating glucose and lipid metabolism and regulating mitochondrial function. MiPs are also involved in the progression of related diseases. This paper reviews the sources and identification of miPs, as well as the functional significance of miPs for metabolism-related diseases, with the aim of revealing their potential clinical applications.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Min Cheng
- School of Basic Medicine Sciences, Weifang Medical University, Weifang 261053, China.
| |
Collapse
|
5
|
Mohsen JJ, Martel AA, Slavoff SA. Microproteins-Discovery, structure, and function. Proteomics 2023; 23:e2100211. [PMID: 37603371 PMCID: PMC10841188 DOI: 10.1002/pmic.202100211] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 08/03/2023] [Accepted: 08/10/2023] [Indexed: 08/22/2023]
Abstract
Advances in proteogenomic technologies have revealed hundreds to thousands of translated small open reading frames (sORFs) that encode microproteins in genomes across evolutionary space. While many microproteins have now been shown to play critical roles in biology and human disease, a majority of recently identified microproteins have little or no experimental evidence regarding their functionality. Computational tools have some limitations for analysis of short, poorly conserved microprotein sequences, so additional approaches are needed to determine the role of each member of this recently discovered polypeptide class. A currently underexplored avenue in the study of microproteins is structure prediction and determination, which delivers a depth of functional information. In this review, we provide a brief overview of microprotein discovery methods, then examine examples of microprotein structures (and, conversely, intrinsic disorder) that have been experimentally determined using crystallography, cryo-electron microscopy, and NMR, which provide insight into their molecular functions and mechanisms. Additionally, we discuss examples of predicted microprotein structures that have provided insight or context regarding their function. Analysis of microprotein structure at the angstrom level, and confirmation of predicted structures, therefore, has potential to identify translated microproteins that are of biological importance and to provide molecular mechanism for their in vivo roles.
Collapse
Affiliation(s)
- Jessica J. Mohsen
- Department of Chemistry, Yale University, New Haven, CT, USA
- Institute of Biomolecular Design and Discovery, Yale University, West Haven, CT, USA
| | - Alina A. Martel
- Institute of Biomolecular Design and Discovery, Yale University, West Haven, CT, USA
| | - Sarah A. Slavoff
- Department of Chemistry, Yale University, New Haven, CT, USA
- Institute of Biomolecular Design and Discovery, Yale University, West Haven, CT, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| |
Collapse
|
6
|
Fan KT, Hsu CW, Chen YR. Mass spectrometry in the discovery of peptides involved in intercellular communication: From targeted to untargeted peptidomics approaches. MASS SPECTROMETRY REVIEWS 2023; 42:2404-2425. [PMID: 35765846 DOI: 10.1002/mas.21789] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Revised: 03/17/2022] [Accepted: 04/08/2022] [Indexed: 06/15/2023]
Abstract
Endogenous peptide hormones represent an essential class of biomolecules, which regulate cell-cell communications in diverse physiological processes of organisms. Mass spectrometry (MS) has been developed to be a powerful technology for identifying and quantifying peptides in a highly efficient manner. However, it is difficult to directly identify these peptide hormones due to their diverse characteristics, dynamic regulations, low abundance, and existence in a complicated biological matrix. Here, we summarize and discuss the roles of targeted and untargeted MS in discovering peptide hormones using bioassay-guided purification, bioinformatics screening, or the peptidomics-based approach. Although the peptidomics approach is expected to discover novel peptide hormones unbiasedly, only a limited number of successful cases have been reported. The critical challenges and corresponding measures for peptidomics from the steps of sample preparation, peptide extraction, and separation to the MS data acquisition and analysis are also discussed. We also identify emerging technologies and methods that can be integrated into the discovery platform toward the comprehensive study of endogenous peptide hormones.
Collapse
Affiliation(s)
- Kai-Ting Fan
- Agricultural Biotechnology Research Center, Academia Sinica, Taipei, Taiwan
| | - Chia-Wei Hsu
- Agricultural Biotechnology Research Center, Academia Sinica, Taipei, Taiwan
| | - Yet-Ran Chen
- Agricultural Biotechnology Research Center, Academia Sinica, Taipei, Taiwan
| |
Collapse
|
7
|
Guo W, Xiao Y, Fu X, Long Z, Wu Y, Lin Q, Ren K, Jiang L. Identification of novel α-glucosidase and ACE inhibitory peptides from Douchi using peptidomics approach and molecular docking. Food Chem X 2023; 19:100779. [PMID: 37780236 PMCID: PMC10534093 DOI: 10.1016/j.fochx.2023.100779] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Revised: 06/13/2023] [Accepted: 06/28/2023] [Indexed: 10/03/2023] Open
Abstract
In this study, the effect of Douchi extract (DWE) on α-glucosidase and angiotensin-converting enzymes (ACE) were investigated, and several novel peptides with inhibitory activity against α-glucosidase and ACE were identified using peptidomics approach based on UPLC-MS/MS. The average inhibition rates of DWE on α-glucosidase and ACE were 73.75-78.10% and 4.56-27.07%, respectively. In the DWE, a total of 710 peptides were detected. Two novel peptides with potential inhibitory activity against α-glucosidase were identified using the correlation analysis, database alignment and molecular docking methods. They were DVFRAIPSEVL and DRPSINGLAGAN, with the IC50 values of 0.121 and 0.128 mg/mL, respectively. Also, four novel peptides with potential inhibitory activity against ACE were identified: PSSPFTDLWD, EEQDERQFPF, PVPVPVQQAFPF and PSSPFTDL, with IC50 values of 1.388, 0.041, 0.761 and 0.097 mg/mL, respectively. These results indicated that combining peptidomics and molecular docking is an effective alternative strategy for rapidly screening numbers of novel bioactive peptides from foods.
Collapse
Affiliation(s)
- Weidan Guo
- College of Food Science and Engineering, Central South University of Forestry and Technology, Changsha 410004, China
| | - Yu Xiao
- College of Food Science and Engineering, Central South University of Forestry and Technology, Changsha 410004, China
| | - Xiangjin Fu
- College of Food Science and Engineering, Central South University of Forestry and Technology, Changsha 410004, China
- Nutrition and Health Products Engineering Technology Research Center of Hunan Province, Changsha 410004, China
- Hunan Provincial Engineering Technology Research Center of Seasonings Green Manufacturing, Changsha 410004, China
- Hunan Provincial Key Laboratory of Special Medical Food, Central South University of Forestry and Technology, Changsha 410004, China
| | - Zhao Long
- College of Food Science and Engineering, Central South University of Forestry and Technology, Changsha 410004, China
- Hunan Provincial Engineering Technology Research Center of Seasonings Green Manufacturing, Changsha 410004, China
- Hunan Provincial Key Laboratory of Special Medical Food, Central South University of Forestry and Technology, Changsha 410004, China
| | - Yue Wu
- College of Food Science and Engineering, Central South University of Forestry and Technology, Changsha 410004, China
- Hunan Provincial Key Laboratory of Special Medical Food, Central South University of Forestry and Technology, Changsha 410004, China
| | - Qinlu Lin
- College of Food Science and Engineering, Central South University of Forestry and Technology, Changsha 410004, China
- Nutrition and Health Products Engineering Technology Research Center of Hunan Province, Changsha 410004, China
- Hunan Provincial Engineering Technology Research Center of Seasonings Green Manufacturing, Changsha 410004, China
- Hunan Provincial Key Laboratory of Special Medical Food, Central South University of Forestry and Technology, Changsha 410004, China
| | - Kangzi Ren
- College of Food Science and Engineering, Central South University of Forestry and Technology, Changsha 410004, China
| | - Liwen Jiang
- College of Food Science and Technology, Hunan Agricultural University, Changsha 410128, China
| |
Collapse
|
8
|
Zhang L, Tang M, Diao H, Xiong L, Yang X, Xing S. LncRNA-encoded peptides: unveiling their significance in cardiovascular physiology and pathology-current research insights. Cardiovasc Res 2023; 119:2165-2178. [PMID: 37517040 DOI: 10.1093/cvr/cvad112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 06/17/2023] [Accepted: 06/30/2023] [Indexed: 08/01/2023] Open
Abstract
Long non-coding RNAs (lncRNAs), which are RNA transcripts exceeding 200 nucleotides were believed to lack any protein-coding capacity. But advancements in -omics technology have revealed that some lncRNAs have small open reading frames (sORFs) that can be translated by ribosomes to encode peptides, some of which have important biological functions. These encoded peptides subserve important biological functions by interacting with their targets to modulate transcriptional or signalling axes, thereby enhancing or suppressing cardiovascular disease (CVD) occurrence and progression. In this review, we summarize what is known about the research strategy of lncRNA-encoded peptides, mainly comprising predictive websites/tools and experimental methods that have been widely used for prediction, identification, and validation. More importantly, we have compiled a list of lncRNA- encoded peptides, with a focus on those that play significant roles in cardiovascular physiology and pathology, including ENSRNOT (RNO)-sORF6/RNO-sORF7/RNO-sORF8, dwarf open reading frame (DOWRF), myoregulin (NLN), etc. Additionally, we have outlined the functions and mechanisms of these peptides in cardiovascular physiology and pathology, such as cardiomyocyte hypertrophy, myocardial contraction, myocardial infarction, and vascular remodelling. Finally, an overview of the existing challenges and potential future developments in the realm of lncRNA-encoded peptides was provided, with consideration given to prospective avenues for further research. Given that many lncRNA-encoded peptides have not been functionally annotated yet, their application in CVD diagnosis and treatment still requires further research.
Collapse
Affiliation(s)
- Li Zhang
- Chengdu Women's and Children's Central Hospital, School of Medicine, University of Electronic Science and Technology of China, 1617 Riyue Street, Qingyang District, Chengdu 611731, China
- Hongqiao International Institute of Medicine, Tongren Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200336, China
| | - Mi Tang
- Chengdu Women's and Children's Central Hospital, School of Medicine, University of Electronic Science and Technology of China, 1617 Riyue Street, Qingyang District, Chengdu 611731, China
| | - Haoyang Diao
- Chengdu Women's and Children's Central Hospital, School of Medicine, University of Electronic Science and Technology of China, 1617 Riyue Street, Qingyang District, Chengdu 611731, China
| | - Liling Xiong
- Chengdu Women's and Children's Central Hospital, School of Medicine, University of Electronic Science and Technology of China, 1617 Riyue Street, Qingyang District, Chengdu 611731, China
| | - Xiao Yang
- Chengdu Women's and Children's Central Hospital, School of Medicine, University of Electronic Science and Technology of China, 1617 Riyue Street, Qingyang District, Chengdu 611731, China
| | - Shasha Xing
- Chengdu Women's and Children's Central Hospital, School of Medicine, University of Electronic Science and Technology of China, 1617 Riyue Street, Qingyang District, Chengdu 611731, China
| |
Collapse
|
9
|
Kussmann M. Mass spectrometry as a lens into molecular human nutrition and health. EUROPEAN JOURNAL OF MASS SPECTROMETRY (CHICHESTER, ENGLAND) 2023; 29:370-379. [PMID: 37587732 DOI: 10.1177/14690667231193555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/18/2023]
Abstract
Mass spectrometry (MS) has developed over the last decades into the most informative and versatile analytical technology in molecular and structural biology (). The platform enables discovery, identification, and characterisation of non-volatile biomolecules, such as proteins, peptides, DNA, RNA, nutrients, metabolites, and lipids at both speed and scale and can elucidate their interactions and effects. The versatility, robustness, and throughput have rendered MS a major research and development platform in molecular human health and biomedical science. More recently, MS has also been established as the central tool for 'Molecular Nutrition', enabling comprehensive and rapid identification and characterisation of macro- and micronutrients, bioactives, and other food compounds. 'Molecular Nutrition' thereby helps understand bioaccessibility, bioavailability, and bioefficacy of macro- and micronutrients and related health effects. Hence, MS provides a lens through which the fate of nutrients can be monitored along digestion via absorption to metabolism. This in turn provides the bioanalytical foundation for 'Personalised Nutrition' or 'Precision Nutrition' in which design and development of diets and nutritional products is tailored towards consumer and patient groups sharing similar genetic and environmental predisposition, health/disease conditions and lifestyles, and/or objectives of performance and wellbeing. The next level of integrated nutrition science is now being built as 'Systems Nutrition' where public and personal health data are correlated with life condition and lifestyle factors, to establish directional relationships between nutrition, lifestyle, environment, and health, eventually translating into science-based public and personal heath recommendations and actions. This account provides a condensed summary of the contributions of MS to a precise, quantitative, and comprehensive nutrition and health science and sketches an outlook on its future role in this fascinating and relevant field.
Collapse
Affiliation(s)
- Martin Kussmann
- Abteilung Wissenschaft, Kompetenzzentrum für Ernährung (KErn), Germany
- Kussmann Biotech GmbH, Germany
| |
Collapse
|
10
|
Dong X, Zhang K, Xun C, Chu T, Liang S, Zeng Y, Liu Z. Small Open Reading Frame-Encoded Micro-Peptides: An Emerging Protein World. Int J Mol Sci 2023; 24:10562. [PMID: 37445739 DOI: 10.3390/ijms241310562] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2023] [Revised: 06/20/2023] [Accepted: 06/21/2023] [Indexed: 07/15/2023] Open
Abstract
Small open reading frames (sORFs) are often overlooked features in genomes. In the past, they were labeled as noncoding or "transcriptional noise". However, accumulating evidence from recent years suggests that sORFs may be transcribed and translated to produce sORF-encoded polypeptides (SEPs) with less than 100 amino acids. The vigorous development of computational algorithms, ribosome profiling, and peptidome has facilitated the prediction and identification of many new SEPs. These SEPs were revealed to be involved in a wide range of basic biological processes, such as gene expression regulation, embryonic development, cellular metabolism, inflammation, and even carcinogenesis. To effectively understand the potential biological functions of SEPs, we discuss the history and development of the newly emerging research on sORFs and SEPs. In particular, we review a range of recently discovered bioinformatics tools for identifying, predicting, and validating SEPs as well as a variety of biochemical experiments for characterizing SEP functions. Lastly, this review underlines the challenges and future directions in identifying and validating sORFs and their encoded micropeptides, providing a significant reference for upcoming research on sORF-encoded peptides.
Collapse
Affiliation(s)
- Xiaoping Dong
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
| | - Kun Zhang
- The State Key Laboratory of Developmental Biology of Freshwater Fish, College of Life Science, Hunan Normal University, Changsha 410081, China
| | - Chengfeng Xun
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
| | - Tianqi Chu
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
| | - Songping Liang
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
| | - Yong Zeng
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
- The State Key Laboratory of Developmental Biology of Freshwater Fish, College of Life Science, Hunan Normal University, Changsha 410081, China
| | - Zhonghua Liu
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
| |
Collapse
|
11
|
Zhao S, Meng J, Wekesa JS, Luan Y. Identification of small open reading frames in plant lncRNA using class-imbalance learning. Comput Biol Med 2023; 157:106773. [PMID: 36924731 DOI: 10.1016/j.compbiomed.2023.106773] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 02/21/2023] [Accepted: 03/09/2023] [Indexed: 03/12/2023]
Abstract
Recently, small open reading frames (sORFs) in long noncoding RNA (lncRNA) have been demonstrated to encode small peptides that can help study the mechanisms of growth and development in organisms. Since machine learning-based computational methods are less costly compared with biological experiments, they can be used to identify sORFs and provide a basis for biological experiments. However, few computational methods and data resources have been exploited for identifying sORFs in plant lncRNA. Besides, machine learning models produce underperforming classifiers when faced with a class-imbalance problem. In this study, an alternative method called SMOTE based on weighted cosine distance (WCDSMOTE) which enables interaction with feature selection is put forward to synthesize minority class samples and weighted edited nearest neighbor (WENN) is applied to clean up majority class samples, thus, hybrid sampling WCDSMOTE-ENN is proposed to deal with imbalanced datasets with the multi-angle feature. A heterogeneous classifier ensemble is introduced to complete the classification task. Therefore, a novel computational method that is based on class-imbalance learning to identify the sORFs with coding potential in plant lncRNA (sORFplnc) is presented. Experimental results manifest that sORFplnc outperforms existing computational methods in identifying sORFs with coding potential. We anticipate that the proposed work can be a reference for relevant research and contribute to agriculture and biomedicine.
Collapse
Affiliation(s)
- Siyuan Zhao
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning, 116024, China
| | - Jun Meng
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning, 116024, China.
| | - Jael Sanyanda Wekesa
- Department of Information Technology, Jomo Kenyatta University of Agriculture and Technology, Nairobi, 62000-00200, Kenya
| | - Yushi Luan
- School of Bioengineering, Dalian University of Technology, Dalian, Liaoning, 116024, China
| |
Collapse
|
12
|
Cassidy L, Kaulich PT, Tholey A. Proteoforms expand the world of microproteins and short open reading frame-encoded peptides. iScience 2023; 26:106069. [PMID: 36818287 PMCID: PMC9929600 DOI: 10.1016/j.isci.2023.106069] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Microproteins and short open reading frame-encoded peptides (SEPs) can, like all proteins, carry numerous posttranslational modifications. Together with posttranscriptional processes, this leads to a high number of possible distinct protein molecules, the proteoforms, out of a limited number of genes. The identification, quantification, and molecular characterization of proteoforms possess special challenges to established, mainly bottom-up proteomics (BUP) based analytical approaches. While BUP methods are powerful, proteins have to be inferred rather than directly identified, which hampers the detection of proteoforms. An alternative approach is top-down proteomics (TDP) which allows to identify intact proteoforms. This perspective article provides a brief overview of modified microproteins and SEPs, introduces the proteoform terminology, and compares present BUP and TDP workflows highlighting their major advantages and caveats. Necessary future developments in TDP to fully accentuate its potential for proteoform-centric analytics of microproteins and SEPs will be discussed.
Collapse
Affiliation(s)
- Liam Cassidy
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, 24105 Kiel, Germany
| | - Philipp T. Kaulich
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, 24105 Kiel, Germany
| | - Andreas Tholey
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, 24105 Kiel, Germany,Corresponding author
| |
Collapse
|
13
|
Complementary peptides represent a credible alternative to agrochemicals by activating translation of targeted proteins. Nat Commun 2023; 14:254. [PMID: 36650156 PMCID: PMC9845214 DOI: 10.1038/s41467-023-35951-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Accepted: 01/10/2023] [Indexed: 01/18/2023] Open
Abstract
The current agriculture main challenge is to maintain food production while facing multiple threats such as increasing world population, temperature increase, lack of agrochemicals due to health issues and uprising of weeds resistant to herbicides. Developing novel, alternative, and safe methods is hence of paramount importance. Here, we show that complementary peptides (cPEPs) from any gene can be designed to target specifically plant coding genes. External application of synthetic peptides increases the abundance of the targeted protein, leading to related phenotypes. Moreover, we provide evidence that cPEPs can be powerful tools in agronomy to improve plant traits, such as growth, resistance to pathogen or heat stress, without the needs of genetic approaches. Finally, by combining their activity they can also be used to reduce weed growth.
Collapse
|
14
|
Yang JE, Zhong WJ, Li JF, Lin YY, Liu FT, Tian H, Chen YJ, Luo XY, Zhuang SM. LINC00998-encoded micropeptide SMIM30 promotes the G1/S transition of cell cycle by regulating cytosolic calcium level. Mol Oncol 2022; 17:901-916. [PMID: 36495128 PMCID: PMC10158777 DOI: 10.1002/1878-0261.13358] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2022] [Revised: 10/04/2022] [Accepted: 12/09/2022] [Indexed: 12/14/2022] Open
Abstract
The biological functions of short open reading frame (sORF)-encoded micropeptides remain largely unknown. Here, we report that LINC00998, a previously annotated lncRNA, was upregulated in multiple cancer types and the sORF on LINC00998 encoded a micropeptide named SMIM30. SMIM30 was localized in the membranes of the endoplasmic reticulum (ER) and mitochondria. Silencing SMIM30 inhibited the proliferation of hepatoma cells in vitro and suppressed the growth of tumor xenografts and N-nitrosodiethylamine-induced hepatoma. Overexpression of the 5'UTR-sORF sequence of LINC00998, encoding wild-type SMIM30, enhanced tumor cell growth, but this was abolished when a premature stop codon was introduced into the sORF via single-base deletion. Gain- and loss-of-function studies revealed that SMIM30 peptide but not LINC00998 reduced cytosolic calcium level, increased CDK4, cyclin E2, phosphorylated-Rb and E2F1, and promoted the G1/S phase transition and cell proliferation. The effect of SMIM30 silencing was attenuated by a calcium chelator or the agonist of sarco/endoplasmic reticulum calcium ATPase (SERCA) pump. These findings suggest a novel function of micropeptide SMIM30 in promoting G1/S transition and cell proliferation by enhancing SERCA activity and reducing cytosolic calcium level.
Collapse
Affiliation(s)
- Jin-E Yang
- MOE Key Laboratory of Gene Function and Regulation, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Wang-Jing Zhong
- MOE Key Laboratory of Gene Function and Regulation, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Jin-Feng Li
- MOE Key Laboratory of Gene Function and Regulation, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Ying-Ying Lin
- MOE Key Laboratory of Gene Function and Regulation, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Feng-Ting Liu
- MOE Key Laboratory of Gene Function and Regulation, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Hao Tian
- MOE Key Laboratory of Gene Function and Regulation, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Ya-Jing Chen
- MOE Key Laboratory of Gene Function and Regulation, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Xiao-Yu Luo
- MOE Key Laboratory of Gene Function and Regulation, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Shi-Mei Zhuang
- MOE Key Laboratory of Gene Function and Regulation, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| |
Collapse
|
15
|
Aguilera A, Distéfano A, Jauzein C, Correa-Aragunde N, Martinez D, Martin MV, Sueldo DJ. Do photosynthetic cells communicate with each other during cell death? From cyanobacteria to vascular plants. JOURNAL OF EXPERIMENTAL BOTANY 2022; 73:7219-7242. [PMID: 36179088 DOI: 10.1093/jxb/erac363] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2022] [Accepted: 09/15/2022] [Indexed: 06/16/2023]
Abstract
As in metazoans, life in oxygenic photosynthetic organisms relies on the accurate regulation of cell death. During development and in response to the environment, photosynthetic cells activate and execute cell death pathways that culminate in the death of a specific group of cells, a process known as regulated cell death (RCD). RCD control is instrumental, as its misregulation can lead to growth penalties and even the death of the entire organism. Intracellular molecules released during cell demise may act as 'survival' or 'death' signals and control the propagation of cell death to surrounding cells, even in unicellular organisms. This review explores different signals involved in cell-cell communication and systemic signalling in photosynthetic organisms, in particular Ca2+, reactive oxygen species, lipid derivates, nitric oxide, and eATP. We discuss their possible mode-of-action as either 'survival' or 'death' molecules and their potential role in determining cell fate in neighbouring cells. By comparing the knowledge available across the taxonomic spectrum of this coherent phylogenetic group, from cyanobacteria to vascular plants, we aim at contributing to the identification of conserved mechanisms that control cell death propagation in oxygenic photosynthetic organisms.
Collapse
Affiliation(s)
- Anabella Aguilera
- Centre for Ecology and Evolution in Microbial Model Systems (EEMiS), Linnaeus University, 39231 Kalmar, Sweden
| | - Ayelén Distéfano
- Instituto de Investigaciones Biológicas-CONICET, Universidad Nacional de Mar del Plata, 7600 Mar del Plata, Argentina
| | - Cécile Jauzein
- Ifremer, Centre de Brest, DYNECO-Pelagos, F-29280 Plouzané, France
| | - Natalia Correa-Aragunde
- Instituto de Investigaciones Biológicas-CONICET, Universidad Nacional de Mar del Plata, 7600 Mar del Plata, Argentina
| | - Dana Martinez
- Instituto de Fisiología Vegetal (INFIVE-CONICET), Universidad Nacional de La Plata, 1900 La Plata, Argentina
| | - María Victoria Martin
- Instituto de Investigaciones en Biodiversidad y Biotecnología (INBIOTEC-CONICET), Fundación para Investigaciones Biológicas Aplicadas (FIBA), Universidad Nacional de Mar del Plata,7600 Mar del Plata, Argentina
| | - Daniela J Sueldo
- Norwegian University of Science and Technology, 7491 Trondheim, Norway
| |
Collapse
|
16
|
Prediction, Discovery, and Characterization of Plant- and Food-Derived Health-Beneficial Bioactive Peptides. Nutrients 2022; 14:nu14224810. [PMID: 36432497 PMCID: PMC9697201 DOI: 10.3390/nu14224810] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Revised: 10/31/2022] [Accepted: 11/11/2022] [Indexed: 11/16/2022] Open
Abstract
Nature may have the answer to many of our questions about human, animal, and environmental health. Natural bioactives, especially when harvested from sustainable plant and food sources, provide a plethora of molecular solutions to nutritionally actionable, chronic conditions. The spectrum of these conditions, such as metabolic, immune, and gastrointestinal disorders, has changed with prolonged human life span, which should be matched with an appropriately extended health span, which would in turn favour more sustainable health care: "adding years to life and adding life to years". To date, bioactive peptides have been undervalued and underexploited as food ingredients and drugs. The future of translational science on bioactive peptides-and natural bioactives in general-is being built on (a) systems-level rather than reductionist strategies for understanding their interdependent, and at times synergistic, functions; and (b) the leverage of artificial intelligence for prediction and discovery, thereby significantly reducing the time from idea and concept to finished solutions for consumers and patients. This new strategy follows the path from benefit definition via design to prediction and, eventually, validation and production.
Collapse
|
17
|
Pérez-Robles R, Salmerón-García A, Clemente-Bautista S, Jiménez-Lozano I, Cabañas-Poy MJ, Cabeza J, Navas N. Method for identification and quantification of intact teduglutide peptide using (RP)UHPLC-UV-(HESI/ORBITRAP)MS. ANALYTICAL METHODS : ADVANCING METHODS AND APPLICATIONS 2022; 14:4359-4369. [PMID: 36263764 DOI: 10.1039/d2ay01254e] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Teduglutide (Revestive®, 10 mg mL-1) is a recombinant human glucagon-like peptide 2 analogue, used in the treatment of short bowel syndrome, a serious and highly disabling condition which results from either too small a length of intestine or loss of critical intestinal function. The determination of therapeutic compounds of protein-nature is always challenging due to their complex structure. In this work, we present a fast, straightforward reversed phase (RP)UHPLC-UV-(HESI/ORBITRAP)MS method for the identification and quantification of the intact teduglutide peptide. The method has been developed and validated in accordance with the International Council for Harmonization of Technical Requirements for Pharmaceuticals for Human Use (ICH) guidelines; therefore, linearity, limits of detection and quantification, accuracy (precision and trueness), robustness, system suitability and specificity using the signal from the UV and MS, have been evaluated. The validation performance parameters obtained from the UV and MS signals were compared throughout the work, to select the most suitable. To study the specificity of the method and the impact of medicine mishandling under hospital conditions, force degradation studies were performed, i.e. thermal (40 °C and 60 °C), shaking (mechanical) and light (accelerated exposition) effects. Identification by the exact mass of teduglutide was achieved and it was confirmed that the peptide does not undergo any post-translational modifications (PTMs). To the best of our knowledge, the present work reports the first method developed for the simultaneous identification, structural characterization, and quantification of the therapeutic teduglutide peptide. Finally, the proposed method is able to indicate stability when quantifying the intact teduglutide since detects and characterises the exact mass of the degradation/modification products.
Collapse
Affiliation(s)
- Raquel Pérez-Robles
- Instituto de Investigación Biosanitaria de Granada (ibs.GRANADA), Granada, Spain
- Department of Analytical Chemistry, Science Faculty, University of Granada, Granada, Spain
- Fundación para la Investigación Biosanitaria de Andalucía Oriental-Alejandro Otero, Granada, Spain
| | - Antonio Salmerón-García
- Instituto de Investigación Biosanitaria de Granada (ibs.GRANADA), Granada, Spain
- Department of Clinical Pharmacy, San Cecilio University Hospital, Granada, Spain
| | | | - Inés Jiménez-Lozano
- Maternal and Child Pharmacy Service, Vall d'Hebron Hospital, Pharmacy, Barcelona, Spain
| | | | - Jose Cabeza
- Instituto de Investigación Biosanitaria de Granada (ibs.GRANADA), Granada, Spain
- Department of Clinical Pharmacy, San Cecilio University Hospital, Granada, Spain
| | - Natalia Navas
- Instituto de Investigación Biosanitaria de Granada (ibs.GRANADA), Granada, Spain
- Department of Analytical Chemistry, Science Faculty, University of Granada, Granada, Spain
| |
Collapse
|
18
|
Peptidomics as a tool to analyze endogenous peptides in milk and milk-related peptides. FOOD BIOSCI 2022. [DOI: 10.1016/j.fbio.2022.102199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
|
19
|
Sruthi KB, Menon A, P A, Vasudevan Soniya E. Pervasive translation of small open reading frames in plant long non-coding RNAs. FRONTIERS IN PLANT SCIENCE 2022; 13:975938. [PMID: 36352887 PMCID: PMC9638090 DOI: 10.3389/fpls.2022.975938] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Accepted: 09/29/2022] [Indexed: 06/16/2023]
Abstract
Long non-coding RNAs (lncRNAs) are primarily recognized as non-coding transcripts longer than 200 nucleotides with low coding potential and are present in both eukaryotes and prokaryotes. Recent findings reveal that lncRNAs can code for micropeptides in various species. Micropeptides are generated from small open reading frames (smORFs) and have been discovered frequently in short mRNAs and non-coding RNAs, such as lncRNAs, circular RNAs, and pri-miRNAs. The most accepted definition of a smORF is an ORF containing fewer than 100 codons, and ribosome profiling and mass spectrometry are the most prevalent experimental techniques used to identify them. Although the majority of micropeptides perform critical roles throughout plant developmental processes and stress conditions, only a handful of their functions have been verified to date. Even though more research is being directed toward identifying micropeptides, there is still a dearth of information regarding these peptides in plants. This review outlines the lncRNA-encoded peptides, the evolutionary roles of such peptides in plants, and the techniques used to identify them. It also describes the functions of the pri-miRNA and circRNA-encoded peptides that have been identified in plants.
Collapse
|
20
|
Sun Y, Huang J, Wang Z, Pan N, Wan C. Identification of Microproteins in Saccharomyces cerevisiae under Different Stress Conditions. J Proteome Res 2022; 21:1939-1947. [PMID: 35838590 DOI: 10.1021/acs.jproteome.2c00212] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Small open reading frame-encoded peptides (SEPs) are microproteins with a length of 100 amino acids or less, which may play a critical role in maintaining cell homeostasis under stress. Therefore, we used mass spectrometry-based proteomics to explore microproteins potentially involved in cellular stress responses in Saccharomyces cerevisiae. A total of 225 microproteins with 1920 unique peptides were identified under six culture conditions: normal, oxidation, starvation, ultraviolet radiation, heat shock, and heat shock with starvation. Among these microproteins, we found 70 SEPs with 75 unique peptides. The annotated microproteins are involved in stress-related processes, such as cell redox reactions, cell wall modification, protein folding and degradation, and DNA damage repair. It suggests that SEPs may also play similar functions under stress conditions. For example, SEP IP_008057, translated from a short coding sequence of YJL159W, may play a role in heat shock. This study identified stress-responsive SEPs in S. cerevisiae and provided valuable information to determine the functions of these proteins, which enrich the genome and proteome of S. cerevisiae and show clues to improving the stress tolerance of S. cerevisiae.
Collapse
Affiliation(s)
- Yan Sun
- School of Life Sciences and Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan, Hubei 430079, People's Republic of China
| | - Jiangmei Huang
- School of Life Sciences and Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan, Hubei 430079, People's Republic of China
| | - Zhiwei Wang
- School of Life Sciences and Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan, Hubei 430079, People's Republic of China
| | - Ni Pan
- School of Life Sciences and Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan, Hubei 430079, People's Republic of China
| | - Cuihong Wan
- School of Life Sciences and Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan, Hubei 430079, People's Republic of China
| |
Collapse
|
21
|
Pan J, Wang R, Shang F, Ma R, Rong Y, Zhang Y. Functional Micropeptides Encoded by Long Non-Coding RNAs: A Comprehensive Review. Front Mol Biosci 2022; 9:817517. [PMID: 35769907 PMCID: PMC9234465 DOI: 10.3389/fmolb.2022.817517] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 05/24/2022] [Indexed: 12/03/2022] Open
Abstract
Long non-coding RNAs (lncRNAs) were originally defined as non-coding RNAs (ncRNAs) which lack protein-coding ability. However, with the emergence of technologies such as ribosome profiling sequencing and ribosome-nascent chain complex sequencing, it has been demonstrated that most lncRNAs have short open reading frames hence the potential to encode functional micropeptides. Such micropeptides have been described to be widely involved in life-sustaining activities in several organisms, such as homeostasis regulation, disease, and tumor occurrence, and development, and morphological development of animals, and plants. In this review, we focus on the latest developments in the field of lncRNA-encoded micropeptides, and describe the relevant computational tools and techniques for micropeptide prediction and identification. This review aims to serve as a reference for future research studies on lncRNA-encoded micropeptides.
Collapse
Affiliation(s)
- Jianfeng Pan
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Ruijun Wang
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
- Key Laboratory of Mutton Sheep Genetics and Breeding, Ministry of Agriculture, Hohhot, China
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Hohhot, China
- Engineering Research Center for Goat Genetics and Breeding, Hohhot, China
| | - Fangzheng Shang
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Rong Ma
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Youjun Rong
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Yanjun Zhang
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
- Key Laboratory of Mutton Sheep Genetics and Breeding, Ministry of Agriculture, Hohhot, China
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Hohhot, China
- Engineering Research Center for Goat Genetics and Breeding, Hohhot, China
- *Correspondence: Yanjun Zhang,
| |
Collapse
|
22
|
Fabre B, Choteau SA, Duboé C, Pichereaux C, Montigny A, Korona D, Deery MJ, Camus M, Brun C, Burlet-Schiltz O, Russell S, Combier JP, Lilley KS, Plaza S. In Depth Exploration of the Alternative Proteome of Drosophila melanogaster. Front Cell Dev Biol 2022; 10:901351. [PMID: 35721519 PMCID: PMC9204603 DOI: 10.3389/fcell.2022.901351] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Accepted: 04/25/2022] [Indexed: 12/13/2022] Open
Abstract
Recent studies have shown that hundreds of small proteins were occulted when protein-coding genes were annotated. These proteins, called alternative proteins, have failed to be annotated notably due to the short length of their open reading frame (less than 100 codons) or the enforced rule establishing that messenger RNAs (mRNAs) are monocistronic. Several alternative proteins were shown to be biologically active molecules and seem to be involved in a wide range of biological functions. However, genome-wide exploration of the alternative proteome is still limited to a few species. In the present article, we describe a deep peptidomics workflow which enabled the identification of 401 alternative proteins in Drosophila melanogaster. Subcellular localization, protein domains, and short linear motifs were predicted for 235 of the alternative proteins identified and point toward specific functions of these small proteins. Several alternative proteins had approximated abundances higher than their canonical counterparts, suggesting that these alternative proteins are actually the main products of their corresponding genes. Finally, we observed 14 alternative proteins with developmentally regulated expression patterns and 10 induced upon the heat-shock treatment of embryos, demonstrating stage or stress-specific production of alternative proteins.
Collapse
Affiliation(s)
- Bertrand Fabre
- Laboratoire de Recherche en Sciences Végétales, UMR5546, Université de Toulouse, UPS, INP, CNRS, Auzeville-Tolosane, France,Cambridge Centre for Proteomics, Cambridge Systems Biology Centre and Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom,*Correspondence: Bertrand Fabre, ; Serge Plaza,
| | - Sebastien A. Choteau
- Aix-Marseille Université, INSERM, TAGC, Turing Centre for Living Systems, Marseille, France
| | - Carine Duboé
- Laboratoire de Recherche en Sciences Végétales, UMR5546, Université de Toulouse, UPS, INP, CNRS, Auzeville-Tolosane, France
| | - Carole Pichereaux
- Fédération de Recherche (FR3450), Agrobiosciences, Interactions et Biodiversité (AIB), CNRS, Toulouse, France,Institut de Pharmacologie et de Biologie Structurale (IPBS), Université de Toulouse, CNRS, UPS, Toulouse, France,Infrastructure Nationale de Protéomique, ProFI, FR 2048, Toulouse, France
| | - Audrey Montigny
- Laboratoire de Recherche en Sciences Végétales, UMR5546, Université de Toulouse, UPS, INP, CNRS, Auzeville-Tolosane, France
| | - Dagmara Korona
- Cambridge Systems Biology Centre and Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Michael J. Deery
- Cambridge Centre for Proteomics, Cambridge Systems Biology Centre and Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Mylène Camus
- Institut de Pharmacologie et de Biologie Structurale (IPBS), Université de Toulouse, CNRS, UPS, Toulouse, France,Infrastructure Nationale de Protéomique, ProFI, FR 2048, Toulouse, France
| | - Christine Brun
- Aix-Marseille Université, INSERM, TAGC, Turing Centre for Living Systems, Marseille, France,CNRS, Marseille, France
| | - Odile Burlet-Schiltz
- Institut de Pharmacologie et de Biologie Structurale (IPBS), Université de Toulouse, CNRS, UPS, Toulouse, France,Infrastructure Nationale de Protéomique, ProFI, FR 2048, Toulouse, France
| | - Steven Russell
- Cambridge Systems Biology Centre and Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Jean-Philippe Combier
- Laboratoire de Recherche en Sciences Végétales, UMR5546, Université de Toulouse, UPS, INP, CNRS, Auzeville-Tolosane, France
| | - Kathryn S. Lilley
- Cambridge Centre for Proteomics, Cambridge Systems Biology Centre and Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Serge Plaza
- Laboratoire de Recherche en Sciences Végétales, UMR5546, Université de Toulouse, UPS, INP, CNRS, Auzeville-Tolosane, France,*Correspondence: Bertrand Fabre, ; Serge Plaza,
| |
Collapse
|
23
|
Zhang Z, Li Y, Yuan W, Wang Z, Wan C. Proteomic-driven identification of short open reading frame-encoded peptides. Proteomics 2022; 22:e2100312. [PMID: 35384297 DOI: 10.1002/pmic.202100312] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Revised: 03/29/2022] [Accepted: 03/30/2022] [Indexed: 11/10/2022]
Abstract
Accumulating evidence has shown that a large number of short open reading frames (sORFs) also have the ability to encode proteins. The discovery of sORFs opens up a new research area, leading to the identification and functional study of sORF encoded peptides (SEPs) at the omics level. Besides bioinformatics prediction and ribosomal profiling, mass spectrometry (MS) has become a significant tool as it directly detects the sequence of SEPs. Though MS-based proteomics methods have proved to be effective for qualitative and quantitative analysis of SEPs, the detection of SEPs is still a great challenge due to their low abundance and short sequence. To illustrate the progress in method development, we described and discussed the main steps of large-scale proteomics identification of SEPs, including SEP extraction and enrichment, MS detection, data processing and quality control, quantification, and function prediction and validation methods. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Zheng Zhang
- School of Life Sciences and Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan, Hubei, 430079, People's Republic of China
| | - Yujie Li
- School of Life Sciences and Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan, Hubei, 430079, People's Republic of China
| | - Wenqian Yuan
- School of Life Sciences and Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan, Hubei, 430079, People's Republic of China
| | - Zhiwei Wang
- School of Life Sciences and Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan, Hubei, 430079, People's Republic of China
| | - Cuihong Wan
- School of Life Sciences and Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan, Hubei, 430079, People's Republic of China
| |
Collapse
|
24
|
Leong AZX, Lee PY, Mohtar MA, Syafruddin SE, Pung YF, Low TY. Short open reading frames (sORFs) and microproteins: an update on their identification and validation measures. J Biomed Sci 2022; 29:19. [PMID: 35300685 PMCID: PMC8928697 DOI: 10.1186/s12929-022-00802-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Accepted: 03/09/2022] [Indexed: 12/17/2022] Open
Abstract
A short open reading frame (sORFs) constitutes ≤ 300 bases, encoding a microprotein or sORF-encoded protein (SEP) which comprises ≤ 100 amino acids. Traditionally dismissed by genome annotation pipelines as meaningless noise, sORFs were found to possess coding potential with ribosome profiling (RIBO-Seq), which unveiled sORF-based transcripts at various genome locations. Nonetheless, the existence of corresponding microproteins that are stable and functional was little substantiated by experimental evidence initially. With recent advancements in multi-omics, the identification, validation, and functional characterisation of sORFs and microproteins have become feasible. In this review, we discuss the history and development of an emerging research field of sORFs and microproteins. In particular, we focus on an array of bioinformatics and OMICS approaches used for predicting, sequencing, validating, and characterizing these recently discovered entities. These strategies include RIBO-Seq which detects sORF transcripts via ribosome footprints, and mass spectrometry (MS)-based proteomics for sequencing the resultant microproteins. Subsequently, our discussion extends to the functional characterisation of microproteins by incorporating CRISPR/Cas9 screen and protein–protein interaction (PPI) studies. Our review discusses not only detection methodologies, but we also highlight on the challenges and potential solutions in identifying and validating sORFs and their microproteins. The novelty of this review lies within its validation for the functional role of microproteins, which could contribute towards the future landscape of microproteomics.
Collapse
Affiliation(s)
- Alyssa Zi-Xin Leong
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia, 56000, Kuala Lumpur, Malaysia
| | - Pey Yee Lee
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia, 56000, Kuala Lumpur, Malaysia
| | - M Aiman Mohtar
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia, 56000, Kuala Lumpur, Malaysia
| | - Saiful Effendi Syafruddin
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia, 56000, Kuala Lumpur, Malaysia
| | - Yuh-Fen Pung
- Division of Biomedical Science, School of Pharmacy, University of Nottingham Malaysia, Semenyih, 43500, Selangor, Malaysia
| | - Teck Yew Low
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia, 56000, Kuala Lumpur, Malaysia.
| |
Collapse
|
25
|
Li B, Zhang Z, Wan C. Identification of Microproteins in Hep3B Cells at Different Cell Cycle Stages. J Proteome Res 2022; 21:1052-1060. [PMID: 35199523 DOI: 10.1021/acs.jproteome.1c00926] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Microproteins are generated from small open reading frames and turn out to play various vital biological functions. As an essential biological event of eukaryotic cells, the cell cycle is involved in cell replication and division. For such a highly regulated event, microproteins associated with cell cycle regulation remained unclarified. Utilizing a combination of bottom-up and top-down proteomics, we analyzed microproteins at specific cell cycle stages of Hep3B cells. A total of 657 microproteins were identified under three cell cycle stages, including 151 in the G0/G1 stage, 163 in the S stage, and 132 in the G2/M stage. The annotation of these microproteins showed their cell cycle-specific functions, such as translation, nuclear assembly, chromatin organization, and the G2/M transition of the mitotic cell cycle. Meanwhile, more than 50% of identified microproteins were ncRNA-encoded. These nonannotated novel microproteins contain several function domains, such as the nucleoside diphosphate kinase domain, the high mobility group domain, and the DNA-binding domain. This suggested the potential functions of these novel microproteins in specific cell cycle stages. This study presented a large-scale profile of microproteins at different cell cycle stages from Hep3B and may provide new perspectives on the regulation mechanism of the cell cycle. Liquid chromatography-mass spectrometry data were deposited to ProteomeXchange using the identifier PXD030286.
Collapse
Affiliation(s)
- Bing Li
- School of Life Sciences and Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan, Hubei 430079, People's Republic of China
| | - Zheng Zhang
- School of Life Sciences and Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan, Hubei 430079, People's Republic of China
| | - Cuihong Wan
- School of Life Sciences and Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan, Hubei 430079, People's Republic of China
| |
Collapse
|
26
|
Kute PM, Soukarieh O, Tjeldnes H, Trégouët DA, Valen E. Small Open Reading Frames, How to Find Them and Determine Their Function. Front Genet 2022; 12:796060. [PMID: 35154250 PMCID: PMC8831751 DOI: 10.3389/fgene.2021.796060] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Accepted: 12/30/2021] [Indexed: 12/12/2022] Open
Abstract
Advances in genomics and molecular biology have revealed an abundance of small open reading frames (sORFs) across all types of transcripts. While these sORFs are often assumed to be non-functional, many have been implicated in physiological functions and a significant number of sORFs have been described in human diseases. Thus, sORFs may represent a hidden repository of functional elements that could serve as therapeutic targets. Unlike protein-coding genes, it is not necessarily the encoded peptide of an sORF that enacts its function, sometimes simply the act of translating an sORF might have a regulatory role. Indeed, the most studied sORFs are located in the 5′UTRs of coding transcripts and can have a regulatory impact on the translation of the downstream protein-coding sequence. However, sORFs have also been abundantly identified in non-coding RNAs including lncRNAs, circular RNAs and ribosomal RNAs suggesting that sORFs may be diverse in function. Of the many different experimental methods used to discover sORFs, the most commonly used are ribosome profiling and mass spectrometry. These can confirm interactions between transcripts and ribosomes and the production of a peptide, respectively. Extensions to ribosome profiling, which also capture scanning ribosomes, have further made it possible to see how sORFs impact the translation initiation of mRNAs. While high-throughput techniques have made the identification of sORFs less difficult, defining their function, if any, is typically more challenging. Together, the abundance and potential function of many of these sORFs argues for the necessity of including sORFs in gene annotations and systematically characterizing these to understand their potential functional roles. In this review, we will focus on the high-throughput methods used in the detection and characterization of sORFs and discuss techniques for validation and functional characterization.
Collapse
Affiliation(s)
- Preeti Madhav Kute
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
- Sars International Centre for Marine Molecular Biology, University of Bergen, Bergen, Norway
| | - Omar Soukarieh
- Department of Molecular Epidemiology Of Vascular and Brain Disorders, INSERM, BPH, U1219, University of Bordeaux, Bordeaux, France
| | - Håkon Tjeldnes
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
| | - David-Alexandre Trégouët
- Department of Molecular Epidemiology Of Vascular and Brain Disorders, INSERM, BPH, U1219, University of Bordeaux, Bordeaux, France
| | - Eivind Valen
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
- Sars International Centre for Marine Molecular Biology, University of Bergen, Bergen, Norway
- *Correspondence: Eivind Valen,
| |
Collapse
|
27
|
Chen L, Yang Y, Zhang Y, Li K, Cai H, Wang H, Zhao Q. The Small Open Reading Frame-Encoded Peptides: Advances in Methodologies and Functional Studies. Chembiochem 2021; 23:e202100534. [PMID: 34862721 DOI: 10.1002/cbic.202100534] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Revised: 11/15/2021] [Indexed: 11/07/2022]
Abstract
Small open reading frames (sORFs) are an important class of genes with less than 100 codons. They were historically annotated as noncoding or even junk sequences. In recent years, accumulating evidence suggests that sORFs could encode a considerable number of polypeptides, many of which play important roles in both physiology and disease pathology. However, it has been technically challenging to directly detect sORF-encoded peptides (SEPs). Here, we discuss the latest advances in methodologies for identifying SEPs with mass spectrometry, as well as the progress on functional studies of SEPs.
Collapse
Affiliation(s)
- Lei Chen
- State Key Laboratory of Chemical Biology and Drug Discovery, Department of Applied Biology and Chemical Technology, Hong Kong Polytechnic University, Hung Hom, Hong Kong SAR, 999077, P. R. China.,Laboratory for Synthetic Chemistry and Chemical Biology Limited, Hong Kong Science and Technology Park, New Territories, Hong Kong SAR, 999077, P. R. China
| | - Ying Yang
- State Key Laboratory of Chemical Biology and Drug Discovery, Department of Applied Biology and Chemical Technology, Hong Kong Polytechnic University, Hung Hom, Hong Kong SAR, 999077, P. R. China
| | - Yuanliang Zhang
- State Key Laboratory of Chemical Biology and Drug Discovery, Department of Applied Biology and Chemical Technology, Hong Kong Polytechnic University, Hung Hom, Hong Kong SAR, 999077, P. R. China
| | - Kecheng Li
- State Key Laboratory of Chemical Biology and Drug Discovery, Department of Applied Biology and Chemical Technology, Hong Kong Polytechnic University, Hung Hom, Hong Kong SAR, 999077, P. R. China
| | - Hongmin Cai
- School of Computer Science and Engineering, South China University of Technology, Guangzhou, 510623, P. R. China
| | - Hongwei Wang
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangzhou, 510623, P. R. China
| | - Qian Zhao
- State Key Laboratory of Chemical Biology and Drug Discovery, Department of Applied Biology and Chemical Technology, Hong Kong Polytechnic University, Hung Hom, Hong Kong SAR, 999077, P. R. China
| |
Collapse
|
28
|
Peeters MKR, Baggerman G, Gabriels R, Pepermans E, Menschaert G, Boonen K. Ion Mobility Coupled to a Time-of-Flight Mass Analyzer Combined With Fragment Intensity Predictions Improves Identification of Classical Bioactive Peptides and Small Open Reading Frame-Encoded Peptides. Front Cell Dev Biol 2021; 9:720570. [PMID: 34604223 PMCID: PMC8484717 DOI: 10.3389/fcell.2021.720570] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Accepted: 08/25/2021] [Indexed: 12/29/2022] Open
Abstract
Bioactive peptides exhibit key roles in a wide variety of complex processes, such as regulation of body weight, learning, aging, and innate immune response. Next to the classical bioactive peptides, emerging from larger precursor proteins by specific proteolytic processing, a new class of peptides originating from small open reading frames (sORFs) have been recognized as important biological regulators. But their intrinsic properties, specific expression pattern and location on presumed non-coding regions have hindered the full characterization of the repertoire of bioactive peptides, despite their predominant role in various pathways. Although the development of peptidomics has offered the opportunity to study these peptides in vivo, it remains challenging to identify the full peptidome as the lack of cleavage enzyme specification and large search space complicates conventional database search approaches. In this study, we introduce a proteogenomics methodology using a new type of mass spectrometry instrument and the implementation of machine learning tools toward improved identification of potential bioactive peptides in the mouse brain. The application of trapped ion mobility spectrometry (tims) coupled to a time-of-flight mass analyzer (TOF) offers improved sensitivity, an enhanced peptide coverage, reduction in chemical noise and the reduced occurrence of chimeric spectra. Subsequent machine learning tools MS2PIP, predicting fragment ion intensities and DeepLC, predicting retention times, improve the database searching based on a large and comprehensive custom database containing both sORFs and alternative ORFs. Finally, the identification of peptides is further enhanced by applying the post-processing semi-supervised learning tool Percolator. Applying this workflow, the first peptidomics workflow combined with spectral intensity and retention time predictions, we identified a total of 167 predicted sORF-encoded peptides, of which 48 originating from presumed non-coding locations, next to 401 peptides from known neuropeptide precursors, linked to 66 annotated bioactive neuropeptides from within 22 different families. Additional PEAKS analysis expanded the pool of SEPs on presumed non-coding locations to 84, while an additional 204 peptides completed the list of peptides from neuropeptide precursors. Altogether, this study provides insights into a new robust pipeline that fuses technological advancements from different fields ensuring an improved coverage of the neuropeptidome in the mouse brain.
Collapse
Affiliation(s)
- Marlies K. R. Peeters
- BioBix, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Geert Baggerman
- Centre for Proteomics, University of Antwerp, Antwerp, Belgium
- Unit Environmental Risk and Health, Flemish Institute for Technological Research, Mol, Belgium
| | - Ralf Gabriels
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
- VIB-UGent Center for Medical Biotechnology, Flanders Institute for Biotechnology, Ghent, Belgium
| | - Elise Pepermans
- Centre for Proteomics, University of Antwerp, Antwerp, Belgium
- Unit Environmental Risk and Health, Flemish Institute for Technological Research, Mol, Belgium
| | - Gerben Menschaert
- BioBix, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
- OHMX.bio, Ghent, Belgium
| | - Kurt Boonen
- Centre for Proteomics, University of Antwerp, Antwerp, Belgium
- Unit Environmental Risk and Health, Flemish Institute for Technological Research, Mol, Belgium
| |
Collapse
|
29
|
Pan N, Wang Z, Wang B, Wan J, Wan C. Mapping Microproteins and ncRNA-Encoded Polypeptides in Different Mouse Tissues. Front Cell Dev Biol 2021; 9:687748. [PMID: 34381774 PMCID: PMC8350139 DOI: 10.3389/fcell.2021.687748] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Accepted: 06/30/2021] [Indexed: 12/30/2022] Open
Abstract
Small open reading frame encoded peptides (SEPs), also called microproteins, play a vital role in biological processes. Plenty of their open reading frames are located within the non-coding RNA (ncRNA) range. Recent research has demonstrated that ncRNA-encoded polypeptides have essential functions and exist ubiquitously in various tissues. To better understand the role of microproteins, especially ncRNA-encoded proteins, expressed in different tissues, we profiled the proteomic characterization of five mouse tissues by mass spectrometry, including bottom-up, top-down, and de novo sequencing strategies. Bottom-up and top-down with database-dependent searches identified 811 microproteins in the OpenProt database. De novo sequencing identified 290 microproteins, including 12 ncRNA-encoded microproteins that were not found in current databases. In this study, we discovered 1,074 microproteins in total, including 270 ncRNA-encoded microproteins. From the annotation of these microproteins, we found that the brain contains the largest number of neuropeptides, while the spleen contains the most immunoassociated microproteins. This suggests that microproteins in different tissues have tissue-specific functions. These unannotated ncRNA-coded microproteins have predicted domains, such as the macrophage migration inhibitory factor domain and the Prefoldin domain. These results expand the mouse proteome and provide insight into the molecular biology of mouse tissues.
Collapse
Affiliation(s)
- Ni Pan
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, School of Life Sciences, Central China Normal University, Wuhan, China
| | - Zhiwei Wang
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, School of Life Sciences, Central China Normal University, Wuhan, China
| | - Bing Wang
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, School of Life Sciences, Central China Normal University, Wuhan, China
| | - Jian Wan
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, School of Life Sciences, Central China Normal University, Wuhan, China
| | - Cuihong Wan
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, School of Life Sciences, Central China Normal University, Wuhan, China
| |
Collapse
|
30
|
Cassidy L, Kaulich PT, Maaß S, Bartel J, Becher D, Tholey A. Bottom-up and top-down proteomic approaches for the identification, characterization, and quantification of the low molecular weight proteome with focus on short open reading frame-encoded peptides. Proteomics 2021; 21:e2100008. [PMID: 34145981 DOI: 10.1002/pmic.202100008] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Revised: 06/09/2021] [Accepted: 06/09/2021] [Indexed: 01/14/2023]
Abstract
The recent discovery of alternative open reading frames creates a need for suitable analytical approaches to verify their translation and to characterize the corresponding gene products at the molecular level. As the analysis of small proteins within a background proteome by means of classical bottom-up proteomics is challenging, method development for the analysis of small open reading frame encoded peptides (SEPs) have become a focal point for research. Here, we highlight bottom-up and top-down proteomics approaches established for the analysis of SEPs in both pro- and eukaryotes. Major steps of analysis, including sample preparation and (small) proteome isolation, separation and mass spectrometry, data interpretation and quality control, quantification, the analysis of post-translational modifications, and exploration of functional aspects of the SEPs by means of proteomics technologies are described. These methods do not exclusively cover the analytics of SEPs but simultaneously include the low molecular weight proteome, and moreover, can also be used for the proteome-wide analysis of proteolytic processing events.
Collapse
Affiliation(s)
- Liam Cassidy
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| | - Philipp T Kaulich
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| | - Sandra Maaß
- Department of Microbial Proteomics, Institute of Microbiology, University of Greifswald, Greifswald, Germany
| | - Jürgen Bartel
- Department of Microbial Proteomics, Institute of Microbiology, University of Greifswald, Greifswald, Germany
| | - Dörte Becher
- Department of Microbial Proteomics, Institute of Microbiology, University of Greifswald, Greifswald, Germany
| | - Andreas Tholey
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| |
Collapse
|