1
|
Beals J, Hu H, Li X. A survey of experimental and computational identification of small proteins. Brief Bioinform 2024; 25:bbae345. [PMID: 39007598 PMCID: PMC11247407 DOI: 10.1093/bib/bbae345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 05/27/2024] [Accepted: 07/02/2024] [Indexed: 07/16/2024] Open
Abstract
Small proteins (SPs) are typically characterized as eukaryotic proteins shorter than 100 amino acids and prokaryotic proteins shorter than 50 amino acids. Historically, they were disregarded because of the arbitrary size thresholds to define proteins. However, recent research has revealed the existence of many SPs and their crucial roles. Despite this, the identification of SPs and the elucidation of their functions are still in their infancy. To pave the way for future SP studies, we briefly introduce the limitations and advancements in experimental techniques for SP identification. We then provide an overview of available computational tools for SP identification, their constraints, and their evaluation. Additionally, we highlight existing resources for SP research. This survey aims to initiate further exploration into SPs and encourage the development of more sophisticated computational tools for SP identification in prokaryotes and microbiomes.
Collapse
Affiliation(s)
- Joshua Beals
- Burnett School of Biomedical Science, University of Central Florida, 4000 Central Florida Blvd, Orlando, FL 32816, United States
| | - Haiyan Hu
- Department of Computer Science, University of Central Florida, 4000 Central Florida Blvd, Orlando, FL 32816, United States
| | - Xiaoman Li
- Burnett School of Biomedical Science, University of Central Florida, 4000 Central Florida Blvd, Orlando, FL 32816, United States
| |
Collapse
|
2
|
Sinha PR, Balasubramanian R, Hegde SR. Integrated sequence and -omic features reveal novel small proteome of Mycobacterium tuberculosis. Front Microbiol 2024; 15:1335310. [PMID: 38812687 PMCID: PMC11133741 DOI: 10.3389/fmicb.2024.1335310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Accepted: 04/15/2024] [Indexed: 05/31/2024] Open
Abstract
Bioinformatic studies on small proteins are under-represented due to difficulties in annotation posed by their small size. However, recent discoveries emphasize the functional significance of small proteins in cellular processes including cell signaling, metabolism, and adaptation to stress. In this study, we utilized a Random Forest classifier trained on sequence features, RNA-Seq, and Ribo-Seq data to uncover small proteins (smORFs) in M. tuberculosis. Independent predictions for the exponential and starvation conditions resulted in 695 potential smORFs. We examined the functional implications of these smORFs using homology searches, LC-MS/MS, and ChIP-seq data, testing their expression in diverse growth conditions, and identifying protein domains. We provide evidence that some of these smORFs could be part of operons, or exist as upstream ORFs. This expanded data resource for the proteins of M. tuberculosis would aid in fine-tuning the existing protein and gene regulatory networks, thereby improving system-wide studies. The primary goal of this study was to uncover and characterize smORFs in M. tuberculosis through bioinformatic analysis, shedding light on their functional roles and genomic organization. Further investigation of these potential smORFs would provide valuable insights into the genome organization and functional diversity of the M. tuberculosis proteome.
Collapse
Affiliation(s)
| | | | - Shubhada R. Hegde
- Institute of Bioinformatics and Applied Biotechnology (IBAB), Bengaluru, India
| |
Collapse
|
3
|
Iwaniak A, Minkiewicz P, Darewicz M. Bioinformatics and bioactive peptides from foods: Do they work together? ADVANCES IN FOOD AND NUTRITION RESEARCH 2024; 108:35-111. [PMID: 38461003 DOI: 10.1016/bs.afnr.2023.09.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/11/2024]
Abstract
We live in the Big Data Era which affects many aspects of science, including research on bioactive peptides derived from foods, which during the last few decades have been a focus of interest for scientists. These two issues, i.e., the development of computer technologies and progress in the discovery of novel peptides with health-beneficial properties, are closely interrelated. This Chapter presents the example applications of bioinformatics for studying biopeptides, focusing on main aspects of peptide analysis as the starting point, including: (i) the role of peptide databases; (ii) aspects of bioactivity prediction; (iii) simulation of peptide release from proteins. Bioinformatics can also be used for predicting other features of peptides, including ADMET, QSAR, structure, and taste. To answer the question asked "bioinformatics and bioactive peptides from foods: do they work together?", currently it is almost impossible to find examples of peptide research with no bioinformatics involved. However, theoretical predictions are not equivalent to experimental work and always require critical scrutiny. The aspects of compatibility of in silico and in vitro results are also summarized herein.
Collapse
Affiliation(s)
- Anna Iwaniak
- Chair of Food Biochemistry, Faculty of Food Science, University of Warmia and Mazury in Olsztyn, Olsztyn-Kortowo, Poland.
| | - Piotr Minkiewicz
- Chair of Food Biochemistry, Faculty of Food Science, University of Warmia and Mazury in Olsztyn, Olsztyn-Kortowo, Poland
| | - Małgorzata Darewicz
- Chair of Food Biochemistry, Faculty of Food Science, University of Warmia and Mazury in Olsztyn, Olsztyn-Kortowo, Poland
| |
Collapse
|
4
|
In Silico and In Vitro Analyses Reveal Promising Antimicrobial Peptides from Myxobacteria. Probiotics Antimicrob Proteins 2023; 15:202-214. [PMID: 36586039 PMCID: PMC9839799 DOI: 10.1007/s12602-022-10036-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/19/2022] [Indexed: 01/01/2023]
Abstract
Antimicrobial resistance (AMR) is a global concern, and as soon as new antibiotics are introduced, resistance to those agents emerges. Therefore, there is an increased appetite for alternative antimicrobial agents to traditional antibiotics. Here, we used in silico methods to investigate potential antimicrobial peptides (AMPs) from predatory myxobacteria. Six hundred seventy-two potential AMP sequences were extracted from eight complete myxobacterial genomes. Most putative AMPs were predicted to be active against Klebsiella pneumoniae with least activity being predicted against Staphylococcus aureus. One hundred seventeen AMPs (defined here as 'potent putative AMPs') were predicted to have very good activity against more than two bacterial pathogens, and these were characterized further in silico. All potent putative AMPs were predicted to have anti-inflammatory and antifungal properties, but none was predicted to be active against viruses. Twenty six (22%) of them were predicted to be hemolytic to human erythrocytes, five were predicted to have anticancer properties, and 56 (47%) were predicted to be biofilm active. In vitro assays using four synthesized AMPs showed high MIC values (e.g. So_ce_56_913 250 µg/ml and Coral_AMP411 125 µg/ml against E. coli). However, antibiofilm assays showed a substantial reduction in numbers (e.g. Coral_AMP411 and Myxo_mac104 showed a 69% and 73% reduction, respectively, at the lowest concentration against E. coli) compared to traditional antibiotics. Fourteen putative AMPs had high sequence similarity to proteins which were functionally associated with proteins of known function. The myxobacterial genomes also possessed a variety of biosynthetic gene clusters (BGCs) that can encode antimicrobial secondary metabolites, but their numbers did not correlate with those of the AMPs. We suggest that AMPs from myxobacteria are a promising source of novel antimicrobial agents with a plethora of biological properties.
Collapse
|
5
|
López-García G, Dublan-García O, Arizmendi-Cotero D, Gómez Oliván LM. Antioxidant and Antimicrobial Peptides Derived from Food Proteins. Molecules 2022; 27:1343. [PMID: 35209132 PMCID: PMC8878547 DOI: 10.3390/molecules27041343] [Citation(s) in RCA: 54] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 02/11/2022] [Accepted: 02/13/2022] [Indexed: 12/12/2022] Open
Abstract
Recently, the demand for food proteins in the market has increased due to a rise in degenerative illnesses that are associated with the excessive production of free radicals and the unwanted side effects of various drugs, for which researchers have suggested diets rich in bioactive compounds. Some of the functional compounds present in foods are antioxidant and antimicrobial peptides, which are used to produce foods that promote health and to reduce the consumption of antibiotics. These peptides have been obtained from various sources of proteins, such as foods and agri-food by-products, via enzymatic hydrolysis and microbial fermentation. Peptides with antioxidant properties exert effective metal ion (Fe2+/Cu2+) chelating activity and lipid peroxidation inhibition, which may lead to notably beneficial effects in promoting human health and food processing. Antimicrobial peptides are small oligo-peptides generally containing from 10 to 100 amino acids, with a net positive charge and an amphipathic structure; they are the most important components of the antibacterial defense of organisms at almost all levels of life-bacteria, fungi, plants, amphibians, insects, birds and mammals-and have been suggested as natural compounds that neutralize the toxicity of reactive oxygen species generated by antibiotics and the stress generated by various exogenous sources. This review discusses what antioxidant and antimicrobial peptides are, their source, production, some bioinformatics tools used for their obtainment, emerging technologies, and health benefits.
Collapse
Affiliation(s)
- Guadalupe López-García
- Food and Environmental Toxicology Laboratory, Chemistry Faculty, Universidad Autónoma del Estado de México, Paseo Colón Intersección Paseo Tollocan s/n. Col. Residencial Colón, Toluca 50120, Mexico; (G.L.-G.); (L.M.G.O.)
| | - Octavio Dublan-García
- Food and Environmental Toxicology Laboratory, Chemistry Faculty, Universidad Autónoma del Estado de México, Paseo Colón Intersección Paseo Tollocan s/n. Col. Residencial Colón, Toluca 50120, Mexico; (G.L.-G.); (L.M.G.O.)
| | - Daniel Arizmendi-Cotero
- Department of Industrial Engineering, Engineering Faculty, Campus Toluca, Universidad Tecnológica de México (UNITEC), Estado de México, Toluca 50160, Mexico;
| | - Leobardo Manuel Gómez Oliván
- Food and Environmental Toxicology Laboratory, Chemistry Faculty, Universidad Autónoma del Estado de México, Paseo Colón Intersección Paseo Tollocan s/n. Col. Residencial Colón, Toluca 50120, Mexico; (G.L.-G.); (L.M.G.O.)
| |
Collapse
|
6
|
A workflow to identify novel proteins based on the direct mapping of peptide-spectrum-matches to genomic locations. BMC Bioinformatics 2021; 22:277. [PMID: 34039272 PMCID: PMC8157683 DOI: 10.1186/s12859-021-04159-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Accepted: 04/27/2021] [Indexed: 02/06/2023] Open
Abstract
Background Small Proteins have received increasing attention in recent years. They have in particular been implicated as signals contributing to the coordination of bacterial communities. In genome annotations they are often missing or hidden among large numbers of hypothetical proteins because genome annotation pipelines often exclude short open reading frames or over-predict hypothetical proteins based on simple models. The validation of novel proteins, and in particular of small proteins (sProteins), therefore requires additional evidence. Proteogenomics is considered the gold standard for this purpose. It extends beyond established annotations and includes all possible open reading frames (ORFs) as potential sources of peptides, thus allowing the discovery of novel, unannotated proteins. Typically this results in large numbers of putative novel small proteins fraught with large fractions of false-positive predictions. Results We observe that number and quality of the peptide-spectrum matches (PSMs) that map to a candidate ORF can be highly informative for the purpose of distinguishing proteins from spurious ORF annotations. We report here on a workflow that aggregates PSM quality information and local context into simple descriptors and reliably separates likely proteins from the large pool of false-positive, i.e., most likely untranslated ORFs. We investigated the artificial gut microbiome model SIHUMIx, comprising eight different species, for which we validate 5114 proteins that have previously been annotated only as hypothetical ORFs. In addition, we identified 37 non-annotated protein candidates for which we found evidence at the proteomic and transcriptomic level. Half (19) of these candidates have close functional homologs in other species. Another 12 candidates have homologs designated as hypothetical proteins in other species. The remaining six candidates are short (< 100 AA) and are most likely bona fide novel proteins. Conclusions The aggregation of PSM quality information for predicted ORFs provides a robust and efficient method to identify novel proteins in proteomics data. The workflow is in particular capable of identifying small proteins and frameshift variants. Since PSMs are explicitly mapped to genomic locations, it furthermore facilitates the integration of transcriptomics data and other sources of genome-level information. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-021-04159-8.
Collapse
|
7
|
Tao H, Zhang Y, Huang SY. Improving Protein-Peptide Docking Results via Pose-Clustering and Rescoring with a Combined Knowledge-Based and MM-GBSA Scoring Function. J Chem Inf Model 2020; 60:2377-2387. [PMID: 32267149 DOI: 10.1021/acs.jcim.0c00058] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Protein-peptide docking, which predicts the complex structure between a protein and a peptide, is a valuable computational tool in peptide therapeutics development and the mechanistic investigation of peptides involved in cellular processes. Although current peptide docking approaches are often able to sample near-native peptide binding modes, correctly identifying those near-native modes from decoys is still challenging because of the extremely high complexity of the peptide binding energy landscape. In this study, we have developed an efficient postdocking rescoring protocol using a combined scoring function of knowledge-based ITScorePP potentials and physics-based MM-GBSA energies. Tested on five benchmark/docking test sets, our postdocking strategy showed an overall significantly better performance in binding mode prediction and score-rmsd correlation than original docking approaches. Specifically, our postdocking protocol outperformed original docking approaches with success rates of 15.8 versus 10.5% for pepATTRACT on the Global_57 benchmark, 5.3 versus 5.3% for CABS-dock on the Global_57 benchmark, 17.0 versus 11.3% for FlexPepDock on the LEADS-PEP data set, 40.3 versus 33.9% for HPEPDOCK on the Local_62 benchmark, and 64.2 versus 52.8% for HPEPDOCK on the LEADS-PEP data set when the top prediction was considered. These results demonstrated the efficacy and robustness of our postdocking protocol.
Collapse
Affiliation(s)
- Huanyu Tao
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Yanjun Zhang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| |
Collapse
|
8
|
Iwaniak A, Darewicz M, Mogut D, Minkiewicz P. Elucidation of the role of in silico methodologies in approaches to studying bioactive peptides derived from foods. J Funct Foods 2019. [DOI: 10.1016/j.jff.2019.103486] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
|
9
|
From biomedicinal to in silico models and back to therapeutics: a review on the advancement of peptidic modeling. Future Med Chem 2019; 11:2313-2331. [PMID: 31581914 DOI: 10.4155/fmc-2018-0365] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
Bioactive peptides participate in numerous metabolic functions of living organisms and have emerged as potential therapeutics on a diverse range of diseases. Albeit peptide design does not go without challenges, overwhelming advancements on in silico methodologies have increased the scope of peptide-based drug design and discovery to an unprecedented amount. Within an in silico model versus an experimental validation scenario, this review aims to summarize and discuss how different in silico techniques contribute at present to the design of peptide-based molecules. Published in silico results from 2014 to 2018 were selected and discriminated in major methodological groups, allowing a transversal analysis, promoting a landscape vision and asserting its increasing value in drug design.
Collapse
|
10
|
de Vries SJ, Rey J, Schindler CEM, Zacharias M, Tuffery P. The pepATTRACT web server for blind, large-scale peptide-protein docking. Nucleic Acids Res 2019; 45:W361-W364. [PMID: 28460116 PMCID: PMC5570166 DOI: 10.1093/nar/gkx335] [Citation(s) in RCA: 72] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2017] [Accepted: 04/18/2017] [Indexed: 12/20/2022] Open
Abstract
Peptide–protein interactions are ubiquitous in the cell and form an important part of the interactome. Computational docking methods can complement experimental characterization of these complexes, but current protocols are not applicable on the proteome scale. pepATTRACT is a novel docking protocol that is fully blind, i.e. it does not require any information about the binding site. In various stages of its development, pepATTRACT has participated in CAPRI, making successful predictions for five out of seven protein–peptide targets. Its performance is similar or better than state-of-the-art local docking protocols that do require binding site information. Here we present a novel web server that carries out the rigid-body stage of pepATTRACT. On the peptiDB benchmark, the web server generates a correct model in the top 50 in 34% of the cases. Compared to the full pepATTRACT protocol, this leads to some loss of performance, but the computation time is reduced from ∼18 h to ∼10 min. Combined with the fact that it is fully blind, this makes the web server well-suited for large-scale in silico protein–peptide docking experiments. The rigid-body pepATTRACT server is freely available at http://bioserv.rpbs.univ-paris-diderot.fr/services/pepATTRACT.
Collapse
Affiliation(s)
- Sjoerd J de Vries
- INSERM UMR-S 973/Université Paris Diderot/Sorbonne Paris Cité/RPBS, Paris 75205, France
| | - Julien Rey
- INSERM UMR-S 973/Université Paris Diderot/Sorbonne Paris Cité/RPBS, Paris 75205, France
| | | | - Martin Zacharias
- Physik T38, Technische Universität München, Garching 85748, Germany
| | - Pierre Tuffery
- INSERM UMR-S 973/Université Paris Diderot/Sorbonne Paris Cité/RPBS, Paris 75205, France
| |
Collapse
|
11
|
Wang J, Yin T, Xiao X, He D, Xue Z, Jiang X, Wang Y. StraPep: a structure database of bioactive peptides. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2018; 2018:4974332. [PMID: 29688386 PMCID: PMC5905355 DOI: 10.1093/database/bay038] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/17/2017] [Accepted: 03/21/2018] [Indexed: 12/03/2022]
Abstract
Bioactive peptides, with a variety of biological activities and wide distribution in nature, have attracted great research interest in biological and medical fields, especially in pharmaceutical industry. The structural information of bioactive peptide is important for the development of peptide-based drugs. Many databases have been developed cataloguing bioactive peptides. However, to our knowledge, database dedicated to collect all the bioactive peptides with known structure is not available yet. Thus, we developed StraPep, a structure database of bioactive peptides. StraPep holds 3791 bioactive peptide structures, which belong to 1312 unique bioactive peptide sequences. About 905 out of 1312 (68%) bioactive peptides in StraPep contain disulfide bonds, which is significantly higher than that (21%) of PDB. Interestingly, 150 out of 616 (24%) bioactive peptides with three or more disulfide bonds form a structural motif known as cystine knot, which confers considerable structural stability on proteins and is an attractive scaffold for drug design. Detailed information of each peptide, including the experimental structure, the location of disulfide bonds, secondary structure, classification, post-translational modification and so on, has been provided. A wide range of user-friendly tools, such as browsing, sequence and structure-based searching and so on, has been incorporated into StraPep. We hope that this database will be helpful for the research community. Database URL: http://isyslab.info/StraPep
Collapse
Affiliation(s)
- Jian Wang
- Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - Tailang Yin
- Reproductive Medicine Center, Renmin Hospital of Wuhan University, Wuhan, Hubei 430060, China
| | - Xuwen Xiao
- School of Software Engineering, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - Dan He
- Department of Neurology, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, Guangdong 510080, China
| | - Zhidong Xue
- School of Software Engineering, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - Xinnong Jiang
- Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - Yan Wang
- Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| |
Collapse
|
12
|
Zhou P, Li B, Yan Y, Jin B, Wang L, Huang SY. Hierarchical Flexible Peptide Docking by Conformer Generation and Ensemble Docking of Peptides. J Chem Inf Model 2018; 58:1292-1302. [PMID: 29738247 DOI: 10.1021/acs.jcim.8b00142] [Citation(s) in RCA: 43] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Given the importance of peptide-mediated protein interactions in cellular processes, protein-peptide docking has received increasing attention. Here, we have developed a Hierarchical flexible Peptide Docking approach through fast generation and ensemble docking of peptide conformations, which is referred to as HPepDock. Tested on the LEADS-PEP benchmark data set of 53 diverse complexes with peptides of 3-12 residues, HPepDock performed significantly better than the 11 docking protocols of five small-molecule docking programs (DOCK, AutoDock, AutoDock Vina, Surflex, and GOLD) in predicting near-native binding conformations. HPepDock was also evaluated on the 19 bound/unbound and 10 unbound/unbound protein-peptide complexes of the Glide SP-PEP benchmark and showed an overall better performance than Glide SP-PEP+MM-GBSA and FlexPepDock in both bound and unbound docking. HPepDock is computationally efficient, and the average running time for docking a peptide is ∼15 min with the range from about 1 min for short peptides to around 40 min for long peptides.
Collapse
Affiliation(s)
- Pei Zhou
- Institute of Biophysics, School of Physics , Huazhong University of Science and Technology , Wuhan , Hubei 430074 , China
| | - Botong Li
- Institute of Biophysics, School of Physics , Huazhong University of Science and Technology , Wuhan , Hubei 430074 , China
| | - Yumeng Yan
- Institute of Biophysics, School of Physics , Huazhong University of Science and Technology , Wuhan , Hubei 430074 , China
| | - Bowen Jin
- Institute of Biophysics, School of Physics , Huazhong University of Science and Technology , Wuhan , Hubei 430074 , China
| | - Libang Wang
- Institute of Biophysics, School of Physics , Huazhong University of Science and Technology , Wuhan , Hubei 430074 , China
| | - Sheng-You Huang
- Institute of Biophysics, School of Physics , Huazhong University of Science and Technology , Wuhan , Hubei 430074 , China
| |
Collapse
|
13
|
Yan Y, Zhang D, Huang SY. Efficient conformational ensemble generation of protein-bound peptides. J Cheminform 2017; 9:59. [PMID: 29168051 PMCID: PMC5700017 DOI: 10.1186/s13321-017-0246-7] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2017] [Accepted: 11/15/2017] [Indexed: 02/06/2023] Open
Abstract
Conformation generation of protein-bound peptides is critical for the determination of protein–peptide complex structures. Despite significant progress in conformer generation of small molecules, few methods have been developed for modeling protein-bound peptide conformations. Here, we have developed a fast de novo peptide modeling algorithm, referred to as MODPEP, for conformational sampling of protein-bound peptides. Given a sequence, MODPEP builds the peptide 3D structure from scratch by assembling amino acids or helix fragments based on constructed rotamer and helix libraries. The MODPEP algorithm was tested on a diverse set of 910 experimentally determined protein-bound peptides with 3–30 amino acids from the PDB and obtained an average accuracy of 1.90 Å when 200 conformations were sampled for each peptide. On average, MODPEP obtained a success rate of 74.3% for all the 910 peptides and ≥ 90% for short peptides with 3–10 amino acids in reproducing experimental protein-bound structures. Comparative evaluations of MODPEP with three other conformer generation methods, PEP-FOLD3, RDKit, and Balloon, have also been performed in both accuracy and success rate. MODPEP is fast and can generate 100 conformations for less than one second. The fast MODPEP will be beneficial for large-scale de novo modeling and docking of peptides. The MODPEP program and libraries are available for download at http://huanglab.phys.hust.edu.cn/.![]()
Collapse
Affiliation(s)
- Yumeng Yan
- School of Physics, Huazhong University of Science and Technology, Wuhan, 430074, Hubei, People's Republic of China
| | - Di Zhang
- School of Physics, Huazhong University of Science and Technology, Wuhan, 430074, Hubei, People's Republic of China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, 430074, Hubei, People's Republic of China.
| |
Collapse
|
14
|
Truman AW. Cyclisation mechanisms in the biosynthesis of ribosomally synthesised and post-translationally modified peptides. Beilstein J Org Chem 2016; 12:1250-68. [PMID: 27559376 PMCID: PMC4979651 DOI: 10.3762/bjoc.12.120] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2016] [Accepted: 06/02/2016] [Indexed: 12/15/2022] Open
Abstract
Ribosomally synthesised and post-translationally modified peptides (RiPPs) are a large class of natural products that are remarkably chemically diverse given an intrinsic requirement to be assembled from proteinogenic amino acids. The vast chemical space occupied by RiPPs means that they possess a wide variety of biological activities, and the class includes antibiotics, co-factors, signalling molecules, anticancer and anti-HIV compounds, and toxins. A considerable amount of RiPP chemical diversity is generated from cyclisation reactions, and the current mechanistic understanding of these reactions will be discussed here. These cyclisations involve a diverse array of chemical reactions, including 1,4-nucleophilic additions, [4 + 2] cycloadditions, ATP-dependent heterocyclisation to form thiazolines or oxazolines, and radical-mediated reactions between unactivated carbons. Future prospects for RiPP pathway discovery and characterisation will also be highlighted.
Collapse
Affiliation(s)
- Andrew W Truman
- Department of Molecular Microbiology, John Innes Centre, Colney Lane, Norwich, NR4 7UH, UK
| |
Collapse
|
15
|
Lamiable A, Thévenet P, Rey J, Vavrusa M, Derreumaux P, Tufféry P. PEP-FOLD3: faster de novo structure prediction for linear peptides in solution and in complex. Nucleic Acids Res 2016; 44:W449-54. [PMID: 27131374 PMCID: PMC4987898 DOI: 10.1093/nar/gkw329] [Citation(s) in RCA: 665] [Impact Index Per Article: 73.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2016] [Accepted: 04/17/2016] [Indexed: 01/15/2023] Open
Abstract
Structure determination of linear peptides of 5–50 amino acids in aqueous solution and interacting with proteins is a key aspect in structural biology. PEP-FOLD3 is a novel computational framework, that allows both (i) de novo free or biased prediction for linear peptides between 5 and 50 amino acids, and (ii) the generation of native-like conformations of peptides interacting with a protein when the interaction site is known in advance. PEP-FOLD3 is fast, and usually returns solutions in a few minutes. Testing PEP-FOLD3 on 56 peptides in aqueous solution led to experimental-like conformations for 80% of the targets. Using a benchmark of 61 peptide–protein targets starting from the unbound form of the protein receptor, PEP-FOLD3 was able to generate peptide poses deviating on average by 3.3Å from the experimental conformation and return a native-like pose in the first 10 clusters for 52% of the targets. PEP-FOLD3 is available at http://bioserv.rpbs.univ-paris-diderot.fr/services/PEP-FOLD3.
Collapse
Affiliation(s)
- Alexis Lamiable
- Molécules Thérapeutiques in Silico, RPBS, INSERM UMR-S 973, Université Paris Diderot, Sorbonne Paris Cité, 75205 Paris Cedex 13, France
| | - Pierre Thévenet
- Molécules Thérapeutiques in Silico, RPBS, INSERM UMR-S 973, Université Paris Diderot, Sorbonne Paris Cité, 75205 Paris Cedex 13, France
| | - Julien Rey
- Molécules Thérapeutiques in Silico, RPBS, INSERM UMR-S 973, Université Paris Diderot, Sorbonne Paris Cité, 75205 Paris Cedex 13, France
| | - Marek Vavrusa
- Molécules Thérapeutiques in Silico, RPBS, INSERM UMR-S 973, Université Paris Diderot, Sorbonne Paris Cité, 75205 Paris Cedex 13, France
| | - Philippe Derreumaux
- Institut de Biologie Physico Chimique, Laboratoire de Biochimie Théorique, Université Paris Diderot, Sorbonne Paris Cité, CNRS UPR 9080, 75005 Paris, France
| | - Pierre Tufféry
- Molécules Thérapeutiques in Silico, RPBS, INSERM UMR-S 973, Université Paris Diderot, Sorbonne Paris Cité, 75205 Paris Cedex 13, France
| |
Collapse
|
16
|
Escobar-Zepeda A, Vera-Ponce de León A, Sanchez-Flores A. The Road to Metagenomics: From Microbiology to DNA Sequencing Technologies and Bioinformatics. Front Genet 2015; 6:348. [PMID: 26734060 PMCID: PMC4681832 DOI: 10.3389/fgene.2015.00348] [Citation(s) in RCA: 161] [Impact Index Per Article: 16.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2015] [Accepted: 11/27/2015] [Indexed: 12/17/2022] Open
Abstract
The study of microorganisms that pervade each and every part of this planet has encountered many challenges through time such as the discovery of unknown organisms and the understanding of how they interact with their environment. The aim of this review is to take the reader along the timeline and major milestones that led us to modern metagenomics. This new and thriving area is likely to be an important contributor to solve different problems. The transition from classical microbiology to modern metagenomics studies has required the development of new branches of knowledge and specialization. Here, we will review how the availability of high-throughput sequencing technologies has transformed microbiology and bioinformatics and how to tackle the inherent computational challenges that arise from the DNA sequencing revolution. New computational methods are constantly developed to collect, process, and extract useful biological information from a variety of samples and complex datasets, but metagenomics needs the integration of several of these computational methods. Despite the level of specialization needed in bioinformatics, it is important that life-scientists have a good understanding of it for a correct experimental design, which allows them to reveal the information in a metagenome.
Collapse
Affiliation(s)
- Alejandra Escobar-Zepeda
- Unidad de Secuenciación Masiva y Bioinformática, Instituto de Biotecnología, Universidad Nacional Autónoma de MéxicoCuernavaca, México
| | - Arturo Vera-Ponce de León
- Programa de Ecología Genómica, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de MéxicoCuernavaca, México
| | - Alejandro Sanchez-Flores
- Unidad de Secuenciación Masiva y Bioinformática, Instituto de Biotecnología, Universidad Nacional Autónoma de MéxicoCuernavaca, México
| |
Collapse
|
17
|
Hao GF, Xu WF, Yang SG, Yang GF. Multiple Simulated Annealing-Molecular Dynamics (MSA-MD) for Conformational Space Search of Peptide and Miniprotein. Sci Rep 2015; 5:15568. [PMID: 26492886 PMCID: PMC4616061 DOI: 10.1038/srep15568] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2015] [Accepted: 09/29/2015] [Indexed: 12/03/2022] Open
Abstract
Protein and peptide structure predictions are of paramount importance for understanding their functions, as well as the interactions with other molecules. However, the use of molecular simulation techniques to directly predict the peptide structure from the primary amino acid sequence is always hindered by the rough topology of the conformational space and the limited simulation time scale. We developed here a new strategy, named Multiple Simulated Annealing-Molecular Dynamics (MSA-MD) to identify the native states of a peptide and miniprotein. A cluster of near native structures could be obtained by using the MSA-MD method, which turned out to be significantly more efficient in reaching the native structure compared to continuous MD and conventional SA-MD simulation.
Collapse
Affiliation(s)
- Ge-Fei Hao
- Key Laboratory of Pesticide &Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, P.R.China
| | - Wei-Fang Xu
- Key Laboratory of Pesticide &Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, P.R.China
| | - Sheng-Gang Yang
- Key Laboratory of Pesticide &Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, P.R.China
| | - Guang-Fu Yang
- Key Laboratory of Pesticide &Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, P.R.China.,Collaborative Innovation Center of Chemical Science and Engineering, Tianjing 300072, P.R.China
| |
Collapse
|