1
|
Stewart MR, Quentel A, Manalo E, Montoya Mira J, Ranganathan S, Branchaud BP, Fischer JM, Tu E, Civitci F, Chiu YJ, Yildirim A. Profiling protease cleavage patterns in plasma for pancreatic cancer detection. Sci Rep 2024; 14:31809. [PMID: 39738320 PMCID: PMC11686259 DOI: 10.1038/s41598-024-83077-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2024] [Accepted: 12/11/2024] [Indexed: 01/02/2025] Open
Abstract
Proteases are promising biomarkers for cancer early detection. Their enzymatic activity against peptide substrates allows for their straightforward detection using low-cost tests. However, the complexity of the human proteome makes it challenging to develop sensitive and selective tests against a specific protease biomarker. Here, we report a different approach by utilizing the total protease activity in plasma samples to detect pancreatic cancer. Instead of targeting a specific protease using a specific peptide substrate, we utilized an array of 360 FRET substrates to screen for cleavage patterns in plasma samples collected from screen negatives and pancreatitis or pancreatic ductal adenocarcinoma cancer (PDAC) patients. In this proof of concept study, we first screened all 360 substrates using a small cohort (n = 13) to identify the top 5 substrates that best separate different conditions. Then, we performed a validation study using a larger cohort (n = 86) and the selected substrates. There was a statistically significant increase in the total protease activity in PDAC samples compared to screen negative and pancreatitis samples. The selected substrates detected PDAC with an area under the curve (AUC) of 0.8. This work represents a novel strategy for identifying peptide substrates for the detection of PDAC and other cancers.
Collapse
Affiliation(s)
- Morgan R Stewart
- CEDAR, Knight Cancer Institute, School of Medicine, Oregon Health and Science University, Portland, OR, 97201, USA
| | - Arnaud Quentel
- CEDAR, Knight Cancer Institute, School of Medicine, Oregon Health and Science University, Portland, OR, 97201, USA
| | - Elise Manalo
- CEDAR, Knight Cancer Institute, School of Medicine, Oregon Health and Science University, Portland, OR, 97201, USA
| | - Jose Montoya Mira
- CEDAR, Knight Cancer Institute, School of Medicine, Oregon Health and Science University, Portland, OR, 97201, USA
- Department of Biomedical Engineering, School of Medicine, Oregon Health and Science University, Portland, OR, 97239, USA
| | - Srivathsan Ranganathan
- CEDAR, Knight Cancer Institute, School of Medicine, Oregon Health and Science University, Portland, OR, 97201, USA
| | - Bruce P Branchaud
- CEDAR, Knight Cancer Institute, School of Medicine, Oregon Health and Science University, Portland, OR, 97201, USA
| | - Jared M Fischer
- CEDAR, Knight Cancer Institute, School of Medicine, Oregon Health and Science University, Portland, OR, 97201, USA
- Department of Molecular and Medical Genetics, School of Medicine, Oregon Health and Science University, Portland, OR, 97201, USA
| | - Eugene Tu
- CEDAR, Knight Cancer Institute, School of Medicine, Oregon Health and Science University, Portland, OR, 97201, USA
| | - Fehmi Civitci
- CEDAR, Knight Cancer Institute, School of Medicine, Oregon Health and Science University, Portland, OR, 97201, USA
| | - Yu-Jui Chiu
- CEDAR, Knight Cancer Institute, School of Medicine, Oregon Health and Science University, Portland, OR, 97201, USA
| | - Adem Yildirim
- CEDAR, Knight Cancer Institute, School of Medicine, Oregon Health and Science University, Portland, OR, 97201, USA.
- Department of Biomedical Engineering, School of Medicine, Oregon Health and Science University, Portland, OR, 97239, USA.
- Division of Oncological Sciences, Knight Cancer Institute, School of Medicine, Oregon Health and Science University, Portland, OR, 97201, USA.
| |
Collapse
|
2
|
Henehan GT, Ryan BJ, Kinsella GK. Approaches to Avoid Proteolysis During Protein Expression and Purification. Methods Mol Biol 2023; 2699:77-95. [PMID: 37646995 DOI: 10.1007/978-1-0716-3362-5_6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
Abstract
All cells contain proteases, which hydrolyze the peptide bonds between amino acids of a protein backbone. Typically, proteases are prevented from nonspecific proteolysis by regulation and by their physical separation into different subcellular compartments; however, this segregation is not retained during cell lysis, which is the initial step in any protein isolation procedure. Prevention of proteolysis during protein purification often takes the form of a two-pronged approach: first, inhibition of proteolysis in situ, followed by the early separation of the protease from the protein of interest via chromatographic purification. Protease inhibitors are routinely used to limit the effect of the proteases before they are physically separated from the protein of interest via column chromatography. In this chapter, commonly used approaches to reducing or avoiding proteolysis during protein expression and purification are reviewed.
Collapse
Affiliation(s)
- Gary T Henehan
- School of Food Science and Environmental Health, Technological University Dublin, Grangegorman, Dublin, Ireland
| | - Barry J Ryan
- School of Food Science and Environmental Health, Technological University Dublin, Grangegorman, Dublin, Ireland
| | - Gemma K Kinsella
- School of Food Science and Environmental Health, Technological University Dublin, Grangegorman, Dublin, Ireland.
| |
Collapse
|
3
|
Deep Learning-Based Advances In Protein Posttranslational Modification Site and Protein Cleavage Prediction. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2499:285-322. [PMID: 35696087 DOI: 10.1007/978-1-0716-2317-6_15] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Posttranslational modification (PTM ) is a ubiquitous phenomenon in both eukaryotes and prokaryotes which gives rise to enormous proteomic diversity. PTM mostly comes in two flavors: covalent modification to polypeptide chain and proteolytic cleavage. Understanding and characterization of PTM is a fundamental step toward understanding the underpinning of biology. Recent advances in experimental approaches, mainly mass-spectrometry-based approaches, have immensely helped in obtaining and characterizing PTMs. However, experimental approaches are not enough to understand and characterize more than 450 different types of PTMs and complementary computational approaches are becoming popular. Recently, due to the various advancements in the field of Deep Learning (DL), along with the explosion of applications of DL to various fields, the field of computational prediction of PTM has also witnessed the development of a plethora of deep learning (DL)-based approaches. In this book chapter, we first review some recent DL-based approaches in the field of PTM site prediction. In addition, we also review the recent advances in the not-so-studied PTM , that is, proteolytic cleavage predictions. We describe advances in PTM prediction by highlighting the Deep learning architecture, feature encoding, novelty of the approaches, and availability of the tools/approaches. Finally, we provide an outlook and possible future research directions for DL-based approaches for PTM prediction.
Collapse
|
4
|
Song J, Wang Y, Li F, Akutsu T, Rawlings ND, Webb GI, Chou KC. iProt-Sub: a comprehensive package for accurately mapping and predicting protease-specific substrates and cleavage sites. Brief Bioinform 2020; 20:638-658. [PMID: 29897410 PMCID: PMC6556904 DOI: 10.1093/bib/bby028] [Citation(s) in RCA: 128] [Impact Index Per Article: 25.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2018] [Revised: 03/02/2018] [Indexed: 01/03/2023] Open
Abstract
Regulation of proteolysis plays a critical role in a myriad of important cellular processes. The key to better understanding the mechanisms that control this process is to identify the specific substrates that each protease targets. To address this, we have developed iProt-Sub, a powerful bioinformatics tool for the accurate prediction of protease-specific substrates and their cleavage sites. Importantly, iProt-Sub represents a significantly advanced version of its successful predecessor, PROSPER. It provides optimized cleavage site prediction models with better prediction performance and coverage for more species-specific proteases (4 major protease families and 38 different proteases). iProt-Sub integrates heterogeneous sequence and structural features and uses a two-step feature selection procedure to further remove redundant and irrelevant features in an effort to improve the cleavage site prediction accuracy. Features used by iProt-Sub are encoded by 11 different sequence encoding schemes, including local amino acid sequence profile, secondary structure, solvent accessibility and native disorder, which will allow a more accurate representation of the protease specificity of approximately 38 proteases and training of the prediction models. Benchmarking experiments using cross-validation and independent tests showed that iProt-Sub is able to achieve a better performance than several existing generic tools. We anticipate that iProt-Sub will be a powerful tool for proteome-wide prediction of protease-specific substrates and their cleavage sites, and will facilitate hypothesis-driven functional interrogation of protease-specific substrate cleavage and proteolytic events.
Collapse
Affiliation(s)
- Jiangning Song
- Monash Centre for Data Science, Faculty of Information Technology, Monash University, Melbourne, VIC 3800, Australia.,Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia and ARC Centre of Excellence in Advanced Molecular Imaging, Monash University, Melbourne, VIC 3800, Australia
| | - Yanan Wang
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai, 200240, China
| | - Fuyi Li
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
| | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto, 611-0011, Japan
| | - Neil D Rawlings
- EMBL European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Geoffrey I Webb
- Monash Centre for Data Science, Faculty of Information Technology, Monash University, Melbourne, VIC 3800, Australia
| | - Kuo-Chen Chou
- Gordon Life Science Institute, Boston, MA 02478, USA and Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 610054, China
| |
Collapse
|
5
|
Zhao X, Jiao Q, Li H, Wu Y, Wang H, Huang S, Wang G. ECFS-DEA: an ensemble classifier-based feature selection for differential expression analysis on expression profiles. BMC Bioinformatics 2020; 21:43. [PMID: 32024464 PMCID: PMC7003361 DOI: 10.1186/s12859-020-3388-y] [Citation(s) in RCA: 61] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2019] [Accepted: 01/27/2020] [Indexed: 11/27/2022] Open
Abstract
Background Various methods for differential expression analysis have been widely used to identify features which best distinguish between different categories of samples. Multiple hypothesis testing may leave out explanatory features, each of which may be composed of individually insignificant variables. Multivariate hypothesis testing holds a non-mainstream position, considering the large computation overhead of large-scale matrix operation. Random forest provides a classification strategy for calculation of variable importance. However, it may be unsuitable for different distributions of samples. Results Based on the thought of using an ensemble classifier, we develop a feature selection tool for differential expression analysis on expression profiles (i.e., ECFS-DEA for short). Considering the differences in sample distribution, a graphical user interface is designed to allow the selection of different base classifiers. Inspired by random forest, a common measure which is applicable to any base classifier is proposed for calculation of variable importance. After an interactive selection of a feature on sorted individual variables, a projection heatmap is presented using k-means clustering. ROC curve is also provided, both of which can intuitively demonstrate the effectiveness of the selected feature. Conclusions Feature selection through ensemble classifiers helps to select important variables and thus is applicable for different sample distributions. Experiments on simulation and realistic data demonstrate the effectiveness of ECFS-DEA for differential expression analysis on expression profiles. The software is available at http://bio-nefu.com/resource/ecfs-dea.
Collapse
Affiliation(s)
- Xudong Zhao
- College of Information and Computer Engineering, Northeast Forestry University, No.26 Hexing Road, Harbin, 150040, China
| | - Qing Jiao
- College of Information and Computer Engineering, Northeast Forestry University, No.26 Hexing Road, Harbin, 150040, China
| | - Hangyu Li
- College of Information and Computer Engineering, Northeast Forestry University, No.26 Hexing Road, Harbin, 150040, China
| | - Yiming Wu
- College of Information and Computer Engineering, Northeast Forestry University, No.26 Hexing Road, Harbin, 150040, China
| | - Hanxu Wang
- College of Information and Computer Engineering, Northeast Forestry University, No.26 Hexing Road, Harbin, 150040, China
| | - Shan Huang
- Department of Neurology, The 2nd Affiliated Hospital of Harbin Medical University, No. 246 Xuefu Road, Harbin, 150086, China
| | - Guohua Wang
- College of Information and Computer Engineering, Northeast Forestry University, No.26 Hexing Road, Harbin, 150040, China. .,State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, No.26 Hexing Road, Harbin, 150040, China.
| |
Collapse
|
6
|
Li F, Wang Y, Li C, Marquez-Lago TT, Leier A, Rawlings ND, Haffari G, Revote J, Akutsu T, Chou KC, Purcell AW, Pike RN, Webb GI, Ian Smith A, Lithgow T, Daly RJ, Whisstock JC, Song J. Twenty years of bioinformatics research for protease-specific substrate and cleavage site prediction: a comprehensive revisit and benchmarking of existing methods. Brief Bioinform 2019; 20:2150-2166. [PMID: 30184176 PMCID: PMC6954447 DOI: 10.1093/bib/bby077] [Citation(s) in RCA: 58] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2018] [Revised: 07/26/2018] [Accepted: 08/01/2018] [Indexed: 01/06/2023] Open
Abstract
The roles of proteolytic cleavage have been intensively investigated and discussed during the past two decades. This irreversible chemical process has been frequently reported to influence a number of crucial biological processes (BPs), such as cell cycle, protein regulation and inflammation. A number of advanced studies have been published aiming at deciphering the mechanisms of proteolytic cleavage. Given its significance and the large number of functionally enriched substrates targeted by specific proteases, many computational approaches have been established for accurate prediction of protease-specific substrates and their cleavage sites. Consequently, there is an urgent need to systematically assess the state-of-the-art computational approaches for protease-specific cleavage site prediction to further advance the existing methodologies and to improve the prediction performance. With this goal in mind, in this article, we carefully evaluated a total of 19 computational methods (including 8 scoring function-based methods and 11 machine learning-based methods) in terms of their underlying algorithm, calculated features, performance evaluation and software usability. Then, extensive independent tests were performed to assess the robustness and scalability of the reviewed methods using our carefully prepared independent test data sets with 3641 cleavage sites (specific to 10 proteases). The comparative experimental results demonstrate that PROSPERous is the most accurate generic method for predicting eight protease-specific cleavage sites, while GPS-CCD and LabCaS outperformed other predictors for calpain-specific cleavage sites. Based on our review, we then outlined some potential ways to improve the prediction performance and ease the computational burden by applying ensemble learning, deep learning, positive unlabeled learning and parallel and distributed computing techniques. We anticipate that our study will serve as a practical and useful guide for interested readers to further advance next-generation bioinformatics tools for protease-specific cleavage site prediction.
Collapse
Affiliation(s)
- Fuyi Li
- Biomedicine Discovery Institute and Department of Biochemistry & Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
| | - Yanan Wang
- Biomedicine Discovery Institute and Department of Biochemistry & Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Chen Li
- Biomedicine Discovery Institute and Department of Biochemistry & Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Department of Biology, Institute of Molecular Systems Biology,ETH Zürich, Zürich 8093, Switzerland
| | - Tatiana T Marquez-Lago
- Department of Genetics and Department of Cell, Developmental and Integrative Biology, School of Medicine, University of Alabama at Birmingham, AL, USA
| | - André Leier
- Department of Genetics and Department of Cell, Developmental and Integrative Biology, School of Medicine, University of Alabama at Birmingham, AL, USA
| | - Neil D Rawlings
- EMBL European Bioinformatics Institute, Wellcome Trust Genome Campus, Wellcome Trust Genome Campus,Hinxton, Cambridgeshire CB10 1SD, UK
| | - Gholamreza Haffari
- Monash Centre for Data Science, Faculty of Information Technology, Monash University, Melbourne, VIC 3800, Australia
| | - Jerico Revote
- Biomedicine Discovery Institute and Department of Biochemistry & Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
| | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Kyoto 611-0011, Japan
| | - Kuo-Chen Chou
- Gordon Life Science Institute, Boston, MA 02478, USA
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Anthony W Purcell
- Biomedicine Discovery Institute and Department of Biochemistry & Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
| | - Robert N Pike
- La Trobe Institute for Molecular Science, La Trobe University, Melbourne, VIC 3086, Australia
- ARC Centre of Excellence in Advanced Molecular Imaging, Monash University, Melbourne, VIC 3800, Australia
| | - Geoffrey I Webb
- Monash Centre for Data Science, Faculty of Information Technology, Monash University, Melbourne, VIC 3800, Australia
| | - A Ian Smith
- Biomedicine Discovery Institute and Department of Biochemistry & Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- ARC Centre of Excellence in Advanced Molecular Imaging, Monash University, Melbourne, VIC 3800, Australia
| | - Trevor Lithgow
- Biomedicine Discovery Institute and Department of Microbiology, Monash University, Melbourne, Victoria 3800, Australia
| | - Roger J Daly
- Biomedicine Discovery Institute and Department of Biochemistry & Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
| | - James C Whisstock
- Biomedicine Discovery Institute and Department of Biochemistry & Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- ARC Centre of Excellence in Advanced Molecular Imaging, Monash University, Melbourne, VIC 3800, Australia
| | - Jiangning Song
- Biomedicine Discovery Institute and Department of Biochemistry & Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Monash Centre for Data Science, Faculty of Information Technology, Monash University, Melbourne, VIC 3800, Australia
- ARC Centre of Excellence in Advanced Molecular Imaging, Monash University, Melbourne, VIC 3800, Australia
| |
Collapse
|
7
|
Bao Y, Marini S, Tamura T, Kamada M, Maegawa S, Hosokawa H, Song J, Akutsu T. Toward more accurate prediction of caspase cleavage sites: a comprehensive review of current methods, tools and features. Brief Bioinform 2019; 20:1669-1684. [PMID: 29860277 PMCID: PMC6917222 DOI: 10.1093/bib/bby041] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Revised: 04/16/2018] [Indexed: 12/20/2022] Open
Abstract
As one of the few irreversible protein posttranslational modifications, proteolytic cleavage is involved in nearly all aspects of cellular activities, ranging from gene regulation to cell life-cycle regulation. Among the various protease-specific types of proteolytic cleavage, cleavages by casapses/granzyme B are considered as essential in the initiation and execution of programmed cell death and inflammation processes. Although a number of substrates for both types of proteolytic cleavage have been experimentally identified, the complete repertoire of caspases and granzyme B substrates remains to be fully characterized. To tackle this issue and complement experimental efforts for substrate identification, systematic bioinformatics studies of known cleavage sites provide important insights into caspase/granzyme B substrate specificity, and facilitate the discovery of novel substrates. In this article, we review and benchmark 12 state-of-the-art sequence-based bioinformatics approaches and tools for caspases/granzyme B cleavage prediction. We evaluate and compare these methods in terms of their input/output, algorithms used, prediction performance, validation methods and software availability and utility. In addition, we construct independent data sets consisting of caspases/granzyme B substrates from different species and accordingly assess the predictive power of these different predictors for the identification of cleavage sites. We find that the prediction results are highly variable among different predictors. Furthermore, we experimentally validate the predictions of a case study by performing caspase cleavage assay. We anticipate that this comprehensive review and survey analysis will provide an insightful resource for biologists and bioinformaticians who are interested in using and/or developing tools for caspase/granzyme B cleavage prediction.
Collapse
Affiliation(s)
- Yu Bao
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto 611-0011, Japan
| | - Simone Marini
- Department of Computational Medicine and Bioinformatics, University of Michigan, 1241 E. Catherine St., 5940 Buhl, Ann Arbor 48109-5618, USA
| | - Takeyuki Tamura
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto 611-0011, Japan
| | - Mayumi Kamada
- Graduate School of Medicine, Kyoto University, Sakyo-ku, Kyoto 606-8507, Japan
| | - Shingo Maegawa
- Graduate School of Informatics, Kyoto University, Sakyo-ku, Kyoto 606-8501, Japan
| | - Hiroshi Hosokawa
- Graduate School of Informatics, Kyoto University, Sakyo-ku, Kyoto 606-8501, Japan
| | - Jiangning Song
- Monash Biomedicine Discovery Institute, Monash Centre for Data Science and ARC Centre of Excellence in Advance Molecular Imaging, Monash University, Melbourne, VIC 3800, Australia
| | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto 611-0011, Japan
| |
Collapse
|
8
|
Song J, Li F, Leier A, Marquez-Lago TT, Akutsu T, Haffari G, Chou KC, Webb GI, Pike RN, Hancock J. PROSPERous: high-throughput prediction of substrate cleavage sites for 90 proteases with improved accuracy. Bioinformatics 2019; 34:684-687. [PMID: 29069280 DOI: 10.1093/bioinformatics/btx670] [Citation(s) in RCA: 109] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2017] [Accepted: 10/18/2017] [Indexed: 11/13/2022] Open
Abstract
Summary Proteases are enzymes that specifically cleave the peptide backbone of their target proteins. As an important type of irreversible post-translational modification, protein cleavage underlies many key physiological processes. When dysregulated, proteases' actions are associated with numerous diseases. Many proteases are highly specific, cleaving only those target substrates that present certain particular amino acid sequence patterns. Therefore, tools that successfully identify potential target substrates for proteases may also identify previously unknown, physiologically relevant cleavage sites, thus providing insights into biological processes and guiding hypothesis-driven experiments aimed at verifying protease-substrate interaction. In this work, we present PROSPERous, a tool for rapid in silico prediction of protease-specific cleavage sites in substrate sequences. Our tool is based on logistic regression models and uses different scoring functions and their pairwise combinations to subsequently predict potential cleavage sites. PROSPERous represents a state-of-the-art tool that enables fast, accurate and high-throughput prediction of substrate cleavage sites for 90 proteases. Availability and implementation http://prosperous.erc.monash.edu/. Contact jiangning.song@monash.edu or geoff.webb@monash.edu or r.pike@latrobe.edu.au. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jiangning Song
- Monash Centre for Data Science, Faculty of Information Technology.,Department of Biochemistry and Molecular Biology and Biomedicine Discovery Institute.,ARC Centre of Excellence in Advanced Molecular Imaging, Monash University, Clayton, VIC 3800, Australia
| | - Fuyi Li
- Department of Biochemistry and Molecular Biology and Biomedicine Discovery Institute
| | - André Leier
- Department of Genetics, School of Medicine, University of Alabama at Birmingham, Birmingham, AL 35294, USA.,Informatics Institute, School of Medicine, University of Alabama at Birmingham, Birmingham, AL 35294, USA
| | - Tatiana T Marquez-Lago
- Department of Genetics, School of Medicine, University of Alabama at Birmingham, Birmingham, AL 35294, USA.,Informatics Institute, School of Medicine, University of Alabama at Birmingham, Birmingham, AL 35294, USA
| | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto 611-0011, Japan
| | | | - Kuo-Chen Chou
- Gordon Life Science Institute, Boston, MA 02478, USA.,Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China.,Center of Excellence in Genomic Medicine Research (CEGMR), King Abdulaziz University, Jeddah 21589, Saudi Arabia
| | - Geoffrey I Webb
- Monash Centre for Data Science, Faculty of Information Technology
| | - Robert N Pike
- ARC Centre of Excellence in Advanced Molecular Imaging, Monash University, Clayton, VIC 3800, Australia.,La Trobe Institute for Molecular Science, La Trobe University, Melbourne, VIC 3086, Australia
| | | |
Collapse
|
9
|
Radchenko T, Fontaine F, Morettoni L, Zamora I. Software-aided workflow for predicting protease-specific cleavage sites using physicochemical properties of the natural and unnatural amino acids in peptide-based drug discovery. PLoS One 2019; 14:e0199270. [PMID: 30620739 PMCID: PMC6324806 DOI: 10.1371/journal.pone.0199270] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2018] [Accepted: 12/18/2018] [Indexed: 12/03/2022] Open
Abstract
Peptide drugs have been used in the treatment of multiple pathologies. During peptide discovery, it is crucially important to be able to map the potential sites of cleavages of the proteases. This knowledge is used to later chemically modify the peptide drug to adapt it for the therapeutic use, making peptide stable against individual proteases or in complex medias. In some other cases it needed to make it specifically unstable for some proteases, as peptides could be used as a system to target delivery drugs on specific tissues or cells. The information about proteases, their sites of cleavages and substrates are widely spread across publications and collected in databases such as MEROPS. Therefore, it is possible to develop models to improve the understanding of the potential peptide drug proteolysis. We propose a new workflow to derive protease specificity rules and predict the potential scissile bonds in peptides for individual proteases. WebMetabase stores the information from experimental or external sources in a chemically aware database where each peptide and site of cleavage is represented as a sequence of structural blocks connected by amide bonds and characterized by its physicochemical properties described by Volsurf descriptors. Thus, this methodology could be applied in the case of non-standard amino acid. A frequency analysis can be performed in WebMetabase to discover the most frequent cleavage sites. These results were used to train several models using logistic regression, support vector machine and ensemble tree classifiers to map cleavage sites for several human proteases from four different families (serine, cysteine, aspartic and matrix metalloproteases). Finally, we compared the predictive performance of the developed models with other available public tools PROSPERous and SitePrediction.
Collapse
Affiliation(s)
- Tatiana Radchenko
- Pompeu Fabra University, Barcelona, Spain
- Lead Molecular Design, S. L, Sant Cugat del Vallés, Spain
- * E-mail: (TR); (IZ)
| | | | | | - Ismael Zamora
- Pompeu Fabra University, Barcelona, Spain
- Lead Molecular Design, S. L, Sant Cugat del Vallés, Spain
- * E-mail: (TR); (IZ)
| |
Collapse
|
10
|
Schneidman-Duhovny D, Khuri N, Dong GQ, Winter MB, Shifrut E, Friedman N, Craik CS, Pratt KP, Paz P, Aswad F, Sali A. Predicting CD4 T-cell epitopes based on antigen cleavage, MHCII presentation, and TCR recognition. PLoS One 2018; 13:e0206654. [PMID: 30399156 PMCID: PMC6219782 DOI: 10.1371/journal.pone.0206654] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2017] [Accepted: 10/17/2018] [Indexed: 12/16/2022] Open
Abstract
Accurate predictions of T-cell epitopes would be useful for designing vaccines, immunotherapies for cancer and autoimmune diseases, and improved protein therapies. The humoral immune response involves uptake of antigens by antigen presenting cells (APCs), APC processing and presentation of peptides on MHC class II (pMHCII), and T-cell receptor (TCR) recognition of pMHCII complexes. Most in silico methods predict only peptide-MHCII binding, resulting in significant over-prediction of CD4 T-cell epitopes. We present a method, ITCell, for prediction of T-cell epitopes within an input protein antigen sequence for given MHCII and TCR sequences. The method integrates information about three stages of the immune response pathway: antigen cleavage, MHCII presentation, and TCR recognition. First, antigen cleavage sites are predicted based on the cleavage profiles of cathepsins S, B, and H. Second, for each 12-mer peptide in the antigen sequence we predict whether it will bind to a given MHCII, based on the scores of modeled peptide-MHCII complexes. Third, we predict whether or not any of the top scoring peptide-MHCII complexes can bind to a given TCR, based on the scores of modeled ternary peptide-MHCII-TCR complexes and the distribution of predicted cleavage sites. Our benchmarks consist of epitope predictions generated by this algorithm, checked against 20 peptide-MHCII-TCR crystal structures, as well as epitope predictions for four peptide-MHCII-TCR complexes with known epitopes and TCR sequences but without crystal structures. ITCell successfully identified the correct epitopes as one of the 20 top scoring peptides for 22 of 24 benchmark cases. To validate the method using a clinically relevant application, we utilized five factor VIII-specific TCR sequences from hemophilia A subjects who developed an immune response to factor VIII replacement therapy. The known HLA-DR1-restricted factor VIII epitope was among the six top-scoring factor VIII peptides predicted by ITCall to bind HLA-DR1 and all five TCRs. Our integrative approach is more accurate than current single-stage epitope prediction algorithms applied to the same benchmarks. It is freely available as a web server (http://salilab.org/itcell).
Collapse
Affiliation(s)
- Dina Schneidman-Duhovny
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, United States of America
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, United States of America
- * E-mail: (AS); (DS); (PP); (FA)
| | - Natalia Khuri
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, United States of America
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, United States of America
- Graduate Group in Biophysics, University of California at San Francisco, San Francisco, CA, United States of America
| | - Guang Qiang Dong
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, United States of America
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, United States of America
| | - Michael B. Winter
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, United States of America
| | - Eric Shifrut
- Department of Immunology, Weizmann Institute of Science, Rehovot, Israel
| | - Nir Friedman
- Department of Immunology, Weizmann Institute of Science, Rehovot, Israel
| | - Charles S. Craik
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, United States of America
- California Institute for Quantitative Biosciences (QB3), University of California, San Francisco, San Francisco, CA, United States of America
| | - Kathleen P. Pratt
- Uniformed Services University of the Health Sciences, Bethesda, MD, United States of America
| | - Pedro Paz
- Bayer HealthCare, San Francisco, CA, United States of America
- * E-mail: (AS); (DS); (PP); (FA)
| | - Fred Aswad
- Bayer HealthCare, San Francisco, CA, United States of America
- * E-mail: (AS); (DS); (PP); (FA)
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, United States of America
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, United States of America
- Graduate Group in Biophysics, University of California at San Francisco, San Francisco, CA, United States of America
- * E-mail: (AS); (DS); (PP); (FA)
| |
Collapse
|
11
|
Bhagwat SR, Hajela K, Kumar A. Proteolysis to Identify Protease Substrates: Cleave to Decipher. Proteomics 2018; 18:e1800011. [DOI: 10.1002/pmic.201800011] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2018] [Revised: 04/03/2018] [Indexed: 02/06/2023]
Affiliation(s)
- Sonali R. Bhagwat
- Discipline of Biosciences and Biomedical Engineering; Indian Institute of Technology; Indore 453552 Simrol India
| | - Krishnan Hajela
- School of Life Sciences; Devi Ahilya Vishwavidyalaya; Indore 452001 India
| | - Amit Kumar
- Discipline of Biosciences and Biomedical Engineering; Indian Institute of Technology; Indore 453552 Simrol India
| |
Collapse
|
12
|
Song J, Wang Y, Li F, Akutsu T, Rawlings ND, Webb GI, Chou KC. iProt-Sub: a comprehensive package for accurately mapping and predicting protease-specific substrates and cleavage sites. Brief Bioinform 2018. [DOI: 10.1093/bib/bby028 epub ahead of print].] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Affiliation(s)
- Jiangning Song
- Monash Centre for Data Science, Faculty of Information Technology, Monash University, Melbourne, VIC 3800, Australia
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia and ARC Centre of Excellence in Advanced Molecular Imaging, Monash University, Melbourne, VIC 3800, Australia
| | - Yanan Wang
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai, 200240, China
| | - Fuyi Li
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
| | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto, 611-0011, Japan
| | - Neil D Rawlings
- EMBL European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Geoffrey I Webb
- Monash Centre for Data Science, Faculty of Information Technology, Monash University, Melbourne, VIC 3800, Australia
| | - Kuo-Chen Chou
- Gordon Life Science Institute, Boston, MA 02478, USA and Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 610054, China
| |
Collapse
|
13
|
Abstract
All cells contain proteases which hydrolyze the peptide bonds between amino acids in a protein backbone. Typically, proteases are prevented from nonspecific proteolysis by regulation and by their physical separation into different subcellular compartments; however, this segregation is not retained during cell lysis, which is the initial step in any protein isolation procedure. Prevention of proteolysis during protein purification often takes the form of a two-pronged approach; firstly inhibition of proteolysis in situ, followed by the early separation of the protease from the protein of interest via chromatographical purification. Protease inhibitors are routinely used to limit the effect of the proteases before they are physically separated from the protein of interest via column chromatography. Here, commonly used approaches to reducing or avoiding proteolysis during protein purification and subsequent chromatography are reviewed.
Collapse
|
14
|
Savickas S, Auf dem Keller U. Targeted degradomics in protein terminomics and protease substrate discovery. Biol Chem 2017; 399:47-54. [PMID: 28850541 DOI: 10.1515/hsz-2017-0187] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2017] [Accepted: 08/21/2017] [Indexed: 02/06/2023]
Abstract
Targeted degradomics integrates positional information into mass spectrometry (MS)-based targeted proteomics workflows and thereby enables analysis of proteolytic cleavage events with unprecedented specificity and sensitivity. Rapid progress in the establishment of protease-substrate relations provides extensive degradomics target lists that now can be tested with help of selected and parallel reaction monitoring (S/PRM) in complex biological systems, where proteases act in physiological environments. In this minireview, we describe the general principles of targeted degradomics, outline the generic experimental workflow of the methodology and highlight recent and future applications in protease research.
Collapse
Affiliation(s)
- Simonas Savickas
- Institute of Molecular Health Sciences, ETH Zurich, Otto-Stern-Weg 7, CH-8093 Zurich, Switzerland
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Anker Engelunds Vej, Building 301, DK-2800 Kgs. Lyngby, Denmark
| | - Ulrich Auf dem Keller
- Institute of Molecular Health Sciences, ETH Zurich, Otto-Stern-Weg 7, CH-8093 Zurich, Switzerland
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Anker Engelunds Vej, Building 301, DK-2800 Kgs. Lyngby, Denmark
| |
Collapse
|
15
|
Qi E, Wang D, Gao B, Li Y, Li G. Block-based characterization of protease specificity from substrate sequence profile. BMC Bioinformatics 2017; 18:438. [PMID: 28974219 PMCID: PMC5627433 DOI: 10.1186/s12859-017-1851-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2017] [Accepted: 09/26/2017] [Indexed: 01/12/2023] Open
Abstract
Background The mechanism of action of proteases has been widely studied based on substrate specificity. Prior research has been focused on the amino acids at a single amino acid site, but rarely on combinations of amino acids around the cleavage bond. Results We propose a novel block-based approach to reveal the potential combinations of amino acids which may regulate the action of proteases. Using the entropies of eight blocks centered at a cleavage bond, we created a distance matrix for 61 proteases to compare their specificities. After quantitative analysis, we discovered a number of prominent blocks, each of which consists of successive amino acids near a cleavage bond, intuitively characterizing the site cooperation of the substrate sequences. Conclusion This approach will help in the discovery of specific substrate sequences which may bridge between proteases and cleavage substrate as more substrate information becomes available. Electronic supplementary material The online version of this article (10.1186/s12859-017-1851-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Enfeng Qi
- School of Mathematics, Shandong University, Jinan, 250100, China
| | - Dongyu Wang
- The State Key Laboratory of Microbial Technology, Shandong University, Jinan, 250100, China
| | - Bo Gao
- School of Mathematics, Shandong University, Jinan, 250100, China
| | - Yang Li
- School of Mathematics, Shandong University, Jinan, 250100, China
| | - Guojun Li
- School of Mathematics, Shandong University, Jinan, 250100, China.
| |
Collapse
|
16
|
Yamamoto H, Saito S, Sawaguchi Y, Kimura M. Identification of Protease Specificity Using Biotin-Labeled Substrates. Open Biochem J 2017; 11:27-35. [PMID: 28567123 PMCID: PMC5418938 DOI: 10.2174/1874091x01711010027] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2016] [Revised: 02/14/2017] [Accepted: 03/17/2017] [Indexed: 11/22/2022] Open
Abstract
BACKGROUND Proteolysis constitutes a major post-translational modification. For example, proteases regulate the activation or inactivation of various proteins, such as enzymes, growth factors, and peptide hormones. Proteases have substrate specificity, and protease expression regulates the specific and regional activation or inactivation of several functional proteins. METHODS We demonstrate a novel method for determining protease specificity through the use of MALDI-TOF mass spectrometry with biotin-labeled substrates. RESULTS This method was able to determine the specificity of TPCK-trypsin, V8 protease, elastase and cyanogen bromide cleavage, and the results were similar to previous reports. In addition, the method can be used to measure crude samples, such as tumor extracts. CONCLUSION We demonstrated that this method could identify protease specificity after simple processing, even for crude samples.
Collapse
Affiliation(s)
- Hiroyuki Yamamoto
- Department of Microbiology and Molecular Cell Biology, Nihon Pharmaceutical University, 10281 Komuro, Inamachi, Kitaadachi-gun, Saitama, 362-0806, Japan
| | - Syota Saito
- Department of Microbiology and Molecular Cell Biology, Nihon Pharmaceutical University, 10281 Komuro, Inamachi, Kitaadachi-gun, Saitama, 362-0806, Japan
| | - Yoshikazu Sawaguchi
- Department of Clinical Pharmaceutics, Nihon Pharmaceutical University, 10281 Komuro, Inamachi, Kitaadachi-gun, Saitama, 362-0806, Japan
| | - Michio Kimura
- Department of Microbiology and Molecular Cell Biology, Nihon Pharmaceutical University, 10281 Komuro, Inamachi, Kitaadachi-gun, Saitama, 362-0806, Japan
| |
Collapse
|
17
|
Pethe MA, Rubenstein AB, Khare SD. Large-Scale Structure-Based Prediction and Identification of Novel Protease Substrates Using Computational Protein Design. J Mol Biol 2016; 429:220-236. [PMID: 27932294 DOI: 10.1016/j.jmb.2016.11.031] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2016] [Revised: 11/23/2016] [Accepted: 11/30/2016] [Indexed: 12/16/2022]
Abstract
Characterizing the substrate specificity of protease enzymes is critical for illuminating the molecular basis of their diverse and complex roles in a wide array of biological processes. Rapid and accurate prediction of their extended substrate specificity would also aid in the design of custom proteases capable of selectively and controllably cleaving biotechnologically or therapeutically relevant targets. However, current in silico approaches for protease specificity prediction, rely on, and are therefore limited by, machine learning of sequence patterns in known experimental data. Here, we describe a general approach for predicting peptidase substrates de novo using protein structure modeling and biophysical evaluation of enzyme-substrate complexes. We construct atomic resolution models of thousands of candidate substrate-enzyme complexes for each of five model proteases belonging to the four major protease mechanistic classes-serine, cysteine, aspartyl, and metallo-proteases-and develop a discriminatory scoring function using enzyme design modules from Rosetta and AMBER's MMPBSA. We rank putative substrates based on calculated interaction energy with a modeled near-attack conformation of the enzyme active site. We show that the energetic patterns obtained from these simulations can be used to robustly rank and classify known cleaved and uncleaved peptides and that these structural-energetic patterns have greater discriminatory power compared to purely sequence-based statistical inference. Combining sequence and energetic patterns using machine-learning algorithms further improves classification performance, and analysis of structural models provides physical insight into the structural basis for the observed specificities. We further tested the predictive capability of the model by designing and experimentally characterizing the cleavage of four novel substrate motifs for the hepatitis C virus NS3/4 protease using an in vivo assay. The presented structure-based approach is generalizable to other protease enzymes with known or modeled structures, and complements existing experimental methods for specificity determination.
Collapse
Affiliation(s)
- Manasi A Pethe
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Center for Integrative Proteomics Research, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Aliza B Rubenstein
- Computational Biology & Molecular Biophysics Program, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Sagar D Khare
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Center for Integrative Proteomics Research, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Computational Biology & Molecular Biophysics Program, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA.
| |
Collapse
|
18
|
Rawlings ND. Peptidase specificity from the substrate cleavage collection in the MEROPS database and a tool to measure cleavage site conservation. Biochimie 2016; 122:5-30. [PMID: 26455268 PMCID: PMC4756867 DOI: 10.1016/j.biochi.2015.10.003] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2015] [Accepted: 10/05/2015] [Indexed: 11/11/2022]
Abstract
One peptidase can usually be distinguished from another biochemically by its action on proteins, peptides and synthetic substrates. Since 1996, the MEROPS database (http://merops.sanger.ac.uk) has accumulated a collection of cleavages in substrates that now amounts to 66,615 cleavages. The total number of peptidases for which at least one cleavage is known is 1700 out of a total of 2457 different peptidases. This paper describes how the cleavages are obtained from the scientific literature, how they are annotated and how cleavages in peptides and proteins are cross-referenced to entries in the UniProt protein sequence database. The specificity profiles of 556 peptidases are shown for which ten or more substrate cleavages are known. However, it has been proposed that at least 40 cleavages in disparate proteins are required for specificity analysis to be meaningful, and only 163 peptidases (6.6%) fulfil this criterion. Also described are the various displays shown on the website to aid with the understanding of peptidase specificity, which are derived from the substrate cleavage collection. These displays include a logo, distribution matrix, and tables to summarize which amino acids or groups of amino acids are acceptable (or not acceptable) in each substrate binding pocket. For each protein substrate, there is a display to show how it is processed and degraded. Also described are tools on the website to help with the assessment of the physiological relevance of cleavages in a substrate. These tools rely on the hypothesis that a cleavage site that is conserved in orthologues is likely to be physiologically relevant, and alignments of substrate protein sequences are made utilizing the UniRef50 database, in which in each entry sequences are 50% or more identical. Conservation in this case means substitutions are permitted only if the amino acid is known to occupy the same substrate binding pocket from at least one other substrate cleaved by the same peptidase.
Collapse
Affiliation(s)
- Neil D Rawlings
- Wellcome Trust Sanger Institute and the EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, CB10 1SA, UK.
| |
Collapse
|
19
|
Rao VA. Perspectives on Engineering Biobetter Therapeutic Proteins with Greater Stability in Inflammatory Environments. BIOBETTERS 2015. [DOI: 10.1007/978-1-4939-2543-8_11] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
|
20
|
Network analyses reveal pervasive functional regulation between proteases in the human protease web. PLoS Biol 2014; 12:e1001869. [PMID: 24865846 PMCID: PMC4035269 DOI: 10.1371/journal.pbio.1001869] [Citation(s) in RCA: 126] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2013] [Accepted: 04/16/2014] [Indexed: 11/21/2022] Open
Abstract
Network modeling of interactions between proteases and their inhibitors reveals a network of new protein connections and cascades in the protease web. Proteolytic processing is an irreversible posttranslational modification affecting a large portion of the proteome. Protease-cleaved mediators frequently exhibit altered activity, and biological pathways are often regulated by proteolytic processing. Many of these mechanisms have not been appreciated as being protease-dependent, and the potential in unraveling a complex new dimension of biological control is increasingly recognized. Proteases are currently believed to act individually or in isolated cascades. However, conclusive but scattered biochemical evidence indicates broader regulation of proteases by protease and inhibitor interactions. Therefore, to systematically study such interactions, we assembled curated protease cleavage and inhibition data into a global, computational representation, termed the protease web. This revealed that proteases pervasively influence the activity of other proteases directly or by cleaving intermediate proteases or protease inhibitors. The protease web spans four classes of proteases and inhibitors and so links both recently and classically described protease groups and cascades, which can no longer be viewed as operating in isolation in vivo. We demonstrated that this observation, termed reachability, is robust to alterations in the data and will only increase in the future as additional data are added. We further show how subnetworks of the web are operational in 23 different tissues reflecting different phenotypes. We applied our network to develop novel insights into biologically relevant protease interactions using cell-specific proteases of the polymorphonuclear leukocyte as a system. Predictions from the protease web on the activity of matrix metalloproteinase 8 (MMP8) and neutrophil elastase being linked by an inactivating cleavage of serpinA1 by MMP8 were validated and explain perplexing Mmp8−/− versus wild-type polymorphonuclear chemokine cleavages in vivo. Our findings supply systematically derived and validated evidence for the existence of the protease web, a network that affects the activity of most proteases and thereby influences the functional state of the proteome and cell activity. Proteases modify the structure and activity of all proteins by peptide bond hydrolysis and are increasingly recognized as integral regulatory components of numerous biological mechanisms. Deregulated protease activity is a common characteristic of many diseases. However, protease drug development is complicated by an incomplete understanding of protease biology. One missing piece in this puzzle is the interplay between proteases: Some proteases activate other proteases, whereas some proteases inactivate inhibitors, leading to currently unpredictable cleavage of additional proteins. Using database annotations we mathematically modeled protease interactions. Our model includes 1,230 proteins and shows connections between 141,523 pairs of proteases, substrates, and inhibitors. Thus, proteases interact on a large scale to form the protease web, which links most studied groups of proteases and their inhibitors, indicating that the potential of regulation through this network is very large. We found that this interplay is robust to targeted or untargeted pruning of the protease web and that protease inhibitors are central to network connectivity. Our model was used to decipher proteolytic pathways that drive inflammatory processes in vivo. Consequently, protease regulatory interactions should be recognized and explored further to understand in vivo roles and to select better drug targets that avoid side effects arising from inhibition of unexpected activities.
Collapse
|
21
|
Chang CCH, Tey BT, Song J, Ramanan RN. Towards more accurate prediction of protein folding rates: a review of the existing web-based bioinformatics approaches. Brief Bioinform 2014; 16:314-24. [DOI: 10.1093/bib/bbu007] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
|
22
|
Wang M, Zhao XM, Tan H, Akutsu T, Whisstock JC, Song J. Cascleave 2.0, a new approach for predicting caspase and granzyme cleavage targets. ACTA ACUST UNITED AC 2013; 30:71-80. [PMID: 24149049 DOI: 10.1093/bioinformatics/btt603] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION Caspases and granzyme B (GrB) are important proteases involved in fundamental cellular processes and play essential roles in programmed cell death, necrosis and inflammation. Although a number of substrates for both types have been experimentally identified, the complete repertoire of caspases and granzyme B substrates remained to be fully characterized. Accordingly, systematic bioinformatics studies of known cleavage sites may provide important insights into their substrate specificity and facilitate the discovery of novel substrates. RESULTS We develop a new bioinformatics tool, termed Cascleave 2.0, which builds on previous success of the Cascleave tool for predicting generic caspase cleavage sites. It can be efficiently used to predict potential caspase-specific cleavage sites for the human caspase-1, 3, 6, 7, 8 and GrB. In particular, we integrate heterogeneous sequence and protein functional information from various sources to improve the prediction accuracy of Cascleave 2.0. During classification, we use both maximum relevance minimum redundancy and forward feature selection techniques to quantify the relative contribution of each feature to prediction and thus remove redundant as well as irrelevant features. A systematic evaluation of Cascleave 2.0 using the benchmark data and comparison with other state-of-the-art tools using independent test data indicate that Cascleave 2.0 outperforms other tools on protease-specific cleavage site prediction of caspase-1, 3, 6, 7 and GrB. Cascleave 2.0 is anticipated to be used as a powerful tool for identifying novel substrates and cleavage sites of caspases and GrB and help understand the functional roles of these important proteases in human proteolytic cascades. AVAILABILITY AND IMPLEMENTATION http://www.structbioinfor.org/cascleave2/.
Collapse
Affiliation(s)
- Mingjun Wang
- National Engineering Laboratory for Industrial Enzymes and Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, Department of Computer Science, School of Electronics and Information Engineering, Tongji University, Shanghai 201804, China, Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Victoria 3800, Australia, Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto 611-0011, Japan and ARC Centre of Excellence for Structural and Functional Microbial Genomics, Monash University, Melbourne, Victoria 3800, Australia
| | | | | | | | | | | |
Collapse
|
23
|
Rendón-Ramírez A, Shukla M, Oda M, Chakraborty S, Minda R, Dandekar AM, Ásgeirsson B, Goñi FM, Rao BJ. A computational module assembled from different protease family motifs identifies PI PLC from Bacillus cereus as a putative prolyl peptidase with a serine protease scaffold. PLoS One 2013; 8:e70923. [PMID: 23940667 PMCID: PMC3733634 DOI: 10.1371/journal.pone.0070923] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2012] [Accepted: 06/28/2013] [Indexed: 12/12/2022] Open
Abstract
Proteolytic enzymes have evolved several mechanisms to cleave peptide bonds. These distinct types have been systematically categorized in the MEROPS database. While a BLAST search on these proteases identifies homologous proteins, sequence alignment methods often fail to identify relationships arising from convergent evolution, exon shuffling, and modular reuse of catalytic units. We have previously established a computational method to detect functions in proteins based on the spatial and electrostatic properties of the catalytic residues (CLASP). CLASP identified a promiscuous serine protease scaffold in alkaline phosphatases (AP) and a scaffold recognizing a β-lactam (imipenem) in a cold-active Vibrio AP. Subsequently, we defined a methodology to quantify promiscuous activities in a wide range of proteins. Here, we assemble a module which encapsulates the multifarious motifs used by protease families listed in the MEROPS database. Since APs and proteases are an integral component of outer membrane vesicles (OMV), we sought to query other OMV proteins, like phospholipase C (PLC), using this search module. Our analysis indicated that phosphoinositide-specific PLC from Bacillus cereus is a serine protease. This was validated by protease assays, mass spectrometry and by inhibition of the native phospholipase activity of PI-PLC by the well-known serine protease inhibitor AEBSF (IC50 = 0.018 mM). Edman degradation analysis linked the specificity of the protease activity to a proline in the amino terminal, suggesting that the PI-PLC is a prolyl peptidase. Thus, we propose a computational method of extending protein families based on the spatial and electrostatic congruence of active site residues.
Collapse
Affiliation(s)
- Adela Rendón-Ramírez
- Unidad de Biofísica (Consejo Superior de Investigaciones Científicas, Universidad del Pais Vasco/Euskal Herriko Unibertsitatea) and Departamento de Bioquímica, Universidad del País Vasco, Bilbao, Spain
| | - Manish Shukla
- Department of Biological Sciences, Tata Institute of Fundamental Research, Homi Bhabha Road, Mumbai, India
| | - Masataka Oda
- Department of Microbiology, Faculty of Pharmaceutical Science, Tokushima Bunri University, Tokushima, Japan
| | - Sandeep Chakraborty
- Department of Biological Sciences, Tata Institute of Fundamental Research, Mumbai, India
- * E-mail:
| | - Renu Minda
- Department of Biological Sciences, Tata Institute of Fundamental Research, Mumbai, India
| | - Abhaya M. Dandekar
- Plant Sciences Department, University of California, Davis, Davis, California, United States of America
| | - Bjarni Ásgeirsson
- Science Institute, Department of Biochemistry, University of Iceland, Dunhaga, Reykjavik, Iceland
| | - Félix M. Goñi
- Unidad de Biofísica (Consejo Superior de Investigaciones Científicas, Universidad del Pais Vasco/Euskal Herriko Unibertsitatea) and Departamento de Bioquímica, Universidad del País Vasco, Bilbao, Spain
| | - Basuthkar J. Rao
- Department of Biological Sciences, Tata Institute of Fundamental Research, Mumbai, India
| |
Collapse
|
24
|
PROSPER: an integrated feature-based tool for predicting protease substrate cleavage sites. PLoS One 2012; 7:e50300. [PMID: 23209700 PMCID: PMC3510211 DOI: 10.1371/journal.pone.0050300] [Citation(s) in RCA: 228] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2012] [Accepted: 10/18/2012] [Indexed: 12/04/2022] Open
Abstract
The ability to catalytically cleave protein substrates after synthesis is fundamental for all forms of life. Accordingly, site-specific proteolysis is one of the most important post-translational modifications. The key to understanding the physiological role of a protease is to identify its natural substrate(s). Knowledge of the substrate specificity of a protease can dramatically improve our ability to predict its target protein substrates, but this information must be utilized in an effective manner in order to efficiently identify protein substrates by in silico approaches. To address this problem, we present PROSPER, an integrated feature-based server for in silico identification of protease substrates and their cleavage sites for twenty-four different proteases. PROSPER utilizes established specificity information for these proteases (derived from the MEROPS database) with a machine learning approach to predict protease cleavage sites by using different, but complementary sequence and structure characteristics. Features used by PROSPER include local amino acid sequence profile, predicted secondary structure, solvent accessibility and predicted native disorder. Thus, for proteases with known amino acid specificity, PROSPER provides a convenient, pre-prepared tool for use in identifying protein substrates for the enzymes. Systematic prediction analysis for the twenty-four proteases thus far included in the database revealed that the features we have included in the tool strongly improve performance in terms of cleavage site prediction, as evidenced by their contribution to performance improvement in terms of identifying known cleavage sites in substrates for these enzymes. In comparison with two state-of-the-art prediction tools, PoPS and SitePrediction, PROSPER achieves greater accuracy and coverage. To our knowledge, PROSPER is the first comprehensive server capable of predicting cleavage sites of multiple proteases within a single substrate sequence using machine learning techniques. It is freely available at http://lightning.med.monash.edu.au/PROSPER/.
Collapse
|
25
|
RECENT ADVANCES ON STRUCTURAL BIOINFORMATICS, CELL MOTION SIMULATION, FUNCTIONAL MODULE IDENTIFICATION, COPY NUMBER VARIATION, AND PROTEASE SUBSTRATE PREDICTION AND SOME CRITICAL COMMENTS ON HMMER2. J Bioinform Comput Biol 2011. [DOI: 10.1142/s0219720011005380] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
26
|
Song J, Matthews AY, Reboul CF, Kaiserman D, Pike RN, Bird PI, Whisstock JC. Predicting serpin/protease interactions. Methods Enzymol 2011; 501:237-73. [PMID: 22078538 DOI: 10.1016/b978-0-12-385950-1.00012-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Proteases are tightly regulated by specific inhibitors, such as serpins, which are able to undergo considerable and irreversible conformational changes in order to trap their targets. There has been a considerable effort to investigate serpin structure and functions in the past few decades; however, the specific interactions between proteases and serpins remain elusive. In this chapter, we describe detailed experimental protocols to determine and characterize the extended substrate specificity of proteases based on a substrate phage display technique. We also describe how to employ a bioinformatics system to analyze the substrate specificity data obtained from this technique and predict the potential inhibitory serpin partners of a protease (in this case, the immune protease, granzyme B) in a step-by-step manner. The method described here could also be applied to other proteases for more generalized substrate specificity analysis and substrate discovery.
Collapse
Affiliation(s)
- Jiangning Song
- Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Victoria, Australia
| | | | | | | | | | | | | |
Collapse
|