1
|
Topcu E, Ridgeway NH, Biggar KK. PeSA 2.0: A software tool for peptide specificity analysis implementing positive and negative motifs and motif-based peptide scoring. Comput Biol Chem 2022; 101:107753. [PMID: 35998543 DOI: 10.1016/j.compbiolchem.2022.107753] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Revised: 08/05/2022] [Accepted: 08/09/2022] [Indexed: 11/26/2022]
Abstract
There are a vast number of molecular interactions that occur at the cellular level. Among these molecular interactions, interactions between multiple proteins are a widely studied area of research due to the importance of these interactions in cellular function and their potential in drug development. PeSA is a desktop application developed to facilitate the in vitro peptide study analysis to predict protein-protein interactions. PeSA can effortlessly generate visual outputs like motifs, bar charts, and visual matrices. Our implementation of PeSA version 2.0 includes additional tools, including the ability to further score peptide lists for consensus amongst interactions. The software is also able to design de novo peptides based on sequence motifs (sequence generator), which can be used to help design additional experiments for motif validation. Further, the efficacy of the sequence generator was validated using the lysine methyltransferase, SETD8, to identify new substrates of methylation based on motif-based predictions developed using PeSA2.0.
Collapse
Affiliation(s)
- Emine Topcu
- Institute of Biochemistry and Department of Biology, Carleton University, 1125 Colonel By Drive, Ottawa, Ontario K1N 5B6, Canada
| | - Nashira H Ridgeway
- Institute of Biochemistry and Department of Biology, Carleton University, 1125 Colonel By Drive, Ottawa, Ontario K1N 5B6, Canada
| | - Kyle K Biggar
- Institute of Biochemistry and Department of Biology, Carleton University, 1125 Colonel By Drive, Ottawa, Ontario K1N 5B6, Canada.
| |
Collapse
|
2
|
Meng F, Liang Z, Zhao K, Luo C. Drug design targeting active posttranslational modification protein isoforms. Med Res Rev 2020; 41:1701-1750. [PMID: 33355944 DOI: 10.1002/med.21774] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Revised: 11/29/2020] [Accepted: 12/03/2020] [Indexed: 12/11/2022]
Abstract
Modern drug design aims to discover novel lead compounds with attractable chemical profiles to enable further exploration of the intersection of chemical space and biological space. Identification of small molecules with good ligand efficiency, high activity, and selectivity is crucial toward developing effective and safe drugs. However, the intersection is one of the most challenging tasks in the pharmaceutical industry, as chemical space is almost infinity and continuous, whereas the biological space is very limited and discrete. This bottleneck potentially limits the discovery of molecules with desirable properties for lead optimization. Herein, we present a new direction leveraging posttranslational modification (PTM) protein isoforms target space to inspire drug design termed as "Post-translational Modification Inspired Drug Design (PTMI-DD)." PTMI-DD aims to extend the intersections of chemical space and biological space. We further rationalized and highlighted the importance of PTM protein isoforms and their roles in various diseases and biological functions. We then laid out a few directions to elaborate the PTMI-DD in drug design including discovering covalent binding inhibitors mimicking PTMs, targeting PTM protein isoforms with distinctive binding sites from that of wild-type counterpart, targeting protein-protein interactions involving PTMs, and hijacking protein degeneration by ubiquitination for PTM protein isoforms. These directions will lead to a significant expansion of the biological space and/or increase the tractability of compounds, primarily due to precisely targeting PTM protein isoforms or complexes which are highly relevant to biological functions. Importantly, this new avenue will further enrich the personalized treatment opportunity through precision medicine targeting PTM isoforms.
Collapse
Affiliation(s)
- Fanwang Meng
- Drug Discovery and Design Center, the Center for Chemical Biology, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China.,Department of Chemistry and Chemical Biology, McMaster University, Hamilton, Ontario, Canada
| | - Zhongjie Liang
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
| | - Kehao Zhao
- School of Pharmacy, Key Laboratory of Molecular Pharmacology and Drug Evaluation (Yantai University), Ministry of Education, Collaborative Innovation Center of Advanced Drug Delivery System and Biotech Drugs in Universities of Shandong, Yantai University, Yantai, China
| | - Cheng Luo
- Drug Discovery and Design Center, the Center for Chemical Biology, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
| |
Collapse
|
3
|
Bernheim S, Meilhac SM. Mesoderm patterning by a dynamic gradient of retinoic acid signalling. Philos Trans R Soc Lond B Biol Sci 2020; 375:20190556. [PMID: 32829679 PMCID: PMC7482219 DOI: 10.1098/rstb.2019.0556] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/11/2020] [Indexed: 12/15/2022] Open
Abstract
Retinoic acid (RA), derived from vitamin A, is a major teratogen, clinically recognized in 1983. Identification of its natural presence in the embryo and dissection of its molecular mechanism of action became possible in the animal model with the advent of molecular biology, starting with the cloning of its nuclear receptor. In normal development, the dose of RA is tightly controlled to regulate organ formation. Its production depends on enzymes, which have a dynamic expression profile during embryonic development. As a small molecule, it diffuses rapidly and acts as a morphogen. Here, we review advances in deciphering how endogenously produced RA provides positional information to cells. We compare three mesodermal tissues, the limb, the somites and the heart, and discuss how RA signalling regulates antero-posterior and left-right patterning. A common principle is the establishment of its spatio-temporal dynamics by positive and negative feedback mechanisms and by antagonistic signalling by FGF. However, the response is cell-specific, pointing to the existence of cofactors and effectors, which are as yet incompletely characterized. This article is part of a discussion meeting issue 'Contemporary morphogenesis'.
Collapse
Affiliation(s)
- Ségolène Bernheim
- Imagine-Institut Pasteur, Laboratory of Heart Morphogenesis, 75015 Paris, France
- INSERM UMR1163, 75015 Paris, France
- Université de Paris, Paris, France
| | - Sigolène M. Meilhac
- Imagine-Institut Pasteur, Laboratory of Heart Morphogenesis, 75015 Paris, France
- INSERM UMR1163, 75015 Paris, France
- Université de Paris, Paris, France
| |
Collapse
|
5
|
Behjati Ardakani F, Schmidt F, Schulz MH. Predicting transcription factor binding using ensemble random forest models. F1000Res 2019; 7:1603. [PMID: 31723409 PMCID: PMC6823902 DOI: 10.12688/f1000research.16200.2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 08/15/2019] [Indexed: 12/03/2022] Open
Abstract
Background: Understanding the location and cell-type specific binding of Transcription Factors (TFs) is important in the study of gene regulation. Computational prediction of TF binding sites is challenging, because TFs often bind only to short DNA motifs and cell-type specific co-factors may work together with the same TF to determine binding. Here, we consider the problem of learning a general model for the prediction of TF binding using DNase1-seq data and TF motif description in form of position specific energy matrices (PSEMs). Methods: We use TF ChIP-seq data as a gold-standard for model training and evaluation. Our contribution is a novel ensemble learning approach using random forest classifiers. In the context of the
ENCODE-DREAM in vivo TF binding site prediction challenge we consider different learning setups. Results: Our results indicate that the ensemble learning approach is able to better generalize across tissues and cell-types compared to individual tissue-specific classifiers or a classifier built based upon data aggregated across tissues. Furthermore, we show that incorporating DNase1-seq peaks is essential to reduce the false positive rate of TF binding predictions compared to considering the raw DNase1 signal. Conclusions: Analysis of important features reveals that the models preferentially select motifs of other TFs that are close interaction partners in existing protein protein-interaction networks. Code generated in the scope of this project is available on GitHub:
https://github.com/SchulzLab/TFAnalysis (DOI: 10.5281/zenodo.1409697).
Collapse
Affiliation(s)
- Fatemeh Behjati Ardakani
- High throughput Genomics and Systems Biology, Cluster of Excellence on Multimodel Computing and Interaction, Saarland University, Saarbruecken,, Saarland, 66123, Germany.,Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarbruecken, Saarland, 66123, Germany.,Graduate School of computer science, Saarland University, Saarbruecken, Saarland, 66123, Germany
| | - Florian Schmidt
- High throughput Genomics and Systems Biology, Cluster of Excellence on Multimodel Computing and Interaction, Saarland University, Saarbruecken,, Saarland, 66123, Germany.,Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarbruecken, Saarland, 66123, Germany.,Graduate School of computer science, Saarland University, Saarbruecken, Saarland, 66123, Germany.,Computational Systems Biology, Genome Institute of Singapore, Singapore, Singapore
| | - Marcel H Schulz
- High throughput Genomics and Systems Biology, Cluster of Excellence on Multimodel Computing and Interaction, Saarland University, Saarbruecken,, Saarland, 66123, Germany.,Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarbruecken, Saarland, 66123, Germany.,Institute for Cardiovasular Regeneration, Goethe University Frankfurt Am Main, Frankfurt Am Main, Hessen, 60590, Germany
| |
Collapse
|
6
|
Gao C, Sun H, Wang T, Tang M, Bohnen NI, Müller MLTM, Herman T, Giladi N, Kalinin A, Spino C, Dauer W, Hausdorff JM, Dinov ID. Model-based and Model-free Machine Learning Techniques for Diagnostic Prediction and Classification of Clinical Outcomes in Parkinson's Disease. Sci Rep 2018; 8:7129. [PMID: 29740058 PMCID: PMC5940671 DOI: 10.1038/s41598-018-24783-4] [Citation(s) in RCA: 67] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Accepted: 04/10/2018] [Indexed: 01/08/2023] Open
Abstract
In this study, we apply a multidisciplinary approach to investigate falls in PD patients using clinical, demographic and neuroimaging data from two independent initiatives (University of Michigan and Tel Aviv Sourasky Medical Center). Using machine learning techniques, we construct predictive models to discriminate fallers and non-fallers. Through controlled feature selection, we identified the most salient predictors of patient falls including gait speed, Hoehn and Yahr stage, postural instability and gait difficulty-related measurements. The model-based and model-free analytical methods we employed included logistic regression, random forests, support vector machines, and XGboost. The reliability of the forecasts was assessed by internal statistical (5-fold) cross validation as well as by external out-of-bag validation. Four specific challenges were addressed in the study: Challenge 1, develop a protocol for harmonizing and aggregating complex, multisource, and multi-site Parkinson's disease data; Challenge 2, identify salient predictive features associated with specific clinical traits, e.g., patient falls; Challenge 3, forecast patient falls and evaluate the classification performance; and Challenge 4, predict tremor dominance (TD) vs. posture instability and gait difficulty (PIGD). Our findings suggest that, compared to other approaches, model-free machine learning based techniques provide a more reliable clinical outcome forecasting of falls in Parkinson's patients, for example, with a classification accuracy of about 70-80%.
Collapse
Affiliation(s)
- Chao Gao
- Statistics Online Computational Resource, Department of Health Behavior and Biological Sciences, University of Michigan, Ann Arbor, MI, United States
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, United States
| | - Hanbo Sun
- Statistics Online Computational Resource, Department of Health Behavior and Biological Sciences, University of Michigan, Ann Arbor, MI, United States
- Department of Statistics, University of Michigan, Ann Arbor, MI, United States
| | - Tuo Wang
- Statistics Online Computational Resource, Department of Health Behavior and Biological Sciences, University of Michigan, Ann Arbor, MI, United States
- Department of Statistics, University of Michigan, Ann Arbor, MI, United States
| | - Ming Tang
- Statistics Online Computational Resource, Department of Health Behavior and Biological Sciences, University of Michigan, Ann Arbor, MI, United States
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, United States
| | - Nicolaas I Bohnen
- Department of Radiology, University of Michigan, Ann Arbor, MI, United States
- Department of Neurology and Ann Arbor VA Medical Center, University of Michigan, Ann Arbor, MI, United States
- Morris K. Udall Center of Excellence for Parkinson's Disease Research, University of Michigan, Ann Arbor, MI, United States
| | - Martijn L T M Müller
- Department of Radiology, University of Michigan, Ann Arbor, MI, United States
- Department of Neurology and Ann Arbor VA Medical Center, University of Michigan, Ann Arbor, MI, United States
- Morris K. Udall Center of Excellence for Parkinson's Disease Research, University of Michigan, Ann Arbor, MI, United States
| | - Talia Herman
- The Center for the Study of Movement, Cognition and Mobility, Neurological Institute, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel
| | - Nir Giladi
- The Center for the Study of Movement, Cognition and Mobility, Neurological Institute, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel
- Department of Neurology and Sieratzki Chair in Neurology, Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Alexandr Kalinin
- Statistics Online Computational Resource, Department of Health Behavior and Biological Sciences, University of Michigan, Ann Arbor, MI, United States
- Morris K. Udall Center of Excellence for Parkinson's Disease Research, University of Michigan, Ann Arbor, MI, United States
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Cathie Spino
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, United States
- Morris K. Udall Center of Excellence for Parkinson's Disease Research, University of Michigan, Ann Arbor, MI, United States
| | - William Dauer
- Department of Neurology and Ann Arbor VA Medical Center, University of Michigan, Ann Arbor, MI, United States
- Morris K. Udall Center of Excellence for Parkinson's Disease Research, University of Michigan, Ann Arbor, MI, United States
| | - Jeffrey M Hausdorff
- The Center for the Study of Movement, Cognition and Mobility, Neurological Institute, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel
- Sagol School of Neuroscience and Department of Physical Therapy, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Rush Alzheimer's Disease Center & Orthopaedic Surgery, Rush University, Chicago, IL, USA
| | - Ivo D Dinov
- Statistics Online Computational Resource, Department of Health Behavior and Biological Sciences, University of Michigan, Ann Arbor, MI, United States.
- Morris K. Udall Center of Excellence for Parkinson's Disease Research, University of Michigan, Ann Arbor, MI, United States.
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, 48109, USA.
- Michigan Institute for Data Science, University of Michigan, Ann Arbor, MI, 48109, USA.
| |
Collapse
|