1
|
Shirvanizadeh N, Vihinen M. VariBench, new variation benchmark categories and data sets. FRONTIERS IN BIOINFORMATICS 2023; 3:1248732. [PMID: 37795169 PMCID: PMC10546188 DOI: 10.3389/fbinf.2023.1248732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 09/08/2023] [Indexed: 10/06/2023] Open
Affiliation(s)
| | - Mauno Vihinen
- Department of Experimental Medical Science, Lund University, Lund, Sweden
| |
Collapse
|
2
|
Santos-Gómez A, García-Recio A, Miguez-Cabello F, Soto D, Altafaj X, Olivella M. Identification of homologous GluN subunits variants accelerates GRIN variants stratification. Front Cell Neurosci 2022; 16:998719. [PMID: 36619673 PMCID: PMC9816381 DOI: 10.3389/fncel.2022.998719] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 12/01/2022] [Indexed: 12/24/2022] Open
Abstract
The clinical spectrum of GRIN-related neurodevelopmental disorders (GRD) results from gene- and variant-dependent primary alterations of the NMDA receptor, disturbing glutamatergic neurotransmission. Despite GRIN gene variants' functional annotations being dually critical for stratification and precision medicine design, genetically diagnosed pathogenic GRIN variants currently outnumber their relative functional annotations. Based on high-resolution crystal 3D models and topological domains conservation between GluN1, GluN2A, and GluN2B subunits of the NMDAR, we have generated GluN1-GluN2A-GluN2B subunits structural superimposition model to find equivalent positions between GluN subunits. We have developed a GRIN structural algorithm that predicts functional changes in the equivalent structural positions in other GluN subunits. GRIN structural algorithm was computationally evaluated to the full GRIN missense variants repertoire, consisting of 4,525 variants. The analysis of this structure-based model revealed an absolute predictive power for GluN1, GluN2A, and GluN2B subunits, both in terms of pathogenicity-association (benign vs. pathogenic variants) and functional impact (loss-of-function, benign, gain-of-function). Further, we validated this computational algorithm experimentally, using an in silico library of GluN2B-equivalent GluN2A artificial variants, designed from pathogenic GluN2B variants. Thus, the implementation of the GRIN structural algorithm allows to computationally predict the pathogenicity and functional annotations of GRIN variants, resulting in the duplication of pathogenic GRIN variants assignment, reduction by 30% of GRIN variants with uncertain significance, and increase by 70% of functionally annotated GRIN variants. Finally, GRIN structural algorithm has been implemented into GRIN variants Database (http://lmc.uab.es/grindb), providing a computational tool that accelerates GRIN missense variants stratification, contributing to clinical therapeutic decisions for this neurodevelopmental disorder.
Collapse
Affiliation(s)
- Ana Santos-Gómez
- Neurophysiology Laboratory, Department of Biomedicine, Faculty of Medicine and Health Sciences, Institute of Neurosciences, University of Barcelona, Barcelona, Spain,August Pi i Sunyer Biomedical Research Institute (IDIBAPS), Barcelona, Spain
| | - Adrián García-Recio
- Bioinfomatics and Medical Statistics Group, University of Vic—Central University of Catalonia, Barcelona, Spain
| | - Federico Miguez-Cabello
- Neurophysiology Laboratory, Department of Biomedicine, Faculty of Medicine and Health Sciences, Institute of Neurosciences, University of Barcelona, Barcelona, Spain,August Pi i Sunyer Biomedical Research Institute (IDIBAPS), Barcelona, Spain
| | - David Soto
- Neurophysiology Laboratory, Department of Biomedicine, Faculty of Medicine and Health Sciences, Institute of Neurosciences, University of Barcelona, Barcelona, Spain,August Pi i Sunyer Biomedical Research Institute (IDIBAPS), Barcelona, Spain
| | - Xavier Altafaj
- Neurophysiology Laboratory, Department of Biomedicine, Faculty of Medicine and Health Sciences, Institute of Neurosciences, University of Barcelona, Barcelona, Spain,August Pi i Sunyer Biomedical Research Institute (IDIBAPS), Barcelona, Spain,*Correspondence: Xavier Altafaj Mireia Olivella
| | - Mireia Olivella
- Bioinfomatics and Medical Statistics Group, University of Vic—Central University of Catalonia, Barcelona, Spain,*Correspondence: Xavier Altafaj Mireia Olivella
| |
Collapse
|
3
|
Ge F, Zhu YH, Xu J, Muhammad A, Song J, Yu DJ. MutTMPredictor: Robust and accurate cascade XGBoost classifier for prediction of mutations in transmembrane proteins. Comput Struct Biotechnol J 2021; 19:6400-6416. [PMID: 34938415 PMCID: PMC8649221 DOI: 10.1016/j.csbj.2021.11.024] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Revised: 11/05/2021] [Accepted: 11/15/2021] [Indexed: 12/11/2022] Open
Abstract
Transmembrane proteins have critical biological functions and play a role in a multitude of cellular processes including cell signaling, transport of molecules and ions across membranes. Approximately 60% of transmembrane proteins are considered as drug targets. Missense mutations in such proteins can lead to many diverse diseases and disorders, such as neurodegenerative diseases and cystic fibrosis. However, there are limited studies on mutations in transmembrane proteins. In this work, we first design a new feature encoding method, termed weight attenuation position-specific scoring matrix (WAPSSM), which builds upon the protein evolutionary information. Then, we propose a new mutation prediction algorithm (cascade XGBoost) by leveraging the idea learned from consensus predictors and gcForest. Multi-level experiments illustrate the effectiveness of WAPSSM and cascade XGBoost algorithms. Finally, based on WAPSSM and other three types of features, in combination with the cascade XGBoost algorithm, we develop a new transmembrane protein mutation predictor, named MutTMPredictor. We benchmark the performance of MutTMPredictor against several existing predictors on seven datasets. On the 546 mutations dataset, MutTMPredictor achieves the accuracy (ACC) of 0.9661 and the Matthew's Correlation Coefficient (MCC) of 0.8950. While on the 67,584 dataset, MutTMPredictor achieves an MCC of 0.7523 and area under curve (AUC) of 0.8746, which are 0.1625 and 0.0801 respectively higher than those of the existing best predictor (fathmm). Besides, MutTMPredictor also outperforms two specific predictors on the Pred-MutHTP datasets. The results suggest that MutTMPredictor can be used as an effective method for predicting and prioritizing missense mutations in transmembrane proteins. The MutTMPredictor webserver and datasets are freely accessible at http://csbio.njust.edu.cn/bioinf/muttmpredictor/ for academic use.
Collapse
Key Words
- 1000 Genomes, 1000 genomes project consortium
- APOGEE, pathogenicity prediction through the logistic model tree
- BorodaTM, boosted regression trees for disease-associated mutations in transmembrane proteins
- COSMIC, catalogue of somatic mutations in cancer
- Cascade XGBoost
- ClinVar, clinical variants
- Condel, consensus deleteriousness score of missense mutations
- Disease-associated mutations
- Entprise, entropy and predicted protein structure
- ExAC, the exome aggregation consortium
- Meta-SNP, meta single nucleotide polymorphism
- Mutation prediction
- PROVEAN, protein variation effect analyzer
- PolyPhen, polymorphism phenotyping
- PolyPhen-2, polymorphism phenotyping v2
- Pred-MutHTP, prediction of mutations in human transmembrane proteins
- PredictSNP1, predict single nucleotide polymorphism v1
- Protein evolutionary information
- REVEL, rare exome variant ensemble learner
- SDM, site-directed mutate
- SIFT, sorting intolerant from tolerant
- SNAP, screening for non-acceptable polymorphisms
- SNP&GO, single nucleotide polymorphisms and gene ontology annotations
- SwissVar, variants in UniProtKB/Swiss-Prot
- TMSNP, transmembrane single nucleotide polymorphisms
- Transmembrane protein
- WEKA, waikato environment for knowledge analysis
- fathmm, functional analysis through hidden markov models
- humsavar, human polymorphisms and disease mutations
Collapse
Affiliation(s)
- Fang Ge
- School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing 210094, China
| | - Yi-Heng Zhu
- School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing 210094, China
| | - Jian Xu
- School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing 210094, China
| | - Arif Muhammad
- School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing 210094, China
- School of Systems and Technology, Department of Informatics and System, University of Management and Technology, Lahore, 54770, Pakistan
| | - Jiangning Song
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Monash Centre for Data Science, Faculty of Information Technology, Monash University, Melbourne, VIC 3800, Australia
| | - Dong-Jun Yu
- School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing 210094, China
| |
Collapse
|
4
|
Garcia-Recio A, Gómez-Tamayo JC, Reina I, Campillo M, Cordomí A, Olivella M. Correction to 'TMSNP: a web server to predict pathogenesis of missense mutations in the transmembrane region of membrane proteins'. NAR Genom Bioinform 2021; 3:lqab076. [PMID: 34431838 PMCID: PMC8378517 DOI: 10.1093/nargab/lqab076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Affiliation(s)
- Adrián Garcia-Recio
- Laboratori de Medicina Computacional, Facultat de Medicina, Universitat Autònoma de Barcelona, 08193 Bellaterra, Spain
| | - José Carlos Gómez-Tamayo
- Pharmacoinformatics Group, Research Program on Biomedical Informatics (IMIM/UPF), 08003 Barcelona, Spain
| | - Iker Reina
- Laboratori de Medicina Computacional, Facultat de Medicina, Universitat Autònoma de Barcelona, 08193 Bellaterra, Spain
| | - Mercedes Campillo
- Laboratori de Medicina Computacional, Facultat de Medicina, Universitat Autònoma de Barcelona, 08193 Bellaterra, Spain
| | - Arnau Cordomí
- Laboratori de Medicina Computacional, Facultat de Medicina, Universitat Autònoma de Barcelona, 08193 Bellaterra, Spain
| | - Mireia Olivella
- Bioinformatics and Medical Statistics Group, Facultat de Ciències i Tecnologia, UVIC-UCC, 08500 Vic, Barcelona, Spain
| |
Collapse
|