1
|
Al-Nakhle H, Al-Shahrani R, Al-Ahmadi J, Al-Madani W, Al-Juhani R. Integrative In Silico Analysis to Identify Functional and Structural Impacts of nsSNPs on Programmed Cell Death Protein 1 (PD-1) Protein and UTRs: Potential Biomarkers for Cancer Susceptibility. Genes (Basel) 2025; 16:307. [PMID: 40149458 PMCID: PMC11942535 DOI: 10.3390/genes16030307] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2025] [Revised: 02/20/2025] [Accepted: 02/26/2025] [Indexed: 03/29/2025] Open
Abstract
Background: Programmed cell death protein 1 (PD-1), encoded by the PDCD1 gene, is critical in immune checkpoint regulation and cancer immune evasion. Variants in PDCD1 may alter its function, impacting cancer susceptibility and disease progression. Objectives: This study evaluates the structural, functional, and regulatory impacts of non-synonymous single-nucleotide polymorphisms (nsSNPs) in the PDCD1 gene, focusing on their pathogenic and oncogenic roles. Methods: Computational tools, including PredictSNP1.0, I-Mutant2.0, MUpro, HOPE, MutPred2, Cscape, Cscape-Somatic, GEPIA2, cBioPortal, and STRING, were used to analyze 695 nsSNPs in the PD1 protein. The analysis covered structural impacts, stability changes, regulatory effects, and oncogenic potential, focusing on conserved domains and protein-ligand interactions. Results: The analysis identified 84 deleterious variants, with 45 mapped to conserved regions like the Ig V-set domain essential for ligand-binding interactions. Stability analyses identified 78 destabilizing variants with significant protein instability (ΔΔG values). Ten nsSNPs were identified as potential cancer drivers. Expression profiling showed differential PDCD1 expression in tumor versus normal tissues, correlating with improved survival in skin melanoma but limited value in ovarian cancer. Regulatory SNPs disrupted miRNA-binding sites and transcriptional regulation, affecting PDCD1 expression. STRING analysis revealed key PD-1 protein partners within immune pathways, including PD-L1 and PD-L2. Conclusions: This study highlights the significance of PDCD1 nsSNPs as potential biomarkers for cancer susceptibility, advancing the understanding of PD-1 regulation. Experimental validation and multi-omics integration are crucial to refine these findings and enhance theraputic strategies.
Collapse
Affiliation(s)
- Hakeemah Al-Nakhle
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, Taibah University, Al-Madinah Al-Monawarah 42353, Saudi Arabia
| | | | | | | | | |
Collapse
|
2
|
Zeng G, Zhao C, Li G, Huang Z, Zhuang J, Liang X, Yu X, Fang S. Identifying somatic driver mutations in cancer with a language model of the human genome. Comput Struct Biotechnol J 2025; 27:531-540. [PMID: 39968174 PMCID: PMC11833646 DOI: 10.1016/j.csbj.2025.01.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2024] [Revised: 01/12/2025] [Accepted: 01/14/2025] [Indexed: 02/20/2025] Open
Abstract
Somatic driver mutations play important roles in cancer and must be precisely identified to advance our understanding of tumorigenesis and its promotion and progression. However, identifying somatic driver mutations remains challenging in Homo sapiens genomics due to the random nature of mutations and the high cost of qualitative experiments. Building on the powerful sequence interpretation capabilities of language models, we propose a self-attention-based contextualized pretrained language model for somatic driver mutation identification. We pretrained the model with the Homo sapiens reference genome to equip it with the ability to understand genome sequences and then fine-tuned it for oncogene and tumor suppressor gene prediction tasks, enabling it to extract features related to driver genes from the original genome sequence. The fine-tuned model was used to obtain the mutations' carcinogenic effect characteristics to further identify whether the mutation is a driver or a passenger. Compared with other computational algorithms, our method achieved excellent somatic driver mutation identification performance on the test set, with an absolute improvement of 4.31% in AUROC over the best comparison method. The strong performance of our method indicates that it can provide new insights into the discovery of cancer drivers.
Collapse
Affiliation(s)
- Guangjian Zeng
- School of Biomedical Engineering, Shenzhen University, Shenzhen, China
- School of Public Health and Emergency Management, Southern University of Science and Technology, Shenzhen, China
| | - Chengzhi Zhao
- School of Public Health and Emergency Management, Southern University of Science and Technology, Shenzhen, China
| | - Guanpeng Li
- School of Public Health and Emergency Management, Southern University of Science and Technology, Shenzhen, China
| | - Zhengyang Huang
- School of Biomedical Engineering, Shenzhen University, Shenzhen, China
| | - Jinhu Zhuang
- Shenzhen Health Development Research and Data Management Center, Guangdong, China
| | - Xiaohua Liang
- Department of Clinical Epidemiology and Biostatistics, Children's Hospital of Chongqing Medical University, National Clinical Research Center for Child Health and Disorders, Ministry of Education Key Laboratory of Child Development and Disorders, Chongqing Key Laboratory of Pediatrics, Chongqing, China
| | - Xiaxia Yu
- School of Biomedical Engineering, Shenzhen University, Shenzhen, China
| | - Shenying Fang
- School of Public Health and Emergency Management, Southern University of Science and Technology, Shenzhen, China
| |
Collapse
|
3
|
Bi C, Shi Y, Xia J, Liang Z, Wu Z, Xu K, Cheng N. Ensemble learning-based predictor for driver synonymous mutation with sequence representation. PLoS Comput Biol 2025; 21:e1012744. [PMID: 39761306 PMCID: PMC11737855 DOI: 10.1371/journal.pcbi.1012744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2024] [Revised: 01/16/2025] [Accepted: 12/21/2024] [Indexed: 01/18/2025] Open
Abstract
Synonymous mutations, once considered neutral, are now understood to have significant implications for a variety of diseases, particularly cancer. It is indispensable to identify these driver synonymous mutations in human cancers, yet current methods are constrained by data limitations. In this study, we initially investigate the impact of sequence-based features, including DNA shape, physicochemical properties and one-hot encoding of nucleotides, and deep learning-derived features from pre-trained chemical molecule language models based on BERT. Subsequently, we propose EPEL, an effect predictor for synonymous mutations employing ensemble learning. EPEL combines five tree-based models and optimizes feature selection to enhance predictive accuracy. Notably, the incorporation of DNA shape features and deep learning-derived features from chemical molecule represents a pioneering effect in assessing the impact of synonymous mutations in cancer. Compared to existing state-of-the-art methods, EPEL demonstrates superior performance on the independent test dataset. Furthermore, our analysis reveals a significant correlation between effect scores and patient outcomes across various cancer types. Interestingly, while deep learning methods have shown promise in other fields, their DNA sequence representations do not significantly enhance the identification of driver synonymous mutations in this study. Overall, we anticipate that EPEL will facilitate researchers to more precisely target driver synonymous mutations. EPEL is designed with flexibility, allowing users to retrain the prediction model and generate effect scores for synonymous mutations in human cancers. A user-friendly web server for EPEL is available at http://ahmu.EPEL.bio/.
Collapse
Affiliation(s)
- Chuanmei Bi
- School of Biomedical Engineering, Anhui Medical University, Hefei, China
| | - Yong Shi
- School of Biomedical Engineering, Anhui Medical University, Hefei, China
| | - Junfeng Xia
- Institutes of Physical Science and Information Technology, Anhui University, Hefei, China
| | - Zhen Liang
- School of Biomedical Engineering, Anhui Medical University, Hefei, China
- Affiliated Chuzhou hospital of Anhui Medical University, Chuzhou, China
| | - Zhiqiang Wu
- School of Biomedical Engineering, Anhui Medical University, Hefei, China
| | - Kai Xu
- School of Biomedical Engineering, Anhui Medical University, Hefei, China
| | - Na Cheng
- School of Biomedical Engineering, Anhui Medical University, Hefei, China
| |
Collapse
|
4
|
Al-nakhle HH, Yagoub HS, Alrehaili RY, Shaqroon OA, Khan MK, Alsharif GS. Elucidating the role of MLL1 nsSNPs: Structural and functional alterations and their contribution to leukemia development. PLoS One 2024; 19:e0304986. [PMID: 39405275 PMCID: PMC11478856 DOI: 10.1371/journal.pone.0304986] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 05/21/2024] [Indexed: 10/19/2024] Open
Abstract
(1) BACKGROUND The Mixed lineage leukemia 1 (MLL1) gene, located on chromosome 11q23, plays a pivotal role in histone lysine-specific methylation and is consistently associated with various types of leukemia. Non-synonymous Single Nucleotide Polymorphisms (nsSNPs) have been tied to numerous diseases, including cancers, and have become valuable cancer biomarkers. There's a notable gap in studies probing the influence of SNPs on MLL1 protein structure, function, and subsequent modifications. (2) METHODS We utilized an array of bioinformatics tools, including PredictSNP, InterPro, ConSurf, I-Mutant2.0, MUpro, Musitedeep, Project HOPE, RegulomeDB, Mutpred2, and both CScape and CScape Somatic, to meticulously analyze the consequences of nsSNPs in the MLL1 gene. (3) RESULTS Out of 2,097 nsSNPs analyzed, 62 were determined to be significantly pathogenic by the PredictSNP tool, with ten crucial MLL1 functional domains identified using InterPro. Additionally, 50 of these nsSNPs had high conservation scores, hinting at potential effects on protein structure and function, while 32 were found to undermine MLL1 protein stability. Notably, four nsSNPs were deemed oncogenic, with two identified as cancer drivers. The nsSNP, D2724G, between the MLL1 protein's FY-rich domains, could disrupt proteolytic cleavage, altering gene expression patterns and potentially promoting cancer. (4) CONCLUSIONS Our research provides a comprehensive assessment of nsSNPs' impact in the MLL1 protein structure and function and consequently on leukemia development, suggesting potential avenues for personalized treatment, early detection, improved prognosis, and a deeper understanding of hematological malignancy genesis.
Collapse
Affiliation(s)
- Hakeemah H. Al-nakhle
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, Taibah University, Al-Madinah Al-Monawarah, Saudi Arabia
| | - Hind S. Yagoub
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, Taibah University, Al-Madinah Al-Monawarah, Saudi Arabia
- Faculty of Medical Laboratory Sciences, Omdurman Islamic University, Omdurman, Sudan
| | - Rahaf Y. Alrehaili
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, Taibah University, Al-Madinah Al-Monawarah, Saudi Arabia
| | - Ola A. Shaqroon
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, Taibah University, Al-Madinah Al-Monawarah, Saudi Arabia
| | - Minna K. Khan
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, Taibah University, Al-Madinah Al-Monawarah, Saudi Arabia
| | - Ghaidaa S. Alsharif
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, Taibah University, Al-Madinah Al-Monawarah, Saudi Arabia
| |
Collapse
|
5
|
Wang L, Sun H, Yue Z, Xia J, Li X. CDMPred: a tool for predicting cancer driver missense mutations with high-quality passenger mutations. PeerJ 2024; 12:e17991. [PMID: 39253604 PMCID: PMC11382650 DOI: 10.7717/peerj.17991] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2024] [Accepted: 08/07/2024] [Indexed: 09/11/2024] Open
Abstract
Most computational methods for predicting driver mutations have been trained using positive samples, while negative samples are typically derived from statistical methods or putative samples. The representativeness of these negative samples in capturing the diversity of passenger mutations remains to be determined. To tackle these issues, we curated a balanced dataset comprising driver mutations sourced from the COSMIC database and high-quality passenger mutations obtained from the Cancer Passenger Mutation database. Subsequently, we encoded the distinctive features of these mutations. Utilizing feature correlation analysis, we developed a cancer driver missense mutation predictor called CDMPred employing feature selection through the ensemble learning technique XGBoost. The proposed CDMPred method, utilizing the top 10 features and XGBoost, achieved an area under the receiver operating characteristic curve (AUC) value of 0.83 and 0.80 on the training and independent test sets, respectively. Furthermore, CDMPred demonstrated superior performance compared to existing state-of-the-art methods for cancer-specific and general diseases, as measured by AUC and area under the precision-recall curve. Including high-quality passenger mutations in the training data proves advantageous for CDMPred's prediction performance. We anticipate that CDMPred will be a valuable tool for predicting cancer driver mutations, furthering our understanding of personalized therapy.
Collapse
Affiliation(s)
- Lihua Wang
- Information Materials and Intelligent Sensing Laboratory of Anhui Province, Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui, China
- School of Information Engineering, Huangshan University, Huangshan, Anhui, China
| | - Haiyang Sun
- State Key Laboratory of Medicinal Chemical Biology, NanKai University, Tianjin, Tianjin, China
| | - Zhenyu Yue
- School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, China
| | - Junfeng Xia
- Information Materials and Intelligent Sensing Laboratory of Anhui Province, Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui, China
| | - Xiaoyan Li
- Information Materials and Intelligent Sensing Laboratory of Anhui Province, Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui, China
| |
Collapse
|
6
|
Giovannetti A, Lazzari S, Mangoni M, Traversa A, Mazza T, Parisi C, Caputo V. Exploring non-coding genetic variability in ACE2: Functional annotation and in vitro validation of regulatory variants. Gene 2024; 915:148422. [PMID: 38570058 DOI: 10.1016/j.gene.2024.148422] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Revised: 02/23/2024] [Accepted: 03/13/2024] [Indexed: 04/05/2024]
Abstract
The surge in human whole-genome sequencing data has facilitated the study of non-coding region variations, yet understanding their biological significance remains a challenge. We used a computational workflow to assess the regulatory potential of non-coding variants, with a particular focus on the Angiotensin Converting Enzyme 2 (ACE2) gene. This gene is crucial in physiological processes and serves as the entry point for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the virus causing coronavirus disease 19 (COVID-19). In our analysis, using data from the gnomAD population database and functional annotation, we identified 17 significant Single Nucleotide Variants (SNVs) in ACE2, particularly in its enhancers, promoters, and 3' untranslated regions (UTRs). We found preliminary evidence supporting the regulatory impact of some of these variants on ACE2 expression. Our detailed examination of two SNVs, rs147718775 and rs140394675, in the ACE2 promoter revealed that these co-occurring SNVs, when mutated, significantly enhance promoter activity, suggesting a possible increase in specific ACE2 isoform expression. This method proves effective in identifying and interpreting impactful non-coding variants, aiding in further studies and enhancing understanding of molecular bases of monogenic and complex traits.
Collapse
Affiliation(s)
- Agnese Giovannetti
- Clinical Genomics Laboratory, Fondazione IRCCS Casa Sollievo della Sofferenza, Viale Cappuccini, snc, 71013 S. Giovanni Rotondo (FG), Italy.
| | - Sara Lazzari
- Department of Experimental Medicine, Sapienza University of Rome, Viale Regina Elena, 324, 00161 Rome, Italy.
| | - Manuel Mangoni
- Department of Experimental Medicine, Sapienza University of Rome, Viale Regina Elena, 324, 00161 Rome, Italy; Bioinformatics Laboratory, Fondazione IRCCS Casa Sollievo della Sofferenza, Viale Cappuccini, snc, 71013 S. Giovanni Rotondo (FG), Italy.
| | - Alice Traversa
- Department of Experimental Medicine, Sapienza University of Rome, Viale Regina Elena, 324, 00161 Rome, Italy; Dipartimento di Scienze della Vita, della Salute e delle Professioni Sanitarie, Università degli Studi "Link Campus University", Via del Casale di San Pio V 44, 00165 Roma, Italy.
| | - Tommaso Mazza
- Bioinformatics Laboratory, Fondazione IRCCS Casa Sollievo della Sofferenza, Viale Cappuccini, snc, 71013 S. Giovanni Rotondo (FG), Italy.
| | - Chiara Parisi
- Institute of Biochemistry and Cell Biology, CNR-National Research Council, Via Ercole Ramarini, 32, 00015 Monterotondo Scalo (RM), Italy.
| | - Viviana Caputo
- Department of Experimental Medicine, Sapienza University of Rome, Viale Regina Elena, 324, 00161 Rome, Italy.
| |
Collapse
|
7
|
Nakamura T, Ueda J, Mizuno S, Honda K, Kazuno AA, Yamamoto H, Hara T, Takata A. Topologically associating domains define the impact of de novo promoter variants on autism spectrum disorder risk. CELL GENOMICS 2024; 4:100488. [PMID: 38280381 PMCID: PMC10879036 DOI: 10.1016/j.xgen.2024.100488] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Revised: 08/24/2023] [Accepted: 01/02/2024] [Indexed: 01/29/2024]
Abstract
Whole-genome sequencing (WGS) studies of autism spectrum disorder (ASD) have demonstrated the roles of rare promoter de novo variants (DNVs). However, most promoter DNVs in ASD are not located immediately upstream of known ASD genes. In this study analyzing WGS data of 5,044 ASD probands, 4,095 unaffected siblings, and their parents, we show that promoter DNVs within topologically associating domains (TADs) containing ASD genes are significantly and specifically associated with ASD. An analysis considering TADs as functional units identified specific TADs enriched for promoter DNVs in ASD and indicated that common variants in these regions also confer ASD heritability. Experimental validation using human induced pluripotent stem cells (iPSCs) showed that likely deleterious promoter DNVs in ASD can influence multiple genes within the same TAD, resulting in overall dysregulation of ASD-associated genes. These results highlight the importance of TADs and gene-regulatory mechanisms in better understanding the genetic architecture of ASD.
Collapse
Affiliation(s)
- Takumi Nakamura
- Laboratory for Molecular Pathology of Psychiatric Disorders, RIKEN Center for Brain Science, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan
| | - Junko Ueda
- Laboratory for Molecular Pathology of Psychiatric Disorders, RIKEN Center for Brain Science, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan.
| | - Shota Mizuno
- Laboratory for Molecular Pathology of Psychiatric Disorders, RIKEN Center for Brain Science, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan
| | - Kurara Honda
- Laboratory for Molecular Pathology of Psychiatric Disorders, RIKEN Center for Brain Science, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan
| | - An-A Kazuno
- Laboratory for Molecular Pathology of Psychiatric Disorders, RIKEN Center for Brain Science, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan
| | - Hirona Yamamoto
- Laboratory for Molecular Pathology of Psychiatric Disorders, RIKEN Center for Brain Science, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan; Department of Neuropsychiatry, Graduate School of Medicine, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8654, Japan
| | - Tomonori Hara
- Laboratory for Molecular Pathology of Psychiatric Disorders, RIKEN Center for Brain Science, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan; Department of Organ Anatomy, Tohoku University Graduate School of Medicine, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi 980-8575, Japan
| | - Atsushi Takata
- Laboratory for Molecular Pathology of Psychiatric Disorders, RIKEN Center for Brain Science, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan; Research Institute for Diseases of Old Age, Juntendo University Graduate School of Medicine, 2-1-1 Hongo, Bunkyo-ku, Tokyo 113-8421, Japan.
| |
Collapse
|
8
|
Cheng N, Bi C, Shi Y, Liu M, Cao A, Ren M, Xia J, Liang Z. Effect Predictor of Driver Synonymous Mutations Based on Multi-Feature Fusion and Iterative Feature Representation Learning. IEEE J Biomed Health Inform 2024; 28:1144-1151. [PMID: 38096097 DOI: 10.1109/jbhi.2023.3343075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Accurate identification of driver mutations is crucial in genetic studies of human cancers. While numerous cancer driver missense mutations have been identified, research into potential cancer drivers for synonymous mutations has shown limited success to date. Here, we developed a novel machine learning framework, epSMic, for predicting cancer driver synonymous mutations. epSMic employs an iterative feature representation scheme that facilitates the learning of discriminative features from various sequential models in a supervised iterative mode. We constructed the benchmark datasets and encoded the embedding sequence, physicochemical property, and basic information such as conservation and splicing feature. The evaluation results on benchmark test datasets demonstrate that epSMic outperforms existing methods, making it a valuable tool for researchers in identifying functional synonymous mutations in cancer. We hope epSMic can enable researchers to concentrate on synonymous mutations that have a functional impact on cancer.
Collapse
|
9
|
Nourbakhsh M, Degn K, Saksager A, Tiberti M, Papaleo E. Prediction of cancer driver genes and mutations: the potential of integrative computational frameworks. Brief Bioinform 2024; 25:bbad519. [PMID: 38261338 PMCID: PMC10805075 DOI: 10.1093/bib/bbad519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 11/27/2023] [Accepted: 12/11/2023] [Indexed: 01/24/2024] Open
Abstract
The vast amount of available sequencing data allows the scientific community to explore different genetic alterations that may drive cancer or favor cancer progression. Software developers have proposed a myriad of predictive tools, allowing researchers and clinicians to compare and prioritize driver genes and mutations and their relative pathogenicity. However, there is little consensus on the computational approach or a golden standard for comparison. Hence, benchmarking the different tools depends highly on the input data, indicating that overfitting is still a massive problem. One of the solutions is to limit the scope and usage of specific tools. However, such limitations force researchers to walk on a tightrope between creating and using high-quality tools for a specific purpose and describing the complex alterations driving cancer. While the knowledge of cancer development increases daily, many bioinformatic pipelines rely on single nucleotide variants or alterations in a vacuum without accounting for cellular compartments, mutational burden or disease progression. Even within bioinformatics and computational cancer biology, the research fields work in silos, risking overlooking potential synergies or breakthroughs. Here, we provide an overview of databases and datasets for building or testing predictive cancer driver tools. Furthermore, we introduce predictive tools for driver genes, driver mutations, and the impact of these based on structural analysis. Additionally, we suggest and recommend directions in the field to avoid silo-research, moving towards integrative frameworks.
Collapse
Affiliation(s)
- Mona Nourbakhsh
- Cancer Systems Biology, Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, 2800 Lyngby, Denmark
| | - Kristine Degn
- Cancer Systems Biology, Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, 2800 Lyngby, Denmark
| | - Astrid Saksager
- Cancer Systems Biology, Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, 2800 Lyngby, Denmark
| | - Matteo Tiberti
- Cancer Structural Biology, Danish Cancer Institute, 2100 Copenhagen, Denmark
| | - Elena Papaleo
- Cancer Systems Biology, Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, 2800 Lyngby, Denmark
- Cancer Structural Biology, Danish Cancer Institute, 2100 Copenhagen, Denmark
| |
Collapse
|
10
|
Al-nakhle HH, Yagoub HS, Anbarkhan SH, Alamri GA, Alsubaie NM. In Silico Evaluation of Coding and Non-Coding nsSNPs in the Thrombopoietin Receptor ( MPL) Proto-Oncogene: Assessing Their Influence on Protein Stability, Structure, and Function. Curr Issues Mol Biol 2023; 45:9390-9412. [PMID: 38132435 PMCID: PMC10742084 DOI: 10.3390/cimb45120589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Revised: 11/12/2023] [Accepted: 11/21/2023] [Indexed: 12/23/2023] Open
Abstract
The thrombopoietin receptor (MPL) gene is a critical regulator of hematopoiesis, and any alterations in its structure or function can result in a range of hematological disorders. Non-synonymous single nucleotide polymorphisms (nsSNPs) in MPL have the potential to disrupt normal protein function, prompting our investigation into the most deleterious MPL SNPs and the associated structural changes affecting protein-protein interactions. We employed a comprehensive suite of bioinformatics tools, including PredictSNP, InterPro, ConSurf, I-Mutant2.0, MUpro, Musitedeep, Project HOPE, STRING, RegulomeDB, Mutpred2, CScape, and CScape Somatic, to analyze 635 nsSNPs within the MPL gene. Among the analyzed nsSNPs, PredictSNP identified 28 as significantly pathogenic, revealing three critical functional domains within MPL. Ten of these nsSNPs exhibited high conservation scores, indicating potential effects on protein structure and function, while 14 were found to compromise MPL protein stability. Although the most harmful nsSNPs did not directly impact post-translational modification sites, 13 had the capacity to substantially alter the protein's physicochemical properties. Some mutations posed a risk to vital protein-protein interactions crucial for hematological functions, and three non-coding region nsSNPs displayed significant regulatory potential with potential implications for hematopoiesis. Furthermore, 13 out of 21 nsSNPs evaluated were classified as high-risk pathogenic variants by Mutpred2. Notably, amino acid alterations such as C291S, T293N, D295G, and W435C, while impactful on protein stability and function, were deemed non-oncogenic "passenger" mutations. Our study underscores the substantial impact of missense nsSNPs on MPL protein structure and function. Given MPL's central role in hematopoiesis, these mutations can significantly disrupt hematological processes, potentially leading to a variety of disorders. The identified high-risk pathogenic nsSNPs may hold promise as potential biomarkers or therapeutic targets for hematological diseases. This research lays the foundation for future investigations into the MPL gene's role in the realm of hematological health and diseases.
Collapse
Affiliation(s)
- Hakeemah H. Al-nakhle
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, Taibah University, Al-Madinah Al-Monawarah 42353, Saudi Arabia; (H.S.Y.); (S.H.A.); (N.M.A.)
| | - Hind S. Yagoub
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, Taibah University, Al-Madinah Al-Monawarah 42353, Saudi Arabia; (H.S.Y.); (S.H.A.); (N.M.A.)
- Faculty of Medical Laboratory Sciences, Omdurman Islamic University, Omdurman 14415, Sudan
| | - Sadin H. Anbarkhan
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, Taibah University, Al-Madinah Al-Monawarah 42353, Saudi Arabia; (H.S.Y.); (S.H.A.); (N.M.A.)
| | - Ghadah A. Alamri
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, Taibah University, Al-Madinah Al-Monawarah 42353, Saudi Arabia; (H.S.Y.); (S.H.A.); (N.M.A.)
| | - Norah M. Alsubaie
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, Taibah University, Al-Madinah Al-Monawarah 42353, Saudi Arabia; (H.S.Y.); (S.H.A.); (N.M.A.)
| |
Collapse
|
11
|
Ahmad RM, Ali BR, Al-Jasmi F, Sinnott RO, Al Dhaheri N, Mohamad MS. A review of genetic variant databases and machine learning tools for predicting the pathogenicity of breast cancer. Brief Bioinform 2023; 25:bbad479. [PMID: 38149678 PMCID: PMC10782903 DOI: 10.1093/bib/bbad479] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 09/22/2023] [Accepted: 12/04/2023] [Indexed: 12/28/2023] Open
Abstract
Studies continue to uncover contributing risk factors for breast cancer (BC) development including genetic variants. Advances in machine learning and big data generated from genetic sequencing can now be used for predicting BC pathogenicity. However, it is unclear which tool developed for pathogenicity prediction is most suited for predicting the impact and pathogenicity of variant effects. A significant challenge is to determine the most suitable data source for each tool since different tools can yield different prediction results with different data inputs. To this end, this work reviews genetic variant databases and tools used specifically for the prediction of BC pathogenicity. We provide a description of existing genetic variants databases and, where appropriate, the diseases for which they have been established. Through example, we illustrate how they can be used for prediction of BC pathogenicity and discuss their associated advantages and disadvantages. We conclude that the tools that are specialized by training on multiple diverse datasets from different databases for the same disease have enhanced accuracy and specificity and are thereby more helpful to the clinicians in predicting and diagnosing BC as early as possible.
Collapse
Affiliation(s)
- Rahaf M Ahmad
- Health Data Science Lab, Department of Genetics and Genomics, College of Medical and Health Sciences, United Arab Emirates University, Tawam road, Al Maqam district, Al Ain, Abu Dhabi, United Arab Emirates
| | - Bassam R Ali
- Health Data Science Lab, Department of Genetics and Genomics, College of Medical and Health Sciences, United Arab Emirates University, Tawam road, Al Maqam district, Al Ain, Abu Dhabi, United Arab Emirates
| | - Fatma Al-Jasmi
- Health Data Science Lab, Department of Genetics and Genomics, College of Medical and Health Sciences, United Arab Emirates University, Tawam road, Al Maqam district, Al Ain, Abu Dhabi, United Arab Emirates
- Division of Metabolic Genetics, Department of Pediatrics, Tawam Hospital, Al Ain, United Arab Emirates
| | - Richard O Sinnott
- School of Computing and Information System, Faculty of Engineering and Information Technology, The University of Melbourne, Melbourne, Victoria, Australia
| | - Noura Al Dhaheri
- Health Data Science Lab, Department of Genetics and Genomics, College of Medical and Health Sciences, United Arab Emirates University, Tawam road, Al Maqam district, Al Ain, Abu Dhabi, United Arab Emirates
- Division of Metabolic Genetics, Department of Pediatrics, Tawam Hospital, Al Ain, United Arab Emirates
| | - Mohd Saberi Mohamad
- Health Data Science Lab, Department of Genetics and Genomics, College of Medical and Health Sciences, United Arab Emirates University, Tawam road, Al Maqam district, Al Ain, Abu Dhabi, United Arab Emirates
| |
Collapse
|
12
|
Nourbakhsh M, Saksager A, Tom N, Chen XS, Colaprico A, Olsen C, Tiberti M, Papaleo E. A workflow to study mechanistic indicators for driver gene prediction with Moonlight. Brief Bioinform 2023; 24:bbad274. [PMID: 37551622 PMCID: PMC10516357 DOI: 10.1093/bib/bbad274] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 06/28/2023] [Accepted: 07/10/2023] [Indexed: 08/09/2023] Open
Abstract
Prediction of driver genes (tumor suppressors and oncogenes) is an essential step in understanding cancer development and discovering potential novel treatments. We recently proposed Moonlight as a bioinformatics framework to predict driver genes and analyze them in a system-biology-oriented manner based on -omics integration. Moonlight uses gene expression as a primary data source and combines it with patterns related to cancer hallmarks and regulatory networks to identify oncogenic mediators. Once the oncogenic mediators are identified, it is important to include extra levels of evidence, called mechanistic indicators, to identify driver genes and to link the observed gene expression changes to the underlying alteration that promotes them. Such a mechanistic indicator could be for example a mutation in the regulatory regions for the candidate gene. Here, we developed new functionalities and released Moonlight2 to provide the user with a mutation-based mechanistic indicator as a second layer of evidence. These functionalities analyze mutations in a cancer cohort to classify them into driver and passenger mutations. Those oncogenic mediators with at least one driver mutation are retained as the final set of driver genes. We applied Moonlight2 to the basal-like breast cancer subtype, lung adenocarcinoma and thyroid carcinoma using data from The Cancer Genome Atlas. For example, in basal-like breast cancer, we found four oncogenes (COPZ2, SF3B4, KRTCAP2 and POLR2J) and nine tumor suppressor genes (KIR2DL4, KIF26B, ARL15, ARHGAP25, EMCN, GMFG, TPK1, NR5A2 and TEK) containing a driver mutation in their promoter region, possibly explaining their deregulation. Moonlight2R is available at https://github.com/ELELAB/Moonlight2R.
Collapse
Affiliation(s)
- Mona Nourbakhsh
- Cancer Systems Biology, Section for Bioinformatics, Department of Health and Technology, Technical University of Denmark, Lyngby, Denmark
| | - Astrid Saksager
- Cancer Systems Biology, Section for Bioinformatics, Department of Health and Technology, Technical University of Denmark, Lyngby, Denmark
| | - Nikola Tom
- Cancer Systems Biology, Section for Bioinformatics, Department of Health and Technology, Technical University of Denmark, Lyngby, Denmark
| | - Xi Steven Chen
- Department of Public Health Sciences, University of Miami Miller School of Medicine, Miami, FL 33136, USA
- Sylvester Comprehensive Cancer Center, University of Miami Miller School of Medicine, Miami, FL 33136, USA
| | - Antonio Colaprico
- Sylvester Comprehensive Cancer Center, University of Miami Miller School of Medicine, Miami, FL 33136, USA
| | - Catharina Olsen
- Vrije Universiteit Brussel (VUB), Universitair Ziekenhuis Brussel (UZ Brussel), Clinical Sciences, Reproduction and Genetics
- Brussels Interuniversity Genomics High Throughput core (BRIGHTcore), VUB-ULB, Brussels 1090, Belgium
- Interuniversity Institute of Bioinformatics in Brussels (IB)2, Brussels 1050, Belgium
| | - Matteo Tiberti
- Cancer Structural Biology, Danish Cancer Institute, Copenhagen, Denmark
| | - Elena Papaleo
- Cancer Systems Biology, Section for Bioinformatics, Department of Health and Technology, Technical University of Denmark, Lyngby, Denmark
- Cancer Structural Biology, Danish Cancer Institute, Copenhagen, Denmark
| |
Collapse
|
13
|
Urwyler-Rösselet C, Tanghe G, Devos M, Hulpiau P, Saeys Y, Declercq W. Functions of the RIP kinase family members in the skin. Cell Mol Life Sci 2023; 80:285. [PMID: 37688617 PMCID: PMC10492769 DOI: 10.1007/s00018-023-04917-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Revised: 07/08/2023] [Accepted: 08/08/2023] [Indexed: 09/11/2023]
Abstract
The receptor interacting protein kinases (RIPK) are a family of serine/threonine kinases that are involved in the integration of various stress signals. In response to several extracellular and/or intracellular stimuli, RIP kinases engage signaling cascades leading to the activation of NF-κB and mitogen-activated protein kinases, cell death, inflammation, differentiation and Wnt signaling and can have kinase-dependent and kinase-independent functions. Although it was previously suggested that seven RIPKs are part of the RIPK family, phylogenetic analysis indicates that there are only five genuine RIPKs. RIPK1 and RIPK3 are mainly involved in controlling and executing necroptosis in keratinocytes, while RIPK4 controls proliferation and differentiation of keratinocytes and thereby can act as a tumor suppressor in skin. Therefore, in this review we summarize and discuss the functions of RIPKs in skin homeostasis as well as the signaling pathways involved.
Collapse
Affiliation(s)
- Corinne Urwyler-Rösselet
- Department of Biomedical Molecular Biology, Ghent University, Ghent, Belgium
- VIB Center for Inflammation Research, Ghent, Belgium
- Department of Biology, Institute of Molecular Health Sciences, ETH Zurich, 8093, Zurich, Switzerland
| | - Giel Tanghe
- Department of Biomedical Molecular Biology, Ghent University, Ghent, Belgium
- VIB Center for Inflammation Research, Ghent, Belgium
| | - Michael Devos
- Department of Biomedical Molecular Biology, Ghent University, Ghent, Belgium
- VIB Center for Inflammation Research, Ghent, Belgium
| | - Paco Hulpiau
- VIB Center for Inflammation Research, Ghent, Belgium
- Howest University of Applied Sciences, Brugge, Belgium
| | - Yvan Saeys
- VIB Center for Inflammation Research, Ghent, Belgium
- Department of Applied Mathematics and Computer Science, Ghent University, Ghent, Belgium
| | - Wim Declercq
- Department of Biomedical Molecular Biology, Ghent University, Ghent, Belgium.
- VIB Center for Inflammation Research, Ghent, Belgium.
| |
Collapse
|
14
|
Johnson A, Ng PKS, Kahle M, Castillo J, Amador B, Wang Y, Zeng J, Holla V, Vu T, Su F, Kim SH, Conway T, Jiang X, Chen K, Shaw KRM, Yap TA, Rodon J, Mills GB, Meric-Bernstam F. Actionability classification of variants of unknown significance correlates with functional effect. NPJ Precis Oncol 2023; 7:67. [PMID: 37454202 DOI: 10.1038/s41698-023-00420-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 07/03/2023] [Indexed: 07/18/2023] Open
Abstract
Genomically-informed therapy requires consideration of the functional impact of genomic alterations on protein expression and/or function. However, a substantial number of variants are of unknown significance (VUS). The MD Anderson Precision Oncology Decision Support (PODS) team developed an actionability classification scheme that categorizes VUS as either "Unknown" or "Potentially" actionable based on their location within functional domains and/or proximity to known oncogenic variants. We then compared PODS VUS actionability classification with results from a functional genomics platform consisting of mutant generation and cell viability assays. 106 (24%) of 438 VUS in 20 actionable genes were classified as oncogenic in functional assays. Variants categorized by PODS as Potentially actionable (N = 204) were more likely to be oncogenic than those categorized as Unknown (N = 230) (37% vs 13%, p = 4.08e-09). Our results demonstrate that rule-based actionability classification of VUS can identify patients more likely to have actionable variants for consideration with genomically-matched therapy.
Collapse
Affiliation(s)
- Amber Johnson
- Institute for Personalized Cancer Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Patrick Kwok-Shing Ng
- Institute for Personalized Cancer Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Michael Kahle
- Institute for Personalized Cancer Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Julia Castillo
- Institute for Personalized Cancer Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Bianca Amador
- Institute for Personalized Cancer Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Yujia Wang
- Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Jia Zeng
- Institute for Personalized Cancer Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Vijaykumar Holla
- Institute for Personalized Cancer Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Thuy Vu
- Institute for Personalized Cancer Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Fei Su
- Institute for Personalized Cancer Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Sun-Hee Kim
- Institute for Personalized Cancer Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Tara Conway
- Institute for Personalized Cancer Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Xianli Jiang
- Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Ken Chen
- Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Kenna R Mills Shaw
- Institute for Personalized Cancer Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Timothy A Yap
- Institute for Personalized Cancer Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
- Investigational Cancer Therapeutics (Phase I Program), The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Jordi Rodon
- Institute for Personalized Cancer Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
- Investigational Cancer Therapeutics (Phase I Program), The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Gordon B Mills
- Division of Oncological Sciences, Knight Cancer Institute, Oregon Health and Science University, Portland, OR, USA
| | - Funda Meric-Bernstam
- Institute for Personalized Cancer Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.
- Investigational Cancer Therapeutics (Phase I Program), The University of Texas MD Anderson Cancer Center, Houston, TX, USA.
| |
Collapse
|
15
|
Wang L, Sun J, Ma S, Xia J, Li X. PredDSMC: A predictor for driver synonymous mutations in human cancers. Front Genet 2023; 14:1164593. [PMID: 37051593 PMCID: PMC10083435 DOI: 10.3389/fgene.2023.1164593] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 03/09/2023] [Indexed: 03/29/2023] Open
Abstract
Introduction: Driver mutations play a critical role in the occurrence and development of human cancers. Most studies have focused on missense mutations that function as drivers in cancer. However, accumulating experimental evidence indicates that synonymous mutations can also act as driver mutations.Methods: Here, we proposed a computational method called PredDSMC to accurately predict driver synonymous mutations in human cancers. We first systematically explored four categories of multimodal features, including sequence features, splicing features, conservation scores, and functional scores. Further feature selection was carried out to remove redundant features and improve the model performance. Finally, we utilized the random forest classifier to build PredDSMC.Results: The results of two independent test sets indicated that PredDSMC outperformed the state-of-the-art methods in differentiating driver synonymous mutations from passenger mutations.Discussion: In conclusion, we expect that PredDSMC, as a driver synonymous mutation prediction method, will be a valuable method for gaining a deeper understanding of synonymous mutations in human cancers.
Collapse
|
16
|
Pandey M, Anoosha P, Yesudhas D, Gromiha MM. Identification of potential driver mutations in glioblastoma using machine learning. Brief Bioinform 2022; 23:6764546. [PMID: 36266243 DOI: 10.1093/bib/bbac451] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Revised: 09/13/2022] [Accepted: 09/22/2022] [Indexed: 12/14/2022] Open
Abstract
Glioblastoma is a fast and aggressively growing tumor in the brain and spinal cord. Mutation of amino acid residues in targets proteins, which are involved in glioblastoma, alters the structure and function and may lead to disease. In this study, we collected a set of 9386 disease-causing (drivers) mutations based on the recurrence in patient samples and experimentally annotated as pathogenic and 8728 as neutral (passenger) mutations. We observed that Arg is highly preferred at the mutant sites of drivers, whereas Met and Ile showed preferences in passengers. Inspecting neighboring residues at the mutant sites revealed that the motifs YP, CP and GRH, are preferred in drivers, whereas SI, IQ and TVI are dominant in neutral. In addition, we have computed other sequence-based features such as conservation scores, Position Specific Scoring Matrices (PSSM) and physicochemical properties, and developed a machine learning-based method, GBMDriver (GlioBlastoma Multiforme Drivers), for distinguishing between driver and passenger mutations. Our method showed an accuracy and AUC of 73.59% and 0.82, respectively, on 10-fold cross-validation and 81.99% and 0.87 in a blind set of 1809 mutants. The tool is available at https://web.iitm.ac.in/bioinfo2/GBMDriver/index.html. We envisage that the present method is helpful to prioritize driver mutations in glioblastoma and assist in identifying therapeutic targets.
Collapse
Affiliation(s)
- Medha Pandey
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India
| | - P Anoosha
- Division of Medical Oncology, Department of Internal Medicine, Comprehensive Cancer Center, The Ohio State University, Columbus, Ohio, USA
| | - Dhanusha Yesudhas
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India
| | - M Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India
| |
Collapse
|
17
|
Saxena S, Krishna Murthy TP, Chandrashekhar CR, Patil LS, Aditya A, Shukla R, Yadav AK, Singh TR, Samantaray M, Ramaswamy A. A bioinformatics approach to the identification of novel deleterious mutations of human TPMT through validated screening and molecular dynamics. Sci Rep 2022; 12:18872. [PMID: 36344599 PMCID: PMC9640560 DOI: 10.1038/s41598-022-23488-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2022] [Accepted: 11/01/2022] [Indexed: 11/09/2022] Open
Abstract
Polymorphisms of Thiopurine S-methyltransferase (TPMT) are known to be associated with leukemia, inflammatory bowel diseases, and more. The objective of the present study was to identify novel deleterious missense SNPs of TPMT through a comprehensive in silico protocol. The initial SNP screening protocol used to identify deleterious SNPs from the pool of all TPMT SNPs in the dbSNP database yielded an accuracy of 83.33% in identifying extremely dangerous variants. Five novel deleterious missense SNPs (W33G, W78R, V89E, W150G, and L182P) of TPMT were identified through the aforementioned screening protocol. These 5 SNPs were then subjected to conservation analysis, interaction analysis, oncogenic and phenotypic analysis, structural analysis, PTM analysis, and molecular dynamics simulations (MDS) analysis to further assess and analyze their deleterious nature. Oncogenic analysis revealed that all five SNPs are oncogenic. MDS analysis revealed that all SNPs are deleterious due to the alterations they cause in the binding energy of the wild-type protein. Plasticity-induced instability caused by most of the mutations as indicated by the MDS results has been hypothesized to be the reason for this alteration. While in vivo or in vitro protocols are more conclusive, they are often more challenging and expensive. Hence, future research endeavors targeted at TPMT polymorphisms and/or their consequences in relevant disease progressions or treatments, through in vitro or in vivo means can give a higher priority to these SNPs rather than considering the massive pool of all SNPs of TPMT.
Collapse
Affiliation(s)
- Sidharth Saxena
- Department of Biotechnology, Ramaiah Institute of Technology, Bengaluru, Karnataka, 560054, India
| | - T P Krishna Murthy
- Department of Biotechnology, Ramaiah Institute of Technology, Bengaluru, Karnataka, 560054, India.
| | - C R Chandrashekhar
- Department of Biotechnology, Ramaiah Institute of Technology, Bengaluru, Karnataka, 560054, India
| | - Lavan S Patil
- Department of Biotechnology, Ramaiah Institute of Technology, Bengaluru, Karnataka, 560054, India
| | - Abhinav Aditya
- Department of Biotechnology, Ramaiah Institute of Technology, Bengaluru, Karnataka, 560054, India
| | - Rohit Shukla
- Department of Biotechnology and Bioinformatics, Jaypee University of Information Technology (JUIT), Solan, Himachal Pradesh, 173234, India
| | - Arvind Kumar Yadav
- Department of Biotechnology and Bioinformatics, Jaypee University of Information Technology (JUIT), Solan, Himachal Pradesh, 173234, India
| | - Tiratha Raj Singh
- Department of Biotechnology and Bioinformatics, Jaypee University of Information Technology (JUIT), Solan, Himachal Pradesh, 173234, India
| | - Mahesh Samantaray
- Department of Bioinformatics, Pondicherry University, Pondicherry, 605014, India
| | - Amutha Ramaswamy
- Department of Bioinformatics, Pondicherry University, Pondicherry, 605014, India
| |
Collapse
|
18
|
Saxena S, Murthy TPK, Chandramohan V, Achyuth S, Maansi M, Das P, Sineagha V, Prakash S. In-silico analysis of deleterious single nucleotide polymorphisms of PNMT gene. MOLECULAR SIMULATION 2022. [DOI: 10.1080/08927022.2022.2094922] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Affiliation(s)
- Sidharth Saxena
- Department of Biotechnology, Ramaiah Institute of Technology, Bengaluru, India
| | | | - Vivek Chandramohan
- Department of Biotechnology, Siddaganga Institute of Technology, Tumakuru, India
| | - Sai Achyuth
- Department of Biotechnology, Ramaiah Institute of Technology, Bengaluru, India
| | - M. Maansi
- Department of Biotechnology, Ramaiah Institute of Technology, Bengaluru, India
| | - Papiya Das
- Department of Biotechnology, Ramaiah Institute of Technology, Bengaluru, India
| | - V. Sineagha
- Department of Biotechnology, Ramaiah Institute of Technology, Bengaluru, India
| | - Sriraksha Prakash
- Department of Biotechnology, Ramaiah Institute of Technology, Bengaluru, India
| |
Collapse
|
19
|
Cancer-related Mutations with Local or Long-range Effects on an Allosteric Loop of p53. J Mol Biol 2022; 434:167663. [PMID: 35659507 DOI: 10.1016/j.jmb.2022.167663] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Revised: 05/19/2022] [Accepted: 05/25/2022] [Indexed: 12/31/2022]
Abstract
The tumor protein 53 (p53) is involved in transcription-dependent and independent processes. Several p53 variants related to cancer have been found to impact protein stability. Other variants, on the contrary, might have little impact on structural stability and have local or long-range effects on the p53 interactome. Our group previously identified a loop in the DNA binding domain (DBD) of p53 (residues 207-213) which can recruit different interactors. Experimental structures of p53 in complex with other proteins strengthen the importance of this interface for protein-protein interactions. We here characterized with structure-based approaches somatic and germline variants of p53 which could have a marginal effect in terms of stability and act locally or allosterically on the region 207-213 with consequences on the cytosolic functions of this protein. To this goal, we studied 1132 variants in the p53 DBD with structure-based approaches, accounting also for protein dynamics. We focused on variants predicted with marginal effects on structural stability. We then investigated each of these variants for their impact on DNA binding, dimerization of the p53 DBD, and intramolecular contacts with the 207-213 region. Furthermore, we identified variants that could modulate long-range the conformation of the region 207-213 using a coarse-grain model for allostery and all-atom molecular dynamics simulations. Our predictions have been further validated using enhanced sampling methods for 15 variants. The methodologies used in this study could be more broadly applied to other p53 variants or cases where conformational changes of loop regions are essential in the function of disease-related proteins.
Collapse
|
20
|
Structural and functional analysis of disease-associated mutations in GOT1 gene: An in silico study. Comput Biol Med 2021; 136:104695. [PMID: 34352456 DOI: 10.1016/j.compbiomed.2021.104695] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Accepted: 07/23/2021] [Indexed: 11/20/2022]
Abstract
Disease-associated single nucleotide polymorphisms (SNPs) alter the natural functioning and the structure of proteins. Glutamic-oxaloacetic transaminase 1 (GOT1) is a gene associated with multiple cancers and neurodegenerative diseases which codes for aspartate aminotransferase. The present study involved a comprehensive in-silico analysis of the disease-associated SNPs of human GOT1. Four highly deleterious nsSNPs (L36R, Y159C, W162C and L345P) were identified through SNP screening using several sequence-based and structure-based tools. Conservation analysis and oncogenic analysis showed that most of the nsSNPs are at highly conserved residues, oncogenic in nature and cancer drivers. Molecular dynamics simulations (MDS) analysis was performed to understand the dynamic behaviour of native and mutant proteins. PTM analysis revealed that the nsSNP Y159C is at a PTM site and will mostly affect phosphorylation at that site. Based on the overall analyses carried out in this study, L36R is the most deleterious mutation amongst the aforementioned deleterious mutations of GOT1.
Collapse
|
21
|
Rogers MF, Gaunt TR, Campbell C. Prediction of driver variants in the cancer genome via machine learning methodologies. Brief Bioinform 2021; 22:bbaa250. [PMID: 33094325 PMCID: PMC8293831 DOI: 10.1093/bib/bbaa250] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Revised: 09/04/2020] [Accepted: 09/06/2020] [Indexed: 01/18/2023] Open
Abstract
Sequencing technologies have led to the identification of many variants in the human genome which could act as disease-drivers. As a consequence, a variety of bioinformatics tools have been proposed for predicting which variants may drive disease, and which may be causatively neutral. After briefly reviewing generic tools, we focus on a subset of these methods specifically geared toward predicting which variants in the human cancer genome may act as enablers of unregulated cell proliferation. We consider the resultant view of the cancer genome indicated by these predictors and discuss ways in which these types of prediction tools may be progressed by further research.
Collapse
Affiliation(s)
| | - Tom R Gaunt
- MRC Integrative Epidemiology Unit, University of Bristol
| | - Colin Campbell
- University of Bristol with interests in machine learning and medical bioinformatics
| |
Collapse
|
22
|
Nussinov R, Jang H, Nir G, Tsai CJ, Cheng F. A new precision medicine initiative at the dawn of exascale computing. Signal Transduct Target Ther 2021; 6:3. [PMID: 33402669 PMCID: PMC7785737 DOI: 10.1038/s41392-020-00420-3] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Revised: 10/27/2020] [Accepted: 10/30/2020] [Indexed: 12/14/2022] Open
Abstract
Which signaling pathway and protein to select to mitigate the patient's expected drug resistance? The number of possibilities facing the physician is massive, and the drug combination should fit the patient status. Here, we briefly review current approaches and data and map an innovative patient-specific strategy to forecast drug resistance targets that centers on parallel (or redundant) proliferation pathways in specialized cells. It considers the availability of each protein in each pathway in the specific cell, its activating mutations, and the chromatin accessibility of its encoding gene. The construction of the resulting Proliferation Pathway Network Atlas will harness the emerging exascale computing and advanced artificial intelligence (AI) methods for therapeutic development. Merging the resulting set of targets, pathways, and proteins, with current strategies will augment the choice for the attending physicians to thwart resistance.
Collapse
Affiliation(s)
- Ruth Nussinov
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, MD, 21702, USA.
- Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Tel Aviv, 69978, Israel.
| | - Hyunbum Jang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, MD, 21702, USA
| | - Guy Nir
- Department of Genetics, Harvard Medical School, Boston, MA, 02115, USA
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA, 02115, USA
- Department of Biochemistry & Molecular Biology, Department of Neuroscience, Cell Biology and Anatomy, Sealy Center for Structural Biology and Molecular Biophysics, University of Texas Medical Branch, Galveston, TX, 77555, USA
| | - Chung-Jung Tsai
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, MD, 21702, USA
| | - Feixiong Cheng
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH, 44106, USA
- Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH, 44195, USA
| |
Collapse
|