1
|
Phan A, Joshi P, Kadelka C, Friedberg I. A longitudinal analysis of function annotations of the human proteome reveals consistently high biases. Database (Oxford) 2025; 2025:baaf036. [PMID: 40338520 PMCID: PMC12060720 DOI: 10.1093/database/baaf036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2024] [Revised: 02/28/2025] [Accepted: 04/08/2025] [Indexed: 05/09/2025]
Abstract
The resources required to study gene function are limited, especially when considering the number of genes in the human genome and the complexity of their function. Therefore, genes are prioritized for experimental studies based on many different considerations, including, but not limited to, perceived biomedical importance, such as disease-associated genes, or the understanding of biological processes, such as cell signalling pathways. At the same time, most genes are not studied or are under-characterized, which hampers our understanding of their function and potential effects on human health and wellness. Understanding function annotation disparity is a necessary first step toward understanding how much functional knowledge is gained from the human genome, and toward guidelines for better targeting future studies of the genes in the human genome effectively. Here, we present a comprehensive longitudinal analysis of the human proteome utilizing data analysis tools from economics and information theory. Specifically, we view the human proteome as a population of proteins within a knowledge economy: we treat the quantified knowledge of the protein's function as the analogue of wealth and examine the distribution of information in a population of proteins in the proteome in the same manner distribution of wealth is studied in societies. Our results show a highly skewed distribution of information about human proteins over the last decade, in which the inequality in the annotations given to the proteins remains high. Additionally, we examine the correlation between the knowledge about protein function as captured in databases and the interest in proteins as reflected by mentions in the scientific literature. We show a large gap between knowledge and interest and dissect the factors leading to this gap. In conclusion, our study shows that research efforts should be redirected to less studied proteins to mitigate the disparity among human proteins both in databases and literature.
Collapse
Affiliation(s)
- An Phan
- Program in Bioinformatics and Computational Biology, Iowa State University, Ames, IA, United States
- Department of Mathematics, Iowa State University, Ames, IA, United States
| | - Parnal Joshi
- Program in Bioinformatics and Computational Biology, Iowa State University, Ames, IA, United States
- Department of Veterinary Microbiology and Preventive Medicine, Iowa State University, Ames, IA, United States
| | - Claus Kadelka
- Department of Mathematics, Iowa State University, Ames, IA, United States
| | - Iddo Friedberg
- Department of Veterinary Microbiology and Preventive Medicine, Iowa State University, Ames, IA, United States
| |
Collapse
|
2
|
Hari K, Harlapur P, Saxena A, Haldar K, Girish A, Malpani T, Levine H, Jolly MK. Low dimensionality of phenotypic space as an emergent property of coordinated teams in biological regulatory networks. iScience 2025; 28:111730. [PMID: 39898023 PMCID: PMC11787609 DOI: 10.1016/j.isci.2024.111730] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Revised: 09/14/2024] [Accepted: 12/30/2024] [Indexed: 02/04/2025] Open
Abstract
Cell-fate decisions involve coordinated genome-wide expression changes, typically leading to a limited number of phenotypes. Although often modeled as simple toggle switches, these rather simplistic representations often disregard the complexity of regulatory networks governing these changes. Here, we unravel design principles underlying complex cell decision-making networks in multiple contexts. We show that the emergent dynamics of these networks and corresponding transcriptomic data are consistently low-dimensional, as quantified by the variance explained by principal component 1 (PC1). This low dimensionality in phenotypic space arises from extensive feedback loops in these networks arranged to effectively enable the formation of two teams of mutually inhibiting nodes. We use team strength as a metric to quantify these feedback interactions and show its strong correlation with PC1 variance. Using artificial networks of varied topologies, we also establish the conditions for generating canalized cell-fate landscapes, offering insights into diverse binary cellular decision-making networks.
Collapse
Affiliation(s)
- Kishore Hari
- Department of Bioengineering, Indian Institute of Science, Bengaluru, Karnataka 560012, India
- Center for Theoretical Biological Physics, Northeastern University, Boston, MA 02115, USA
- Department of Physics, Northeastern University, Boston, MA 02115, USA
| | - Pradyumna Harlapur
- Department of Bioengineering, Indian Institute of Science, Bengaluru, Karnataka 560012, India
| | - Aashna Saxena
- Department of Bioengineering, Indian Institute of Science, Bengaluru, Karnataka 560012, India
| | - Kushal Haldar
- Department of Bioengineering, Indian Institute of Science, Bengaluru, Karnataka 560012, India
- Indian Institute of Science Education and Research Kolkata, Kolkata, West Bengal 741246, India
| | - Aishwarya Girish
- Department of Bioengineering, Indian Institute of Science, Bengaluru, Karnataka 560012, India
| | - Tanisha Malpani
- Department of Bioengineering, Indian Institute of Science, Bengaluru, Karnataka 560012, India
| | - Herbert Levine
- Center for Theoretical Biological Physics, Northeastern University, Boston, MA 02115, USA
- Department of Physics, Northeastern University, Boston, MA 02115, USA
| | - Mohit Kumar Jolly
- Department of Bioengineering, Indian Institute of Science, Bengaluru, Karnataka 560012, India
| |
Collapse
|
3
|
Kagan D, Jubran J, Yeger-Lotem E, Fire M. Network-based anomaly detection algorithm reveals proteins with major roles in human tissues. Gigascience 2025; 14:giaf034. [PMID: 40197822 PMCID: PMC11976396 DOI: 10.1093/gigascience/giaf034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2024] [Revised: 12/27/2024] [Accepted: 03/05/2025] [Indexed: 04/10/2025] Open
Abstract
BACKGROUND Proteins act through physical interactions with other molecules to maintain organismal health. Protein-protein interaction (PPI) networks have proved to be a powerful framework for obtaining insight into protein functions, cellular organization, response to signals, and disease states. In multicellular organisms, protein content varies between tissues, influencing tissue morphology and function. Weighted PPI networks, reflecting the likelihood of interactions in specific tissues, offer insights into tissue-specific processes and disease mechanisms. We hypothesized that detecting anomalous nodes in these networks could reveal proteins with key tissue-specific functions. RESULTS Here, we introduce Weighted Graph Anomalous Node Detection (WGAND), a novel machine-learning algorithm to identify anomalous nodes in weighted graphs. WGAND estimates expected edge weights and uses deviations to generate anomaly detection features, which are then used to score network nodes. We applied WGAND to weighted PPI networks of 17 human tissues. High-ranking anomalous nodes were enriched for proteins associated with tissue-specific diseases and tissue-specific biological processes, such as neuron signaling in the brain and spermatogenesis in the testis. WGAND outperformed other methods in terms of area under the ROC curve and precision at K, highlighting its effectiveness in uncovering biologically meaningful anomalies. CONCLUSIONS Our findings demonstrate WGAND's potential as a powerful tool for detecting anomalous proteins with significant biological roles. By identifying proteins involved in critical tissue-specific processes and diseases, WGAND offers valuable insights for discovering novel biomarkers and therapeutic targets. Its versatile algorithm is suitable for any weighted graph and is broadly applicable across various fields. The WGAND algorithm is available as an open-source Python library at https://github.com/data4goodlab/wgand.
Collapse
Affiliation(s)
- Dima Kagan
- Department of Software and Information Systems Engineering, Ben-Gurion University of the Negev, Beer Sheva 84105, Israel
| | - Juman Jubran
- Department of Clinical Biochemistry and Pharmacology, Ben-Gurion University of the Negev, Beer Sheva 84105, Israel
| | - Esti Yeger-Lotem
- Department of Clinical Biochemistry and Pharmacology, Ben-Gurion University of the Negev, Beer Sheva 84105, Israel
- The National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer Sheva 84105, Israel
| | - Michael Fire
- Department of Software and Information Systems Engineering, Ben-Gurion University of the Negev, Beer Sheva 84105, Israel
| |
Collapse
|
4
|
Argov CM, Shneyour A, Jubran J, Sabag E, Mansbach A, Sepunaru Y, Filtzer E, Gruber G, Volozhinsky M, Yogev Y, Birk O, Chalifa-Caspi V, Rokach L, Yeger-Lotem E. Tissue-aware interpretation of genetic variants advances the etiology of rare diseases. Mol Syst Biol 2024; 20:1187-1206. [PMID: 39285047 PMCID: PMC11535248 DOI: 10.1038/s44320-024-00061-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 08/08/2024] [Accepted: 08/09/2024] [Indexed: 09/19/2024] Open
Abstract
Pathogenic variants underlying Mendelian diseases often disrupt the normal physiology of a few tissues and organs. However, variant effect prediction tools that aim to identify pathogenic variants are typically oblivious to tissue contexts. Here we report a machine-learning framework, denoted "Tissue Risk Assessment of Causality by Expression for variants" (TRACEvar, https://netbio.bgu.ac.il/TRACEvar/ ), that offers two advancements. First, TRACEvar predicts pathogenic variants that disrupt the normal physiology of specific tissues. This was achieved by creating 14 tissue-specific models that were trained on over 14,000 variants and combined 84 attributes of genetic variants with 495 attributes derived from tissue omics. TRACEvar outperformed 10 well-established and tissue-oblivious variant effect prediction tools. Second, the resulting models are interpretable, thereby illuminating variants' mode of action. Application of TRACEvar to variants of 52 rare-disease patients highlighted pathogenicity mechanisms and relevant disease processes. Lastly, the interpretation of all tissue models revealed that top-ranking determinants of pathogenicity included attributes of disease-affected tissues, particularly cellular process activities. Collectively, these results show that tissue contexts and interpretable machine-learning models can greatly enhance the etiology of rare diseases.
Collapse
Affiliation(s)
- Chanan M Argov
- Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
| | - Ariel Shneyour
- Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
| | - Juman Jubran
- Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
| | - Eric Sabag
- Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
| | - Avigdor Mansbach
- Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
| | - Yair Sepunaru
- Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
| | - Emmi Filtzer
- Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
| | - Gil Gruber
- Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
| | - Miri Volozhinsky
- Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
| | - Yuval Yogev
- Morris Kahn Laboratory of Human Genetics and the Genetics Institute at Soroka Medical Center, Faculty of Health Sciences, Ben Gurion University of the Negev, Beer Sheva, 84105, Israel
| | - Ohad Birk
- Morris Kahn Laboratory of Human Genetics and the Genetics Institute at Soroka Medical Center, Faculty of Health Sciences, Ben Gurion University of the Negev, Beer Sheva, 84105, Israel
- The National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
| | - Vered Chalifa-Caspi
- Ilse Katz Institute for Nanoscale Science & Technology, Ben-Gurion University of the Negev, Beer-Sheva, 84105, Israel
| | - Lior Rokach
- Department of Software & Information Systems Engineering, Faculty of Engineering Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
| | - Esti Yeger-Lotem
- Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel.
- The National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel.
| |
Collapse
|
5
|
da Silva Rosa SC, Barzegar Behrooz A, Guedes S, Vitorino R, Ghavami S. Prioritization of genes for translation: a computational approach. Expert Rev Proteomics 2024; 21:125-147. [PMID: 38563427 DOI: 10.1080/14789450.2024.2337004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Accepted: 02/21/2024] [Indexed: 04/04/2024]
Abstract
INTRODUCTION Gene identification for genetic diseases is critical for the development of new diagnostic approaches and personalized treatment options. Prioritization of gene translation is an important consideration in the molecular biology field, allowing researchers to focus on the most promising candidates for further investigation. AREAS COVERED In this paper, we discussed different approaches to prioritize genes for translation, including the use of computational tools and machine learning algorithms, as well as experimental techniques such as knockdown and overexpression studies. We also explored the potential biases and limitations of these approaches and proposed strategies to improve the accuracy and reliability of gene prioritization methods. Although numerous computational methods have been developed for this purpose, there is a need for computational methods that incorporate tissue-specific information to enable more accurate prioritization of candidate genes. Such methods should provide tissue-specific predictions, insights into underlying disease mechanisms, and more accurate prioritization of genes. EXPERT OPINION Using advanced computational tools and machine learning algorithms to prioritize genes, we can identify potential targets for therapeutic intervention of complex diseases. This represents an up-and-coming method for drug development and personalized medicine.
Collapse
Affiliation(s)
- Simone C da Silva Rosa
- Department of Human Anatomy and Cell Science, Max Rady College of Medicine, Rady Faculty of Health Science, University of Manitoba, Winnipeg, Canada
| | - Amir Barzegar Behrooz
- Department of Human Anatomy and Cell Science, Max Rady College of Medicine, Rady Faculty of Health Science, University of Manitoba, Winnipeg, Canada
- Electrophysiology Research Center, Neuroscience Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Sofia Guedes
- LAQV/REQUIMTE, Department of Chemistry, University of Aveiro, Aveiro, Portugal
| | - Rui Vitorino
- LAQV/REQUIMTE, Department of Chemistry, University of Aveiro, Aveiro, Portugal
- Department of Medical Sciences, Institute of Biomedicine-iBiMED, University of Aveiro, Aveiro, Portugal
- UnIC@RISE, Department of Surgery and Physiology, Faculty of Medicine of the University of Porto, Porto, Portugal
| | - Saeid Ghavami
- Department of Human Anatomy and Cell Science, Max Rady College of Medicine, Rady Faculty of Health Science, University of Manitoba, Winnipeg, Canada
- Faculty of Medicine in Zabrze, Academia of Silesia, Katowice, Poland
- Research Institute of Oncology and Hematology, Cancer Care Manitoba, University of Manitoba, Winnipeg, Canada
| |
Collapse
|
6
|
Simonovsky E, Sharon M, Ziv M, Mauer O, Hekselman I, Jubran J, Vinogradov E, Argov CM, Basha O, Kerber L, Yogev Y, Segrè AV, Im HK, GTEx Consortium, Birk O, Rokach L, Yeger‐Lotem E. Predicting molecular mechanisms of hereditary diseases by using their tissue-selective manifestation. Mol Syst Biol 2023; 19:e11407. [PMID: 37232043 PMCID: PMC10407743 DOI: 10.15252/msb.202211407] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Revised: 04/30/2023] [Accepted: 05/10/2023] [Indexed: 05/27/2023] Open
Abstract
How do aberrations in widely expressed genes lead to tissue-selective hereditary diseases? Previous attempts to answer this question were limited to testing a few candidate mechanisms. To answer this question at a larger scale, we developed "Tissue Risk Assessment of Causality by Expression" (TRACE), a machine learning approach to predict genes that underlie tissue-selective diseases and selectivity-related features. TRACE utilized 4,744 biologically interpretable tissue-specific gene features that were inferred from heterogeneous omics datasets. Application of TRACE to 1,031 disease genes uncovered known and novel selectivity-related features, the most common of which was previously overlooked. Next, we created a catalog of tissue-associated risks for 18,927 protein-coding genes (https://netbio.bgu.ac.il/trace/). As proof-of-concept, we prioritized candidate disease genes identified in 48 rare-disease patients. TRACE ranked the verified disease gene among the patient's candidate genes significantly better than gene prioritization methods that rank by gene constraint or tissue expression. Thus, tissue selectivity combined with machine learning enhances genetic and clinical understanding of hereditary diseases.
Collapse
Affiliation(s)
- Eyal Simonovsky
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Moran Sharon
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Maya Ziv
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Omry Mauer
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Idan Hekselman
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Juman Jubran
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Ekaterina Vinogradov
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Chanan M Argov
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Omer Basha
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Lior Kerber
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Yuval Yogev
- Morris Kahn Laboratory of Human Genetics and the Genetics Institute at Soroka Medical Center, Faculty of Health SciencesBen Gurion University of the NegevBeer ShevaIsrael
| | - Ayellet V Segrè
- Ocular Genomics Institute, Massachusetts Eye and EarHarvard Medical SchoolBostonMAUSA
- The Broad Institute of MIT and HarvardCambridgeMAUSA
| | - Hae Kyung Im
- Section of Genetic Medicine, Department of MedicineThe University of ChicagoChicagoILUSA
| | | | - Ohad Birk
- Morris Kahn Laboratory of Human Genetics and the Genetics Institute at Soroka Medical Center, Faculty of Health SciencesBen Gurion University of the NegevBeer ShevaIsrael
- The National Institute for Biotechnology in the NegevBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Lior Rokach
- Department of Software & Information Systems EngineeringBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Esti Yeger‐Lotem
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
- The National Institute for Biotechnology in the NegevBen‐Gurion University of the NegevBeer ShevaIsrael
| |
Collapse
|
7
|
Zhang X, Gunda A, Kranenbarg EMK, Liefers GJ, Savitha BA, Shrivastava P, Serkad CPVK, Kaur T, Eshwaraiah MS, Tollenaar RAEM, van de Velde CJH, Seynaeve CMJ, Bakre M, Kuppen PJK. Ten-year distant-recurrence risk prediction in breast cancer by CanAssist Breast (CAB) in Dutch sub-cohort of the randomized TEAM trial. Breast Cancer Res 2023; 25:40. [PMID: 37060036 PMCID: PMC10103430 DOI: 10.1186/s13058-023-01643-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Accepted: 03/30/2023] [Indexed: 04/16/2023] Open
Abstract
BACKGROUND Hormone receptor (HR)-positive, HER2/neu-negative breast cancers have a sustained risk of recurrence up to 20 years from diagnosis. TEAM (Tamoxifen, Exemestane Adjuvant Multinational) is a large, multi-country, phase III trial that randomized 9776 women for the use of hormonal therapy. Of these 2754 were Dutch patients. The current study aims for the first time to correlate the ten-year clinical outcomes with predictions by CanAssist Breast (CAB)-a prognostic test developed in South East Asia, on a Dutch sub-cohort that participated in the TEAM. The total Dutch TEAM cohort and the current Dutch sub-cohort were almost similar with respect to patient age and tumor anatomical features. METHODS Of the 2754 patients from the Netherlands, which are part of the original TEAM trial, 592 patients' samples were available with Leiden University Medical Center (LUMC). The risk stratification of CAB was correlated with outcomes of patients using logistic regression approaches entailing Kaplan-Meier survival curves, univariate and multivariate cox-regression hazards model. We used hazard ratios (HRs), the cumulative incidence of distant metastasis/death due to breast cancer (DM), and distant recurrence-free interval (DRFi) for assessment. RESULTS Out of 433 patients finally included, the majority, 68.4% had lymph node-positive disease, while only a minority received chemotherapy (20.8%) in addition to endocrine therapy. CAB stratified 67.5% of the total cohort as low-risk [DM = 11.5% (95% CI, 7.6-15.2)] and 32.5% as high-risk [DM = 30.2% (95% CI, 21.9-37.6)] with an HR of 2.90 (95% CI, 1.75-4.80; P < 0.001) at ten years. CAB risk score was an independent prognostic factor in the consideration of clinical parameters in multivariate analysis. At ten years, CAB high-risk had the worst DRFi of 69.8%, CAB low-risk in the exemestane monotherapy arm had the best DRFi of 92.7% [vs CAB high-risk, HR, 0.21 (95% CI, 0.11-0.43), P < 0.001], and CAB low-risk in the sequential arm had a DRFi of 84.2% [vs CAB high-risk, HR, 0.48 (95% CI, 0.28-0.82), P = 0.009]. CONCLUSIONS Cost-effective CAB is a statistically robust prognostic and predictive tool for ten-year DM for postmenopausal women with HR+/HER2-, early breast cancer. CAB low-risk patients who received exemestane monotherapy had an excellent ten-year DRFi.
Collapse
Affiliation(s)
- Xi Zhang
- Department of Surgery, Leiden University Medical Center (LUMC), Albinusdreef 2, Leiden, 2333 ZA, The Netherlands
| | - Aparna Gunda
- OncoStem Diagnostics Pvt Ltd, #4, Raja Ram Mohan Roy Road, Aanand Tower, 2nd Floor, Bangalore, 560027, India
| | | | - Gerrit-Jan Liefers
- Geriatric Oncology Research Group, Leiden University Medical Center (LUMC), Leiden, The Netherlands
| | | | - Payal Shrivastava
- OncoStem Diagnostics Pvt Ltd, #4, Raja Ram Mohan Roy Road, Aanand Tower, 2nd Floor, Bangalore, 560027, India
| | | | - Taranjot Kaur
- OncoStem Diagnostics Pvt Ltd, #4, Raja Ram Mohan Roy Road, Aanand Tower, 2nd Floor, Bangalore, 560027, India
| | | | - Rob A E M Tollenaar
- Department of Surgery, Leiden University Medical Center (LUMC), Albinusdreef 2, Leiden, 2333 ZA, The Netherlands
| | - Cornelis J H van de Velde
- Department of Surgery, Leiden University Medical Center (LUMC), Albinusdreef 2, Leiden, 2333 ZA, The Netherlands
| | | | - Manjiri Bakre
- OncoStem Diagnostics Pvt Ltd, #4, Raja Ram Mohan Roy Road, Aanand Tower, 2nd Floor, Bangalore, 560027, India.
| | - Peter J K Kuppen
- Department of Surgery, Leiden University Medical Center (LUMC), Albinusdreef 2, Leiden, 2333 ZA, The Netherlands.
| |
Collapse
|
8
|
Azmi MB, Naeem U, Saleem A, Jawed A, Usman H, Qureshi SA, Azim MK. In silico identification of the rare-coding pathogenic mutations and structural modeling of human NNAT gene associated with anorexia nervosa. Eat Weight Disord 2022; 27:2725-2744. [PMID: 35655118 DOI: 10.1007/s40519-022-01422-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Accepted: 05/10/2022] [Indexed: 10/18/2022] Open
Abstract
PURPOSE Increased susceptibility towards anorexia nervosa (AN) was reported with reduced levels of neuronatin (NNAT) gene. We sought to investigate the most pathogenic rare-coding missense mutations, non-synonymous single-nucleotide polymorphisms (nsSNPs) of NNAT and their potential damaging impact on protein function through transcript level sequence and structure based in silico approaches. METHODS Gene sequence, single nucleotide polymorphisms (SNPs) of NNAT was retrieved from public databases and the putative post-translational modification (PTM) sites were analyzed. Distinctive in silico algorithms were recruited for transcript level SNPs analyses and to characterized high-risk rare-coding nsSNPs along with their impact on protein stability function. Ab initio 3D-modeling of wild-type, alternate model prediction for most deleterious nsSNP, validation and recognition of druggable binding pockets were also performed. AN 3D therapeutic compounds that followed rule of drug-likeness were docked with most pathogenic variant of NNAT to estimate the drugs' binding free energies. RESULTS Conclusively, 10 transcript (201-205)-based nsSNPs from 3 rare-coding missense variants, i.e., rs539681368, rs542858994, rs560845323 out of 840 exonic SNPs were identified. Transcript-based functional impact analyses predicted rs539681368 (C30Y) from NNAT-204 as the high-risk rare-coding pathogenic nsSNP, deviating protein functions. The 3D-modeling analysis of AN drugs' binding energies indicated lowest binding free energy (ΔG) and significant inhibition constant (Ki) with mutant models C30Y. CONCLUSIONS Mutant model (C30Y) exhibiting significant drug binding affinity and the commonest interaction observed at the acetylation site K59. Thus, based on these findings, we concluded that the identified nsSNP may serve as potential targets for various studies, diagnosis and therapeutic interventions. LEVEL OF EVIDENCE No level of evidence-open access bioinformatics research.
Collapse
Affiliation(s)
- Muhammad Bilal Azmi
- Department of Biochemistry, Dow Medical College, Dow University of Health Sciences, Karachi, Pakistan.
| | - Unaiza Naeem
- Dow Medical College, Dow University of Health Sciences, Karachi, Pakistan
| | - Arisha Saleem
- Dow Medical College, Dow University of Health Sciences, Karachi, Pakistan
| | - Areesha Jawed
- Dow Medical College, Dow University of Health Sciences, Karachi, Pakistan
| | - Haroon Usman
- Department of Biochemistry, University of Karachi, Karachi, Pakistan
| | | | - M Kamran Azim
- Department of Biosciences, Mohammad Ali Jinnah University, Karachi, Pakistan
| |
Collapse
|