1
|
Luppino F, Lenz S, Chow CFW, Toth-Petroczy A. Deep learning tools predict variants in disordered regions with lower sensitivity. BMC Genomics 2025; 26:367. [PMID: 40221640 PMCID: PMC11992697 DOI: 10.1186/s12864-025-11534-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2025] [Accepted: 03/27/2025] [Indexed: 04/14/2025] Open
Abstract
BACKGROUND The recent AI breakthrough of AlphaFold2 has revolutionized 3D protein structural modeling, proving crucial for protein design and variant effects prediction. However, intrinsically disordered regions-known for their lack of well-defined structure and lower sequence conservation-often yield low-confidence models. The latest Variant Effect Predictor (VEP), AlphaMissense, leverages AlphaFold2 models, achieving over 90% sensitivity and specificity in predicting variant effects. However, the effectiveness of tools for variants in disordered regions, which account for 30% of the human proteome, remains unclear. RESULTS In this study, we found that predicting pathogenicity for variants in disordered regions is less accurate than in ordered regions, particularly for mutations at the first N-Methionine site. Investigations into the efficacy of variant effect predictors on intrinsically disordered regions (IDRs) indicated that mutations in IDRs are predicted with lower sensitivity and the gap between sensitivity and specificity is largest in disordered regions, especially for AlphaMissense and VARITY. CONCLUSIONS The prevalence of IDRs within the human proteome, coupled with the increasing repertoire of biological functions they are known to perform, necessitated an investigation into the efficacy of state-of-the-art VEPs on such regions. This analysis revealed their consistently reduced sensitivity and differing prediction performance profile to ordered regions, indicating that new IDR-specific features and paradigms are needed to accurately classify disease mutations within those regions.
Collapse
Affiliation(s)
- Federica Luppino
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307, Dresden, Germany
- Center for Systems Biology Dresden, Pfotenhauerstrasse 108, 01307, Dresden, Germany
| | - Swantje Lenz
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307, Dresden, Germany
- Center for Systems Biology Dresden, Pfotenhauerstrasse 108, 01307, Dresden, Germany
| | - Chi Fung Willis Chow
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307, Dresden, Germany
- Center for Systems Biology Dresden, Pfotenhauerstrasse 108, 01307, Dresden, Germany
- Cluster of Excellence Physics of Life, TU Dresden, 01062, Dresden, Germany
| | - Agnes Toth-Petroczy
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307, Dresden, Germany.
- Center for Systems Biology Dresden, Pfotenhauerstrasse 108, 01307, Dresden, Germany.
- Cluster of Excellence Physics of Life, TU Dresden, 01062, Dresden, Germany.
| |
Collapse
|
2
|
Ohno S, Manabe N, Yamaguchi Y. Prediction of protein structure and AI. J Hum Genet 2024; 69:477-480. [PMID: 38177398 DOI: 10.1038/s10038-023-01215-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 12/10/2023] [Indexed: 01/06/2024]
Abstract
AlphaFold, an artificial intelligence (AI)-based tool for predicting the 3D structure of proteins, is now widely recognized for its high accuracy and versatility in the folding of human proteins. AlphaFold is useful for understanding structure-function relationships from protein 3D structure models and can serve as a template or a reference for experimental structural analysis including X-ray crystallography, NMR and cryo-EM analysis. Its use is expanding among researchers, not only in structural biology but also in other research fields. Researchers are currently exploring the full potential of AlphaFold-generated protein models. Predicting disease severity caused by missense mutations is one such application. This article provides an overview of the 3D structural modeling of AlphaFold based on deep learning techniques and highlights the challenges in predicting the pathogenicity of missense mutations.
Collapse
Affiliation(s)
- Shiho Ohno
- Division of Structural Glycobiology, Institute of Molecular Biomembrane and Glycobiology, Tohoku Medical and Pharmaceutical University, 4-4-1 Komatsushima, Aoba-ku, Sendai, Miyagi, 981-8558, Japan
| | - Noriyoshi Manabe
- Division of Structural Glycobiology, Institute of Molecular Biomembrane and Glycobiology, Tohoku Medical and Pharmaceutical University, 4-4-1 Komatsushima, Aoba-ku, Sendai, Miyagi, 981-8558, Japan
| | - Yoshiki Yamaguchi
- Division of Structural Glycobiology, Institute of Molecular Biomembrane and Glycobiology, Tohoku Medical and Pharmaceutical University, 4-4-1 Komatsushima, Aoba-ku, Sendai, Miyagi, 981-8558, Japan.
| |
Collapse
|
3
|
Shirvanizadeh N, Vihinen M. VariBench, new variation benchmark categories and data sets. FRONTIERS IN BIOINFORMATICS 2023; 3:1248732. [PMID: 37795169 PMCID: PMC10546188 DOI: 10.3389/fbinf.2023.1248732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 09/08/2023] [Indexed: 10/06/2023] Open
Affiliation(s)
| | - Mauno Vihinen
- Department of Experimental Medical Science, Lund University, Lund, Sweden
| |
Collapse
|
4
|
Lindorff-Larsen K, Kragelund BB. On the potential of machine learning to examine the relationship between sequence, structure, dynamics and function of intrinsically disordered proteins. J Mol Biol 2021; 433:167196. [PMID: 34390736 DOI: 10.1016/j.jmb.2021.167196] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Revised: 08/03/2021] [Accepted: 08/04/2021] [Indexed: 11/29/2022]
Abstract
Intrinsically disordered proteins (IDPs) constitute a broad set of proteins with few uniting and many diverging properties. IDPs-and intrinsically disordered regions (IDRs) interspersed between folded domains-are generally characterized as having no persistent tertiary structure; instead they interconvert between a large number of different and often expanded structures. IDPs and IDRs are involved in an enormously wide range of biological functions and reveal novel mechanisms of interactions, and while they defy the common structure-function paradigm of folded proteins, their structural preferences and dynamics are important for their function. We here discuss open questions in the field of IDPs and IDRs, focusing on areas where machine learning and other computational methods play a role. We discuss computational methods aimed to predict transiently formed local and long-range structure, including methods for integrative structural biology. We discuss the many different ways in which IDPs and IDRs can bind to other molecules, both via short linear motifs, as well as in the formation of larger dynamic complexes such as biomolecular condensates. We discuss how experiments are providing insight into such complexes and may enable more accurate predictions. Finally, we discuss the role of IDPs in disease and how new methods are needed to interpret the mechanistic effects of genomic variants in IDPs.
Collapse
Affiliation(s)
- Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory & Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen. Ole Maaløes Vej 5, DK-2200 Copenhagen N, Denmark.
| | - Birthe B Kragelund
- Structural Biology and NMR Laboratory & Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen. Ole Maaløes Vej 5, DK-2200 Copenhagen N, Denmark.
| |
Collapse
|
5
|
An K, Zhou JB, Xiong Y, Han W, Wang T, Ye ZQ, Wu YD. Computational Studies of the Structural Basis of Human RPS19 Mutations Associated With Diamond-Blackfan Anemia. Front Genet 2021; 12:650897. [PMID: 34108988 PMCID: PMC8181406 DOI: 10.3389/fgene.2021.650897] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Accepted: 04/28/2021] [Indexed: 11/13/2022] Open
Abstract
Diamond-Blackfan Anemia (DBA) is an inherited rare disease characterized with severe pure red cell aplasia, and it is caused by the defective ribosome biogenesis stemming from the impairment of ribosomal proteins. Among all DBA-associated ribosomal proteins, RPS19 affects most patients and carries most DBA mutations. Revealing how these mutations lead to the impairment of RPS19 is highly demanded for understanding the pathogenesis of DBA, but a systematic study is currently lacking. In this work, based on the complex structure of human ribosome, we comprehensively studied the structural basis of DBA mutations of RPS19 by using computational methods. Main structure elements and five conserved surface patches involved in RPS19-18S rRNA interaction were identified. We further revealed that DBA mutations would destabilize RPS19 through disrupting the hydrophobic core or breaking the helix, or perturb the RPS19-18S rRNA interaction through destroying hydrogen bonds, introducing steric hindrance effect, or altering surface electrostatic property at the interface. Moreover, we trained a machine-learning model to predict the pathogenicity of all possible RPS19 mutations. Our work has laid a foundation for revealing the pathogenesis of DBA from the structural perspective.
Collapse
Affiliation(s)
- Ke An
- State Key Laboratory of Chemical Oncogenomics, Peking University Shenzhen Graduate School, Shenzhen, China
| | - Jing-Bo Zhou
- State Key Laboratory of Chemical Oncogenomics, Peking University Shenzhen Graduate School, Shenzhen, China
| | - Yao Xiong
- State Key Laboratory of Chemical Oncogenomics, Peking University Shenzhen Graduate School, Shenzhen, China
| | - Wei Han
- State Key Laboratory of Chemical Oncogenomics, Peking University Shenzhen Graduate School, Shenzhen, China
| | - Tao Wang
- Shenzhen Bay Laboratory, Shenzhen, China
| | - Zhi-Qiang Ye
- State Key Laboratory of Chemical Oncogenomics, Peking University Shenzhen Graduate School, Shenzhen, China
- Shenzhen Bay Laboratory, Shenzhen, China
| | - Yun-Dong Wu
- State Key Laboratory of Chemical Oncogenomics, Peking University Shenzhen Graduate School, Shenzhen, China
- Shenzhen Bay Laboratory, Shenzhen, China
- College of Chemistry and Molecular Engineering, Peking University, Beijing, China
| |
Collapse
|
6
|
Valadez-Barba V, Cota-Coronado A, Hernández-Pérez O, Lugo-Fabres PH, Padilla-Camberos E, Díaz NF, Díaz-Martínez NE. iPSC for modeling neurodegenerative disorders. Regen Ther 2021; 15:332-339. [PMID: 33426236 PMCID: PMC7770414 DOI: 10.1016/j.reth.2020.11.006] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Revised: 11/10/2020] [Accepted: 11/19/2020] [Indexed: 02/07/2023] Open
Abstract
Neurodegenerative disorders such as Parkinson's and Alzheimer's disease, are fundamental health concerns all around the world. The development of novel treatments and new techniques to address these disorders, are being actively studied by researchers and medical personnel. In the present review we will discuss the application of induced Pluripotent Stem Cells (iPSCs) for cell-therapy replacement and disease modelling. The aim of iPSCs is to restore the functionality of the damaged tissue by replacing the impaired cells with competitive ones. To achieve this objective, iPSCs can be properly differentiated into virtually any cell fate and can be strongly translated into human health via in vitro and in vivo disease modeling for the development of new therapies, the discovery of biomarkers for several disorders, the elaboration and testing of new drugs as novel treatments, and as a tool for personalized medicine. Novel treatments to address neurodegenerative disorders. Induced pluripotent stem cell therapy and disease modelling. Parkinson's & Alzheimer's disease research.
Collapse
Key Words
- AD, Alzheimer's disease
- AFP, Alpha-Fetoprotein
- Alzheimer
- Aβ, β-Amyloid
- B-III-TUB, β–III–Tubulin
- BBB, Blood Brain Barrier
- CRISPR, Clustered Regularly Interspaced Short Palindromic Repeats
- DOPAL, 3,4-Dihydroxyphenylacetaldehyde
- EBs, Embryoid Bodies
- FLASH, Fast Length Adjustment of Short Reads
- LUHMES, Lund Human Mesencephalic Cell Line
- MHC, Mayor Histocompatibility Complex
- Neurodegenerative diseasaes
- PCR, Polymerase Chain Reaction
- PD, Parkinson's Disease
- Parkinson
- ROS, Reactive Oxygen Species
- SCs, Stem Cells
- SMA, Smooth-Muscle Antibody
- SNPc, Substantia Nigra Pars Compacta
- TH, Tyrosine Hydroxylase
- WGS, Whole Genome Sequencing
- gRNA, guide RNA
- hESC, Human Embryonic Stem Cells
- iPSCs
- iPSCs, Induced Pluripotent Stem Cells
- nsSNVs, nonsynonymous single nucleotide variants
- pTau, Phosphorylated Tau
Collapse
Affiliation(s)
- Valeria Valadez-Barba
- Department of Medical and Pharmaceutical Biotechnology, Center for Research and Assistance in Technology and Design of the State of Jalisco, A.C. Av. Normalistas 800, Colinas de las Normal, Jalisco, Mexico, P.C.44270
| | - A. Cota-Coronado
- Department of Medical and Pharmaceutical Biotechnology, Center for Research and Assistance in Technology and Design of the State of Jalisco, A.C. Av. Normalistas 800, Colinas de las Normal, Jalisco, Mexico, P.C.44270
- The Florey Institute for Neuroscience and Mental Health, The University of Melbourne, 30 Royal Parade, Parkville, VIC, 3052, Australia
| | - O.R. Hernández-Pérez
- Department of Medical and Pharmaceutical Biotechnology, Center for Research and Assistance in Technology and Design of the State of Jalisco, A.C. Av. Normalistas 800, Colinas de las Normal, Jalisco, Mexico, P.C.44270
| | - Pavel H. Lugo-Fabres
- Department of Medical and Pharmaceutical Biotechnology, Center for Research and Assistance in Technology and Design of the State of Jalisco, A.C. Av. Normalistas 800, Colinas de las Normal, Jalisco, Mexico, P.C.44270
| | - Eduardo Padilla-Camberos
- Department of Medical and Pharmaceutical Biotechnology, Center for Research and Assistance in Technology and Design of the State of Jalisco, A.C. Av. Normalistas 800, Colinas de las Normal, Jalisco, Mexico, P.C.44270
| | - Néstor Fabián Díaz
- Departamento de Fisiología y Desarrollo Celular, Instituto Nacional de Perinatología Isidro Espinosa de los Reyes, Mexico City, Mexico
| | - N. Emmanuel Díaz-Martínez
- Department of Medical and Pharmaceutical Biotechnology, Center for Research and Assistance in Technology and Design of the State of Jalisco, A.C. Av. Normalistas 800, Colinas de las Normal, Jalisco, Mexico, P.C.44270
- Corresponding author. Department of Medical and Pharmaceutical Biotechnology, Center for Research and Assistance in Technology and Design of the State of Jalisco, A.C. Jalisco, Mexico.
| |
Collapse
|