1
|
Kimura H, Lahouel K, Tomasetti C, Roberts NJ. Functional characterization of all CDKN2A missense variants and comparison to in silico models of pathogenicity. eLife 2025; 13:RP95347. [PMID: 40238651 PMCID: PMC12002794 DOI: 10.7554/elife.95347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/18/2025] Open
Abstract
Interpretation of variants identified during genetic testing is a significant clinical challenge. In this study, we developed a high-throughput CDKN2A functional assay and characterized all possible human CDKN2A missense variants. We found that 17.7% of all missense variants were functionally deleterious. We also used our functional classifications to assess the performance of in silico models that predict the effect of variants, including recently reported models based on machine learning. Notably, we found that all in silico models performed similarly when compared to our functional classifications with accuracies of 39.5-85.4%. Furthermore, while we found that functionally deleterious variants were enriched within ankyrin repeats, we did not identify any residues where all missense variants were functionally deleterious. Our functional classifications are a resource to aid the interpretation of CDKN2A variants and have important implications for the application of variant interpretation guidelines, particularly the use of in silico models for clinical variant interpretation.
Collapse
Affiliation(s)
- Hirokazu Kimura
- Department of Pathology, the Johns Hopkins University School of MedicineBaltimoreUnited States
| | - Kamel Lahouel
- Division of Integrated Genomics, Translational Genomics Research InstitutePhoenixUnited States
- Department of Computational and Quantitative Medicine, Beckman Research Institute, City of HopeDuarteUnited States
| | - Cristian Tomasetti
- Division of Integrated Genomics, Translational Genomics Research InstitutePhoenixUnited States
- Department of Computational and Quantitative Medicine, Beckman Research Institute, City of HopeDuarteUnited States
| | - Nicholas Jason Roberts
- Department of Pathology, the Johns Hopkins University School of MedicineBaltimoreUnited States
- Department of Oncology, the Johns Hopkins University School of MedicineBaltimoreUnited States
| |
Collapse
|
2
|
Baskin A, Soudah N, Gilad N, Halevi N, Darlyuk-Saadon I, Schoffman H, Engelberg D. All intrinsically active Erk1/2 mutants autophosphorylate threonine207/188, a plausible regulator of the TEY motif phosphorylation. J Biol Chem 2025; 301:108509. [PMID: 40222547 DOI: 10.1016/j.jbc.2025.108509] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2024] [Revised: 03/19/2025] [Accepted: 04/08/2025] [Indexed: 04/15/2025] Open
Abstract
The extracellular-activated kinases 1 & 2 (Erk1/2) are catalytically active when dually phosphorylated on a TEY motif located at the activation loop. In human patients with cardiac hypertrophy, Erk1/2 are phosphorylated on yet another activation loop's residue, T207/188. Intrinsically active variants of Erk1/2, mutated at R84/65, are also (auto)phosphorylated on T207/188. It is not known whether T207/188 phosphorylation is restricted to these cases, nor how it affects Erks' activity. We report that T207/188 phosphorylation is not rare, as we found that: 1) All known auto-activated Erk1/2 variants are phosphorylated on T207/188. 2) It occurs in various cell lines and mouse tissues. 3) It is extremely high in patients with skeletal muscle atrophies or myopathies. We propose that T207/188 controls the permissiveness of the TEY motif for phosphorylation because T207/188-mutated Erk1/2 and the yeast Erk/Mpk1 were efficiently dually phosphorylated when expressed in HEK293 or yeast cells, respectively. The T207/188-mutated Mpk1 was not TEY-phosphorylated in cells knocked out for MEKs, suggesting that its enhanced phosphorylation in wild-type cells is MEK-dependent. Thus, as T207/188-mutated Erk1/2 and Mpk1 recruit MEKs, the role of T207/188 is to impede MEKs' ability to phosphorylate Erks. T207/188 also impedes autophosphorylation as recombinant Erk2 mutated at T188 is spontaneously autophosphorylated, although exclusively on Y185. The role of T207/188 in regulating activation loop phosphorylation may be common to most Ser/Thr kinases, as 86% of them (in the human kinome) possess T207/188 orthologs, and 160 of them were already reported to be phosphorylated on this residue.
Collapse
Affiliation(s)
- Alexey Baskin
- Department of Biological Chemistry, The Institute of Life Science, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Nadine Soudah
- Department of Biological Chemistry, The Institute of Life Science, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Nechama Gilad
- Department of Biological Chemistry, The Institute of Life Science, The Hebrew University of Jerusalem, Jerusalem, Israel; Singapore-HUJ Alliance for Research and Enterprise, Mechanisms of Liver Inflammatory Diseases Program, National University of Singapore, Singapore
| | - Neriya Halevi
- Department of Biological Chemistry, The Institute of Life Science, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Ilona Darlyuk-Saadon
- Singapore-HUJ Alliance for Research and Enterprise, Mechanisms of Liver Inflammatory Diseases Program, National University of Singapore, Singapore
| | - Hanan Schoffman
- Stein Family Mass Spectrometry Unit, The Research Infrastructure Center, The Institute of Life Science, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - David Engelberg
- Department of Biological Chemistry, The Institute of Life Science, The Hebrew University of Jerusalem, Jerusalem, Israel; Singapore-HUJ Alliance for Research and Enterprise, Mechanisms of Liver Inflammatory Diseases Program, National University of Singapore, Singapore; Department of Microbiology and Immunology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore.
| |
Collapse
|
3
|
Torres Robles J, Stiegler AL, Boggon TJ, Turk BE. Cancer hotspot mutations rewire ERK2 specificity by selective exclusion of docking interactions. J Biol Chem 2025; 301:108348. [PMID: 40015635 PMCID: PMC11982978 DOI: 10.1016/j.jbc.2025.108348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2024] [Revised: 02/11/2025] [Accepted: 02/20/2025] [Indexed: 03/01/2025] Open
Abstract
The protein kinase ERK2 is recurrently mutated in human squamous cell carcinomas and other tumors. ERK2 mutations cluster in an essential docking recruitment site that interacts with short linear motifs found within intrinsically disordered regions of ERK substrates and regulators. Cancer-associated mutations do not disrupt ERK2 docking interactions altogether but selectively inhibit some interactions while sparing others. However, the full scope of disrupted or maintained interactions remains unknown, limiting our understanding of how these mutations contribute to cancer. We recently defined the docking interactome of wild-type ERK2 by screening a yeast two-hybrid library of proteomic short linear motifs. Here, we apply this approach to the two most recurrent cancer-associated mutants. We find that most sequences binding to WT ERK2 also interact with both mutant forms. Analysis of differentially interacting sequences revealed that ERK2 mutants selectively lose the ability to bind sequences conforming to a specific motif. We solved the co-crystal structure of ERK2 in complex with a peptide fragment of ISG20, a screening hit that binds exclusively to the WT kinase. This structure demonstrated the mechanism by which cancer hotspot mutations at Glu81, Arg135, Asp321, and Glu322 selectively impact peptide binding. Finally, we found that cancer-associated ERK2 mutations had decreased activity in phosphorylating GEF-H1/ARHGEF2, a known ERK substrate harboring a WT-selective docking motif. Collectively, our studies provide a structural rationale for how a broad set of interactions are disrupted by ERK2 hotspot mutations, suggesting mechanisms for pathway rewiring in cancers harboring these mutations.
Collapse
Affiliation(s)
- Jaylissa Torres Robles
- Department of Chemistry, Yale University, New Haven, Connecticut, USA; Department of Pharmacology, Yale School of Medicine, New Haven, Connecticut, USA
| | - Amy L Stiegler
- Department of Pharmacology, Yale School of Medicine, New Haven, Connecticut, USA
| | - Titus J Boggon
- Department of Pharmacology, Yale School of Medicine, New Haven, Connecticut, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, USA
| | - Benjamin E Turk
- Department of Pharmacology, Yale School of Medicine, New Haven, Connecticut, USA.
| |
Collapse
|
4
|
Estevam GO, Linossi E, Rao J, Macdonald CB, Ravikumar A, Chrispens KM, Capra JA, Coyote-Maestas W, Pimentel H, Collisson EA, Jura N, Fraser JS. Mapping kinase domain resistance mechanisms for the MET receptor tyrosine kinase via deep mutational scanning. eLife 2025; 13:RP101882. [PMID: 39960754 PMCID: PMC11832172 DOI: 10.7554/elife.101882] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/20/2025] Open
Abstract
Mutations in the kinase and juxtamembrane domains of the MET Receptor Tyrosine Kinase are responsible for oncogenesis in various cancers and can drive resistance to MET-directed treatments. Determining the most effective inhibitor for each mutational profile is a major challenge for MET-driven cancer treatment in precision medicine. Here, we used a deep mutational scan (DMS) of ~5764 MET kinase domain variants to profile the growth of each mutation against a panel of 11 inhibitors that are reported to target the MET kinase domain. We validate previously identified resistance mutations, pinpoint common resistance sites across type I, type II, and type I ½ inhibitors, unveil unique resistance and sensitizing mutations for each inhibitor, and verify non-cross-resistant sensitivities for type I and type II inhibitor pairs. We augment a protein language model with biophysical and chemical features to improve the predictive performance for inhibitor-treated datasets. Together, our study demonstrates a pooled experimental pipeline for identifying resistance mutations, provides a reference dictionary for mutations that are sensitized to specific therapies, and offers insights for future drug development.
Collapse
Affiliation(s)
- Gabriella O Estevam
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
- Tetrad Graduate Program, University of California, San FranciscoSan FranciscoUnited States
| | - Edmond Linossi
- Cardiovascular Research Institute, University of California, San FranciscoSan FranciscoUnited States
- Department of Cellular and Molecular Pharmacology, University of California, San FranciscoSan FranciscoUnited States
| | - Jingyou Rao
- Department of Computer Science, University of California, Los AngelesLos AngelesUnited States
| | - Christian B Macdonald
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| | - Ashraya Ravikumar
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| | - Karson M Chrispens
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
- Biophysics Graduate ProgramSan FranciscoUnited States
| | - John A Capra
- Bakar Computational Health Sciences Institute and Department of Epidemiology and Biostatistics, University of California, San FranciscoSan FranciscoUnited States
| | - Willow Coyote-Maestas
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
- Quantitative Biosciences Institute, University of California, San FranciscoSan FranciscoUnited States
| | - Harold Pimentel
- Department of Computer Science, University of California, Los AngelesLos AngelesUnited States
- Department of Computational Medicine and Human Genetics, University of California, Los AngelesLos AngelesUnited States
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los AngelesLos AngelesUnited States
| | - Eric A Collisson
- Human Biology, Fred Hutchinson Cancer CenterSeattleUnited States
- Department of Medicine, University of WashingtonSeattleUnited States
| | - Natalia Jura
- Cardiovascular Research Institute, University of California, San FranciscoSan FranciscoUnited States
- Department of Cellular and Molecular Pharmacology, University of California, San FranciscoSan FranciscoUnited States
- Quantitative Biosciences Institute, University of California, San FranciscoSan FranciscoUnited States
| | - James S Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
- Quantitative Biosciences Institute, University of California, San FranciscoSan FranciscoUnited States
| |
Collapse
|
5
|
Jiang K, Yan Z, Di Bernardo M, Sgrizzi SR, Villiger L, Kayabolen A, Kim BJ, Carscadden JK, Hiraizumi M, Nishimasu H, Gootenberg JS, Abudayyeh OO. Rapid in silico directed evolution by a protein language model with EVOLVEpro. Science 2025; 387:eadr6006. [PMID: 39571002 DOI: 10.1126/science.adr6006] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2024] [Accepted: 11/12/2024] [Indexed: 01/25/2025]
Abstract
Directed protein evolution is central to biomedical applications but faces challenges such as experimental complexity, inefficient multiproperty optimization, and local maxima traps. Although in silico methods that use protein language models (PLMs) can provide modeled fitness landscape guidance, they struggle to generalize across diverse protein families and map to protein activity. We present EVOLVEpro, a few-shot active learning framework that combines PLMs and regression models to rapidly improve protein activity. EVOLVEpro surpasses current methods, yielding up to 100-fold improvements in desired properties. We demonstrate its effectiveness across six proteins in RNA production, genome editing, and antibody binding applications. These results highlight the advantages of few-shot active learning with minimal experimental data over zero-shot predictions. EVOLVEpro opens new possibilities for artificial intelligence-guided protein engineering in biology and medicine.
Collapse
Affiliation(s)
- Kaiyi Jiang
- Department of Medicine Division of Engineering in Medicine Brigham and Women's Hospital Harvard Medical School, Boston, MA, USA
- Gene and Cell Therapy Institute Mass General Brigham, Cambridge, MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School, Boston, MA, USA
- Department of Bioengineering Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Zhaoqing Yan
- Department of Medicine Division of Engineering in Medicine Brigham and Women's Hospital Harvard Medical School, Boston, MA, USA
- Gene and Cell Therapy Institute Mass General Brigham, Cambridge, MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School, Boston, MA, USA
| | - Matteo Di Bernardo
- Whitehead Institute Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Samantha R Sgrizzi
- Department of Medicine Division of Engineering in Medicine Brigham and Women's Hospital Harvard Medical School, Boston, MA, USA
- Gene and Cell Therapy Institute Mass General Brigham, Cambridge, MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School, Boston, MA, USA
| | - Lukas Villiger
- Department of Dermatology and Allergology Kantonspital St. Gallen, St. Gallen, Switzerland
| | - Alisan Kayabolen
- Department of Medicine Division of Engineering in Medicine Brigham and Women's Hospital Harvard Medical School, Boston, MA, USA
- Gene and Cell Therapy Institute Mass General Brigham, Cambridge, MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School, Boston, MA, USA
| | - B J Kim
- Koch Institute for Integrative Cancer Research at MIT Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Josephine K Carscadden
- Department of Medicine Division of Engineering in Medicine Brigham and Women's Hospital Harvard Medical School, Boston, MA, USA
- Gene and Cell Therapy Institute Mass General Brigham, Cambridge, MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School, Boston, MA, USA
| | - Masahiro Hiraizumi
- Department of Chemistry and Biotechnology, Graduate School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, Japan
| | - Hiroshi Nishimasu
- Department of Chemistry and Biotechnology, Graduate School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, Japan
- Structural Biology Division, Research Center for Advanced Science and Technology, The University of Tokyo, 4-6-1 Komaba, Meguro-ku, Tokyo, Japan
- Inamori Research Institute for Science, 620 Suiginya-cho, Shimogyo-ku, Kyoto, Japan
| | - Jonathan S Gootenberg
- Department of Medicine Division of Engineering in Medicine Brigham and Women's Hospital Harvard Medical School, Boston, MA, USA
- Gene and Cell Therapy Institute Mass General Brigham, Cambridge, MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School, Boston, MA, USA
| | - Omar O Abudayyeh
- Department of Medicine Division of Engineering in Medicine Brigham and Women's Hospital Harvard Medical School, Boston, MA, USA
- Gene and Cell Therapy Institute Mass General Brigham, Cambridge, MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School, Boston, MA, USA
| |
Collapse
|
6
|
Estevam GO, Linossi EM, Rao J, Macdonald CB, Ravikumar A, Chrispens KM, Capra JA, Coyote-Maestas W, Pimentel H, Collisson EA, Jura N, Fraser JS. Mapping kinase domain resistance mechanisms for the MET receptor tyrosine kinase via deep mutational scanning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.16.603579. [PMID: 39071407 PMCID: PMC11275805 DOI: 10.1101/2024.07.16.603579] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
Mutations in the kinase and juxtamembrane domains of the MET Receptor Tyrosine Kinase are responsible for oncogenesis in various cancers and can drive resistance to MET-directed treatments. Determining the most effective inhibitor for each mutational profile is a major challenge for MET-driven cancer treatment in precision medicine. Here, we used a deep mutational scan (DMS) of ~5,764 MET kinase domain variants to profile the growth of each mutation against a panel of 11 inhibitors that are reported to target the MET kinase domain. We validate previously identified resistance mutations, pinpoint common resistance sites across type I, type II, and type I ½ inhibitors, unveil unique resistance and sensitizing mutations for each inhibitor, and verify non-cross-resistant sensitivities for type I and type II inhibitor pairs. We augment a protein language model with biophysical and chemical features to improve the predictive performance for inhibitor-treated datasets. Together, our study demonstrates a pooled experimental pipeline for identifying resistance mutations, provides a reference dictionary for mutations that are sensitized to specific therapies, and offers insights for future drug development.
Collapse
Affiliation(s)
- Gabriella O. Estevam
- Department of Bioengineering and Therapeutic Sciences, UCSF, San Francisco, CA, United States
- Tetrad Graduate Program, UCSF, San Francisco, CA, United States
| | - Edmond M. Linossi
- Cardiovascular Research Institute, UCSF, San Francisco, CA, United States
- Department of Cellular and Molecular Pharmacology, UCSF, San Francisco, CA, United States
| | - Jingyou Rao
- Department of Computer Science, UCLA, Los Angeles, CA, United States
| | - Christian B. Macdonald
- Department of Bioengineering and Therapeutic Sciences, UCSF, San Francisco, CA, United States
| | - Ashraya Ravikumar
- Department of Bioengineering and Therapeutic Sciences, UCSF, San Francisco, CA, United States
| | - Karson M. Chrispens
- Department of Bioengineering and Therapeutic Sciences, UCSF, San Francisco, CA, United States
- Biophysics Graduate Program, UCSF, San Francisco, CA, United States
| | - John A. Capra
- Bakar Computational Health Sciences Institute and Department of Epidemiology and Biostatistics, UCSF, San Francisco, CA, United States
| | - Willow Coyote-Maestas
- Department of Bioengineering and Therapeutic Sciences, UCSF, San Francisco, CA, United States
- Quantitative Biosciences Institute, UCSF, San Francisco, CA, United States
| | - Harold Pimentel
- Department of Computer Science, UCLA, Los Angeles, CA, United States
- Department of Computational Medicine and Human Genetics, UCLA, Los Angeles, CA, United States
- Department of Human Genetics, David Geffen School of Medicine, UCLA, Los Angeles, CA, United States
| | - Eric A. Collisson
- Human Biology, Fred Hutchinson Cancer Center, Seattle, Washington, United States
- Department of Medicine, University of Washington, Seattle, Washington, United States
| | - Natalia Jura
- Cardiovascular Research Institute, UCSF, San Francisco, CA, United States
- Department of Cellular and Molecular Pharmacology, UCSF, San Francisco, CA, United States
- Quantitative Biosciences Institute, UCSF, San Francisco, CA, United States
| | - James S. Fraser
- Department of Bioengineering and Therapeutic Sciences, UCSF, San Francisco, CA, United States
- Quantitative Biosciences Institute, UCSF, San Francisco, CA, United States
| |
Collapse
|
7
|
Blaabjerg LM, Jonsson N, Boomsma W, Stein A, Lindorff-Larsen K. SSEmb: A joint embedding of protein sequence and structure enables robust variant effect predictions. Nat Commun 2024; 15:9646. [PMID: 39511177 PMCID: PMC11544099 DOI: 10.1038/s41467-024-53982-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2024] [Accepted: 10/28/2024] [Indexed: 11/15/2024] Open
Abstract
The ability to predict how amino acid changes affect proteins has a wide range of applications including in disease variant classification and protein engineering. Many existing methods focus on learning from patterns found in either protein sequences or protein structures. Here, we present a method for integrating information from sequence and structure in a single model that we term SSEmb (Sequence Structure Embedding). SSEmb combines a graph representation for the protein structure with a transformer model for processing multiple sequence alignments. We show that by integrating both types of information we obtain a variant effect prediction model that is robust when sequence information is scarce. We also show that SSEmb learns embeddings of the sequence and structure that are useful for other downstream tasks such as to predict protein-protein binding sites. We envisage that SSEmb may be useful both for variant effect predictions and as a representation for learning to predict protein properties that depend on sequence and structure.
Collapse
Affiliation(s)
- Lasse M Blaabjerg
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen N, Denmark
| | - Nicolas Jonsson
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen N, Denmark
| | - Wouter Boomsma
- Center for Basic Machine Learning Research in Life Science, Department of Computer Science, University of Copenhagen, Copenhagen N, Denmark.
| | - Amelie Stein
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen N, Denmark.
| | - Kresten Lindorff-Larsen
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen N, Denmark.
| |
Collapse
|
8
|
Estevam GO, Linossi EM, Macdonald CB, Espinoza CA, Michaud JM, Coyote-Maestas W, Collisson EA, Jura N, Fraser JS. Conserved regulatory motifs in the juxtamembrane domain and kinase N-lobe revealed through deep mutational scanning of the MET receptor tyrosine kinase domain. eLife 2024; 12:RP91619. [PMID: 39268701 PMCID: PMC11398868 DOI: 10.7554/elife.91619] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/15/2024] Open
Abstract
MET is a receptor tyrosine kinase (RTK) responsible for initiating signaling pathways involved in development and wound repair. MET activation relies on ligand binding to the extracellular receptor, which prompts dimerization, intracellular phosphorylation, and recruitment of associated signaling proteins. Mutations, which are predominantly observed clinically in the intracellular juxtamembrane and kinase domains, can disrupt typical MET regulatory mechanisms. Understanding how juxtamembrane variants, such as exon 14 skipping (METΔEx14), and rare kinase domain mutations can increase signaling, often leading to cancer, remains a challenge. Here, we perform a parallel deep mutational scan (DMS) of the MET intracellular kinase domain in two fusion protein backgrounds: wild-type and METΔEx14. Our comparative approach has revealed a critical hydrophobic interaction between a juxtamembrane segment and the kinase ⍺C-helix, pointing to potential differences in regulatory mechanisms between MET and other RTKs. Additionally, we have uncovered a β5 motif that acts as a structural pivot for the kinase domain in MET and other TAM family of kinases. We also describe a number of previously unknown activating mutations, aiding the effort to annotate driver, passenger, and drug resistance mutations in the MET kinase domain.
Collapse
Affiliation(s)
- Gabriella O Estevam
- Tetrad Graduate Program, University of California, San FranciscoSan FranciscoUnited States
- Cardiovascular Research Institute, University of California, San FranciscoSan FranciscoUnited States
| | - Edmond M Linossi
- Department of Cellular and Molecular Pharmacology, University of California, San FranciscoSan FranciscoUnited States
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| | - Christian B Macdonald
- Tetrad Graduate Program, University of California, San FranciscoSan FranciscoUnited States
| | - Carla A Espinoza
- Cardiovascular Research Institute, University of California, San FranciscoSan FranciscoUnited States
- Department of Cellular and Molecular Pharmacology, University of California, San FranciscoSan FranciscoUnited States
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| | - Jennifer M Michaud
- Tetrad Graduate Program, University of California, San FranciscoSan FranciscoUnited States
| | - Willow Coyote-Maestas
- Tetrad Graduate Program, University of California, San FranciscoSan FranciscoUnited States
- Quantitative Biosciences Institute, University of California, San FranciscoSan FranciscoUnited States
| | - Eric A Collisson
- Helen Diller Family Comprehensive Cancer Center, University of California, San FranciscoSan FranciscoUnited States
- Department of Medicine/Hematology and Oncology, University of California, San FranciscoSan FranciscoUnited States
| | - Natalia Jura
- Department of Cellular and Molecular Pharmacology, University of California, San FranciscoSan FranciscoUnited States
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
- Quantitative Biosciences Institute, University of California, San FranciscoSan FranciscoUnited States
| | - James S Fraser
- Tetrad Graduate Program, University of California, San FranciscoSan FranciscoUnited States
- Quantitative Biosciences Institute, University of California, San FranciscoSan FranciscoUnited States
| |
Collapse
|
9
|
Padigepati SR, Stafford DA, Tan CA, Silvis MR, Jamieson K, Keyser A, Nunez PAC, Nicoludis JM, Manders T, Fresard L, Kobayashi Y, Araya CL, Aradhya S, Johnson B, Nykamp K, Reuter JA. Scalable approaches for generating, validating and incorporating data from high-throughput functional assays to improve clinical variant classification. Hum Genet 2024; 143:995-1004. [PMID: 39085601 PMCID: PMC11303574 DOI: 10.1007/s00439-024-02691-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Accepted: 07/12/2024] [Indexed: 08/02/2024]
Abstract
As the adoption and scope of genetic testing continue to expand, interpreting the clinical significance of DNA sequence variants at scale remains a formidable challenge, with a high proportion classified as variants of uncertain significance (VUSs). Genetic testing laboratories have historically relied, in part, on functional data from academic literature to support variant classification. High-throughput functional assays or multiplex assays of variant effect (MAVEs), designed to assess the effects of DNA variants on protein stability and function, represent an important and increasingly available source of evidence for variant classification, but their potential is just beginning to be realized in clinical lab settings. Here, we describe a framework for generating, validating and incorporating data from MAVEs into a semi-quantitative variant classification method applied to clinical genetic testing. Using single-cell gene expression measurements, cellular evidence models were built to assess the effects of DNA variation in 44 genes of clinical interest. This framework was also applied to models for an additional 22 genes with previously published MAVE datasets. In total, modeling data was incorporated from 24 genes into our variant classification method. These data contributed evidence for classifying 4043 observed variants in over 57,000 individuals. Genetic testing laboratories are uniquely positioned to generate, analyze, validate, and incorporate evidence from high-throughput functional data and ultimately enable the use of these data to provide definitive clinical variant classifications for more patients.
Collapse
Affiliation(s)
| | | | | | - Melanie R Silvis
- Invitae Corporation, San Francisco, CA, 94103, USA
- Epic Bio, South San Francisco, CA, 94080, USA
| | - Kirsty Jamieson
- Invitae Corporation, San Francisco, CA, 94103, USA
- Epic Bio, South San Francisco, CA, 94080, USA
| | - Andrew Keyser
- Invitae Corporation, San Francisco, CA, 94103, USA
- Calico Life Sciences, South San Francisco, CA, 94080, USA
| | | | - John M Nicoludis
- Invitae Corporation, San Francisco, CA, 94103, USA
- Department of Structural Biology, Genentech, South San Francisco, CA, 94080, USA
| | - Toby Manders
- Invitae Corporation, San Francisco, CA, 94103, USA
| | | | | | - Carlos L Araya
- Invitae Corporation, San Francisco, CA, 94103, USA
- Tapanti.org, Santa Barbara, CA, 93108, USA
| | | | - Britt Johnson
- Invitae Corporation, San Francisco, CA, 94103, USA
- GeneDx, Stamford, CT, 06902, USA
| | - Keith Nykamp
- Invitae Corporation, San Francisco, CA, 94103, USA.
| | | |
Collapse
|
10
|
Gordon MG, Kathail P, Choy B, Kim MC, Mazumder T, Gearing M, Ye CJ. Population Diversity at the Single-Cell Level. Annu Rev Genomics Hum Genet 2024; 25:27-49. [PMID: 38382493 DOI: 10.1146/annurev-genom-021623-083207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2024]
Abstract
Population-scale single-cell genomics is a transformative approach for unraveling the intricate links between genetic and cellular variation. This approach is facilitated by cutting-edge experimental methodologies, including the development of high-throughput single-cell multiomics and advances in multiplexed environmental and genetic perturbations. Examining the effects of natural or synthetic genetic variants across cellular contexts provides insights into the mutual influence of genetics and the environment in shaping cellular heterogeneity. The development of computational methodologies further enables detailed quantitative analysis of molecular variation, offering an opportunity to examine the respective roles of stochastic, intercellular, and interindividual variation. Future opportunities lie in leveraging long-read sequencing, refining disease-relevant cellular models, and embracing predictive and generative machine learning models. These advancements hold the potential for a deeper understanding of the genetic architecture of human molecular traits, which in turn has important implications for understanding the genetic causes of human disease.
Collapse
Affiliation(s)
| | - Pooja Kathail
- Center for Computational Biology, University of California, Berkeley, California, USA
| | - Bryson Choy
- Division of Rheumatology, Department of Medicine, University of California, San Francisco, California, USA
- Institute for Human Genetics, University of California, San Francisco, California, USA
| | - Min Cheol Kim
- Division of Rheumatology, Department of Medicine, University of California, San Francisco, California, USA
- Institute for Human Genetics, University of California, San Francisco, California, USA
| | - Thomas Mazumder
- Division of Rheumatology, Department of Medicine, University of California, San Francisco, California, USA
- Institute for Human Genetics, University of California, San Francisco, California, USA
| | - Melissa Gearing
- Division of Rheumatology, Department of Medicine, University of California, San Francisco, California, USA
- Institute for Human Genetics, University of California, San Francisco, California, USA
| | - Chun Jimmie Ye
- Arc Institute, Palo Alto, California, USA
- Division of Rheumatology, Department of Medicine, University of California, San Francisco, California, USA
- Institute for Human Genetics, University of California, San Francisco, California, USA
- Bakar Computational Health Sciences Institute, Gladstone-UCSF Institute of Genomic Immunology, Parker Institute for Cancer Immunotherapy, Department of Epidemiology and Biostatistics, Department of Microbiology and Immunology, and Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA;
| |
Collapse
|
11
|
Jiang K, Yan Z, Di Bernardo M, Sgrizzi SR, Villiger L, Kayabolen A, Kim B, Carscadden JK, Hiraizumi M, Nishimasu H, Gootenberg JS, Abudayyeh OO. Rapid protein evolution by few-shot learning with a protein language model. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.17.604015. [PMID: 39071429 PMCID: PMC11275896 DOI: 10.1101/2024.07.17.604015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
Directed evolution of proteins is critical for applications in basic biological research, therapeutics, diagnostics, and sustainability. However, directed evolution methods are labor intensive, cannot efficiently optimize over multiple protein properties, and are often trapped by local maxima. In silico-directed evolution methods incorporating protein language models (PLMs) have the potential to accelerate this engineering process, but current approaches fail to generalize across diverse protein families. We introduce EVOLVEpro, a few-shot active learning framework to rapidly improve protein activity using a combination of PLMs and protein activity predictors, achieving improved activity with as few as four rounds of evolution. EVOLVEpro substantially enhances the efficiency and effectiveness of in silico protein evolution, surpassing current state-of-the-art methods and yielding proteins with up to 100-fold improvement of desired properties. We showcase EVOLVEpro for five proteins across three applications: T7 RNA polymerase for RNA production, a miniature CRISPR nuclease, a prime editor, and an integrase for genome editing, and a monoclonal antibody for epitope binding. These results demonstrate the advantages of few-shot active learning with small amounts of experimental data over zero-shot predictions. EVOLVEpro paves the way for broader applications of AI-guided protein engineering in biology and medicine.
Collapse
Affiliation(s)
- Kaiyi Jiang
- Department of Medicine Division of Engineering in Medicine Brigham and Women’s Hospital Harvard Medical School Boston, 02115 MA, USA
- Gene and Cell Therapy Institute Mass General Brigham Cambridge, 02139 MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School Boston, 02115 MA, USA
- Department of Bioengineering Massachusetts Institute of Technology Cambridge, 02139 MA, USA
| | - Zhaoqing Yan
- Department of Medicine Division of Engineering in Medicine Brigham and Women’s Hospital Harvard Medical School Boston, 02115 MA, USA
- Gene and Cell Therapy Institute Mass General Brigham Cambridge, 02139 MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School Boston, 02115 MA, USA
| | - Matteo Di Bernardo
- Department of Bioengineering Massachusetts Institute of Technology Cambridge, 02139 MA, USA
| | - Samantha R. Sgrizzi
- Department of Medicine Division of Engineering in Medicine Brigham and Women’s Hospital Harvard Medical School Boston, 02115 MA, USA
- Gene and Cell Therapy Institute Mass General Brigham Cambridge, 02139 MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School Boston, 02115 MA, USA
| | - Lukas Villiger
- Department of Dermatology and Allergology Kantonspital St. Gallen St. Gallen, 9000, Switzerland
| | - Alisan Kayabolen
- Department of Medicine Division of Engineering in Medicine Brigham and Women’s Hospital Harvard Medical School Boston, 02115 MA, USA
- Gene and Cell Therapy Institute Mass General Brigham Cambridge, 02139 MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School Boston, 02115 MA, USA
| | - Byungji Kim
- Koch Institute for Integrative Cancer Research At MIT Massachusetts Institute of Technology Cambridge, 02139 MA, USA
| | - Josephine K. Carscadden
- Department of Medicine Division of Engineering in Medicine Brigham and Women’s Hospital Harvard Medical School Boston, 02115 MA, USA
- Gene and Cell Therapy Institute Mass General Brigham Cambridge, 02139 MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School Boston, 02115 MA, USA
| | - Masahiro Hiraizumi
- Department of Chemistry and Biotechnology, Graduate School of Engineering, The University of Tokyo 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan
| | - Hiroshi Nishimasu
- Department of Chemistry and Biotechnology, Graduate School of Engineering, The University of Tokyo 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan
- Structural Biology Division, Research Center for Advanced Science and Technology, The University of Tokyo 4-6-1 Komaba, Meguro-ku, Tokyo 153-8904, Japan
- Inamori Research Institute for Science 620 Suiginya-cho, Shimogyo-ku, Kyoto 600-8411, Japan
| | - Jonathan S. Gootenberg
- Department of Medicine Division of Engineering in Medicine Brigham and Women’s Hospital Harvard Medical School Boston, 02115 MA, USA
- Gene and Cell Therapy Institute Mass General Brigham Cambridge, 02139 MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School Boston, 02115 MA, USA
| | - Omar O. Abudayyeh
- Department of Medicine Division of Engineering in Medicine Brigham and Women’s Hospital Harvard Medical School Boston, 02115 MA, USA
- Gene and Cell Therapy Institute Mass General Brigham Cambridge, 02139 MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School Boston, 02115 MA, USA
| |
Collapse
|
12
|
Shanker VR, Bruun TU, Hie BL, Kim PS. Unsupervised evolution of protein and antibody complexes with a structure-informed language model. Science 2024; 385:46-53. [PMID: 38963838 PMCID: PMC11616794 DOI: 10.1126/science.adk8946] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Accepted: 05/29/2024] [Indexed: 07/06/2024]
Abstract
Large language models trained on sequence information alone can learn high-level principles of protein design. However, beyond sequence, the three-dimensional structures of proteins determine their specific function, activity, and evolvability. Here, we show that a general protein language model augmented with protein structure backbone coordinates can guide evolution for diverse proteins without the need to model individual functional tasks. We also demonstrate that ESM-IF1, which was only trained on single-chain structures, can be extended to engineer protein complexes. Using this approach, we screened about 30 variants of two therapeutic clinical antibodies used to treat severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection. We achieved up to 25-fold improvement in neutralization and 37-fold improvement in affinity against antibody-escaped viral variants of concern BQ.1.1 and XBB.1.5, respectively. These findings highlight the advantage of integrating structural information to identify efficient protein evolution trajectories without requiring any task-specific training data.
Collapse
Affiliation(s)
- Varun R. Shanker
- Stanford Biophysics Program, Stanford University School of Medicine; Stanford, CA 94305, USA
- Stanford Medical Scientist Training Program, Stanford University School of Medicine; Stanford CA 94305, USA
- Sarafan ChEM-H, Stanford University; Stanford, CA 94305, USA
| | - Theodora U.J. Bruun
- Stanford Medical Scientist Training Program, Stanford University School of Medicine; Stanford CA 94305, USA
- Sarafan ChEM-H, Stanford University; Stanford, CA 94305, USA
- Department of Biochemistry, Stanford University School of Medicine; Stanford, CA 94305, USA
| | - Brian L. Hie
- Sarafan ChEM-H, Stanford University; Stanford, CA 94305, USA
- Department of Biochemistry, Stanford University School of Medicine; Stanford, CA 94305, USA
| | - Peter S. Kim
- Sarafan ChEM-H, Stanford University; Stanford, CA 94305, USA
- Department of Biochemistry, Stanford University School of Medicine; Stanford, CA 94305, USA
- Chan Zuckerberg Biohub, San Francisco, CA 94158, USA
| |
Collapse
|
13
|
Hauser BM, Luo Y, Nathan A, Al-Moujahed A, Vavvas DG, Comander J, Pierce EA, Place EM, Bujakowska KM, Gaiha GD, Rossin EJ. Structure-based network analysis predicts pathogenic variants in human proteins associated with inherited retinal disease. NPJ Genom Med 2024; 9:31. [PMID: 38802398 PMCID: PMC11130145 DOI: 10.1038/s41525-024-00416-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Accepted: 05/02/2024] [Indexed: 05/29/2024] Open
Abstract
Advances in gene sequencing technologies have accelerated the identification of genetic variants, but better tools are needed to understand which are causal of disease. This would be particularly useful in fields where gene therapy is a potential therapeutic modality for a disease-causing variant such as inherited retinal disease (IRD). Here, we apply structure-based network analysis (SBNA), which has been successfully utilized to identify variant-constrained amino acid residues in viral proteins, to identify residues that may cause IRD if subject to missense mutation. SBNA is based entirely on structural first principles and is not fit to specific outcome data, which makes it distinct from other contemporary missense prediction tools. In 4 well-studied human disease-associated proteins (BRCA1, HRAS, PTEN, and ERK2) with high-quality structural data, we find that SBNA scores correlate strongly with deep mutagenesis data. When applied to 47 IRD genes with available high-quality crystal structure data, SBNA scores reliably identified disease-causing variants according to phenotype definitions from the ClinVar database. Finally, we applied this approach to 63 patients at Massachusetts Eye and Ear (MEE) with IRD but for whom no genetic cause had been identified. Untrained models built using SBNA scores and BLOSUM62 scores for IRD-associated genes successfully predicted the pathogenicity of novel variants (AUC = 0.851), allowing us to identify likely causative disease variants in 40 IRD patients. Model performance was further augmented by incorporating orthogonal data from EVE scores (AUC = 0.927), which are based on evolutionary multiple sequence alignments. In conclusion, SBNA can used to successfully identify variants as causal of disease in human proteins and may help predict variants causative of IRD in an unbiased fashion.
Collapse
Affiliation(s)
| | - Yuyang Luo
- Harvard Medical School, Department of Ophthalmology, Massachusetts Eye and Ear, Boston, MA, USA
| | - Anusha Nathan
- Ragon Institute of Mass General, MIT, and Harvard, Cambridge, MA, USA
| | - Ahmad Al-Moujahed
- Harvard Medical School, Department of Ophthalmology, Massachusetts Eye and Ear, Boston, MA, USA
| | - Demetrios G Vavvas
- Harvard Medical School, Department of Ophthalmology, Massachusetts Eye and Ear, Boston, MA, USA
| | - Jason Comander
- Harvard Medical School, Department of Ophthalmology, Massachusetts Eye and Ear, Boston, MA, USA
| | - Eric A Pierce
- Harvard Medical School, Department of Ophthalmology, Massachusetts Eye and Ear, Boston, MA, USA
| | - Emily M Place
- Harvard Medical School, Department of Ophthalmology, Massachusetts Eye and Ear, Boston, MA, USA
| | - Kinga M Bujakowska
- Harvard Medical School, Department of Ophthalmology, Massachusetts Eye and Ear, Boston, MA, USA
| | - Gaurav D Gaiha
- Ragon Institute of Mass General, MIT, and Harvard, Cambridge, MA, USA
- Division of Gastroenterology, Massachusetts General Hospital, Boston, MA, USA
| | - Elizabeth J Rossin
- Harvard Medical School, Department of Ophthalmology, Massachusetts Eye and Ear, Boston, MA, USA.
| |
Collapse
|
14
|
Estevam GO, Linossi EM, Macdonald CB, Espinoza CA, Michaud JM, Coyote-Maestas W, Collisson EA, Jura N, Fraser JS. Conserved regulatory motifs in the juxtamembrane domain and kinase N-lobe revealed through deep mutational scanning of the MET receptor tyrosine kinase domain. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.08.03.551866. [PMID: 37577651 PMCID: PMC10418267 DOI: 10.1101/2023.08.03.551866] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]
Abstract
MET is a receptor tyrosine kinase (RTK) responsible for initiating signaling pathways involved in development and wound repair. MET activation relies on ligand binding to the extracellular receptor, which prompts dimerization, intracellular phosphorylation, and recruitment of associated signaling proteins. Mutations, which are predominantly observed clinically in the intracellular juxtamembrane and kinase domains, can disrupt typical MET regulatory mechanisms. Understanding how juxtamembrane variants, such as exon 14 skipping (METΔEx14), and rare kinase domain mutations can increase signaling, often leading to cancer, remains a challenge. Here, we perform a parallel deep mutational scan (DMS) of the MET intracellular kinase domain in two fusion protein backgrounds: wild type and METΔEx14. Our comparative approach has revealed a critical hydrophobic interaction between a juxtamembrane segment and the kinase αC-helix, pointing to potential differences in regulatory mechanisms between MET and other RTKs. Additionally, we have uncovered a β5 motif that acts as a structural pivot for the kinase domain in MET and other TAM family of kinases. We also describe a number of previously unknown activating mutations, aiding the effort to annotate driver, passenger, and drug resistance mutations in the MET kinase domain.
Collapse
Affiliation(s)
- Gabriella O. Estevam
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco,United States
- Tetrad Graduate Program, University of California San Francisco, San Francisco, United States
| | - Edmond M. Linossi
- Cardiovascular Research Institute, University of California San Francisco, San Francisco, United States
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, United States
| | - Christian B. Macdonald
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco,United States
| | - Carla A. Espinoza
- Tetrad Graduate Program, University of California San Francisco, San Francisco, United States
- Cardiovascular Research Institute, University of California San Francisco, San Francisco, United States
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, United States
| | - Jennifer M. Michaud
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco,United States
| | - Willow Coyote-Maestas
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco,United States
- Quantitative Biosciences Institute, University of California, San Francisco, United States, United States
| | - Eric A. Collisson
- Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, United States
- Department of Medicine/Hematology and Oncology, University of California, San Francisco, United States
| | - Natalia Jura
- Cardiovascular Research Institute, University of California San Francisco, San Francisco, United States
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, United States
- Quantitative Biosciences Institute, University of California, San Francisco, United States, United States
| | - James S. Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco,United States
- Quantitative Biosciences Institute, University of California, San Francisco, United States, United States
| |
Collapse
|
15
|
Lampson BL, Ramίrez AS, Baro M, He L, Hegde M, Koduri V, Pfaff JL, Hanna RE, Kowal J, Shirole NH, He Y, Doench JG, Contessa JN, Locher KP, Kaelin WG. Positive selection CRISPR screens reveal a druggable pocket in an oligosaccharyltransferase required for inflammatory signaling to NF-κB. Cell 2024; 187:2209-2223.e16. [PMID: 38670073 PMCID: PMC11149550 DOI: 10.1016/j.cell.2024.03.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 09/29/2023] [Accepted: 03/18/2024] [Indexed: 04/28/2024]
Abstract
Nuclear factor κB (NF-κB) plays roles in various diseases. Many inflammatory signals, such as circulating lipopolysaccharides (LPSs), activate NF-κB via specific receptors. Using whole-genome CRISPR-Cas9 screens of LPS-treated cells that express an NF-κB-driven suicide gene, we discovered that the LPS receptor Toll-like receptor 4 (TLR4) is specifically dependent on the oligosaccharyltransferase complex OST-A for N-glycosylation and cell-surface localization. The tool compound NGI-1 inhibits OST complexes in vivo, but the underlying molecular mechanism remained unknown. We did a CRISPR base-editor screen for NGI-1-resistant variants of STT3A, the catalytic subunit of OST-A. These variants, in conjunction with cryoelectron microscopy studies, revealed that NGI-1 binds the catalytic site of STT3A, where it traps a molecule of the donor substrate dolichyl-PP-GlcNAc2-Man9-Glc3, suggesting an uncompetitive inhibition mechanism. Our results provide a rationale for and an initial step toward the development of STT3A-specific inhibitors and illustrate the power of contemporaneous base-editor and structural studies to define drug mechanism of action.
Collapse
Affiliation(s)
- Benjamin L Lampson
- Department of Medical Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA 02215, USA
| | - Ana S Ramίrez
- Institute of Molecular Biology and Biophysics, Eidgenössische Technische Hochschule (ETH), Zürich, Switzerland
| | - Marta Baro
- Department of Therapeutic Radiology, Yale University School of Medicine, New Haven, CT 06510, USA
| | - Lixia He
- Department of Medical Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA 02215, USA
| | - Mudra Hegde
- Genetic Perturbation Platform, Broad Institute, Cambridge, MA 02142, USA
| | - Vidyasagar Koduri
- Division of Hematology, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02215, USA
| | - Jamie L Pfaff
- Department of Medical Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA 02215, USA
| | - Ruth E Hanna
- Genetic Perturbation Platform, Broad Institute, Cambridge, MA 02142, USA
| | - Julia Kowal
- Institute of Molecular Biology and Biophysics, Eidgenössische Technische Hochschule (ETH), Zürich, Switzerland
| | - Nitin H Shirole
- Department of Medical Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA 02215, USA
| | - Yanfeng He
- Department of Medical Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA 02215, USA
| | - John G Doench
- Genetic Perturbation Platform, Broad Institute, Cambridge, MA 02142, USA
| | - Joseph N Contessa
- Department of Therapeutic Radiology, Yale University School of Medicine, New Haven, CT 06510, USA
| | - Kaspar P Locher
- Institute of Molecular Biology and Biophysics, Eidgenössische Technische Hochschule (ETH), Zürich, Switzerland.
| | - William G Kaelin
- Department of Medical Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA 02215, USA.
| |
Collapse
|
16
|
Hayes TK, Aquilanti E, Persky NS, Yang X, Kim EE, Brenan L, Goodale AB, Alan D, Sharpe T, Shue RE, Westlake L, Golomb L, Silverman BR, Morris MD, Fisher TR, Beyene E, Li YY, Cherniack AD, Piccioni F, Hicks JK, Chi AS, Cahill DP, Dietrich J, Batchelor TT, Root DE, Johannessen CM, Meyerson M. Comprehensive mutational scanning of EGFR reveals TKI sensitivities of extracellular domain mutants. Nat Commun 2024; 15:2742. [PMID: 38548752 PMCID: PMC10978866 DOI: 10.1038/s41467-024-45594-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 01/30/2024] [Indexed: 04/01/2024] Open
Abstract
The epidermal growth factor receptor, EGFR, is frequently activated in lung cancer and glioblastoma by genomic alterations including missense mutations. The different mutation spectra in these diseases are reflected in divergent responses to EGFR inhibition: significant patient benefit in lung cancer, but limited in glioblastoma. Here, we report a comprehensive mutational analysis of EGFR function. We perform saturation mutagenesis of EGFR and assess function of ~22,500 variants in a human EGFR-dependent lung cancer cell line. This approach reveals enrichment of erlotinib-insensitive variants of known and unknown significance in the dimerization, transmembrane, and kinase domains. Multiple EGFR extracellular domain variants, not associated with approved targeted therapies, are sensitive to afatinib and dacomitinib in vitro. Two glioblastoma patients with somatic EGFR G598V dimerization domain mutations show responses to dacomitinib treatment followed by within-pathway resistance mutation in one case. In summary, this comprehensive screen expands the landscape of functional EGFR variants and suggests broader clinical investigation of EGFR inhibition for cancers harboring extracellular domain mutations.
Collapse
Affiliation(s)
- Tikvah K Hayes
- Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of M.I.T. and Harvard, Cambridge, MA, USA
- Department of Molecular and Medical Pharmacology, University of California, Los Angeles, CA, USA
| | - Elisa Aquilanti
- Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of M.I.T. and Harvard, Cambridge, MA, USA
| | - Nicole S Persky
- Cancer Program, The Broad Institute of M.I.T. and Harvard, Cambridge, MA, USA
- Genetic Perturbation Platform, The Broad Institute of M.I.T. and Harvard, Cambridge, MA, USA
- Aera Therapeutics, Cambridge, MA, USA
| | - Xiaoping Yang
- Genetic Perturbation Platform, The Broad Institute of M.I.T. and Harvard, Cambridge, MA, USA
| | - Erica E Kim
- Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA, USA
| | - Lisa Brenan
- Cancer Program, The Broad Institute of M.I.T. and Harvard, Cambridge, MA, USA
| | - Amy B Goodale
- Genetic Perturbation Platform, The Broad Institute of M.I.T. and Harvard, Cambridge, MA, USA
| | - Douglas Alan
- Genetic Perturbation Platform, The Broad Institute of M.I.T. and Harvard, Cambridge, MA, USA
| | - Ted Sharpe
- Data Science Platform, The Broad Institute of M.I.T. and Harvard Cambridge, Cambridge, MA, USA
| | - Robert E Shue
- Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of M.I.T. and Harvard, Cambridge, MA, USA
| | - Lindsay Westlake
- Cancer Program, The Broad Institute of M.I.T. and Harvard, Cambridge, MA, USA
| | - Lior Golomb
- Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of M.I.T. and Harvard, Cambridge, MA, USA
| | - Brianna R Silverman
- Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA, USA
| | - Myshal D Morris
- Summer Honors Undergraduate Research Program, Harvard Medical School, Boston, MA, USA
| | - Ty Running Fisher
- Summer Honors Undergraduate Research Program, Harvard Medical School, Boston, MA, USA
| | - Eden Beyene
- Summer Honors Undergraduate Research Program, Harvard Medical School, Boston, MA, USA
| | - Yvonne Y Li
- Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of M.I.T. and Harvard, Cambridge, MA, USA
| | - Andrew D Cherniack
- Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of M.I.T. and Harvard, Cambridge, MA, USA
| | - Federica Piccioni
- Genetic Perturbation Platform, The Broad Institute of M.I.T. and Harvard, Cambridge, MA, USA
- Merck Research Laboratories, Cambridge, MA, USA
| | - J Kevin Hicks
- Department of Pathology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, USA
| | - Andrew S Chi
- Center for Neuro-Oncology, Division of Neuro-Oncology, Massachusetts General Hospital, Boston, MA, USA
| | - Daniel P Cahill
- Center for Neuro-Oncology, Division of Neuro-Oncology, Massachusetts General Hospital, Boston, MA, USA
| | - Jorg Dietrich
- Department of Neurology, Division of Neuro-Oncology, Massachusetts General Hospital, Boston, MA, USA
| | - Tracy T Batchelor
- Department of Neurology, Brigham and Women's Hospital & Harvard Medical School, Boston, MA, USA
| | - David E Root
- Genetic Perturbation Platform, The Broad Institute of M.I.T. and Harvard, Cambridge, MA, USA
| | - Cory M Johannessen
- Cancer Program, The Broad Institute of M.I.T. and Harvard, Cambridge, MA, USA
- Department of Oncology, Novartis Institutes for Biomedical Research, Cambridge, MA, USA
| | - Matthew Meyerson
- Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA, USA.
- Cancer Program, The Broad Institute of M.I.T. and Harvard, Cambridge, MA, USA.
| |
Collapse
|
17
|
Timofeev O, Giron P, Lawo S, Pichler M, Noeparast M. ERK pathway agonism for cancer therapy: evidence, insights, and a target discovery framework. NPJ Precis Oncol 2024; 8:70. [PMID: 38485987 PMCID: PMC10940698 DOI: 10.1038/s41698-024-00554-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Accepted: 02/16/2024] [Indexed: 03/18/2024] Open
Abstract
At least 40% of human cancers are associated with aberrant ERK pathway activity (ERKp). Inhibitors targeting various effectors within the ERKp have been developed and explored for over two decades. Conversely, a substantial body of evidence suggests that both normal human cells and, notably to a greater extent, cancer cells exhibit susceptibility to hyperactivation of ERKp. However, this vulnerability of cancer cells remains relatively unexplored. In this review, we reexamine the evidence on the selective lethality of highly elevated ERKp activity in human cancer cells of varying backgrounds. We synthesize the insights proposed for harnessing this vulnerability of ERK-associated cancers for therapeutical approaches and contextualize these insights within established pharmacological cancer-targeting models. Moreover, we compile the intriguing preclinical findings of ERK pathway agonism in diverse cancer models. Lastly, we present a conceptual framework for target discovery regarding ERKp agonism, emphasizing the utilization of mutual exclusivity among oncogenes to develop novel targeted therapies for precision oncology.
Collapse
Affiliation(s)
- Oleg Timofeev
- Institute of Molecular Oncology, Member of the German Center for Lung Research (DZL), Philipps University, 35043, Marburg, Germany
| | - Philippe Giron
- Vrije Universiteit Brussel (VUB), Universitair Ziekenhuis Brussel (UZ Brussel), Clinical Sciences, Research group Genetics, Reproduction and Development, Centre for Medical Genetics, Laarbeeklaan 101, 1090, Brussels, Belgium
| | - Steffen Lawo
- CRISPR Screening Core Facility, Max Planck Institute for Biology of Ageing, 50931, Cologne, Germany
| | - Martin Pichler
- Translational Oncology, II. Med Clinics Hematology and Oncology, 86156, Augsburg, Germany
| | - Maxim Noeparast
- Translational Oncology, II. Med Clinics Hematology and Oncology, 86156, Augsburg, Germany.
| |
Collapse
|
18
|
Hurabielle C, LaFlam TN, Gearing M, Ye CJ. Functional genomics in inborn errors of immunity. Immunol Rev 2024; 322:53-70. [PMID: 38329267 PMCID: PMC10950534 DOI: 10.1111/imr.13309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2024]
Abstract
Inborn errors of immunity (IEI) comprise a diverse spectrum of 485 disorders as recognized by the International Union of Immunological Societies Committee on Inborn Error of Immunity in 2022. While IEI are monogenic by definition, they illuminate various pathways involved in the pathogenesis of polygenic immune dysregulation as in autoimmune or autoinflammatory syndromes, or in more common infectious diseases that may not have a significant genetic basis. Rapid improvement in genomic technologies has been the main driver of the accelerated rate of discovery of IEI and has led to the development of innovative treatment strategies. In this review, we will explore various facets of IEI, delving into the distinctions between PIDD and PIRD. We will examine how Mendelian inheritance patterns contribute to these disorders and discuss advancements in functional genomics that aid in characterizing new IEI. Additionally, we will explore how emerging genomic tools help to characterize new IEI as well as how they are paving the way for innovative treatment approaches for managing and potentially curing these complex immune conditions.
Collapse
Affiliation(s)
- Charlotte Hurabielle
- Division of Rheumatology, Department of Medicine, UCSF, San Francisco, California, USA
| | - Taylor N LaFlam
- Division of Pediatric Rheumatology, Department of Pediatrics, UCSF, San Francisco, California, USA
| | - Melissa Gearing
- Division of Rheumatology, Department of Medicine, UCSF, San Francisco, California, USA
| | - Chun Jimmie Ye
- Institute for Human Genetics, UCSF, San Francisco, California, USA
- Institute of Computational Health Sciences, UCSF, San Francisco, California, USA
- Gladstone Genomic Immunology Institute, San Francisco, California, USA
- Parker Institute for Cancer Immunotherapy, UCSF, San Francisco, California, USA
- Department of Epidemiology and Biostatistics, UCSF, San Francisco, California, USA
- Department of Microbiology and Immunology, UCSF, San Francisco, California, USA
- Department of Bioengineering and Therapeutic Sciences, UCSF, San Francisco, California, USA
- Arc Institute, Palo Alto, California, USA
| |
Collapse
|
19
|
Hie BL, Shanker VR, Xu D, Bruun TUJ, Weidenbacher PA, Tang S, Wu W, Pak JE, Kim PS. Efficient evolution of human antibodies from general protein language models. Nat Biotechnol 2024; 42:275-283. [PMID: 37095349 PMCID: PMC10869273 DOI: 10.1038/s41587-023-01763-2] [Citation(s) in RCA: 135] [Impact Index Per Article: 135.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Accepted: 03/28/2023] [Indexed: 04/26/2023]
Abstract
Natural evolution must explore a vast landscape of possible sequences for desirable yet rare mutations, suggesting that learning from natural evolutionary strategies could guide artificial evolution. Here we report that general protein language models can efficiently evolve human antibodies by suggesting mutations that are evolutionarily plausible, despite providing the model with no information about the target antigen, binding specificity or protein structure. We performed language-model-guided affinity maturation of seven antibodies, screening 20 or fewer variants of each antibody across only two rounds of laboratory evolution, and improved the binding affinities of four clinically relevant, highly mature antibodies up to sevenfold and three unmatured antibodies up to 160-fold, with many designs also demonstrating favorable thermostability and viral neutralization activity against Ebola and severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pseudoviruses. The same models that improve antibody binding also guide efficient evolution across diverse protein families and selection pressures, including antibiotic resistance and enzyme activity, suggesting that these results generalize to many settings.
Collapse
Affiliation(s)
- Brian L Hie
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA.
- Sarafan ChEM-H, Stanford University, Stanford, CA, USA.
| | - Varun R Shanker
- Sarafan ChEM-H, Stanford University, Stanford, CA, USA
- Stanford Medical Scientist Training Program, Stanford University School of Medicine, Stanford, CA, USA
| | - Duo Xu
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
- Sarafan ChEM-H, Stanford University, Stanford, CA, USA
| | - Theodora U J Bruun
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
- Sarafan ChEM-H, Stanford University, Stanford, CA, USA
- Stanford Medical Scientist Training Program, Stanford University School of Medicine, Stanford, CA, USA
| | - Payton A Weidenbacher
- Sarafan ChEM-H, Stanford University, Stanford, CA, USA
- Department of Chemistry, Stanford University, Stanford, CA, USA
| | - Shaogeng Tang
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
- Sarafan ChEM-H, Stanford University, Stanford, CA, USA
| | - Wesley Wu
- Chan Zuckerberg Biohub, San Francisco, CA, USA
| | - John E Pak
- Chan Zuckerberg Biohub, San Francisco, CA, USA
| | - Peter S Kim
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA.
- Sarafan ChEM-H, Stanford University, Stanford, CA, USA.
- Chan Zuckerberg Biohub, San Francisco, CA, USA.
| |
Collapse
|
20
|
Chiang CY, Zhang M, Huang J, Zeng J, Chen C, Pan D, Yang H, Zhang T, Yang M, Han Q, Wang Z, Xiao T, Chen Y, Zou Y, Yin F, Li Z, Zhu L, Zheng D. A novel selective ERK1/2 inhibitor, Laxiflorin B, targets EGFR mutation subtypes in non-small-cell lung cancer. Acta Pharmacol Sin 2024; 45:422-435. [PMID: 37816856 PMCID: PMC10789733 DOI: 10.1038/s41401-023-01164-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Accepted: 09/01/2023] [Indexed: 10/12/2023]
Abstract
Extracellular regulated protein kinases 1/2 (ERK1/2) are key members of multiple signaling pathways, including the ErbB axis. Ectopic ERK1/2 activation contributes to various types of cancer, especially drug resistance to inhibitors of RTK, RAF and MEK, and specific ERK1/2 inhibitors are scarce. In this study, we identified a potential novel covalent ERK inhibitor, Laxiflorin B, which is a herbal compound with anticancer activity. However, Laxiflorin B is present at low levels in herbs; therefore, we adopted a semi-synthetic process for the efficient production of Laxiflorin B to improve the yield. Laxiflorin B induced mitochondria-mediated apoptosis via BAD activation in non-small-cell lung cancer (NSCLC) cells, especially in EGFR mutant subtypes. Transcriptomic analysis suggested that Laxiflorin B inhibits amphiregulin (AREG) and epiregulin (EREG) expression through ERK inhibition, and suppressed the activation of their receptors, ErbBs, via a positive feedback loop. Moreover, mass spectrometry analysis combined with computer simulation revealed that Laxiflorin B binds covalently to Cys-183 in the ATP-binding pocket of ERK1 via the D-ring, and Cys-178 of ERK1 through non-inhibitory binding of the A-ring. In a NSCLC tumor xenograft model in nude mice, Laxiflorin B also exhibited strong tumor suppressive effects with low toxicity and AREG and EREG were identified as biomarkers of Laxiflorin B efficacy. Finally, Laxiflorin B-4, a C-6 analog of Laxiflorin B, exhibited higher binding affinity for ERK1/2 and stronger tumor suppression. These findings provide a new approach to tumor inhibition using natural anticancer compounds.
Collapse
Affiliation(s)
- Cheng-Yao Chiang
- Guangdong Key Laboratory of Genome Instability and Human Disease Prevention, International Cancer Center, Department of Cell Biology and Genetics, Shenzhen University Medical School; College of Life Sciences and Oceanography, Shenzhen University; Department of Pharmacy, The First Affiliated Hospital of Shenzhen University, Shenzhen Second People's Hospital (Shenzhen Institute of Translational Medicine), Shenzhen, 518055, China
| | - Min Zhang
- Guangdong Key Laboratory of Genome Instability and Human Disease Prevention, International Cancer Center, Department of Cell Biology and Genetics, Shenzhen University Medical School; College of Life Sciences and Oceanography, Shenzhen University; Department of Pharmacy, The First Affiliated Hospital of Shenzhen University, Shenzhen Second People's Hospital (Shenzhen Institute of Translational Medicine), Shenzhen, 518055, China
| | - Junrong Huang
- Guangdong Key Laboratory of Genome Instability and Human Disease Prevention, International Cancer Center, Department of Cell Biology and Genetics, Shenzhen University Medical School; College of Life Sciences and Oceanography, Shenzhen University; Department of Pharmacy, The First Affiliated Hospital of Shenzhen University, Shenzhen Second People's Hospital (Shenzhen Institute of Translational Medicine), Shenzhen, 518055, China
| | - Juan Zeng
- School of Biomedical Engineering, Guangdong Medical University, Dongguan, 523808, China
| | - Chunlan Chen
- Guangdong Key Laboratory of Genome Instability and Human Disease Prevention, International Cancer Center, Department of Cell Biology and Genetics, Shenzhen University Medical School; College of Life Sciences and Oceanography, Shenzhen University; Department of Pharmacy, The First Affiliated Hospital of Shenzhen University, Shenzhen Second People's Hospital (Shenzhen Institute of Translational Medicine), Shenzhen, 518055, China
| | - Dongmei Pan
- Guangdong Key Laboratory of Genome Instability and Human Disease Prevention, International Cancer Center, Department of Cell Biology and Genetics, Shenzhen University Medical School; College of Life Sciences and Oceanography, Shenzhen University; Department of Pharmacy, The First Affiliated Hospital of Shenzhen University, Shenzhen Second People's Hospital (Shenzhen Institute of Translational Medicine), Shenzhen, 518055, China
| | - Heng Yang
- Guangdong Key Laboratory of Genome Instability and Human Disease Prevention, International Cancer Center, Department of Cell Biology and Genetics, Shenzhen University Medical School; College of Life Sciences and Oceanography, Shenzhen University; Department of Pharmacy, The First Affiliated Hospital of Shenzhen University, Shenzhen Second People's Hospital (Shenzhen Institute of Translational Medicine), Shenzhen, 518055, China
| | - Tiantian Zhang
- Guangdong Key Laboratory of Genome Instability and Human Disease Prevention, International Cancer Center, Department of Cell Biology and Genetics, Shenzhen University Medical School; College of Life Sciences and Oceanography, Shenzhen University; Department of Pharmacy, The First Affiliated Hospital of Shenzhen University, Shenzhen Second People's Hospital (Shenzhen Institute of Translational Medicine), Shenzhen, 518055, China
| | - Min Yang
- Guangdong Key Laboratory of Genome Instability and Human Disease Prevention, International Cancer Center, Department of Cell Biology and Genetics, Shenzhen University Medical School; College of Life Sciences and Oceanography, Shenzhen University; Department of Pharmacy, The First Affiliated Hospital of Shenzhen University, Shenzhen Second People's Hospital (Shenzhen Institute of Translational Medicine), Shenzhen, 518055, China
| | - Qiangqiang Han
- SpecAlly Life Technology Co., Ltd, Wuhan, 430075, China
- Wuhan Biobank Co., Ltd, Wuhan, 430074, China
| | - Zou Wang
- Wuhan Biobank Co., Ltd, Wuhan, 430074, China
| | - Tian Xiao
- Guangdong Key Laboratory of Genome Instability and Human Disease Prevention, International Cancer Center, Department of Cell Biology and Genetics, Shenzhen University Medical School; College of Life Sciences and Oceanography, Shenzhen University; Department of Pharmacy, The First Affiliated Hospital of Shenzhen University, Shenzhen Second People's Hospital (Shenzhen Institute of Translational Medicine), Shenzhen, 518055, China
| | - Yangchao Chen
- School of Biomedical Sciences, Faculty of Medicine, The Chinese University of Hong Kong, Shatin, NT, Hong Kong
| | - Yongdong Zou
- Guangdong Key Laboratory of Genome Instability and Human Disease Prevention, International Cancer Center, Department of Cell Biology and Genetics, Shenzhen University Medical School; College of Life Sciences and Oceanography, Shenzhen University; Department of Pharmacy, The First Affiliated Hospital of Shenzhen University, Shenzhen Second People's Hospital (Shenzhen Institute of Translational Medicine), Shenzhen, 518055, China
| | - Feng Yin
- Laboratory of Chemical Genomics, School of Chemical Biology and Biotechnology, Peking University Shenzhen Graduate School, Shenzhen University Town, Xili, Shenzhen, 518055, China
| | - Zigang Li
- Laboratory of Chemical Genomics, School of Chemical Biology and Biotechnology, Peking University Shenzhen Graduate School, Shenzhen University Town, Xili, Shenzhen, 518055, China
| | - Lizhi Zhu
- Guangdong Key Laboratory of Genome Instability and Human Disease Prevention, International Cancer Center, Department of Cell Biology and Genetics, Shenzhen University Medical School; College of Life Sciences and Oceanography, Shenzhen University; Department of Pharmacy, The First Affiliated Hospital of Shenzhen University, Shenzhen Second People's Hospital (Shenzhen Institute of Translational Medicine), Shenzhen, 518055, China.
- Guangdong Provincial Key Laboratory of Systems Biology and Synthetic Biology for Urogenital Tumors, Shenzhen, 518035, China.
| | - Duo Zheng
- Guangdong Key Laboratory of Genome Instability and Human Disease Prevention, International Cancer Center, Department of Cell Biology and Genetics, Shenzhen University Medical School; College of Life Sciences and Oceanography, Shenzhen University; Department of Pharmacy, The First Affiliated Hospital of Shenzhen University, Shenzhen Second People's Hospital (Shenzhen Institute of Translational Medicine), Shenzhen, 518055, China.
| |
Collapse
|
21
|
Shanker VR, Bruun TU, Hie BL, Kim PS. Inverse folding of protein complexes with a structure-informed language model enables unsupervised antibody evolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.19.572475. [PMID: 38187780 PMCID: PMC10769282 DOI: 10.1101/2023.12.19.572475] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2024]
Abstract
Large language models trained on sequence information alone are capable of learning high level principles of protein design. However, beyond sequence, the three-dimensional structures of proteins determine their specific function, activity, and evolvability. Here we show that a general protein language model augmented with protein structure backbone coordinates and trained on the inverse folding problem can guide evolution for diverse proteins without needing to explicitly model individual functional tasks. We demonstrate inverse folding to be an effective unsupervised, structure-based sequence optimization strategy that also generalizes to multimeric complexes by implicitly learning features of binding and amino acid epistasis. Using this approach, we screened ~30 variants of two therapeutic clinical antibodies used to treat SARS-CoV-2 infection and achieved up to 26-fold improvement in neutralization and 37-fold improvement in affinity against antibody-escaped viral variants-of-concern BQ.1.1 and XBB.1.5, respectively. In addition to substantial overall improvements in protein function, we find inverse folding performs with leading experimental success rates among other reported machine learning-guided directed evolution methods, without requiring any task-specific training data.
Collapse
Affiliation(s)
- Varun R. Shanker
- Stanford Biophysics Program, Stanford University School of Medicine, Stanford, CA 94305, USA
- Stanford Medical Scientist Training Program, Stanford University School of Medicine, Stanford CA 94305, USA
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
| | - Theodora U.J. Bruun
- Stanford Medical Scientist Training Program, Stanford University School of Medicine, Stanford CA 94305, USA
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Brian L. Hie
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Peter S. Kim
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA
- Chan Zuckerberg Biohub, San Francisco, CA 94158, USA
| |
Collapse
|
22
|
Bunch H, Kim D, Naganuma M, Nakagawa R, Cong A, Jeong J, Ehara H, Vu H, Chang JH, Schellenberg MJ, Sekine SI. ERK2-topoisomerase II regulatory axis is important for gene activation in immediate early genes. Nat Commun 2023; 14:8341. [PMID: 38097570 PMCID: PMC10721843 DOI: 10.1038/s41467-023-44089-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Accepted: 11/29/2023] [Indexed: 12/17/2023] Open
Abstract
The function of the mitogen-activated protein kinase signaling pathway is required for the activation of immediate early genes (IEGs), including EGR1 and FOS, for cell growth and proliferation. Recent studies have identified topoisomerase II (TOP2) as one of the important regulators of the transcriptional activation of IEGs. However, the mechanism underlying transcriptional regulation involving TOP2 in IEG activation has remained unknown. Here, we demonstrate that ERK2, but not ERK1, is important for IEG transcriptional activation and report a critical ELK1 binding sequence for ERK2 function at the EGR1 gene. Our data indicate that both ERK1 and ERK2 extensively phosphorylate the C-terminal domain of TOP2B at mutual and distinctive residues. Although both ERK1 and ERK2 enhance the catalytic rate of TOP2B required to relax positive DNA supercoiling, ERK2 delays TOP2B catalysis of negative DNA supercoiling. In addition, ERK1 may relax DNA supercoiling by itself. ERK2 catalytic inhibition or knock-down interferes with transcription and deregulates TOP2B in IEGs. Furthermore, we present the first cryo-EM structure of the human cell-purified TOP2B and etoposide together with the EGR1 transcriptional start site (-30 to +20) that has the strongest affinity to TOP2B within -423 to +332. The structure shows TOP2B-mediated breakage and dramatic bending of the DNA. Transcription is activated by etoposide, while it is inhibited by ICRF193 at EGR1 and FOS, suggesting that TOP2B-mediated DNA break to favor transcriptional activation. Taken together, this study suggests that activated ERK2 phosphorylates TOP2B to regulate TOP2-DNA interactions and favor transcriptional activation in IEGs. We propose that TOP2B association, catalysis, and dissociation on its substrate DNA are important processes for regulating transcription and that ERK2-mediated TOP2B phosphorylation may be key for the catalysis and dissociation steps.
Collapse
Affiliation(s)
- Heeyoun Bunch
- Department of Applied Biosciences, Kyungpook National University, Daegu, 41566, Republic of Korea.
- School of Applied Biosciences, College of Agriculture & Life Sciences, Kyungpook National University, Daegu, 41566, Republic of Korea.
| | - Deukyeong Kim
- School of Applied Biosciences, College of Agriculture & Life Sciences, Kyungpook National University, Daegu, 41566, Republic of Korea
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea
| | - Masahiro Naganuma
- Laboratory for Transcription Structural Biology, RIKEN Center for Biosystems Dynamics Research, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 230-0045, Japan
| | - Reiko Nakagawa
- RIKEN BDR Laboratory for Phyloinformatics, Hyogo, 650-0047, Japan
| | - Anh Cong
- Department of Biochemistry and Molecular Biology, Mayo Clinic, Rochester, MN, 55905, USA
| | - Jaehyeon Jeong
- Department of Applied Biosciences, Kyungpook National University, Daegu, 41566, Republic of Korea
| | - Haruhiko Ehara
- Laboratory for Transcription Structural Biology, RIKEN Center for Biosystems Dynamics Research, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 230-0045, Japan
| | - Hongha Vu
- Department of Biology Education, Kyungpook National University, Daegu, 41566, Republic of Korea
| | - Jeong Ho Chang
- Department of Biology Education, Kyungpook National University, Daegu, 41566, Republic of Korea
| | - Matthew J Schellenberg
- Department of Biochemistry and Molecular Biology, Mayo Clinic, Rochester, MN, 55905, USA
| | - Shun-Ichi Sekine
- Laboratory for Transcription Structural Biology, RIKEN Center for Biosystems Dynamics Research, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 230-0045, Japan
| |
Collapse
|
23
|
Notin P, Kollasch AW, Ritter D, van Niekerk L, Paul S, Spinner H, Rollins N, Shaw A, Weitzman R, Frazer J, Dias M, Franceschi D, Orenbuch R, Gal Y, Marks DS. ProteinGym: Large-Scale Benchmarks for Protein Design and Fitness Prediction. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.07.570727. [PMID: 38106144 PMCID: PMC10723403 DOI: 10.1101/2023.12.07.570727] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Predicting the effects of mutations in proteins is critical to many applications, from understanding genetic disease to designing novel proteins that can address our most pressing challenges in climate, agriculture and healthcare. Despite a surge in machine learning-based protein models to tackle these questions, an assessment of their respective benefits is challenging due to the use of distinct, often contrived, experimental datasets, and the variable performance of models across different protein families. Addressing these challenges requires scale. To that end we introduce ProteinGym, a large-scale and holistic set of benchmarks specifically designed for protein fitness prediction and design. It encompasses both a broad collection of over 250 standardized deep mutational scanning assays, spanning millions of mutated sequences, as well as curated clinical datasets providing high-quality expert annotations about mutation effects. We devise a robust evaluation framework that combines metrics for both fitness prediction and design, factors in known limitations of the underlying experimental methods, and covers both zero-shot and supervised settings. We report the performance of a diverse set of over 70 high-performing models from various subfields (eg., alignment-based, inverse folding) into a unified benchmark suite. We open source the corresponding codebase, datasets, MSAs, structures, model predictions and develop a user-friendly website that facilitates data access and analysis.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Ada Shaw
- Applied Mathematics, Harvard University
| | | | | | - Mafalda Dias
- Centre for Genomic Regulation, Universitat Pompeu Fabra
| | | | | | - Yarin Gal
- Computer Science, University of Oxford
| | | |
Collapse
|
24
|
Martin-Vega A, Cobb MH. Navigating the ERK1/2 MAPK Cascade. Biomolecules 2023; 13:1555. [PMID: 37892237 PMCID: PMC10605237 DOI: 10.3390/biom13101555] [Citation(s) in RCA: 39] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2023] [Revised: 10/16/2023] [Accepted: 10/18/2023] [Indexed: 10/29/2023] Open
Abstract
The RAS-ERK pathway is a fundamental signaling cascade crucial for many biological processes including proliferation, cell cycle control, growth, and survival; common across all cell types. Notably, ERK1/2 are implicated in specific processes in a context-dependent manner as in stem cells and pancreatic β-cells. Alterations in the different components of this cascade result in dysregulation of the effector kinases ERK1/2 which communicate with hundreds of substrates. Aberrant activation of the pathway contributes to a range of disorders, including cancer. This review provides an overview of the structure, activation, regulation, and mutational frequency of the different tiers of the cascade; with a particular focus on ERK1/2. We highlight the importance of scaffold proteins that contribute to kinase localization and coordinate interaction dynamics of the kinases with substrates, activators, and inhibitors. Additionally, we explore innovative therapeutic approaches emphasizing promising avenues in this field.
Collapse
Affiliation(s)
- Ana Martin-Vega
- Department of Pharmacology, UT Southwestern Medical Center, 6001 Forest Park Rd., Dallas, TX 75390, USA;
| | - Melanie H. Cobb
- Department of Pharmacology, UT Southwestern Medical Center, 6001 Forest Park Rd., Dallas, TX 75390, USA;
- Simmons Comprehensive Cancer Center, UT Southwestern Medical Center, 6001 Forest Park Rd., Dallas, TX 75390, USA
| |
Collapse
|
25
|
Tian Y, Zhang M, Heng P, Hou H, Wang B. Computational Investigations on Reaction Mechanisms of the Covalent Inhibitors Ponatinib and Analogs Targeting the Extracellular Signal-Regulated Kinases. Int J Mol Sci 2023; 24:15223. [PMID: 37894903 PMCID: PMC10607051 DOI: 10.3390/ijms242015223] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2023] [Revised: 10/08/2023] [Accepted: 10/14/2023] [Indexed: 10/29/2023] Open
Abstract
As an important cancer therapeutic target, extracellular signal-regulated kinases (ERK) are involved in triggering various cellular responses in tumors. Regulation of the ERK signaling pathway by the small molecular inhibitors is highly desired for the sake of cancer therapy. In contrast to the routine inhibitors targeting ERKs through long-range non-bonding interactions, Ponatinib, a covalent inhibitor to ERK2 with a macrocyclic structure characterized by the α,β-C=C unsaturated ketone, can form the stable -C(S)-C(H)-type complex via the four-center barrier due to the nucleophilic addition reaction of the thiol group of the Cys166 residue of ERK2 with the C=C double bond of Ponatinib with reaction free-energy barrier of 47.2 kcal/mol. Reaction mechanisms for the covalent binding were calculated using QM/MM methods and molecular dynamics simulations. The interaction modes and the corresponding binding free energies were obtained for the non-covalent and covalent complexation. The binding free energies of the non-covalent and covalent inhibitions are 14.8 kcal/mol and 33.4 kcal/mol, respectively. The mechanistic study stimulated a rational design on the modified Ponatinib structure by substituting the C=C bond with the C=N bond. It was demonstrated that the new compound exhibits better inhibition activity toward ERK2 in term of both thermodynamic and kinetic aspects through the covalent binding with a lower reaction free-energy barrier of 23.1 kcal/mol. The present theoretical work sheds new light on the development of the covalent inhibitors for the regulation of ERKs.
Collapse
Affiliation(s)
| | | | | | | | - Baoshan Wang
- College of Chemistry and Molecular Sciences, Wuhan University, Wuhan 430072, China; (Y.T.); (M.Z.); (P.H.); (H.H.)
| |
Collapse
|
26
|
Kugler V, Lieb A, Guerin N, Donald BR, Stefan E, Kaserer T. Disruptor: Computational identification of oncogenic mutants disrupting protein-protein and protein-DNA interactions. Commun Biol 2023; 6:720. [PMID: 37443295 PMCID: PMC10344873 DOI: 10.1038/s42003-023-05089-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Accepted: 06/29/2023] [Indexed: 07/15/2023] Open
Abstract
We report an Osprey-based computational protocol to prospectively identify oncogenic mutations that act via disruption of molecular interactions. It is applicable to analyse both protein-protein and protein-DNA interfaces and it is validated on a dataset of clinically relevant mutations. In addition, it is used to predict previously uncharacterised patient mutations in CDK6 and p16 genes, which are experimentally confirmed to impair complex formation.
Collapse
Affiliation(s)
- Valentina Kugler
- Institute of Biochemistry and Center for Molecular Biosciences, University of Innsbruck, Innsbruck, Austria
| | - Andreas Lieb
- Institute of Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
| | - Nathan Guerin
- Department of Computer Science, Duke University, Durham, NC, USA
| | - Bruce R Donald
- Department of Computer Science, Duke University, Durham, NC, USA
- Department of Biochemistry, Duke University Medical Center, Durham, NC, USA
- Department of Chemistry, Duke University, Durham, NC, USA
- Department of Mathematics, Duke University, Durham, NC, USA
| | - Eduard Stefan
- Institute of Biochemistry and Center for Molecular Biosciences, University of Innsbruck, Innsbruck, Austria
- Tyrolean Cancer Research Institute (TKFI), Innrain 66, 6020, Innsbruck, Austria
- Institute of Molecular Biology, University of Innsbruck, Innsbruck, Austria
| | - Teresa Kaserer
- Institute of Pharmacy/Pharmaceutical Chemistry, University of Innsbruck, Innsbruck, Austria.
| |
Collapse
|
27
|
Hauser BM, Luo Y, Nathan A, Gaiha GD, Vavvas D, Comander J, Pierce EA, Place EM, Bujakowska KM, Rossin EJ. Structure-based network analysis predicts mutations associated with inherited retinal disease. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.07.05.23292247. [PMID: 37461650 PMCID: PMC10350150 DOI: 10.1101/2023.07.05.23292247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/27/2023]
Abstract
With continued advances in gene sequencing technologies comes the need to develop better tools to understand which mutations cause disease. Here we validate structure-based network analysis (SBNA)1,2 in well-studied human proteins and report results of using SBNA to identify critical amino acids that may cause retinal disease if subject to missense mutation. We computed SBNA scores for genes with high-quality structural data, starting with validating the method using 4 well-studied human disease-associated proteins. We then analyzed 47 inherited retinal disease (IRD) genes. We compared SBNA scores to phenotype data from the ClinVar database and found a significant difference between benign and pathogenic mutations with respect to network score. Finally, we applied this approach to 65 patients at Massachusetts Eye and Ear (MEE) who were diagnosed with IRD but for whom no genetic cause was found. Multivariable logistic regression models built using SBNA scores for IRD-associated genes successfully predicted pathogenicity of novel mutations, allowing us to identify likely causative disease variants in 37 patients with IRD from our clinic. In conclusion, SBNA can be meaningfully applied to human proteins and may help predict mutations causative of IRD.
Collapse
Affiliation(s)
| | - Yuyang Luo
- Department of Ophthalmology, Massachusetts Eye and Ear, Harvard Medical School, Boston, MA
| | - Anusha Nathan
- Ragon Institute of Mass General, MIT, and Harvard, Cambridge, MA
| | - Gaurav D. Gaiha
- Ragon Institute of Mass General, MIT, and Harvard, Cambridge, MA
| | - Demetrios Vavvas
- Department of Ophthalmology, Massachusetts Eye and Ear, Harvard Medical School, Boston, MA
| | - Jason Comander
- Department of Ophthalmology, Massachusetts Eye and Ear, Harvard Medical School, Boston, MA
| | - Eric A. Pierce
- Department of Ophthalmology, Massachusetts Eye and Ear, Harvard Medical School, Boston, MA
| | - Emily M. Place
- Department of Ophthalmology, Massachusetts Eye and Ear, Harvard Medical School, Boston, MA
| | - Kinga M. Bujakowska
- Department of Ophthalmology, Massachusetts Eye and Ear, Harvard Medical School, Boston, MA
| | - Elizabeth J. Rossin
- Department of Ophthalmology, Massachusetts Eye and Ear, Harvard Medical School, Boston, MA
| |
Collapse
|
28
|
Martin-Vega A, Earnest S, Augustyn A, Wichaidit C, Gazdar A, Girard L, Peyton M, Kollipara RK, Minna JD, Johnson JE, Cobb MH. ASCL1-ERK1/2 Axis: ASCL1 restrains ERK1/2 via the dual specificity phosphatase DUSP6 to promote survival of a subset of neuroendocrine lung cancers. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.15.545148. [PMID: 37398419 PMCID: PMC10312738 DOI: 10.1101/2023.06.15.545148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
The transcription factor achaete-scute complex homolog 1 (ASCL1) is a lineage oncogene that is central for the growth and survival of small cell lung cancers (SCLC) and neuroendocrine non-small cell lung cancers (NSCLC-NE) that express it. Targeting ASCL1, or its downstream pathways, remains a challenge. However, a potential clue to overcoming this challenage has been information that SCLC and NSCLC-NE that express ASCL1 exhibit extremely low ERK1/2 activity, and efforts to increase ERK1/2 activity lead to inhibition of SCLC growth and surival. Of course, this is in dramatic contrast to the majority of NSCLCs where high activity of the ERK pathway plays a major role in cancer pathogenesis. A major knowledge gap is defining the mechanism(s) underlying the low ERK1/2 activity in SCLC, determining if ERK1/2 activity and ASCL1 function are inter-related, and if manipulating ERK1/2 activity provides a new therapeutic strategy for SCLC. We first found that expression of ERK signaling and ASCL1 have an inverse relationship in NE lung cancers: knocking down ASCL1 in SCLCs and NE-NSCLCs increased active ERK1/2, while inhibition of residual SCLC/NSCLC-NE ERK1/2 activity with a MEK inhibitor increased ASCL1 expression. To determine the effects of ERK activity on expression of other genes, we obtained RNA-seq from ASCL1-expressing lung tumor cells treated with an ERK pathway MEK inhibitor and identified down-regulated genes (such as SPRY4, ETV5, DUSP6, SPRED1) that potentially could influence SCLC/NSCLC-NE tumor cell survival. This led us to discover that genes regulated by MEK inhibition suppress ERK activation and CHIP-seq demonstrated these are bound by ASCL1. In addition, SPRY4, DUSP6, SPRED1 are known suppressors of the ERK1/2 pathway, while ETV5 regulates DUSP6. Survival of NE lung tumors was inhibited by activation of ERK1/2 and a subset of ASCL1-high NE lung tumors expressed DUSP6. Because the dual specificity phosphatase 6 (DUSP6) is an ERK1/2-selective phosphatase that inactivates these kinases and has a pharmacologic inhibitor, we focused mechanistic studies on DUSP6. These studies showed: Inhibition of DUSP6 increased active ERK1/2, which accumulated in the nucleus; pharmacologic and genetic inhibition of DUSP6 affected proliferation and survival of ASCL1-high NE lung cancers; and that knockout of DUSP6 "cured" some SCLCs while in others resistance rapidly developed indicating a bypass mechanism was activated. Thus, our findings fill this knowledge gap and indicate that combined expression of ASCL1, DUSP6 and low phospho-ERK1/2 identify some neuroendocrine lung cancers for which DUSP6 may be a therapeutic target.
Collapse
|
29
|
Gao H, Hamp T, Ede J, Schraiber JG, McRae J, Singer-Berk M, Yang Y, Dietrich ASD, Fiziev PP, Kuderna LFK, Sundaram L, Wu Y, Adhikari A, Field Y, Chen C, Batzoglou S, Aguet F, Lemire G, Reimers R, Balick D, Janiak MC, Kuhlwilm M, Orkin JD, Manu S, Valenzuela A, Bergman J, Rousselle M, Silva FE, Agueda L, Blanc J, Gut M, de Vries D, Goodhead I, Harris RA, Raveendran M, Jensen A, Chuma IS, Horvath JE, Hvilsom C, Juan D, Frandsen P, de Melo FR, Bertuol F, Byrne H, Sampaio I, Farias I, do Amaral JV, Messias M, da Silva MNF, Trivedi M, Rossi R, Hrbek T, Andriaholinirina N, Rabarivola CJ, Zaramody A, Jolly CJ, Phillips-Conroy J, Wilkerson G, Abee C, Simmons JH, Fernandez-Duque E, Kanthaswamy S, Shiferaw F, Wu D, Zhou L, Shao Y, Zhang G, Keyyu JD, Knauf S, Le MD, Lizano E, Merker S, Navarro A, Bataillon T, Nadler T, Khor CC, Lee J, Tan P, Lim WK, Kitchener AC, Zinner D, Gut I, Melin A, Guschanski K, Schierup MH, Beck RMD, Umapathy G, Roos C, Boubli JP, Lek M, Sunyaev S, O'Donnell-Luria A, Rehm HL, Xu J, Rogers J, Marques-Bonet T, Farh KKH. The landscape of tolerated genetic variation in humans and primates. Science 2023; 380:eabn8153. [PMID: 37262156 DOI: 10.1126/science.abn8197] [Citation(s) in RCA: 78] [Impact Index Per Article: 39.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Accepted: 03/22/2023] [Indexed: 06/03/2023]
Abstract
Personalized genome sequencing has revealed millions of genetic differences between individuals, but our understanding of their clinical relevance remains largely incomplete. To systematically decipher the effects of human genetic variants, we obtained whole-genome sequencing data for 809 individuals from 233 primate species and identified 4.3 million common protein-altering variants with orthologs in humans. We show that these variants can be inferred to have nondeleterious effects in humans based on their presence at high allele frequencies in other primate populations. We use this resource to classify 6% of all possible human protein-altering variants as likely benign and impute the pathogenicity of the remaining 94% of variants with deep learning, achieving state-of-the-art accuracy for diagnosing pathogenic variants in patients with genetic diseases.
Collapse
Affiliation(s)
- Hong Gao
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Tobias Hamp
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Jeffrey Ede
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Joshua G Schraiber
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Jeremy McRae
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Moriel Singer-Berk
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Boston, MA, 02142, USA
| | - Yanshen Yang
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | | | - Petko P Fiziev
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Lukas F K Kuderna
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Laksshman Sundaram
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Yibing Wu
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Aashish Adhikari
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Yair Field
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Chen Chen
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Serafim Batzoglou
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Francois Aguet
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Gabrielle Lemire
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Boston, MA, 02142, USA
- Division of Genetics and Genomics, Department of Pediatrics, Boston Children's Hospital, Harvard Medical School, Boston, MA, 02115, USA
| | - Rebecca Reimers
- Division of Genetics and Genomics, Department of Pediatrics, Boston Children's Hospital, Harvard Medical School, Boston, MA, 02115, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA
| | - Daniel Balick
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA
- Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, 02115, USA
| | - Mareike C Janiak
- School of Science, Engineering & Environment, University of Salford, Salford M5 4WT, UK
| | - Martin Kuhlwilm
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
- Department of Evolutionary Anthropology, University of Vienna, Djerassiplatz 1, 1030 Vienna, Austria
- Human Evolution and Archaeological Sciences (HEAS), University of Vienna, 1030 Vienna, Austria
| | - Joseph D Orkin
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
- Département d'anthropologie, Université de Montréal, 3150 Jean-Brillant, Montréal, QC H3T 1N8, Canada
| | - Shivakumara Manu
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
- Laboratory for the Conservation of Endangered Species, CSIR-Centre for Cellular and Molecular Biology, Hyderabad 500007, India
| | - Alejandro Valenzuela
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Juraj Bergman
- Bioinformatics Research Centre, Aarhus University, Aarhus 8000, Denmark
- Section for Ecoinformatics & Biodiversity, Department of Biology, Aarhus University, 8000 Aarhus, Denmark
| | | | - Felipe Ennes Silva
- Research Group on Primate Biology and Conservation, Mamirauá Institute for Sustainable Development, Estrada da Bexiga 2584, Tefé, Amazonas, CEP 69553-225, Brazil
- Evolutionary Biology and Ecology (EBE), Département de Biologie des Organismes, Université libre de Bruxelles (ULB), Av. Franklin D. Roosevelt 50, CP 160/12, B-1050 Brussels, Belgium
| | - Lidia Agueda
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
| | - Julie Blanc
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
| | - Marta Gut
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
| | - Dorien de Vries
- School of Science, Engineering & Environment, University of Salford, Salford M5 4WT, UK
| | - Ian Goodhead
- School of Science, Engineering & Environment, University of Salford, Salford M5 4WT, UK
| | - R Alan Harris
- Human Genome Sequencing Center and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Muthuswamy Raveendran
- Human Genome Sequencing Center and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Axel Jensen
- Department of Ecology and Genetics, Animal Ecology, Uppsala University, SE-75236 Uppsala, Sweden
| | | | - Julie E Horvath
- North Carolina Museum of Natural Sciences, Raleigh, NC 27601, USA
- Department of Biological and Biomedical Sciences, North Carolina Central University, Durham, NC 27707, USA
- Department of Biological Sciences, North Carolina State University, Raleigh, NC 27695, USA
- Department of Evolutionary Anthropology, Duke University, Durham, NC 27708, USA
- Renaissance Computing Institute, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | | | - David Juan
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
| | | | | | - Fabrício Bertuol
- Universidade Federal do Amazonas, Departamento de Genética, Laboratório de Evolução e Genética Animal (LEGAL), Manaus, Amazonas, 69080-900, Brazil
| | - Hazel Byrne
- Department of Anthropology, University of Utah, Salt Lake City, UT 84102, USA
| | - Iracilda Sampaio
- Universidade Federal do Para, Guamá, Belém - PA, 66075-110, Brazil
| | - Izeni Farias
- Universidade Federal do Amazonas, Departamento de Genética, Laboratório de Evolução e Genética Animal (LEGAL), Manaus, Amazonas, 69080-900, Brazil
| | - João Valsecchi do Amaral
- Research Group on Terrestrial Vertebrate Ecology, Mamirauá Institute for Sustainable Development, Tefé, Amazonas, 69553-225, Brazil
- Rede de Pesquisa para Estudos sobre Diversidade, Conservação e Uso da Fauna na Amazônia - RedeFauna, Manaus, Amazonas, 69080-900, Brazil
- Comunidad de Manejo de Fauna Silvestre en la Amazonía y en Latinoamérica - ComFauna, Iquitos, Loreto, 16001, Peru
| | - Mariluce Messias
- Universidade Federal de Rondonia, Porto Velho, Rondônia, 78900-000, Brazil
- PPGREN - Programa de Pós-Graduação "Conservação e Uso dos Recursos Naturais and BIONORTE - Programa de Pós-Graduação em Biodiversidade e Biotecnologia da Rede BIONORTE, Universidade Federal de Rondonia, Porto Velho, Rondônia, 78900-000, Brazil
| | - Maria N F da Silva
- Instituto Nacional de Pesquisas da Amazonia, Petrópolis, Manaus - AM, 69067-375, Brazil
| | - Mihir Trivedi
- Laboratory for the Conservation of Endangered Species, CSIR-Centre for Cellular and Molecular Biology, Hyderabad 500007, India
| | - Rogerio Rossi
- Universidade Federal do Mato Grosso, Boa Esperança, Cuiabá - MT, 78060-900, Brazil
| | - Tomas Hrbek
- Universidade Federal do Amazonas, Departamento de Genética, Laboratório de Evolução e Genética Animal (LEGAL), Manaus, Amazonas, 69080-900, Brazil
- Department of Biology, Trinity University, San Antonio, TX 78212, USA
| | - Nicole Andriaholinirina
- Life Sciences and Environment, Technology and Environment of Mahajanga, University of Mahajanga, Mahajanga, 401, Madagascar
| | - Clément J Rabarivola
- Life Sciences and Environment, Technology and Environment of Mahajanga, University of Mahajanga, Mahajanga, 401, Madagascar
| | - Alphonse Zaramody
- Life Sciences and Environment, Technology and Environment of Mahajanga, University of Mahajanga, Mahajanga, 401, Madagascar
| | | | | | - Gregory Wilkerson
- Keeling Center for Comparative Medicine and Research, MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Christian Abee
- Keeling Center for Comparative Medicine and Research, MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Joe H Simmons
- Keeling Center for Comparative Medicine and Research, MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Eduardo Fernandez-Duque
- Yale University, New Haven, CT 06520, USA
- Universidad Nacional de Formosa, Argentina Fundacion ECO, Formosa, Argentina
| | | | - Fekadu Shiferaw
- Guinea Worm Eradication Program, The Carter Center Ethiopia, PoB 16316, Addis Ababa 1000, Ethiopia
| | - Dongdong Wu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China
| | - Long Zhou
- Center for Evolutionary & Organismal Biology, Zhejiang University School of Medicine, Hangzhou 310058, China
| | - Yong Shao
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China
| | - Guojie Zhang
- Center for Evolutionary & Organismal Biology, Zhejiang University School of Medicine, Hangzhou 310058, China
- Villum Center for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, DK-2100 Copenhagen, Denmark
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China
- Liangzhu Laboratory, Zhejiang University Medical Center, 1369 West Wenyi Road, Hangzhou 311121, China
- Women's Hospital, School of Medicine, Zhejiang University, 1 Xueshi Road, Shangcheng District, Hangzhou 310006, China
| | - Julius D Keyyu
- Tanzania Wildlife Research Institute (TAWIRI), Head Office, P.O. Box 661, Arusha, Tanzania
| | - Sascha Knauf
- Institute of International Animal Health/One Health, Friedrich-Loeffler-Institut, Federal Research Institute for Animal Health, 17493 Greifswald - Insei Riems, Germany
| | - Minh D Le
- Department of Environmental Ecology, Faculty of Environmental Sciences, University of Science and Central Institute for Natural Resources and Environmental Studies, Vietnam National University, Hanoi 100000, Vietnam
| | - Esther Lizano
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
- Catalan Institution of Research and Advanced Studies (ICREA), Passeig de Lluís Companys, 23, 08010 Barcelona, Spain
| | - Stefan Merker
- Department of Zoology, State Museum of Natural History Stuttgart, 70191 Stuttgart, Germany
| | - Arcadi Navarro
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Edifici ICTA-ICP, c/ Columnes s/n, 08193 Cerdanyola del Vallès, Barcelona, Spain
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Av. Doctor Aiguader, N88, 08003 Barcelona, Spain
- BarcelonaBeta Brain Research Center, Pasqual Maragall Foundation, C. Wellington 30, 08005 Barcelona, Spain
| | - Thomas Bataillon
- Bioinformatics Research Centre, Aarhus University, Aarhus 8000, Denmark
| | - Tilo Nadler
- Cuc Phuong Commune, Nho Quan District, Ninh Binh Province 430000, Vietnam
| | - Chiea Chuen Khor
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), 60 Biopolis Street, Genome, Singapore 138672, Republic of Singapore
| | - Jessica Lee
- Mandai Nature, 80 Mandai Lake Road, Singapore 729826, Republic of Singapore
| | - Patrick Tan
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), 60 Biopolis Street, Genome, Singapore 138672, Republic of Singapore
- SingHealth Duke-NUS Institute of Precision Medicine (PRISM), Singapore 168582, Republic of Singapore
- Cancer and Stem Cell Biology Program, Duke-NUS Medical School, Singapore 168582, Republic of Singapore
| | - Weng Khong Lim
- SingHealth Duke-NUS Institute of Precision Medicine (PRISM), Singapore 168582, Republic of Singapore
- Cancer and Stem Cell Biology Program, Duke-NUS Medical School, Singapore 168582, Republic of Singapore
- SingHealth Duke-NUS Genomic Medicine Centre, Singapore 168582, Republic of Singapore
| | - Andrew C Kitchener
- Department of Natural Sciences, National Museums Scotland, Chambers Street, Edinburgh EH1 1JF, UK
- School of Geosciences, University of Edinburgh, Drummond Street, Edinburgh EH8 9XP, UK
| | - Dietmar Zinner
- Cognitive Ethology Laboratory, Germany Primate Center, Leibniz Institute for Primate Research, 37077 Göttingen, Germany
- Department of Primate Cognition, Georg-August-Universität Göttingen, 37077 Göttingen, Germany
- Leibniz Science Campus Primate Cognition, 37077 Göttingen, Germany
| | - Ivo Gut
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
- Universitat Pompeu Fabra, Pg. Luís Companys 23, 08010 Barcelona, Spain
| | - Amanda Melin
- Department of Anthropology & Archaeology, University of Calgary, 2500 University Dr NW, Calgary, AB T2N 1N4, Canada
- Department of Medical Genetics, 3330 Hospital Drive NW, HMRB 202, Calgary, AB T2N 4N1, Canada
- Alberta Children's Hospital Research Institute, University of Calgary, 2500 University Dr NW, Calgary, AB T2N 1N4, Canada
| | - Katerina Guschanski
- Department of Ecology and Genetics, Animal Ecology, Uppsala University, SE-75236 Uppsala, Sweden
- Institute of Ecology and Evolution, School of Biological Sciences, University of Edinburgh, Edinburgh EH8 9XP, UK
| | | | - Robin M D Beck
- School of Science, Engineering & Environment, University of Salford, Salford M5 4WT, UK
| | - Govindhaswamy Umapathy
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
- Laboratory for the Conservation of Endangered Species, CSIR-Centre for Cellular and Molecular Biology, Hyderabad 500007, India
| | - Christian Roos
- Gene Bank of Primates and Primate Genetics Laboratory, German Primate Center, Leibniz Institute for Primate Research, Kellnerweg 4, 37077 Göttingen, Germany
| | - Jean P Boubli
- School of Science, Engineering & Environment, University of Salford, Salford M5 4WT, UK
| | - Monkol Lek
- Department of Genetics, Yale School of Medicine, New Haven, CT 06520, USA
| | - Shamil Sunyaev
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA
- Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, 02115, USA
| | - Anne O'Donnell-Luria
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Boston, MA, 02142, USA
- Division of Genetics and Genomics, Department of Pediatrics, Boston Children's Hospital, Harvard Medical School, Boston, MA, 02115, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02115, USA
| | - Heidi L Rehm
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Boston, MA, 02142, USA
- Department of Genetics, Yale School of Medicine, New Haven, CT 06520, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Jinbo Xu
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
- Toyota Technological Institute at Chicago, Chicago, IL 60637, USA
| | - Jeffrey Rogers
- Human Genome Sequencing Center and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Tomas Marques-Bonet
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
- Catalan Institution of Research and Advanced Studies (ICREA), Passeig de Lluís Companys, 23, 08010 Barcelona, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Edifici ICTA-ICP, c/ Columnes s/n, 08193 Cerdanyola del Vallès, Barcelona, Spain
| | - Kyle Kai-How Farh
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| |
Collapse
|
30
|
Sherkatghanad Z, Abdar M, Charlier J, Makarenkov V. Using traditional machine learning and deep learning methods for on- and off-target prediction in CRISPR/Cas9: a review. Brief Bioinform 2023; 24:bbad131. [PMID: 37080758 PMCID: PMC10199778 DOI: 10.1093/bib/bbad131] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Revised: 03/07/2023] [Accepted: 03/13/2023] [Indexed: 04/22/2023] Open
Abstract
CRISPR/Cas9 (Clustered Regularly Interspaced Short Palindromic Repeats and CRISPR-associated protein 9) is a popular and effective two-component technology used for targeted genetic manipulation. It is currently the most versatile and accurate method of gene and genome editing, which benefits from a large variety of practical applications. For example, in biomedicine, it has been used in research related to cancer, virus infections, pathogen detection, and genetic diseases. Current CRISPR/Cas9 research is based on data-driven models for on- and off-target prediction as a cleavage may occur at non-target sequence locations. Nowadays, conventional machine learning and deep learning methods are applied on a regular basis to accurately predict on-target knockout efficacy and off-target profile of given single-guide RNAs (sgRNAs). In this paper, we present an overview and a comparative analysis of traditional machine learning and deep learning models used in CRISPR/Cas9. We highlight the key research challenges and directions associated with target activity prediction. We discuss recent advances in the sgRNA-DNA sequence encoding used in state-of-the-art on- and off-target prediction models. Furthermore, we present the most popular deep learning neural network architectures used in CRISPR/Cas9 prediction models. Finally, we summarize the existing challenges and discuss possible future investigations in the field of on- and off-target prediction. Our paper provides valuable support for academic and industrial researchers interested in the application of machine learning methods in the field of CRISPR/Cas9 genome editing.
Collapse
Affiliation(s)
- Zeinab Sherkatghanad
- Departement d’Informatique, Universite du Quebec a Montreal, H2X 3Y7, Montreal, QC, Canada
| | - Moloud Abdar
- Institute for Intelligent Systems Research and Innovation (IISRI), Deakin University, 3216, Geelong, VIC, Australia
| | - Jeremy Charlier
- Departement d’Informatique, Universite du Quebec a Montreal, H2X 3Y7, Montreal, QC, Canada
| | - Vladimir Makarenkov
- Departement d’Informatique, Universite du Quebec a Montreal, H2X 3Y7, Montreal, QC, Canada
| |
Collapse
|
31
|
Gao H, Hamp T, Ede J, Schraiber JG, McRae J, Singer-Berk M, Yang Y, Dietrich A, Fiziev P, Kuderna L, Sundaram L, Wu Y, Adhikari A, Field Y, Chen C, Batzoglou S, Aguet F, Lemire G, Reimers R, Balick D, Janiak MC, Kuhlwilm M, Orkin JD, Manu S, Valenzuela A, Bergman J, Rouselle M, Silva FE, Agueda L, Blanc J, Gut M, de Vries D, Goodhead I, Harris RA, Raveendran M, Jensen A, Chuma IS, Horvath J, Hvilsom C, Juan D, Frandsen P, de Melo FR, Bertuol F, Byrne H, Sampaio I, Farias I, do Amaral JV, Messias M, da Silva MNF, Trivedi M, Rossi R, Hrbek T, Andriaholinirina N, Rabarivola CJ, Zaramody A, Jolly CJ, Phillips-Conroy J, Wilkerson G, Abee C, Simmons JH, Fernandez-Duque E, Kanthaswamy S, Shiferaw F, Wu D, Zhou L, Shao Y, Zhang G, Keyyu JD, Knauf S, Le MD, Lizano E, Merker S, Navarro A, Batallion T, Nadler T, Khor CC, Lee J, Tan P, Lim WK, Kitchener AC, Zinner D, Gut I, Melin A, Guschanski K, Schierup MH, Beck RMD, Umapathy G, Roos C, Boubli JP, Lek M, Sunyaev S, O’Donnell A, Rehm H, Xu J, Rogers J, Marques-Bonet T, Kai-How Farh K. The landscape of tolerated genetic variation in humans and primates. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.01.538953. [PMID: 37205491 PMCID: PMC10187174 DOI: 10.1101/2023.05.01.538953] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Personalized genome sequencing has revealed millions of genetic differences between individuals, but our understanding of their clinical relevance remains largely incomplete. To systematically decipher the effects of human genetic variants, we obtained whole genome sequencing data for 809 individuals from 233 primate species, and identified 4.3 million common protein-altering variants with orthologs in human. We show that these variants can be inferred to have non-deleterious effects in human based on their presence at high allele frequencies in other primate populations. We use this resource to classify 6% of all possible human protein-altering variants as likely benign and impute the pathogenicity of the remaining 94% of variants with deep learning, achieving state-of-the-art accuracy for diagnosing pathogenic variants in patients with genetic diseases. One Sentence Summary Deep learning classifier trained on 4.3 million common primate missense variants predicts variant pathogenicity in humans.
Collapse
Affiliation(s)
- Hong Gao
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Tobias Hamp
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Jeffrey Ede
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Joshua G. Schraiber
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Jeremy McRae
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Moriel Singer-Berk
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard; Boston, Massachusetts, 02142, USA
| | - Yanshen Yang
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Anastasia Dietrich
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Petko Fiziev
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Lukas Kuderna
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
- Institute of Evolutionary Biology (UPF-CSIC); PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Laksshman Sundaram
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Yibing Wu
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Aashish Adhikari
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Yair Field
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Chen Chen
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Serafim Batzoglou
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Francois Aguet
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Gabrielle Lemire
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard; Boston, Massachusetts, 02142, USA
- Division of Genetics and Genomics, Department of Pediatrics, Boston Children’s Hospital, Harvard Medical School; Boston, Massachusetts, 02115, USA
| | - Rebecca Reimers
- Division of Genetics and Genomics, Department of Pediatrics, Boston Children’s Hospital, Harvard Medical School; Boston, Massachusetts, 02115, USA
| | - Daniel Balick
- Division of Genetics, Brigham and Women’s Hospital, Harvard Medical School; Boston, Massachusetts, 02115, USA
| | - Mareike C. Janiak
- School of Science, Engineering & Environment, University of Salford; Salford, M5 4WT, United Kingdom
| | - Martin Kuhlwilm
- Institute of Evolutionary Biology (UPF-CSIC); PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
- Department of Evolutionary Anthropology, University of Vienna; Djerassiplatz 1, 1030, Vienna, Austria
- Human Evolution and Archaeological Sciences (HEAS), University of Vienna; 1030, Vienna, Austria
| | - Joseph D. Orkin
- Institute of Evolutionary Biology (UPF-CSIC); PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
- Département d’anthropologie, Université de Montréal; 3150 Jean-Brillant, Montréal, QC, H3T 1N8, Canada
| | - Shivakumara Manu
- Academy of Scientific and Innovative Research (AcSIR); Ghaziabad, 201002, India
- Laboratory for the Conservation of Endangered Species, CSIR-Centre for Cellular and Molecular Biology; Hyderabad, 500007, India
| | - Alejandro Valenzuela
- Institute of Evolutionary Biology (UPF-CSIC); PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Juraj Bergman
- Bioinformatics Research Centre, Aarhus University; Aarhus, 8000, Denmark
- Section for Ecoinformatics & Biodiversity, Department of Biology, Aarhus University; Aarhus, 8000, Denmark
| | | | - Felipe Ennes Silva
- Research Group on Primate Biology and Conservation, Mamirauá Institute for Sustainable Development; Estrada da Bexiga 2584, Tefé, Amazonas, CEP 69553-225, Brazil
- Faculty of Sciences, Department of Organismal Biology, Unit of Evolutionary Biology and Ecology, Université Libre de Bruxelles (ULB); Avenue Franklin D. Roosevelt 50, 1050, Brussels, Belgium
| | - Lidia Agueda
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST); Baldiri i Reixac 4, 08028, Barcelona, Spain
| | - Julie Blanc
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST); Baldiri i Reixac 4, 08028, Barcelona, Spain
| | - Marta Gut
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST); Baldiri i Reixac 4, 08028, Barcelona, Spain
| | - Dorien de Vries
- School of Science, Engineering & Environment, University of Salford; Salford, M5 4WT, United Kingdom
| | - Ian Goodhead
- School of Science, Engineering & Environment, University of Salford; Salford, M5 4WT, United Kingdom
| | - R. Alan Harris
- Human Genome Sequencing Center and Department of Molecular and Human Genetics, Baylor College of Medicine; Houston, Texas, 77030, USA
| | - Muthuswamy Raveendran
- Human Genome Sequencing Center and Department of Molecular and Human Genetics, Baylor College of Medicine; Houston, Texas, 77030, USA
| | - Axel Jensen
- Department of Ecology and Genetics, Animal Ecology, Uppsala University; SE-75236, Uppsala, Sweden
| | | | - Julie Horvath
- North Carolina Museum of Natural Sciences; Raleigh, North Carolina, 27601, USA
- Department of Biological and Biomedical Sciences, North Carolina Central University; Durham, North Carolina , 27707, USA
- Department of Biological Sciences, North Carolina State University; Raleigh, North Carolina , 27695, USA
- Department of Evolutionary Anthropology, Duke University; Durham, North Carolina , 27708, USA
- Renaissance Computing Institute, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | | | - David Juan
- Institute of Evolutionary Biology (UPF-CSIC); PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
| | | | | | - Fabricio Bertuol
- Universidade Federal do Amazonas, Departamento de Genética, Laboratório de Evolução e Genética Animal (LEGAL); Manaus, Amazonas, 69080-900, Brazil
| | - Hazel Byrne
- Department of Anthropology, University of Utah; Salt Lake City, Utah, 84102, USA
| | - Iracilda Sampaio
- Universidade Federal do Para; Guamá, Belém - PA, 66075-110, Brazil
| | - Izeni Farias
- Universidade Federal do Amazonas, Departamento de Genética, Laboratório de Evolução e Genética Animal (LEGAL); Manaus, Amazonas, 69080-900, Brazil
| | - João Valsecchi do Amaral
- Research Group on Terrestrial Vertebrate Ecology, Mamirauá Institute for Sustainable Development; Tefé, Amazonas, 69553-225, Brazil
- Rede de Pesquisa para Estudos sobre Diversidade, Conservação e Uso da Fauna na Amazônia – RedeFauna; Manaus, Amazonas, 69080-900, Brazil
- Comunidad de Manejo de Fauna Silvestre en la Amazonía y en Latinoamérica – ComFauna; Iquitos, Loreto, 16001, Peru
| | - Mariluce Messias
- Universidade Federal de Rondonia; Porto Velho, Rondônia, 78900-000, Brazil
- PPGREN - Programa de Pós-Graduação “Conservação e Uso dos Recursos Naturais and BIONORTE - Programa de Pós-Graduação em Biodiversidade e Biotecnologia da Rede BIONORTE, Universidade Federal de Rondonia; Porto Velho, Rondônia, 78900-000, Brazil
| | - Maria N. F. da Silva
- Instituto Nacional de Pesquisas da Amazonia; Petrópolis, Manaus - AM, 69067-375, Brazil
| | - Mihir Trivedi
- Laboratory for the Conservation of Endangered Species, CSIR-Centre for Cellular and Molecular Biology; Hyderabad, 500007, India
| | - Rogerio Rossi
- Universidade Federal do Mato Grosso; Boa Esperança, Cuiabá - MT, 78060-900, Brazil
| | - Tomas Hrbek
- Universidade Federal do Amazonas, Departamento de Genética, Laboratório de Evolução e Genética Animal (LEGAL); Manaus, Amazonas, 69080-900, Brazil
- Department of Biology, Trinity University; San Antonio, Texas, 78212, USA
| | - Nicole Andriaholinirina
- Life Sciences and Environment, Technology and Environment of Mahajanga, University of Mahajanga; Mahajanga, 401, Madagascar
| | - Clément J. Rabarivola
- Life Sciences and Environment, Technology and Environment of Mahajanga, University of Mahajanga; Mahajanga, 401, Madagascar
| | - Alphonse Zaramody
- Life Sciences and Environment, Technology and Environment of Mahajanga, University of Mahajanga; Mahajanga, 401, Madagascar
| | | | | | - Gregory Wilkerson
- Keeling Center for Comparative Medicine and Research, MD Anderson Cancer Center; Houston, Texas, 77030, USA
| | | | - Joe H. Simmons
- Keeling Center for Comparative Medicine and Research, MD Anderson Cancer Center; Houston, Texas, 77030, USA
| | - Eduardo Fernandez-Duque
- Yale University; New Haven, Connecticut, 06520, USA
- Universidad Nacional de Formosa, Argentina Fundacion ECO, Formosa, Argentina
| | | | | | - Dongdong Wu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences; Kunming, Yunnan, 650223, China
| | - Long Zhou
- Center for Evolutionary & Organismal Biology, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Yong Shao
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences; Kunming, Yunnan, 650223, China
| | - Guojie Zhang
- Center for Evolutionary & Organismal Biology, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Villum Center for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen; Copenhagen, DK-2100, Denmark
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China
- Liangzhu Laboratory, Zhejiang University Medical Center; 1369 West Wenyi Road, Hangzhou, 311121, China
- Women’s Hospital, School of Medicine, Zhejiang University; 1 Xueshi Road, Shangcheng District, Hangzhou, 310006, China
| | - Julius D. Keyyu
- Tanzania Wildlife Research Institute (TAWIRI), Head Office; P.O.Box 661, Arusha, Tanzania
| | - Sascha Knauf
- Institute of International Animal Health/One Health, Friedrich-Loeffler-Institut, Federal Research Institute for Animal Health; 17493 Greifswald - Isle of Riems, Germany
| | - Minh D. Le
- Department of Environmental Ecology, Faculty of Environmental Sciences, University of Science and Central Institute for Natural Resources and Environmental Studies, Vietnam National University; Hanoi, 100000, Vietnam
| | - Esther Lizano
- Institute of Evolutionary Biology (UPF-CSIC); PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Barcelona, Spain; Catalan Institution of Research and Advanced Studies (ICREA), Barcelona, Spain
| | - Stefan Merker
- Department of Zoology, State Museum of Natural History Stuttgart; 70191 Stuttgart, Germany
| | - Arcadi Navarro
- Institute of Evolutionary Biology (UPF-CSIC); PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA) and Universitat Pompeu Fabra, Pg. Luís Companys 23, Barcelona, 08010, Spain
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology; Av. Doctor Aiguader, N88, Barcelona, 08003, Spain
- BarcelonaBeta Brain Research Center, Pasqual Maragall Foundation; C. Wellington 30, Barcelona, 08005, Spain
| | - Thomas Batallion
- Bioinformatics Research Centre, Aarhus University; Aarhus, 8000, Denmark
| | - Tilo Nadler
- Cuc Phuong Commune; Nho Quan District, Ninh Binh Province, 430000, Vietnam
| | - Chiea Chuen Khor
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), 60 Biopolis Street, Genome, Singapore 138672, Republic of Singapore
| | - Jessica Lee
- Mandai Nature; 80 Mandai Lake Road, Singapore 729826, Republic of Singapore
| | - Patrick Tan
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), 60 Biopolis Street, Genome, Singapore 138672, Republic of Singapore
- SingHealth Duke-NUS Institute of Precision Medicine (PRISM); Singapore 168582, Republic of Singapore
- Cancer and Stem Cell Biology Program, Duke-NUS Medical School; Singapore 168582, Republic of Singapore
| | - Weng Khong Lim
- SingHealth Duke-NUS Institute of Precision Medicine (PRISM); Singapore 168582, Republic of Singapore
- Cancer and Stem Cell Biology Program, Duke-NUS Medical School; Singapore 168582, Republic of Singapore
- SingHealth Duke-NUS Genomic Medicine Centre; Singapore 168582, Republic of Singapore
| | - Andrew C. Kitchener
- Department of Natural Sciences, National Museums Scotland; Chambers Street, Edinburgh, EH1 1JF, UK
- School of Geosciences, University of Edinburgh; Drummond Street, Edinburgh, EH8 9XP, UK
| | - Dietmar Zinner
- Cognitive Ethology Laboratory, Germany Primate Center, Leibniz Institute for Primate Research; 37077 Göttingen, Germany
- Department of Primate Cognition, Georg-August-Universität Göttingen; 37077 Göttingen, Germany
| | - Ivo Gut
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST); Baldiri i Reixac 4, 08028, Barcelona, Spain
- Universitat Pompeu Fabra, Pg. Luís Companys 23, Barcelona, 08010, Spain
| | - Amanda Melin
- Leibniz Science Campus Primate Cognition; 37077 Göttingen, Germany
- Department of Anthropology & Archaeology and Department of Medical Genetics
| | - Katerina Guschanski
- Department of Ecology and Genetics, Animal Ecology, Uppsala University; SE-75236, Uppsala, Sweden
- Alberta Children’s Hospital Research Institute; University of Calgary; 2500 University Dr NW T2N 1N4, Calgary, Alberta, Canada
| | | | - Robin M. D. Beck
- School of Science, Engineering & Environment, University of Salford; Salford, M5 4WT, United Kingdom
| | - Govindhaswamy Umapathy
- Academy of Scientific and Innovative Research (AcSIR); Ghaziabad, 201002, India
- Laboratory for the Conservation of Endangered Species, CSIR-Centre for Cellular and Molecular Biology; Hyderabad, 500007, India
| | - Christian Roos
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh; Edinburgh, EH8 9XP, UK
| | - Jean P. Boubli
- School of Science, Engineering & Environment, University of Salford; Salford, M5 4WT, United Kingdom
| | - Monkol Lek
- Gene Bank of Primates and Primate Genetics Laboratory, German Primate Center, Leibniz Institute for Primate Research; Kellnerweg 4, 37077 Göttingen, Germany
| | - Shamil Sunyaev
- Division of Genetics, Brigham and Women’s Hospital, Harvard Medical School; Boston, Massachusetts, 02115, USA
- Department of Genetics, Yale School of Medicine; New Haven, Connecticut, 06520, USA
| | - Anne O’Donnell
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard; Boston, Massachusetts, 02142, USA
- Division of Genetics and Genomics, Department of Pediatrics, Boston Children’s Hospital, Harvard Medical School; Boston, Massachusetts, 02115, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, 02115, USA
| | - Heidi Rehm
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard; Boston, Massachusetts, 02142, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital and Harvard Medical School; Boston, Massachusetts, 02115, USA
| | - Jinbo Xu
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
- Toyota Technological Institute at Chicago; Chicago, Illinois, 60637, USA
| | - Jeffrey Rogers
- Human Genome Sequencing Center and Department of Molecular and Human Genetics, Baylor College of Medicine; Houston, Texas, 77030, USA
| | - Tomas Marques-Bonet
- Institute of Evolutionary Biology (UPF-CSIC); PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST); Baldiri i Reixac 4, 08028, Barcelona, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Barcelona, Spain; Catalan Institution of Research and Advanced Studies (ICREA), Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA) and Universitat Pompeu Fabra, Pg. Luís Companys 23, Barcelona, 08010, Spain
| | - Kyle Kai-How Farh
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| |
Collapse
|
32
|
Flynn J, Samant N, Schneider-Nachum G, Tenzin T, Bolon DNA. Mutational fitness landscape and drug resistance. Curr Opin Struct Biol 2023; 78:102525. [PMID: 36621152 PMCID: PMC10243218 DOI: 10.1016/j.sbi.2022.102525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 11/29/2022] [Accepted: 12/06/2022] [Indexed: 01/08/2023]
Abstract
Robust technology has been developed to systematically quantify fitness landscapes that provide valuable opportunities to improve our understanding of drug resistance and define new avenues to develop drugs with reduced resistance susceptibility. We outline the critical importance of drug resistance studies and the potential for fitness landscape approaches to contribute to this effort. We describe the major technical advancements in mutational scanning, which is the primary approach used to quantify protein fitness landscapes. There are many complex steps to consider in planning and executing mutational scanning projects including developing a selection scheme, generating mutant libraries, tracking the frequency of variants using next-generation sequencing, and processing and interpreting the data. Key experimental parameters impacting each of these steps are discussed to aid in planning fitness landscape studies. There is a strong need for improved understanding of drug resistance, and fitness landscapes provide a promising new approach.
Collapse
Affiliation(s)
- Julia Flynn
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA
| | - Neha Samant
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA
| | - Gily Schneider-Nachum
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA
| | - Tsepal Tenzin
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA
| | - Daniel N A Bolon
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA.
| |
Collapse
|
33
|
Tartaglia M, Aoki Y, Gelb BD. The molecular genetics of RASopathies: An update on novel disease genes and new disorders. AMERICAN JOURNAL OF MEDICAL GENETICS. PART C, SEMINARS IN MEDICAL GENETICS 2022; 190:425-439. [PMID: 36394128 PMCID: PMC10100036 DOI: 10.1002/ajmg.c.32012] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 10/31/2022] [Accepted: 11/05/2022] [Indexed: 11/18/2022]
Abstract
Enhanced signaling through RAS and the mitogen-associated protein kinase (MAPK) cascade underlies the RASopathies, a family of clinically related disorders affecting development and growth. In RASopathies, increased RAS-MAPK signaling can result from the upregulated activity of various RAS GTPases, enhanced function of proteins positively controlling RAS function or favoring the efficient transmission of RAS signaling to downstream transducers, functional upregulation of RAS effectors belonging to the MAPK cascade, or inefficient signaling switch-off operated by feedback mechanisms acting at different levels. The massive effort in RASopathy gene discovery performed in the last 20 years has identified more than 20 genes implicated in these disorders. It has also facilitated the characterization of several molecular activating mechanisms that had remained unappreciated due to their minor impact in oncogenesis. Here, we provide an overview on the discoveries collected during the last 5 years that have delivered unexpected insights (e.g., Noonan syndrome as a recessive disease) and allowed to profile new RASopathies, novel disease genes and new molecular circuits contributing to the control of RAS-MAPK signaling.
Collapse
Affiliation(s)
- Marco Tartaglia
- Genetics and Rare Diseases Research DivisionOspedale Pediatrico Bambino Gesù, IRCCSRomeItaly
| | - Yoko Aoki
- Department of Medical GeneticsTohoku University School of MedicineSendaiJapan
| | - Bruce D. Gelb
- Mindich Child Health and Development InstituteIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Department of Pediatrics and GeneticsIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Department of Genomic SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| |
Collapse
|
34
|
Hain C, Stadler R, Kalinowski J. Unraveling the Structural Variations of Early-Stage Mycosis Fungoides-CD3 Based Purification and Third Generation Sequencing as Novel Tools for the Genomic Landscape in CTCL. Cancers (Basel) 2022; 14:4466. [PMID: 36139626 PMCID: PMC9497107 DOI: 10.3390/cancers14184466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Revised: 09/08/2022] [Accepted: 09/12/2022] [Indexed: 11/16/2022] Open
Abstract
Mycosis fungoides (MF) is the most common cutaneous T-cell lymphoma (CTCL). At present, knowledge of genetic changes in early-stage MF is insufficient. Additionally, low tumor cell fraction renders calling of copy-number variations as the predominant mutations in MF challenging, thereby impeding further investigations. We show that enrichment of T cells from a biopsy of a stage I MF patient greatly increases tumor fraction. This improvement enables accurate calling of recurrent MF copy-number variants such as ARID1A and CDKN2A deletion and STAT5 amplification, undetected in the unprocessed biopsy. Furthermore, we demonstrate that application of long-read nanopore sequencing is especially useful for the structural variant rich CTCL. We detect the structural variants underlying recurrent MF copy-number variants and show phasing of multiple breakpoints into complex structural variant haplotypes. Additionally, we record multiple occurrences of templated insertion structural variants in this sample. Taken together, this study suggests a workflow to make the early stages of MF accessible for genetic analysis, and indicates long-read sequencing as a major tool for genetic analysis for MF.
Collapse
Affiliation(s)
- Carsten Hain
- Center for Biotechnology (CeBiTec), Bielefeld University, 33615 Bielefeld, Germany
| | - Rudolf Stadler
- University Clinic for Dermatology, Johannes Wesling Medical Centre, UKRUB, University of Bochum, 32429 Minden, Germany
| | - Jörn Kalinowski
- Center for Biotechnology (CeBiTec), Bielefeld University, 33615 Bielefeld, Germany
| |
Collapse
|
35
|
Zhang Y, Truong B, Fahl SP, Martinez E, Cai KQ, Al-Saleem ED, Gong Y, Liebermann DA, Soboloff J, Dunbrack R, Levine RL, Fletcher S, Kappes D, Sykes SM, Shapiro P, Wiest DL. The ERK2-DBP domain opposes pathogenesis of a mouse JAK2V617F-driven myeloproliferative neoplasm. Blood 2022; 140:359-373. [PMID: 35436326 PMCID: PMC9335498 DOI: 10.1182/blood.2021013068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Accepted: 03/30/2022] [Indexed: 01/18/2023] Open
Abstract
Although Ras/mitogen-activated protein kinase (MAPK) signaling is activated in most human cancers, attempts to target this pathway using kinase-active site inhibitors have not typically led to durable clinical benefit. To address this shortcoming, we sought to test the feasibility of an alternative targeting strategy, focused on the ERK2 substrate binding domains, D and DEF binding pocket (DBP). Disabling the ERK2-DBP domain in mice caused baseline erythrocytosis. Consequently, we investigated the role of the ERK2-D and -DBP domains in disease, using a JAK2-dependent model of polycythemia vera (PV). Of note, inactivation of the ERK2-DBP domain promoted the progression of disease from PV to myelofibrosis, suggesting that the ERK2-DBP domain normally opposes progression. ERK2-DBP inactivation also prevented oncogenic JAK2 kinase (JAK2V617F) from promoting oncogene-induced senescence in vitro. The ERK2-DBP mutation attenuated JAK2-mediated oncogene-induced senescence by preventing the physical interaction of ERK2 with the transcription factor Egr1. Because inactivation of the ERK2-DBP created a functional ERK2 kinase limited to binding substrates through its D domain, these data suggested that the D domain substrates were responsible for promoting oncogene-induced progenitor growth and tumor progression and that pharmacologic targeting of the ERK2-D domain may attenuate cancer cell growth. Indeed, pharmacologic agents targeting the ERK2-D domain were effective in attenuating the growth of JAK2-dependent myeloproliferative neoplasm cell lines. Taken together, these data indicate that the ERK-D and -DBP domains can play distinct roles in the progression of neoplasms and that the D domain has the potential to be a potent therapeutic target in Ras/MAPK-dependent cancers.
Collapse
Affiliation(s)
- Yong Zhang
- Blood Cell Development and Function Program
| | | | | | | | | | | | - Yulan Gong
- Department of Pathology, Fox Chase Cancer Center, Philadelphia, PA
| | - Dan A Liebermann
- Fels Institute for Personalized Medicine and Molecular Biology, Lewis Katz School of Medicine, Temple University, Philadelphia, PA
| | - Jonathan Soboloff
- Fels Institute for Personalized Medicine and Molecular Biology, Lewis Katz School of Medicine, Temple University, Philadelphia, PA
| | - Roland Dunbrack
- Molecular Therapeutics Program, Fox Chase Cancer Center, Philadelphia, PA
| | - Ross L Levine
- Department of Medicine, Leukemia Service, Center for Hematologic Malignancies, Human Oncology and Pathogenesis Program, Memorial Sloan Kettering Cancer Center, New York, NY; and
| | - Steven Fletcher
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Maryland, Baltimore, MD
| | | | | | - Paul Shapiro
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Maryland, Baltimore, MD
| | | |
Collapse
|
36
|
Kim Y, Lee S, Cho S, Park J, Chae D, Park T, Minna JD, Kim HH. High-throughput functional evaluation of human cancer-associated mutations using base editors. Nat Biotechnol 2022; 40:874-884. [PMID: 35411116 PMCID: PMC10243181 DOI: 10.1038/s41587-022-01276-4] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Accepted: 03/10/2022] [Indexed: 12/26/2022]
Abstract
Comprehensive phenotypic characterization of the many mutations found in cancer tissues is one of the biggest challenges in cancer genomics. In this study, we evaluated the functional effects of 29,060 cancer-related transition mutations that result in protein variants on the survival and proliferation of non-tumorigenic lung cells using cytosine and adenine base editors and single guide RNA (sgRNA) libraries. By monitoring base editing efficiencies and outcomes using surrogate target sequences paired with sgRNA-encoding sequences on the lentiviral delivery construct, we identified sgRNAs that induced a single primary protein variant per sgRNA, enabling linking those mutations to the cellular phenotypes caused by base editing. The functions of the vast majority of the protein variants (28,458 variants, 98%) were classified as neutral or likely neutral; only 18 (0.06%) and 157 (0.5%) variants caused outgrowing and likely outgrowing phenotypes, respectively. We expect that our approach can be extended to more variants of unknown significance and other tumor types.
Collapse
Affiliation(s)
- Younggwang Kim
- Department of Pharmacology, Yonsei University College of Medicine, Seoul, Republic of Korea
- Graduate School of Medical Science, Brain Korea 21 Plus Project for Medical Sciences, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Seungho Lee
- Department of Pharmacology, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Soohyuk Cho
- Department of Pharmacology, Yonsei University College of Medicine, Seoul, Republic of Korea
- Graduate School of Medical Science, Brain Korea 21 Plus Project for Medical Sciences, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Jinman Park
- Department of Pharmacology, Yonsei University College of Medicine, Seoul, Republic of Korea
- Graduate School of Medical Science, Brain Korea 21 Plus Project for Medical Sciences, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Dongwoo Chae
- Department of Pharmacology, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Taeyoung Park
- Department of Applied Statistics, Yonsei University, Seoul, Republic of Korea
| | - John D Minna
- Hamon Center for Therapeutic Oncology Research, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Hyongbum Henry Kim
- Department of Pharmacology, Yonsei University College of Medicine, Seoul, Republic of Korea.
- Graduate School of Medical Science, Brain Korea 21 Plus Project for Medical Sciences, Yonsei University College of Medicine, Seoul, Republic of Korea.
- Severance Biomedical Science Institute, Yonsei University College of Medicine, Seoul, Republic of Korea.
- Center for Nanomedicine, Institute for Basic Science (IBS), Seoul, Republic of Korea.
- Yonsei-IBS Institute, Yonsei University, Seoul, Republic of Korea.
- Institute for Immunology and Immunological Diseases, Yonsei University College of Medicine, Seoul, Republic of Korea.
| |
Collapse
|
37
|
Livesey BJ, Marsh JA. Interpreting protein variant effects with computational predictors and deep mutational scanning. Dis Model Mech 2022; 15:275742. [PMID: 35736673 PMCID: PMC9235876 DOI: 10.1242/dmm.049510] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Computational predictors of genetic variant effect have advanced rapidly in recent years. These programs provide clinical and research laboratories with a rapid and scalable method to assess the likely impacts of novel variants. However, it can be difficult to know to what extent we can trust their results. To benchmark their performance, predictors are often tested against large datasets of known pathogenic and benign variants. These benchmarking data may overlap with the data used to train some supervised predictors, which leads to data re-use or circularity, resulting in inflated performance estimates for those predictors. Furthermore, new predictors are usually found by their authors to be superior to all previous predictors, which suggests some degree of computational bias in their benchmarking. Large-scale functional assays known as deep mutational scans provide one possible solution to this problem, providing independent datasets of variant effect measurements. In this Review, we discuss some of the key advances in predictor methodology, current benchmarking strategies and how data derived from deep mutational scans can be used to overcome the issue of data circularity. We also discuss the ability of such functional assays to directly predict clinical impacts of mutations and how this might affect the future need for variant effect predictors.
Collapse
Affiliation(s)
- Benjamin J Livesey
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh EH4 2XU, UK
| | - Joseph A Marsh
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh EH4 2XU, UK
| |
Collapse
|
38
|
Deák G, Cook AG. Missense Variants Reveal Functional Insights Into the Human ARID Family of Gene Regulators. J Mol Biol 2022; 434:167529. [PMID: 35257783 PMCID: PMC9077328 DOI: 10.1016/j.jmb.2022.167529] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 02/10/2022] [Accepted: 03/01/2022] [Indexed: 11/16/2022]
Abstract
Missense variants are alterations to protein coding sequences that result in amino acid substitutions. They can be deleterious if the amino acid is required for maintaining structure or/and function, but are likely to be tolerated at other sites. Consequently, missense variation within a healthy population can mirror the effects of negative selection on protein structure and function, such that functional sites on proteins are often depleted of missense variants. Advances in high-throughput sequencing have dramatically increased the sample size of available human variation data, allowing for population-wide analysis of selective pressures. In this study, we developed a convenient set of tools, called 1D-to-3D, for visualizing the positions of missense variants on protein sequences and structures. We used these tools to characterize human homologues of the ARID family of gene regulators. ARID family members are implicated in multiple cancer types, developmental disorders, and immunological diseases but current understanding of their mechanistic roles is incomplete. Combined with phylogenetic and structural analyses, our approach allowed us to characterise sites important for protein-protein interactions, histone modification recognition, and DNA binding by the ARID proteins. We find that comparing missense depletion patterns among paralogs can reveal sub-functionalization at the level of domains. We propose that visualizing missense variants and their depletion on structures can serve as a valuable tool for complementing evolutionary and experimental findings.
Collapse
Affiliation(s)
- Gauri Deák
- Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, Edinburgh EH9 3BF, United Kingdom. https://twitter.com/GauriDeak
| | - Atlanta G Cook
- Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, Edinburgh EH9 3BF, United Kingdom.
| |
Collapse
|
39
|
Scheele RA, Lindenburg LH, Petek M, Schober M, Dalby KN, Hollfelder F. Droplet-based screening of phosphate transfer catalysis reveals how epistasis shapes MAP kinase interactions with substrates. Nat Commun 2022; 13:844. [PMID: 35149678 PMCID: PMC8837617 DOI: 10.1038/s41467-022-28396-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Accepted: 01/10/2022] [Indexed: 11/20/2022] Open
Abstract
The combination of ultrahigh-throughput screening and sequencing informs on function and intragenic epistasis within combinatorial protein mutant libraries. Establishing a droplet-based, in vitro compartmentalised approach for robust expression and screening of protein kinase cascades (>107 variants/day) allowed us to dissect the intrinsic molecular features of the MKK-ERK signalling pathway, without interference from endogenous cellular components. In a six-residue combinatorial library of the MKK1 docking domain, we identified 29,563 sequence permutations that allow MKK1 to efficiently phosphorylate and activate its downstream target kinase ERK2. A flexibly placed hydrophobic sequence motif emerges which is defined by higher order epistatic interactions between six residues, suggesting synergy that enables high connectivity in the sequence landscape. Through positive epistasis, MKK1 maintains function during mutagenesis, establishing the importance of co-dependent residues in mammalian protein kinase-substrate interactions, and creating a scenario for the evolution of diverse human signalling networks. Here, the authors use a droplet-based screen for phosphate transfer catalysis, testing variants of the human protein kinase MKK1 for its ability to activate its downstream target ERK2. Data reveal a flexible motif in the MKK1 docking domain that promotes efficient activation of ERK2, and suggest epistasis between the residues within that sequence.
Collapse
Affiliation(s)
- Remkes A Scheele
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, UK
| | | | - Maya Petek
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, UK.,Faculty of Medicine, University of Maribor, SI-2000, Maribor, Slovenia
| | - Markus Schober
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, UK
| | - Kevin N Dalby
- Division of Chemical Biology and Medicinal Chemistry, The University of Texas at Austin, Austin, TX, 78712, USA
| | - Florian Hollfelder
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, UK.
| |
Collapse
|
40
|
Trivedi VD, Chappell TC, Krishna NB, Shetty A, Sigamani GG, Mohan K, Ramesh A, R PK, Nair NU. In-Depth Sequence–Function Characterization Reveals Multiple Pathways to Enhance Enzymatic Activity. ACS Catal 2022. [DOI: 10.1021/acscatal.1c05508] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Affiliation(s)
- Vikas D. Trivedi
- Department of Chemical and Biological Engineering, Tufts University, Medford, Massachusetts 02155, United States
| | - Todd C. Chappell
- Department of Chemical and Biological Engineering, Tufts University, Medford, Massachusetts 02155, United States
| | | | - Anuj Shetty
- Kcat Enzymatic Private Limited, Bengaluru, Karnataka, India 560005
| | | | - Karishma Mohan
- Department of Chemical and Biological Engineering, Tufts University, Medford, Massachusetts 02155, United States
| | - Athreya Ramesh
- Department of Chemical and Biological Engineering, Tufts University, Medford, Massachusetts 02155, United States
| | - Pravin Kumar R
- Kcat Enzymatic Private Limited, Bengaluru, Karnataka, India 560005
| | - Nikhil U. Nair
- Department of Chemical and Biological Engineering, Tufts University, Medford, Massachusetts 02155, United States
| |
Collapse
|
41
|
Massively parallel phenotyping of coding variants in cancer with Perturb-seq. Nat Biotechnol 2022; 40:896-905. [DOI: 10.1038/s41587-021-01160-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2020] [Accepted: 11/11/2021] [Indexed: 02/08/2023]
|
42
|
Høie MH, Cagiada M, Beck Frederiksen AH, Stein A, Lindorff-Larsen K. Predicting and interpreting large-scale mutagenesis data using analyses of protein stability and conservation. Cell Rep 2022; 38:110207. [PMID: 35021073 DOI: 10.1016/j.celrep.2021.110207] [Citation(s) in RCA: 64] [Impact Index Per Article: 21.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2021] [Revised: 10/01/2021] [Accepted: 12/13/2021] [Indexed: 01/23/2023] Open
Abstract
Understanding and predicting the functional consequences of single amino acid changes is central in many areas of protein science. Here, we collect and analyze experimental measurements of effects of >150,000 variants in 29 proteins. We use biophysical calculations to predict changes in stability for each variant and assess them in light of sequence conservation. We find that the sequence analyses give more accurate prediction of variant effects than predictions of stability and that about half of the variants that show loss of function do so due to stability effects. We construct a machine learning model to predict variant effects from protein structure and sequence alignments and show how the two sources of information support one another and enable mechanistic interpretations. Together, our results show how one can leverage large-scale experimental assessments of variant effects to gain deeper and general insights into the mechanisms that cause loss of function.
Collapse
Affiliation(s)
- Magnus Haraldson Høie
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, DK-2200 Copenhagen N, Denmark
| | - Matteo Cagiada
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, DK-2200 Copenhagen N, Denmark
| | - Anders Haagen Beck Frederiksen
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, DK-2200 Copenhagen N, Denmark
| | - Amelie Stein
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, DK-2200 Copenhagen N, Denmark.
| | - Kresten Lindorff-Larsen
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, DK-2200 Copenhagen N, Denmark.
| |
Collapse
|
43
|
Paralog knockout profiling identifies DUSP4 and DUSP6 as a digenic dependence in MAPK pathway-driven cancers. Nat Genet 2021; 53:1664-1672. [PMID: 34857952 DOI: 10.1038/s41588-021-00967-z] [Citation(s) in RCA: 77] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Accepted: 10/14/2021] [Indexed: 12/26/2022]
Abstract
Although single-gene perturbation screens have revealed a number of new targets, vulnerabilities specific to frequently altered drivers have not been uncovered. An important question is whether the compensatory relationship between functionally redundant genes masks potential therapeutic targets in single-gene perturbation studies. To identify digenic dependencies, we developed a CRISPR paralog targeting library to investigate the viability effects of disrupting 3,284 genes, 5,065 paralog pairs and 815 paralog families. We identified that dual inactivation of DUSP4 and DUSP6 selectively impairs growth in NRAS and BRAF mutant cells through the hyperactivation of MAPK signaling. Furthermore, cells resistant to MAPK pathway therapeutics become cross-sensitized to DUSP4 and DUSP6 perturbations such that the mechanisms of resistance to the inhibitors reinforce this mechanism of vulnerability. Together, multigene perturbation technologies unveil previously unrecognized digenic vulnerabilities that may be leveraged as new therapeutic targets in cancer.
Collapse
|
44
|
Freedy AM, Liau BB. Discovering new biology with drug-resistance alleles. Nat Chem Biol 2021; 17:1219-1229. [PMID: 34799733 PMCID: PMC9530778 DOI: 10.1038/s41589-021-00865-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Accepted: 07/21/2021] [Indexed: 02/08/2023]
Abstract
Small molecule drugs form the backbone of modern medicine's therapeutic arsenal. Often less appreciated is the role that small molecules have had in advancing basic biology. In this Review, we highlight how resistance mutations have unlocked the potential of small molecule chemical probes to discover new biology. We describe key instances in which resistance mutations and related genetic variants yielded foundational biological insight and categorize these examples on the basis of their role in the discovery of novel molecular mechanisms, protein allostery, physiology and cell signaling. Next, we suggest ways in which emerging technologies can be leveraged to systematically introduce and characterize resistance mutations to catalyze basic biology research and drug discovery. By recognizing how resistance mutations have propelled biological discovery, we can better harness new technologies and maximize the potential of small molecules to advance our understanding of biology and improve human health.
Collapse
Affiliation(s)
- Allyson M. Freedy
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA, USA.,Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Brian B. Liau
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA, USA.,Broad Institute of Harvard and MIT, Cambridge, MA, USA.,Correspondence should be addressed to Brian B. Liau,
| |
Collapse
|
45
|
ERK: A Double-Edged Sword in Cancer. ERK-Dependent Apoptosis as a Potential Therapeutic Strategy for Cancer. Cells 2021; 10:cells10102509. [PMID: 34685488 PMCID: PMC8533760 DOI: 10.3390/cells10102509] [Citation(s) in RCA: 184] [Impact Index Per Article: 46.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2021] [Revised: 09/16/2021] [Accepted: 09/17/2021] [Indexed: 12/12/2022] Open
Abstract
The RAF/MEK/ERK signaling pathway regulates diverse cellular processes as exemplified by cell proliferation, differentiation, motility, and survival. Activation of ERK1/2 generally promotes cell proliferation, and its deregulated activity is a hallmark of many cancers. Therefore, components and regulators of the ERK pathway are considered potential therapeutic targets for cancer, and inhibitors of this pathway, including some MEK and BRAF inhibitors, are already being used in the clinic. Notably, ERK1/2 kinases also have pro-apoptotic functions under certain conditions and enhanced ERK1/2 signaling can cause tumor cell death. Although the repertoire of the compounds which mediate ERK activation and apoptosis is expanding, and various anti-cancer compounds induce ERK activation while exerting their anti-proliferative effects, the mechanisms underlying ERK1/2-mediated cell death are still vague. Recent studies highlight the importance of dual-specificity phosphatases (DUSPs) in determining the pro- versus anti-apoptotic function of ERK in cancer. In this review, we will summarize the recent major findings in understanding the role of ERK in apoptosis, focusing on the major compounds mediating ERK-dependent apoptosis. Studies that further define the molecular targets of these compounds relevant to cell death will be essential to harnessing these compounds for developing effective cancer treatments.
Collapse
|
46
|
Bepler T, Berger B. Learning the protein language: Evolution, structure, and function. Cell Syst 2021; 12:654-669.e3. [PMID: 34139171 PMCID: PMC8238390 DOI: 10.1016/j.cels.2021.05.017] [Citation(s) in RCA: 212] [Impact Index Per Article: 53.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2021] [Revised: 05/20/2021] [Accepted: 05/20/2021] [Indexed: 02/06/2023]
Abstract
Language models have recently emerged as a powerful machine-learning approach for distilling information from massive protein sequence databases. From readily available sequence data alone, these models discover evolutionary, structural, and functional organization across protein space. Using language models, we can encode amino-acid sequences into distributed vector representations that capture their structural and functional properties, as well as evaluate the evolutionary fitness of sequence variants. We discuss recent advances in protein language modeling and their applications to downstream protein property prediction problems. We then consider how these models can be enriched with prior biological knowledge and introduce an approach for encoding protein structural knowledge into the learned representations. The knowledge distilled by these models allows us to improve downstream function prediction through transfer learning. Deep protein language models are revolutionizing protein biology. They suggest new ways to approach protein and therapeutic design. However, further developments are needed to encode strong biological priors into protein language models and to increase their accessibility to the broader community.
Collapse
Affiliation(s)
- Tristan Bepler
- Simons Machine Learning Center, New York Structural Biology Center, New York, NY, USA; Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA; Computational and Systems Biology Program, Massachusetts Institute of Technology, Cambridge, MA, USA.
| | - Bonnie Berger
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA; Department of Mathematics, Massachusetts Institute of Technology, Cambridge, MA, USA.
| |
Collapse
|
47
|
Huang KL, Scott AD, Zhou DC, Wang LB, Weerasinghe A, Elmas A, Liu R, Wu Y, Wendl MC, Wyczalkowski MA, Baral J, Sengupta S, Lai CW, Ruggles K, Payne SH, Raphael B, Fenyö D, Chen K, Mills G, Ding L. Spatially interacting phosphorylation sites and mutations in cancer. Nat Commun 2021; 12:2313. [PMID: 33875650 PMCID: PMC8055881 DOI: 10.1038/s41467-021-22481-w] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2020] [Accepted: 02/17/2021] [Indexed: 11/18/2022] Open
Abstract
Advances in mass-spectrometry have generated increasingly large-scale proteomics datasets containing tens of thousands of phosphorylation sites (phosphosites) that require prioritization. We develop a bioinformatics tool called HotPho and systematically discover 3D co-clustering of phosphosites and cancer mutations on protein structures. HotPho identifies 474 such hybrid clusters containing 1255 co-clustering phosphosites, including RET p.S904/Y928, the conserved HRAS/KRAS p.Y96, and IDH1 p.Y139/IDH2 p.Y179 that are adjacent to recurrent mutations on protein structures not found by linear proximity approaches. Hybrid clusters, enriched in histone and kinase domains, frequently include expression-associated mutations experimentally shown as activating and conferring genetic dependency. Approximately 300 co-clustering phosphosites are verified in patient samples of 5 cancer types or previously implicated in cancer, including CTNNB1 p.S29/Y30, EGFR p.S720, MAPK1 p.S142, and PTPN12 p.S275. In summary, systematic 3D clustering analysis highlights nearly 3,000 likely functional mutations and over 1000 cancer phosphosites for downstream investigation and evaluation of potential clinical relevance.
Collapse
Affiliation(s)
- Kuan-Lin Huang
- Department of Genetics and Genomics, Tisch Cancer Institute, Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
| | - Adam D Scott
- Department of Medicine, McDonnell Genome Institute, Department of Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO, USA
| | - Daniel Cui Zhou
- Department of Medicine, McDonnell Genome Institute, Department of Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO, USA
| | - Liang-Bo Wang
- Department of Medicine, McDonnell Genome Institute, Department of Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO, USA
| | - Amila Weerasinghe
- Department of Medicine, McDonnell Genome Institute, Department of Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO, USA
| | - Abdulkadir Elmas
- Department of Genetics and Genomics, Tisch Cancer Institute, Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ruiyang Liu
- Department of Medicine, McDonnell Genome Institute, Department of Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO, USA
| | - Yige Wu
- Department of Medicine, McDonnell Genome Institute, Department of Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO, USA
| | - Michael C Wendl
- Department of Medicine, McDonnell Genome Institute, Department of Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO, USA
| | - Matthew A Wyczalkowski
- Department of Medicine, McDonnell Genome Institute, Department of Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO, USA
| | - Jessika Baral
- Department of Medicine, McDonnell Genome Institute, Department of Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO, USA
| | - Sohini Sengupta
- Department of Medicine, McDonnell Genome Institute, Department of Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO, USA
| | - Chin-Wen Lai
- Department of Pathology and Immunology, Washington University in St. Louis, St. Louis, MO, USA
| | - Kelly Ruggles
- Center for Health Informatics and Bioinformatics, New York University School of Medicine, New York, NY, USA
| | - Samuel H Payne
- Department of Biology, Brigham Young University, Provo, UT, USA
| | - Benjamin Raphael
- Lewis-Sigler Institute, Princeton University, Princeton, NJ, USA
| | - David Fenyö
- Center for Health Informatics and Bioinformatics, New York University School of Medicine, New York, NY, USA
| | - Ken Chen
- Departments of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Gordon Mills
- Knight Cancer Institute, Oregon Health & Science University, Portland, OR, USA
| | - Li Ding
- Department of Medicine, McDonnell Genome Institute, Department of Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO, USA.
| |
Collapse
|
48
|
Sruthi CK, Prakash MK. Disentangling the Contribution of Each Descriptive Characteristic of Every Single Mutation to Its Functional Effects. J Chem Inf Model 2021; 61:2090-2098. [PMID: 33754712 DOI: 10.1021/acs.jcim.0c01223] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Mutational effects predictions continue to improve in accuracy as advanced artificial intelligence (AI) algorithms are trained on exhaustive experimental data. The next natural questions to ask are if it is possible to gain insights into which attribute of the mutation contributes how much to the mutational effects and if one can develop universal rules for mapping the descriptors to mutational effects. In this work, we mainly address the former aspect using a framework of interpretable AI. Relations between the physicochemical descriptors and their contributions to the mutational effects are extracted by analyzing the data on 29,832 variants from eight systematic deep mutational scan studies. An opposite trend in the dependence of fitness and solubility on the distance of the amino acid from the catalytic sites could be extracted and quantified. The dependence of the mutational effect contributions on the position-specific scoring matrix (PSSM) score for the amino acid after mutation or the BLOSUM score of the substitution showed universal trends. Our attempts in the present work to explain the quantitative differences in the dependence on conservation and SASA across proteins were not successful. The work nevertheless brings transparency into the predictions and development of rules, and will hopefully lead to empirically uncovering the universalities among these rules.
Collapse
Affiliation(s)
- C K Sruthi
- Theoretical Sciences Unit, Jawaharlal Nehru Centre for Advanced Scientific Research, Bangalore 560064, India
| | - Meher K Prakash
- Theoretical Sciences Unit, Jawaharlal Nehru Centre for Advanced Scientific Research, Bangalore 560064, India
| |
Collapse
|
49
|
Nedrud D, Coyote-Maestas W, Schmidt D. A large-scale survey of pairwise epistasis reveals a mechanism for evolutionary expansion and specialization of PDZ domains. Proteins 2021; 89:899-914. [PMID: 33620761 DOI: 10.1002/prot.26067] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 02/02/2021] [Accepted: 02/18/2021] [Indexed: 12/21/2022]
Abstract
Deep mutational scanning (DMS) facilitates data-driven models of protein structure and function. Here, we adapted Saturated Programmable Insertion Engineering (SPINE) as a programmable DMS technique. We validate SPINE with a reference single mutant dataset in the PSD95 PDZ3 domain and then characterize most pairwise double mutants to study epistasis. We observe wide-spread proximal negative epistasis, which we attribute to mutations affecting thermodynamic stability, and strong long-range positive epistasis, which is enriched in an evolutionarily conserved and function-defining network of "sector" and clade-specifying residues. Conditional neutrality of mutations in clade-specifying residues compensates for deleterious mutations in sector positions. This suggests that epistatic interactions between these position pairs facilitated the evolutionary expansion and specialization of PDZ domains. We propose that SPINE provides easy experimental access to reveal epistasis signatures in proteins that will improve our understanding of the structural basis for protein function and adaptation.
Collapse
Affiliation(s)
- David Nedrud
- Department of Biochemistry, Molecular Biology & Biophysics, University of Minnesota, Minneapolis, Minnesota, USA
| | - Willow Coyote-Maestas
- Department of Biochemistry, Molecular Biology & Biophysics, University of Minnesota, Minneapolis, Minnesota, USA
| | - Daniel Schmidt
- Department of Genetics, Cell Biology & Development, University of Minnesota, Minneapolis, Minnesota, USA
| |
Collapse
|
50
|
Hao JF, Chen P, Li HY, Li YJ, Zhang YL. Effects of LncRNA HCP5/miR-214-3p/MAPK1 Molecular Network on Renal Cell Carcinoma Cells. Cancer Manag Res 2021; 12:13347-13356. [PMID: 33380840 PMCID: PMC7769072 DOI: 10.2147/cmar.s274426] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Accepted: 12/04/2020] [Indexed: 12/21/2022] Open
Abstract
Background Recent researches have shown that long non-coding RNA (LncRNA) is often disordered and acts in many carcinomas. Clear cell renal cell carcinoma (ccRCC) is the main reason for carcinoma-related deaths, which are mainly caused by the metastasis. HCP5 is a newly discovered LcnRNA. Early studies have found that HCP5 acts in neoplasm metastasis, but the mechanism of HCP5 in ccRCC is still unclear. Methods The expression of HCP5 in human renal cell carcinoma (RCC) was detected by real-time quantitative PCR. The biological effect of LncRNAs in proliferation, migration, invasion and metastasis of RCC cells was explored by gain-of-function and loss-of-function tests. The molecular mechanism of LncRNAs was explored by RNA immunoprecipitation and Western blot. Results qRT-PCR revealed that HCP5 was enhanced in neoplasm tissues of ccRCC patients and correlated with the metastatic characteristics of RCC. Over-expression of HCP5 promoted the proliferation, migration and invasion of renal carcinoma cells. The deletion of HCP5 inhibited the proliferation, migration and invasion of RCC in vitro and the metastasis of RCC in vivo. Mechanically, HCP5 inhibited the growth and metastasis of ccRCC cells by regulating miR-214-3p/MAPK1 axis. Conclusion HCP5, as a key LncRNA, can promote ccRCC metastasis by regulating miR-214-3p/MAPK1 axis and may be a biomarker and be helpful for judging the prognosis of ccRCC.
Collapse
Affiliation(s)
- Jun-Feng Hao
- Department of Nephrology and Blood Purification Center, Jin Qiu Hospital of Liaoning Province (Geriatric Hospital of Liaoning Province), Shenyang City, Liaoning Province 110000, People's Republic of China
| | - Pei Chen
- Department of Basic Medical Sciences, Jiangsu College of Nursing, Huai'an, Jiangsu Province 223000, People's Republic of China
| | - He-Yi Li
- Department of Ophthalmology, Jin Qiu Hospital of Liaoning Province (Geriatric Hospital of Liaoning Province), Shenyang City, Liaoning Province 110000, People's Republic of China
| | - Ya-Jing Li
- Department of Nephrology and Blood Purification Center, Jin Qiu Hospital of Liaoning Province (Geriatric Hospital of Liaoning Province), Shenyang City, Liaoning Province 110000, People's Republic of China
| | - Yu-Ling Zhang
- Department of Basic Medical Sciences, Jiangsu College of Nursing, Huai'an, Jiangsu Province 223000, People's Republic of China
| |
Collapse
|