1
|
Obolenski S, Olvera-León R, Sun D, Adams DJ, Waters AJ. Protocol for the functional evaluation of genetic variants using saturation genome editing. STAR Protoc 2025; 6:103710. [PMID: 40198219 PMCID: PMC12008576 DOI: 10.1016/j.xpro.2025.103710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2024] [Revised: 02/19/2025] [Accepted: 02/28/2025] [Indexed: 04/10/2025] Open
Abstract
Saturation genome editing (SGE) employs CRISPR-Cas9 and homology-directed repair (HDR) to introduce exhaustive nucleotide modifications at specific genomic sites in multiplex, enabling the functional analysis of genetic variants while preserving their native genomic context. Here, we present a protocol for SGE-based variant evaluation in HAP1-A5 cells. We describe the steps for designing variant libraries, single-guide RNAs (sgRNAs), and oligonucleotide primers for PCR. We also detail the sample preparation before the SGE screen, the cellular screening process, and subsequent next-generation sequencing (NGS) library preparation. For complete details on the use and execution of this protocol, please refer to Radford et al.,1 Waters et al.,2 and Olvera-León et al.3.
Collapse
Affiliation(s)
- Sofia Obolenski
- Experimental Cancer Genetics, Wellcome Sanger Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1SA Cambridge, UK; Department of Dermatology, Leiden University Medical Centre, 2333 ZA Leiden, the Netherlands
| | - Rebeca Olvera-León
- Experimental Cancer Genetics, Wellcome Sanger Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1SA Cambridge, UK
| | - Dijue Sun
- Experimental Cancer Genetics, Wellcome Sanger Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1SA Cambridge, UK
| | - David J Adams
- Experimental Cancer Genetics, Wellcome Sanger Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1SA Cambridge, UK.
| | - Andrew J Waters
- Experimental Cancer Genetics, Wellcome Sanger Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1SA Cambridge, UK.
| |
Collapse
|
2
|
Olvera-León R, Zhang F, Offord V, Zhao Y, Tan HK, Gupta P, Pal T, Robles-Espinoza CD, Arriaga-González FG, Matsuyama LSAS, Delage E, Dicks E, Ezquina S, Rowlands CF, Turnbull C, Pharoah P, Perry JRB, Jasin M, Waters AJ, Adams DJ. High-resolution functional mapping of RAD51C by saturation genome editing. Cell 2024; 187:5719-5734.e19. [PMID: 39299233 DOI: 10.1016/j.cell.2024.08.039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 02/29/2024] [Accepted: 08/20/2024] [Indexed: 09/22/2024]
Abstract
Pathogenic variants in RAD51C confer an elevated risk of breast and ovarian cancer, while individuals homozygous for specific RAD51C alleles may develop Fanconi anemia. Using saturation genome editing (SGE), we functionally assess 9,188 unique variants, including >99.5% of all possible coding sequence single-nucleotide alterations. By computing changes in variant abundance and Gaussian mixture modeling (GMM), we functionally classify 3,094 variants to be disruptive and use clinical truth sets to reveal an accuracy/concordance of variant classification >99.9%. Cell fitness was the primary assay readout allowing us to observe a phenomenon where specific missense variants exhibit distinct depletion kinetics potentially suggesting that they represent hypomorphic alleles. We further explored our exhaustive functional map, revealing critical residues on the RAD51C structure and resolving variants found in cancer-segregating kindred. Furthermore, through interrogation of UK Biobank and a large multi-center ovarian cancer cohort, we find significant associations between SGE-depleted variants and cancer diagnoses.
Collapse
Affiliation(s)
- Rebeca Olvera-León
- Wellcome Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK; Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Campus Juriquilla, Querétaro, Querétaro, Mexico
| | - Fang Zhang
- Meinig School of Biomedical Engineering, Cornell University, Ithaca, NY, USA; Developmental Biology Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Victoria Offord
- Wellcome Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| | - Yajie Zhao
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Institute of Metabolic Science, Cambridge, UK
| | - Hong Kee Tan
- Wellcome Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| | - Prashant Gupta
- Wellcome Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| | - Tuya Pal
- Department of Medicine, Division of Genetic Medicine, Vanderbilt University Medical Center (VUMC)/Vanderbilt-Ingram Cancer Center (VICC), Nashville, TN, USA
| | - Carla Daniela Robles-Espinoza
- Wellcome Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK; Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Campus Juriquilla, Querétaro, Querétaro, Mexico
| | - Fernanda G Arriaga-González
- Wellcome Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK; Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Campus Juriquilla, Querétaro, Querétaro, Mexico
| | | | - Erwan Delage
- Wellcome Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| | - Ed Dicks
- Department of Public Health and Primary Care, University of Cambridge, Robinson Way, Cambridge, UK
| | - Suzana Ezquina
- Department of Public Health and Primary Care, University of Cambridge, Robinson Way, Cambridge, UK
| | - Charlie F Rowlands
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, UK
| | - Clare Turnbull
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, UK; National Cancer Registration and Analysis Service, National Health Service (NHS) England, London, UK; Cancer Genetics Unit, The Royal Marsden NHS Foundation Trust, London, UK
| | - Paul Pharoah
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - John R B Perry
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Institute of Metabolic Science, Cambridge, UK
| | - Maria Jasin
- Developmental Biology Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Andrew J Waters
- Wellcome Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK.
| | - David J Adams
- Wellcome Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK.
| |
Collapse
|
3
|
Lin YJ, Menon AS, Hu Z, Brenner SE. Variant Impact Predictor database (VIPdb), version 2: trends from three decades of genetic variant impact predictors. Hum Genomics 2024; 18:90. [PMID: 39198917 PMCID: PMC11360829 DOI: 10.1186/s40246-024-00663-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2024] [Accepted: 08/19/2024] [Indexed: 09/01/2024] Open
Abstract
BACKGROUND Variant interpretation is essential for identifying patients' disease-causing genetic variants amongst the millions detected in their genomes. Hundreds of Variant Impact Predictors (VIPs), also known as Variant Effect Predictors (VEPs), have been developed for this purpose, with a variety of methodologies and goals. To facilitate the exploration of available VIP options, we have created the Variant Impact Predictor database (VIPdb). RESULTS The Variant Impact Predictor database (VIPdb) version 2 presents a collection of VIPs developed over the past three decades, summarizing their characteristics, ClinGen calibrated scores, CAGI assessment results, publication details, access information, and citation patterns. We previously summarized 217 VIPs and their features in VIPdb in 2019. Building upon this foundation, we identified and categorized an additional 190 VIPs, resulting in a total of 407 VIPs in VIPdb version 2. The majority of the VIPs have the capacity to predict the impacts of single nucleotide variants and nonsynonymous variants. More VIPs tailored to predict the impacts of insertions and deletions have been developed since the 2010s. In contrast, relatively few VIPs are dedicated to the prediction of splicing, structural, synonymous, and regulatory variants. The increasing rate of citations to VIPs reflects the ongoing growth in their use, and the evolving trends in citations reveal development in the field and individual methods. CONCLUSIONS VIPdb version 2 summarizes 407 VIPs and their features, potentially facilitating VIP exploration for various variant interpretation applications. VIPdb is available at https://genomeinterpretation.org/vipdb.
Collapse
Affiliation(s)
- Yu-Jen Lin
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, 94720, USA
- Center for Computational Biology, University of California, Berkeley, CA, 94720, USA
| | - Arul S Menon
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, 94720, USA
- College of Computing, Data Science, and Society, University of California, Berkeley, CA, 94720, USA
| | - Zhiqiang Hu
- Department of Plant and Microbial Biology, University of California, 111 Koshland Hall #3102, Berkeley, CA, 94720-3102, USA
- Illumina, Foster City, CA, 94404, USA
| | - Steven E Brenner
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, 94720, USA.
- Center for Computational Biology, University of California, Berkeley, CA, 94720, USA.
- College of Computing, Data Science, and Society, University of California, Berkeley, CA, 94720, USA.
- Department of Plant and Microbial Biology, University of California, 111 Koshland Hall #3102, Berkeley, CA, 94720-3102, USA.
| |
Collapse
|
4
|
Waters AJ, Brendler-Spaeth T, Smith D, Offord V, Tan HK, Zhao Y, Obolenski S, Nielsen M, van Doorn R, Murphy JE, Gupta P, Rowlands CF, Hanson H, Delage E, Thomas M, Radford EJ, Gerety SS, Turnbull C, Perry JRB, Hurles ME, Adams DJ. Saturation genome editing of BAP1 functionally classifies somatic and germline variants. Nat Genet 2024; 56:1434-1445. [PMID: 38969833 PMCID: PMC11250367 DOI: 10.1038/s41588-024-01799-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Accepted: 05/14/2024] [Indexed: 07/07/2024]
Abstract
Many variants that we inherit from our parents or acquire de novo or somatically are rare, limiting the precision with which we can associate them with disease. We performed exhaustive saturation genome editing (SGE) of BAP1, the disruption of which is linked to tumorigenesis and altered neurodevelopment. We experimentally characterized 18,108 unique variants, of which 6,196 were found to have abnormal functions, and then used these data to evaluate phenotypic associations in the UK Biobank. We also characterized variants in a large population-ascertained tumor collection, in cancer pedigrees and ClinVar, and explored the behavior of cancer-associated variants compared to that of variants linked to neurodevelopmental phenotypes. Our analyses demonstrated that disruptive germline BAP1 variants were significantly associated with higher circulating levels of the mitogen IGF-1, suggesting a possible pathological mechanism and therapeutic target. Furthermore, we built a variant classifier with >98% sensitivity and specificity and quantify evidence strengths to aid precision variant interpretation.
Collapse
Affiliation(s)
| | | | | | | | | | - Yajie Zhao
- Metabolic Research Laboratory, Wellcome-MRC Institute of Metabolic Science, University of Cambridge School of Clinical Medicine, Cambridge, UK
| | | | - Maartje Nielsen
- Department of Clinical Genetics, Leiden University Medical Center, Leiden, the Netherlands
| | - Remco van Doorn
- Department of Dermatology, Leiden University Medical Center, Leiden, the Netherlands
| | | | | | - Charlie F Rowlands
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, UK
| | - Helen Hanson
- Department of Clinical Genetics, Royal Devon University Healthcare NHS Foundation Trust, Exeter, UK
- Faculty of Health and Life Sciences, University of Exeter, Exeter, UK
| | | | | | - Elizabeth J Radford
- Wellcome Sanger Institute, Hinxton, UK
- Department of Paediatrics, University of Cambridge, Cambridge, UK
| | | | - Clare Turnbull
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, UK
- National Cancer Registration and Analysis Service, NHS England, London, UK
- Cancer Genetics Unit, The Royal Marsden NHS Foundation Trust, London, UK
| | - John R B Perry
- Metabolic Research Laboratory, Wellcome-MRC Institute of Metabolic Science, University of Cambridge School of Clinical Medicine, Cambridge, UK
| | | | | |
Collapse
|
5
|
Lin YJ, Menon AS, Hu Z, Brenner SE. Variant Impact Predictor database (VIPdb), version 2: Trends from 25 years of genetic variant impact predictors. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.25.600283. [PMID: 38979289 PMCID: PMC11230257 DOI: 10.1101/2024.06.25.600283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
Background Variant interpretation is essential for identifying patients' disease-causing genetic variants amongst the millions detected in their genomes. Hundreds of Variant Impact Predictors (VIPs), also known as Variant Effect Predictors (VEPs), have been developed for this purpose, with a variety of methodologies and goals. To facilitate the exploration of available VIP options, we have created the Variant Impact Predictor database (VIPdb). Results The Variant Impact Predictor database (VIPdb) version 2 presents a collection of VIPs developed over the past 25 years, summarizing their characteristics, ClinGen calibrated scores, CAGI assessment results, publication details, access information, and citation patterns. We previously summarized 217 VIPs and their features in VIPdb in 2019. Building upon this foundation, we identified and categorized an additional 186 VIPs, resulting in a total of 403 VIPs in VIPdb version 2. The majority of the VIPs have the capacity to predict the impacts of single nucleotide variants and nonsynonymous variants. More VIPs tailored to predict the impacts of insertions and deletions have been developed since the 2010s. In contrast, relatively few VIPs are dedicated to the prediction of splicing, structural, synonymous, and regulatory variants. The increasing rate of citations to VIPs reflects the ongoing growth in their use, and the evolving trends in citations reveal development in the field and individual methods. Conclusions VIPdb version 2 summarizes 403 VIPs and their features, potentially facilitating VIP exploration for various variant interpretation applications. Availability VIPdb version 2 is available at https://genomeinterpretation.org/vipdb.
Collapse
Affiliation(s)
- Yu-Jen Lin
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
- Center for Computational Biology, University of California, Berkeley, California 94720, USA
| | - Arul S. Menon
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
- College of Computing, Data Science, and Society, University of California, Berkeley, California 94720, USA
| | - Zhiqiang Hu
- Department of Plant and Microbial Biology, University of California, Berkeley, California 94720, USA
- Currently at: Illumina, Foster City, California 94404, USA
| | - Steven E. Brenner
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
- Center for Computational Biology, University of California, Berkeley, California 94720, USA
- College of Computing, Data Science, and Society, University of California, Berkeley, California 94720, USA
- Department of Plant and Microbial Biology, University of California, Berkeley, California 94720, USA
| |
Collapse
|
6
|
Cooper S, Obolenski S, Waters AJ, Bassett AR, Coelho MA. Analyzing the functional effects of DNA variants with gene editing. CELL REPORTS METHODS 2024; 4:100776. [PMID: 38744287 PMCID: PMC11133854 DOI: 10.1016/j.crmeth.2024.100776] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 03/01/2024] [Accepted: 04/22/2024] [Indexed: 05/16/2024]
Abstract
Continual advancements in genomics have led to an ever-widening disparity between the rate of discovery of genetic variants and our current understanding of their functions and potential roles in disease. Systematic methods for phenotyping DNA variants are required to effectively translate genomics data into improved outcomes for patients with genetic diseases. To make the biggest impact, these approaches must be scalable and accurate, faithfully reflect disease biology, and define complex disease mechanisms. We compare current methods to analyze the function of variants in their endogenous DNA context using genome editing strategies, such as saturation genome editing, base editing and prime editing. We discuss how these technologies can be linked to high-content readouts to gain deep mechanistic insights into variant effects. Finally, we highlight key challenges that need to be addressed to bridge the genotype to phenotype gap, and ultimately improve the diagnosis and treatment of genetic diseases.
Collapse
Affiliation(s)
- Sarah Cooper
- Cellular and Gene Editing Research, Wellcome Sanger Institute, Hinxton, UK
| | - Sofia Obolenski
- Experimental Cancer Genetics, Wellcome Sanger Institute, Hinxton, UK; Department of Dermatology, Leiden University Medical Center, Leiden, the Netherlands
| | - Andrew J Waters
- Experimental Cancer Genetics, Wellcome Sanger Institute, Hinxton, UK
| | - Andrew R Bassett
- Cellular and Gene Editing Research, Wellcome Sanger Institute, Hinxton, UK.
| | | |
Collapse
|
7
|
Radford EJ, Tan HK, Andersson MHL, Stephenson JD, Gardner EJ, Ironfield H, Waters AJ, Gitterman D, Lindsay S, Abascal F, Martincorena I, Kolesnik-Taylor A, Ng-Cordell E, Firth HV, Baker K, Perry JRB, Adams DJ, Gerety SS, Hurles ME. Saturation genome editing of DDX3X clarifies pathogenicity of germline and somatic variation. Nat Commun 2023; 14:7702. [PMID: 38057330 PMCID: PMC10700591 DOI: 10.1038/s41467-023-43041-4] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Accepted: 10/30/2023] [Indexed: 12/08/2023] Open
Abstract
Loss-of-function of DDX3X is a leading cause of neurodevelopmental disorders (NDD) in females. DDX3X is also a somatically mutated cancer driver gene proposed to have tumour promoting and suppressing effects. We perform saturation genome editing of DDX3X, testing in vitro the functional impact of 12,776 nucleotide variants. We identify 3432 functionally abnormal variants, in three distinct classes. We train a machine learning classifier to identify functionally abnormal variants of NDD-relevance. This classifier has at least 97% sensitivity and 99% specificity to detect variants pathogenic for NDD, substantially out-performing in silico predictors, and resolving up to 93% of variants of uncertain significance. Moreover, functionally-abnormal variants can account for almost all of the excess nonsynonymous DDX3X somatic mutations seen in DDX3X-driven cancers. Systematic maps of variant effects generated in experimentally tractable cell types have the potential to transform clinical interpretation of both germline and somatic disease-associated variation.
Collapse
Affiliation(s)
- Elizabeth J Radford
- Wellcome Sanger Institute, Hinxton, CB10 1SA, UK
- Department of Paediatrics, University of Cambridge, Level 8, Cambridge Biomedical Campus, Cambridge, CB2 0QQ, UK
| | - Hong-Kee Tan
- Wellcome Sanger Institute, Hinxton, CB10 1SA, UK
| | | | | | - Eugene J Gardner
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, CB2 0QQ, UK
| | | | | | | | | | | | | | | | - Elise Ng-Cordell
- MRC Cognition and Brain Sciences Unit, University of Cambridge, Cambridge, UK
- Department of Psychology, University of British Columbia, Vancouver, Canada
| | - Helen V Firth
- Wellcome Sanger Institute, Hinxton, CB10 1SA, UK
- Department of Medical Genetics, University of Cambridge, Cambridge, UK
| | - Kate Baker
- MRC Cognition and Brain Sciences Unit, University of Cambridge, Cambridge, UK
- Department of Medical Genetics, University of Cambridge, Cambridge, UK
| | - John R B Perry
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, CB2 0QQ, UK
| | | | | | | |
Collapse
|
8
|
Jeong BR, Jang J, Jin E. Genome engineering via gene editing technologies in microalgae. BIORESOURCE TECHNOLOGY 2023; 373:128701. [PMID: 36746216 DOI: 10.1016/j.biortech.2023.128701] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/02/2023] [Revised: 01/27/2023] [Accepted: 02/01/2023] [Indexed: 06/18/2023]
Abstract
CRISPR-Cas has revolutionized genetic modification with its comparative simplicity and accuracy, and it can be used even at the genomic level. Microalgae are excellent feedstocks for biofuels and nutraceuticals because they contain high levels of fatty acids, carotenoids, and other metabolites; however, genome engineering for microalgae is not yet as developed as for other model organisms. Microalgal engineering at the genetic and metabolic levels is relatively well established, and a few genomic resources are available. Their genomic information was used for a "safe harbor" site for stable transgene expression in microalgae. This review proposes further genome engineering schemes including the construction of sgRNA libraries, pan-genomic and epigenomic resources, and mini-genomes, which can together be developed into synthetic biology for carbon-based engineering in microalgae. Acetyl-CoA is at the center of carbon metabolic pathways and is further reviewed for the production of molecules including terpenoids in microalgae.
Collapse
Affiliation(s)
- Byeong-Ryool Jeong
- Department of Life Science, Research Institute for Natural Sciences, Hanyang University, Seoul 04763, Korea
| | - Junhwan Jang
- Department of Life Science, Research Institute for Natural Sciences, Hanyang University, Seoul 04763, Korea
| | - EonSeon Jin
- Department of Life Science, Research Institute for Natural Sciences, Hanyang University, Seoul 04763, Korea; Hanyang Institute of Bioscience and Biotechnology, Hanyang University, Seoul 04763, Korea.
| |
Collapse
|