1
|
Poondi Krishnan V, Morone B, Toubiana S, Krzak M, Fioriniello S, Della Ragione F, Strazzullo M, Angelini C, Selig S, Matarazzo MR. The aberrant epigenome of DNMT3B-mutated ICF1 patient iPSCs is amenable to correction, with the exception of a subset of regions with H3K4me3- and/or CTCF-based epigenetic memory. Genome Res 2023; 33:169-183. [PMID: 36828588 PMCID: PMC10069469 DOI: 10.1101/gr.276986.122] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Accepted: 01/12/2023] [Indexed: 02/26/2023]
Abstract
Bi-allelic hypomorphic mutations in DNMT3B disrupt DNA methyltransferase activity and lead to immunodeficiency, centromeric instability, facial anomalies syndrome, type 1 (ICF1). Although several ICF1 phenotypes have been linked to abnormally hypomethylated repetitive regions, the unique genomic regions responsible for the remaining disease phenotypes remain largely uncharacterized. Here we explored two ICF1 patient-derived induced pluripotent stem cells (iPSCs) and their CRISPR-Cas9-corrected clones to determine whether DNMT3B correction can globally overcome DNA methylation defects and related changes in the epigenome. Hypomethylated regions throughout the genome are highly comparable between ICF1 iPSCs carrying different DNMT3B variants, and significantly overlap with those in ICF1 patient peripheral blood and lymphoblastoid cell lines. These regions include large CpG island domains, as well as promoters and enhancers of several lineage-specific genes, in particular immune-related, suggesting that they are premarked during early development. CRISPR-corrected ICF1 iPSCs reveal that the majority of phenotype-related hypomethylated regions reacquire normal DNA methylation levels following editing. However, at the most severely hypomethylated regions in ICF1 iPSCs, which also display the highest increases in H3K4me3 levels and/or abnormal CTCF binding, the epigenetic memory persists, and hypomethylation remains uncorrected. Overall, we demonstrate that restoring the catalytic activity of DNMT3B can reverse the majority of the aberrant ICF1 epigenome. However, a small fraction of the genome is resilient to this rescue, highlighting the challenge of reverting disease states that are due to genome-wide epigenetic perturbations. Uncovering the basis for the persistent epigenetic memory will promote the development of strategies to overcome this obstacle.
Collapse
Affiliation(s)
- Varsha Poondi Krishnan
- Institute of Genetics and Biophysics Adriano Buzzati Traverso, (IGB-ABT) CNR, Naples 80131, Italy
| | - Barbara Morone
- Institute of Genetics and Biophysics Adriano Buzzati Traverso, (IGB-ABT) CNR, Naples 80131, Italy
| | - Shir Toubiana
- Department of Genetics and Developmental Biology, Rappaport Faculty of Medicine and Research Institute, Technion, Haifa 31096, Israel
| | - Monika Krzak
- Institute for Applied Computing (IAC) "Mauro Picone", CNR, Naples 80131 Italy
| | - Salvatore Fioriniello
- Institute of Genetics and Biophysics Adriano Buzzati Traverso, (IGB-ABT) CNR, Naples 80131, Italy
| | - Floriana Della Ragione
- Institute of Genetics and Biophysics Adriano Buzzati Traverso, (IGB-ABT) CNR, Naples 80131, Italy.,IRCCS Istituto Neurologico Mediterraneo Neuromed, Pozzilli, Isernia 86077, Italy
| | - Maria Strazzullo
- Institute of Genetics and Biophysics Adriano Buzzati Traverso, (IGB-ABT) CNR, Naples 80131, Italy;
| | - Claudia Angelini
- Institute for Applied Computing (IAC) "Mauro Picone", CNR, Naples 80131 Italy;
| | - Sara Selig
- Department of Genetics and Developmental Biology, Rappaport Faculty of Medicine and Research Institute, Technion, Haifa 31096, Israel; .,Laboratory of Molecular Medicine, Rambam Health Care Campus, Haifa 31096, Israel
| | - Maria R Matarazzo
- Institute of Genetics and Biophysics Adriano Buzzati Traverso, (IGB-ABT) CNR, Naples 80131, Italy
| |
Collapse
|
2
|
Thind AS, Monga I, Thakur PK, Kumari P, Dindhoria K, Krzak M, Ranson M, Ashford B. Demystifying emerging bulk RNA-Seq applications: the application and utility of bioinformatic methodology. Brief Bioinform 2021; 22:6330938. [PMID: 34329375 DOI: 10.1093/bib/bbab259] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2021] [Revised: 06/14/2021] [Accepted: 06/18/2021] [Indexed: 12/13/2022] Open
Abstract
Significant innovations in next-generation sequencing techniques and bioinformatics tools have impacted our appreciation and understanding of RNA. Practical RNA sequencing (RNA-Seq) applications have evolved in conjunction with sequence technology and bioinformatic tools advances. In most projects, bulk RNA-Seq data is used to measure gene expression patterns, isoform expression, alternative splicing and single-nucleotide polymorphisms. However, RNA-Seq holds far more hidden biological information including details of copy number alteration, microbial contamination, transposable elements, cell type (deconvolution) and the presence of neoantigens. Recent novel and advanced bioinformatic algorithms developed the capacity to retrieve this information from bulk RNA-Seq data, thus broadening its scope. The focus of this review is to comprehend the emerging bulk RNA-Seq-based analyses, emphasizing less familiar and underused applications. In doing so, we highlight the power of bulk RNA-Seq in providing biological insights.
Collapse
Affiliation(s)
- Amarinder Singh Thind
- University of Wollongong, Wollongong, Australia.,Illawarra Health and Medical Research Institute, Wollongong, Australia
| | - Isha Monga
- Columbia University, New York City, NY, USA
| | | | - Pallawi Kumari
- Institute of Microbial Technology, Council of Scientific and Industrial Research, Chandigarh, India
| | - Kiran Dindhoria
- Institute of Microbial Technology, Council of Scientific and Industrial Research, Chandigarh, India
| | | | - Marie Ranson
- University of Wollongong, Wollongong, Australia.,Illawarra Health and Medical Research Institute, Wollongong, Australia
| | - Bruce Ashford
- University of Wollongong, Wollongong, Australia.,Illawarra Health and Medical Research Institute, Wollongong, Australia
| |
Collapse
|
3
|
Krzak M, Raykov Y, Boukouvalas A, Cutillo L, Angelini C. Benchmark and Parameter Sensitivity Analysis of Single-Cell RNA Sequencing Clustering Methods. Front Genet 2019; 10:1253. [PMID: 31921297 PMCID: PMC6918801 DOI: 10.3389/fgene.2019.01253] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2019] [Accepted: 11/13/2019] [Indexed: 01/04/2023] Open
Abstract
Single-cell RNA-seq (scRNAseq) is a powerful tool to study heterogeneity of cells. Recently, several clustering based methods have been proposed to identify distinct cell populations. These methods are based on different statistical models and usually require to perform several additional steps, such as preprocessing or dimension reduction, before applying the clustering algorithm. Individual steps are often controlled by method-specific parameters, permitting the method to be used in different modes on the same datasets, depending on the user choices. The large number of possibilities that these methods provide can intimidate non-expert users, since the available choices are not always clearly documented. In addition, to date, no large studies have invistigated the role and the impact that these choices can have in different experimental contexts. This work aims to provide new insights into the advantages and drawbacks of scRNAseq clustering methods and describe the ranges of possibilities that are offered to users. In particular, we provide an extensive evaluation of several methods with respect to different modes of usage and parameter settings by applying them to real and simulated datasets that vary in terms of dimensionality, number of cell populations or levels of noise. Remarkably, the results presented here show that great variability in the performance of the models is strongly attributed to the choice of the user-specific parameter settings. We describe several tendencies in the performance attributed to their modes of usage and different types of datasets, and identify which methods are strongly affected by data dimensionality in terms of computational time. Finally, we highlight some open challenges in scRNAseq data clustering, such as those related to the identification of the number of clusters.
Collapse
Affiliation(s)
- Monika Krzak
- Institute for Applied Mathematics "Mauro Picone", Naples, Italy
| | - Yordan Raykov
- Department of Mathematics, Aston University, Birmingham, United Kingdom
| | | | - Luisa Cutillo
- School of Mathematics, University of Leeds, Leeds, United Kingdom
| | | |
Collapse
|