1
|
Dragland JS, Liu G, Nilsen HL, Böttcher Y, Wang J. EpiMapper: A new tool for analyzing high-throughput sequencing from CUT&Tag. Comput Biol Med 2025; 186:109692. [PMID: 39832438 DOI: 10.1016/j.compbiomed.2025.109692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2024] [Revised: 01/09/2025] [Accepted: 01/13/2025] [Indexed: 01/22/2025]
Abstract
Since the invention of next-generation sequencing, new methods have been developed to understand the regulation of gene expression through epigenetic markers. Among these, CUT&Tag (Cleavage Under Targets and Tagmentation) analysis has emerged as an efficient epigenomic profiling technique with low input requirements, high sensitivity, and low background signals. Although wet-lab techniques are available, data analysis remains challenging for scientists without expert-level computational skills. Therefore, we developed EpiMapper, a new Python package that simplifies the data analysis of CUT&Tag sequencing and similar techniques, such as ATAC-seq (Assay for Transposase-Accessible Chromatin using sequencing) or ChIP-seq (chromatin immunoprecipitation [ChIP] assays with sequencing), and allows biomedical scientists to easily interpret the results. This new package includes every necessary step, from quality control to annotation and differential peak analysis. In particular, EpiMapper has improved functionality (e.g., reproducibility assessment) compared to previous analysis protocols and visualization plots and provides new features, such as genome annotation and differential peak analysis. Using three case studies, two on CUT&Tag and one on ATAC-seq data, the EpiMapper Python package successfully reproduced previous results.
Collapse
Affiliation(s)
- Jenny Sofie Dragland
- Department of Pathology, Oslo University Hospital - Norwegian Radium Hospital, Oslo, Norway
| | - Gege Liu
- Department of Pathology, Oslo University Hospital - Norwegian Radium Hospital, Oslo, Norway
| | - Hilde Loge Nilsen
- Department of Microbiology, Oslo University Hospital, Oslo, Norway; Centre for Embryology and Healthy Development, Institute of Clinical Medicine, University of Oslo, Oslo, Norway
| | - Yvonne Böttcher
- Department of Clinical Molecular Biology, Institute of Clinical Medicine, University of Oslo, Lørenskog, Norway; Medical Division (EpiGen), Akershus University Hospital, Lørenskog, Norway
| | - Junbai Wang
- Department of Clinical Molecular Biology, Institute of Clinical Medicine, University of Oslo, Lørenskog, Norway; Medical Division (EpiGen), Akershus University Hospital, Lørenskog, Norway.
| |
Collapse
|
2
|
Wang J, Yang M, Ali O, Dragland JS, Bjørås M, Farkas L. Predicting regulatory mutations and their target genes by new computational integrative analysis: A study of follicular lymphoma. Comput Biol Med 2024; 178:108787. [PMID: 38901187 DOI: 10.1016/j.compbiomed.2024.108787] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 06/12/2024] [Accepted: 06/16/2024] [Indexed: 06/22/2024]
Abstract
Mutations in DNA regulatory regions are increasingly being recognized as important drivers of cancer and other complex diseases. These mutations can regulate gene expression by affecting DNA-protein binding and epigenetic profiles, such as DNA methylation in genome regulatory elements. However, identifying mutation hotspots associated with expression regulation and disease progression in non-coding DNA remains a challenge. Unlike most existing approaches that assign a mutation score to individual single nucleotide polymorphisms (SNP), a mutation block (MB)-based approach was introduced in this study to assess the collective impact of a cluster of SNPs on transcription factor-DNA binding affinity, differential gene expression (DEG), and nearby DNA methylation. Moreover, the long-distance target genes of functional MBs were identified using a new permutation-based algorithm that assessed the significance of correlations between DNA methylation at regulatory regions and target gene expression. Two new Python packages were developed. The Differential Methylation Region (DMR-analysis) analysis tool was used to detect DMR and map them to regulatory elements. The second tool, an integrated DMR, DEG, and SNP analysis tool (DDS-analysis), was used to combine the omics data to identify functional MBs and long-distance target genes. Both tools were validated in follicular lymphoma (FL) cohorts, where not only known functional MBs and their target genes (BCL2 and BCL6) were recovered, but also novel genes were found, including CDCA4 and JAG2, which may be associated with FL development. These genes are linked to target gene expression and are significantly correlated with the methylation of nearby DNA sequences in FL. The proposed computational integrative analysis of multiomics data holds promise for identifying regulatory mutations in cancer and other complex diseases.
Collapse
Affiliation(s)
- Junbai Wang
- Department of Clinical Molecular Biology (EpiGen), Akershus University Hospital and University of Oslo, Lørenskog, Norway; Institute of Clinical Medicine, Faculty of Medicine, University of Oslo, Campus AHUS/Oslo, Norway.
| | - Mingyi Yang
- Department of Microbiology, Oslo University Hospital, Oslo, Norway; Department of Medical Biochemistry, Oslo University Hospital, Oslo, Norway; Centre for Embryology and Healthy Development (CRESCO), University of Oslo, Oslo, 0373, Norway
| | - Omer Ali
- Institute of Clinical Medicine, Faculty of Medicine, University of Oslo, Campus AHUS/Oslo, Norway; Department of Pathology, Oslo University Hospital - Norwegian Radium Hospital, Oslo, Norway
| | - Jenny Sofie Dragland
- Department of Pathology, Oslo University Hospital - Norwegian Radium Hospital, Oslo, Norway
| | - Magnar Bjørås
- Department of Microbiology, Oslo University Hospital, Oslo, Norway; Department of Clinical and Molecular Medicine, Norwegian University of Science and Technology, Trondheim, Norway; Centre for Embryology and Healthy Development (CRESCO), University of Oslo, Oslo, 0373, Norway
| | - Lorant Farkas
- Institute of Clinical Medicine, Faculty of Medicine, University of Oslo, Campus AHUS/Oslo, Norway; Department of Pathology, Oslo University Hospital - Norwegian Radium Hospital, Oslo, Norway
| |
Collapse
|
3
|
Yang M, Kaarbø M, Myhre V, Reims HM, Karlsen TH, Wang J, Rognes T, Halvorsen B, Fevang B, Lundin KEA, Aukrust P, Bjørås M, Jørgensen SF. Altered Genome-Wide DNA Methylation in the Duodenum of Common Variable Immunodeficiency Patients. J Clin Immunol 2024; 44:133. [PMID: 38780872 PMCID: PMC11116262 DOI: 10.1007/s10875-024-01726-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Accepted: 04/29/2024] [Indexed: 05/25/2024]
Abstract
PURPOSE A large proportion of Common variable immunodeficiency (CVID) patients has duodenal inflammation with increased intraepithelial lymphocytes (IEL) of unknown aetiology. The histologic similarities to celiac disease, lead to confusion regarding treatment (gluten-free diet) of these patients. We aimed to elucidate the role of epigenetic DNA methylation in the aetiology of duodenal inflammation in CVID and differentiate it from true celiac disease. METHODS DNA was isolated from snap-frozen pieces of duodenal biopsies and analysed for differences in genome-wide epigenetic DNA methylation between CVID patients with increased IEL (CVID_IEL; n = 5) without IEL (CVID_N; n = 3), celiac disease (n = 3) and healthy controls (n = 3). RESULTS The DNA methylation data of 5-methylcytosine in CpG sites separated CVID and celiac diseases from healthy controls. Differential methylation in promoters of genes were identified as potential novel mediators in CVID and celiac disease. There was limited overlap of methylation associated genes between CVID_IEL and Celiac disease. High frequency of differentially methylated CpG sites was detected in over 100 genes nearby transcription start site (TSS) in both CVID_IEL and celiac disease, compared to healthy controls. Differential methylation of genes involved in regulation of TNF/cytokine production were enriched in CVID_IEL, compared to healthy controls. CONCLUSION This is the first study to reveal a role of epigenetic DNA methylation in the etiology of duodenal inflammation of CVID patients, distinguishing CVID_IEL from celiac disease. We identified potential biomarkers and therapeutic targets within gene promotors and in high-frequency differentially methylated CpG regions proximal to TSS in both CVID_IEL and celiac disease.
Collapse
Affiliation(s)
- Mingyi Yang
- Department of Microbiology, Oslo University Hospital and University of Oslo, Oslo, Norway
- Department of Medical Biochemistry, Oslo University Hospital and University of Oslo, Oslo, Norway
| | - Mari Kaarbø
- Department of Microbiology, Oslo University Hospital and University of Oslo, Oslo, Norway
| | - Vegard Myhre
- Research Institute of Internal Medicine, Division of Surgery, Inflammatory Diseases and Transplantation, Oslo University Hospital, Oslo, Norway
- Institute of Clinical Medicine, University of Oslo, Oslo, Norway
| | - Henrik M Reims
- Department of Pathology, Oslo University Hospital, Rikshospitalet, Oslo, Norway
| | - Tom H Karlsen
- Research Institute of Internal Medicine, Division of Surgery, Inflammatory Diseases and Transplantation, Oslo University Hospital, Oslo, Norway
- Institute of Clinical Medicine, University of Oslo, Oslo, Norway
- Norwegian PSC Research Center, Department of Transplantation Medicine, Oslo University Hospital, Oslo, Norway
- Section of Gastroenterology, Department of Transplantation Medicine, Oslo University Hospital, Rikshospitalet, Oslo, Norway
| | - Junbai Wang
- Department of Clinical Molecular Biology (EpiGen), Akershus University Hospital and University of Oslo, Lørenskog, Norway
| | - Torbjørn Rognes
- Department of Microbiology, Oslo University Hospital and University of Oslo, Oslo, Norway
- Centre for Bioinformatics, Department of Informatics, University of Oslo, Oslo, Norway
| | - Bente Halvorsen
- Research Institute of Internal Medicine, Division of Surgery, Inflammatory Diseases and Transplantation, Oslo University Hospital, Oslo, Norway
- Institute of Clinical Medicine, University of Oslo, Oslo, Norway
| | - Børre Fevang
- Section of Clinical Immunology and Infectious Diseases, Oslo University Hospital, Rikshospitalet, Oslo, Norway
| | - Knut E A Lundin
- Institute of Clinical Medicine, University of Oslo, Oslo, Norway
- Section of Gastroenterology, Department of Transplantation Medicine, Oslo University Hospital, Rikshospitalet, Oslo, Norway
| | - Pål Aukrust
- Research Institute of Internal Medicine, Division of Surgery, Inflammatory Diseases and Transplantation, Oslo University Hospital, Oslo, Norway
- Institute of Clinical Medicine, University of Oslo, Oslo, Norway
- Section of Clinical Immunology and Infectious Diseases, Oslo University Hospital, Rikshospitalet, Oslo, Norway
| | - Magnar Bjørås
- Department of Microbiology, Oslo University Hospital and University of Oslo, Oslo, Norway
- Department of Clinical and Molecular Medicine, Norwegian University of Science and Technology, NTNU, Trondheim, Norway
- The Proteomics and Modomics Experimental Core Facility (PROMEC) at Norwegian University of Science and Technology, Trondheim, Norway
| | - Silje F Jørgensen
- Research Institute of Internal Medicine, Division of Surgery, Inflammatory Diseases and Transplantation, Oslo University Hospital, Oslo, Norway.
- Section of Clinical Immunology and Infectious Diseases, Oslo University Hospital, Rikshospitalet, Oslo, Norway.
| |
Collapse
|
4
|
Yang M, Ali O, Bjørås M, Wang J. Identifying functional regulatory mutation blocks by integrating genome sequencing and transcriptome data. iScience 2023; 26:107266. [PMID: 37520692 PMCID: PMC10371843 DOI: 10.1016/j.isci.2023.107266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 04/05/2023] [Accepted: 06/28/2023] [Indexed: 08/01/2023] Open
Abstract
Millions of single nucleotide variants (SNVs) exist in the human genome; however, it remains challenging to identify functional SNVs associated with diseases. We propose a non-encoding SNVs analysis tool bpb3, BayesPI-BAR version 3, aiming to identify the functional mutation blocks (FMBs) by integrating genome sequencing and transcriptome data. The identified FMBs display high frequency SNVs, significant changes in transcription factors (TFs) binding affinity and are nearby the regulatory regions of differentially expressed genes. A two-level Bayesian approach with a biophysical model for protein-DNA interactions is implemented, to compute TF-DNA binding affinity changes based on clustered position weight matrices (PWMs) from over 1700 TF-motifs. The epigenetic data, such as the DNA methylome can also be integrated to scan FMBs. By testing the datasets from follicular lymphoma and melanoma, bpb3 automatically and robustly identifies FMBs, demonstrating that bpb3 can provide insight into patho-mechanisms, and therapeutic targets from transcriptomic and genomic data.
Collapse
Affiliation(s)
- Mingyi Yang
- Department of Microbiology, Oslo University Hospital and University of Oslo, Oslo, Norway
- Department of Medical Biochemistry, Oslo University Hospital and University of Oslo, Oslo, Norway
| | - Omer Ali
- Department of Pathology, Oslo University Hospital - Norwegian Radium Hospital, Oslo, Norway
- Faculty of Medicine, University of Oslo, Oslo, Norway
| | - Magnar Bjørås
- Department of Microbiology, Oslo University Hospital and University of Oslo, Oslo, Norway
- Department of Clinical and Molecular Medicine, Norwegian University of Science and Technology, Trondheim, Norway
| | - Junbai Wang
- Department of Clinical Molecular Biology (EpiGen), Akershus University Hospital and University of Oslo, Lørenskog, Norway
| |
Collapse
|
5
|
Farooq A, Trøen G, Delabie J, Wang J. Integrating whole genome sequencing, methylation, gene expression, topological associated domain information in regulatory mutation prediction: a study of follicular lymphoma. Comput Struct Biotechnol J 2022; 20:1726-1742. [PMID: 35495111 PMCID: PMC9024376 DOI: 10.1016/j.csbj.2022.03.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Revised: 03/22/2022] [Accepted: 03/22/2022] [Indexed: 11/13/2022] Open
Abstract
A major challenge in human genetics is of the analysis of the interplay between genetic and epigenetic factors in a multifactorial disease like cancer. Here, a novel methodology is proposed to investigate genome-wide regulatory mechanisms in cancer, as studied with the example of follicular Lymphoma (FL). In a first phase, a new machine-learning method is designed to identify Differentially Methylated Regions (DMRs) by computing six attributes. In a second phase, an integrative data analysis method is developed to study regulatory mutations in FL, by considering differential methylation information together with DNA sequence variation, differential gene expression, 3D organization of genome (e.g., topologically associated domains), and enriched biological pathways. Resulting mutation block-gene pairs are further ranked to find out the significant ones. By this approach, BCL2 and BCL6 were identified as top-ranking FL-related genes with several mutation blocks and DMRs acting on their regulatory regions. Two additional genes, CDCA4 and CTSO, were also found in top rank with significant DNA sequence variation and differential methylation in neighboring areas, pointing towards their potential use as biomarkers for FL. This work combines both genomic and epigenomic information to investigate genome-wide gene regulatory mechanisms in cancer and contribute to devising novel treatment strategies.
Collapse
|
6
|
He N, Wang W, Fang C, Tan Y, Li L, Hou C. Integration of Count Difference and Curve Similarity in Negative Regulatory Element Detection. Front Genet 2022; 13:818344. [PMID: 35251128 PMCID: PMC8896116 DOI: 10.3389/fgene.2022.818344] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Accepted: 01/20/2022] [Indexed: 12/05/2022] Open
Abstract
Negative regulatory elements (NREs) down-regulate gene expression by inhibiting the activities of promoters or enhancers. The repressing activity of NREs can be measured globally by massively parallel reporter assays (MPRAs). However, most existing algorithms are designed for the statistical detection of positively enriched signals in MPRA datasets. To identify reduced signals in MPRA experiments, we designed a NRE identification program, fast-NR, by integrating the count and graphic features of sequenced reads to detect NREs using datasets generated by experiments of self-transcribing active regulatory region sequencing (STARR-seq). Fast-NR identified hundreds of silencers in human K562 cells that can be validated by independent methods.
Collapse
Affiliation(s)
- Na He
- Harbin Institute of Technology, Harbin, China
- Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China
- *Correspondence: Chunhui Hou, ; Na He,
| | - Wenjing Wang
- School of Life Science and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, Hong Kong SAR, China
| | - Chao Fang
- Cancer Centre, Faculty of Health Sciences, University of Macau, Macao, China
| | - Yongjian Tan
- Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China
| | - Li Li
- Department of Bioinformatics, Huazhong Agricultural University, Wuhan, China
- Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
| | - Chunhui Hou
- Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China
- *Correspondence: Chunhui Hou, ; Na He,
| |
Collapse
|