1
|
Exploring DNA Methylation Diversity in the Honey Bee Brain by Ultra-Deep Amplicon Sequencing. EPIGENOMES 2020; 4:epigenomes4020010. [PMID: 34968244 PMCID: PMC8594699 DOI: 10.3390/epigenomes4020010] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2019] [Revised: 06/17/2020] [Accepted: 06/22/2020] [Indexed: 02/04/2023] Open
Abstract
Understanding methylation dynamics in organs or tissues containing many different cell types is a challenging task that cannot be efficiently addressed by the low-depth bisulphite sequencing of DNA extracted from such sources. Here we explored the feasibility of ultra-deep bisulphite sequencing of long amplicons to reveal the brain methylation patterns in three selected honey bee genes analysed across five distinct conditions on the Illumina MiSeq platform. By combing 15 libraries in one run we achieved a very high sequencing depth of 240,000–340,000 reads per amplicon, suggesting that most of the cell types in the honey bee brain, containing approximately 1 million neurons, are represented in this dataset. We found a small number of gene-specific patterns for each condition in individuals of different ages and performing distinct tasks with 80–90% of those were represented by no more than a dozen patterns. One possibility is that such a small number of frequent patterns is the result of differentially methylated epialleles, whereas the rare and less frequent patterns reflect activity-dependent modifications. The condition-specific methylation differences within each gene appear to be position-dependent with some CpGs showing significant changes and others remaining stable in a methylated or non-methylated state. Interestingly, no significant loss of methylation was detected in very old individuals. Our findings imply that these diverse patterns represent a special challenge in the analyses of DNA methylation in complex tissues and organs that cannot be investigated by low-depth genome-wide bisulphite sequencing. We conclude that ultra-deep sequencing of gene-specific amplicons combined with genotyping of differentially methylated epialleles is an effective way to facilitate more advanced neuro-epigenomic studies in honey bees and other insects.
Collapse
|
2
|
Scherer M, Nebel A, Franke A, Walter J, Lengauer T, Bock C, Müller F, List M. Quantitative comparison of within-sample heterogeneity scores for DNA methylation data. Nucleic Acids Res 2020; 48:e46. [PMID: 32103242 PMCID: PMC7192612 DOI: 10.1093/nar/gkaa120] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2020] [Accepted: 02/14/2020] [Indexed: 12/13/2022] Open
Abstract
DNA methylation is an epigenetic mark with important regulatory roles in cellular identity and can be quantified at base resolution using bisulfite sequencing. Most studies are limited to the average DNA methylation levels of individual CpGs and thus neglect heterogeneity within the profiled cell populations. To assess this within-sample heterogeneity (WSH) several window-based scores that quantify variability in DNA methylation in sequencing reads have been proposed. We performed the first systematic comparison of four published WSH scores based on simulated and publicly available datasets. Moreover, we propose two new scores and provide guidelines for selecting appropriate scores to address cell-type heterogeneity, cellular contamination and allele-specific methylation. Most of the measures were sensitive in detecting DNA methylation heterogeneity in these scenarios, while we detected differences in susceptibility to technical bias. Using recently published DNA methylation profiles of Ewing sarcoma samples, we show that DNA methylation heterogeneity provides information complementary to the DNA methylation level. WSH scores are powerful tools for estimating variance in DNA methylation patterns and have the potential for detecting novel disease-associated genomic loci not captured by established statistics. We provide an R-package implementing the WSH scores for integration into analysis workflows.
Collapse
Affiliation(s)
- Michael Scherer
- Computational Biology, Max Planck Institute for Informatics, Saarland Informatics Campus, 66123 Saarbrücken, Germany
- Graduate School of Computer Science, Saarland Informatics Campus, 66123 Saarbrücken, Germany
- Department of Genetics/Epigenetics, Saarland University, 66123 Saarbrücken, Germany
| | - Almut Nebel
- Institute of Clinical Molecular Biology, Kiel University, 24105 Kiel, Germany
| | - Andre Franke
- Institute of Clinical Molecular Biology, Kiel University, 24105 Kiel, Germany
| | - Jörn Walter
- Department of Genetics/Epigenetics, Saarland University, 66123 Saarbrücken, Germany
| | - Thomas Lengauer
- Computational Biology, Max Planck Institute for Informatics, Saarland Informatics Campus, 66123 Saarbrücken, Germany
| | - Christoph Bock
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, 1090 Vienna, Austria
- Department of Laboratory Medicine, Medical University of Vienna, 1090 Vienna, Austria
| | - Fabian Müller
- Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Markus List
- Big Data in BioMedicine Group, Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, 85354 Freising, Germany
| |
Collapse
|
3
|
Lakhal-Chaieb L, Greenwood CMT, Ouhourane M, Zhao K, Abdous B, Oualkacha K. A smoothed EM-algorithm for DNA methylation profiles from sequencing-based methods in cell lines or for a single cell type. Stat Appl Genet Mol Biol 2018; 16:333-347. [PMID: 29055941 DOI: 10.1515/sagmb-2016-0062] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
We consider the assessment of DNA methylation profiles for sequencing-derived data from a single cell type or from cell lines. We derive a kernel smoothed EM-algorithm, capable of analyzing an entire chromosome at once, and to simultaneously correct for experimental errors arising from either the pre-treatment steps or from the sequencing stage and to take into account spatial correlations between DNA methylation profiles at neighbouring CpG sites. The outcomes of our algorithm are then used to (i) call the true methylation status at each CpG site, (ii) provide accurate smoothed estimates of DNA methylation levels, and (iii) detect differentially methylated regions. Simulations show that the proposed methodology outperforms existing analysis methods that either ignore the correlation between DNA methylation profiles at neighbouring CpG sites or do not correct for errors. The use of the proposed inference procedure is illustrated through the analysis of a publicly available data set from a cell line of induced pluripotent H9 human embryonic stem cells and also a data set where methylation measures were obtained for a small genomic region in three different immune cell types separated from whole blood.
Collapse
|
4
|
Jenkinson G, Abante J, Feinberg AP, Goutsias J. An information-theoretic approach to the modeling and analysis of whole-genome bisulfite sequencing data. BMC Bioinformatics 2018. [PMID: 29514626 PMCID: PMC5842653 DOI: 10.1186/s12859-018-2086-5] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND DNA methylation is a stable form of epigenetic memory used by cells to control gene expression. Whole genome bisulfite sequencing (WGBS) has emerged as a gold-standard experimental technique for studying DNA methylation by producing high resolution genome-wide methylation profiles. Statistical modeling and analysis is employed to computationally extract and quantify information from these profiles in an effort to identify regions of the genome that demonstrate crucial or aberrant epigenetic behavior. However, the performance of most currently available methods for methylation analysis is hampered by their inability to directly account for statistical dependencies between neighboring methylation sites, thus ignoring significant information available in WGBS reads. RESULTS We present a powerful information-theoretic approach for genome-wide modeling and analysis of WGBS data based on the 1D Ising model of statistical physics. This approach takes into account correlations in methylation by utilizing a joint probability model that encapsulates all information available in WGBS methylation reads and produces accurate results even when applied on single WGBS samples with low coverage. Using the Shannon entropy, our approach provides a rigorous quantification of methylation stochasticity in individual WGBS samples genome-wide. Furthermore, it utilizes the Jensen-Shannon distance to evaluate differences in methylation distributions between a test and a reference sample. Differential performance assessment using simulated and real human lung normal/cancer data demonstrate a clear superiority of our approach over DSS, a recently proposed method for WGBS data analysis. Critically, these results demonstrate that marginal methods become statistically invalid when correlations are present in the data. CONCLUSIONS This contribution demonstrates clear benefits and the necessity of modeling joint probability distributions of methylation using the 1D Ising model of statistical physics and of quantifying methylation stochasticity using concepts from information theory. By employing this methodology, substantial improvement of DNA methylation analysis can be achieved by effectively taking into account the massive amount of statistical information available in WGBS data, which is largely ignored by existing methods.
Collapse
Affiliation(s)
- Garrett Jenkinson
- Whitaker Biomedical Engineering Institute, Johns Hopkins University, Baltimore, MD, USA.,Center for Epigenetics, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Jordi Abante
- Whitaker Biomedical Engineering Institute, Johns Hopkins University, Baltimore, MD, USA
| | - Andrew P Feinberg
- Center for Epigenetics, Johns Hopkins School of Medicine, Baltimore, MD, USA.,Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA.,Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - John Goutsias
- Whitaker Biomedical Engineering Institute, Johns Hopkins University, Baltimore, MD, USA.
| |
Collapse
|
5
|
Barrett JE, Feber A, Herrero J, Tanic M, Wilson GA, Swanton C, Beck S. Quantification of tumour evolution and heterogeneity via Bayesian epiallele detection. BMC Bioinformatics 2017; 18:354. [PMID: 28743252 PMCID: PMC5526259 DOI: 10.1186/s12859-017-1753-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2017] [Accepted: 07/05/2017] [Indexed: 01/19/2023] Open
Abstract
BACKGROUND Epigenetic heterogeneity within a tumour can play an important role in tumour evolution and the emergence of resistance to treatment. It is increasingly recognised that the study of DNA methylation (DNAm) patterns along the genome - so-called 'epialleles' - offers greater insight into epigenetic dynamics than conventional analyses which examine DNAm marks individually. RESULTS We have developed a Bayesian model to infer which epialleles are present in multiple regions of the same tumour. We apply our method to reduced representation bisulfite sequencing (RRBS) data from multiple regions of one lung cancer tumour and a matched normal sample. The model borrows information from all tumour regions to leverage greater statistical power. The total number of epialleles, the epiallele DNAm patterns, and a noise hyperparameter are all automatically inferred from the data. Uncertainty as to which epiallele an observed sequencing read originated from is explicitly incorporated by marginalising over the appropriate posterior densities. The degree to which tumour samples are contaminated with normal tissue can be estimated and corrected for. By tracing the distribution of epialleles throughout the tumour we can infer the phylogenetic history of the tumour, identify epialleles that differ between normal and cancer tissue, and define a measure of global epigenetic disorder. CONCLUSIONS Detection and comparison of epialleles within multiple tumour regions enables phylogenetic analyses, identification of differentially expressed epialleles, and provides a measure of epigenetic heterogeneity. R code is available at github.com/james-e-barrett.
Collapse
Affiliation(s)
- James E Barrett
- UCL Cancer Institute, University College London, London, UK.
| | - Andrew Feber
- UCL Cancer Institute, University College London, London, UK
| | - Javier Herrero
- UCL Cancer Institute, University College London, London, UK
| | - Miljana Tanic
- UCL Cancer Institute, University College London, London, UK
| | - Gareth A Wilson
- UCL Cancer Institute, University College London, London, UK
- The Francis Crick Institute, London, UK
| | - Charles Swanton
- UCL Cancer Institute, University College London, London, UK
- The Francis Crick Institute, London, UK
- Cancer Research U.K. Lung Cancer Centre of Excellence, UCL Cancer Institute, London, UK
- University College London Hospitals NHS Foundation Trust, London, UK
| | - Stephan Beck
- UCL Cancer Institute, University College London, London, UK
| |
Collapse
|
6
|
EGFR gene methylation is not involved in Royalactin controlled phenotypic polymorphism in honey bees. Sci Rep 2015; 5:14070. [PMID: 26358539 PMCID: PMC4566103 DOI: 10.1038/srep14070] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Accepted: 08/17/2015] [Indexed: 12/16/2022] Open
Abstract
The 2011 highly publicised Nature paper by Kamakura on honeybee phenotypic dimorphism, (also using Drosophila as an experimental surrogate), claims that a single protein in royal jelly, Royalactin, essentially acts as a master “on-off” switch in development via the epidermal growth factor receptor (AmEGFR), to seal the fate of queen or worker. One mechanism proposed in that study as important for the action of Royalactin is differential amegfr methylation in alternate organismal outcomes. According to the author differential methylation of amegfr was experimentally confirmed and shown in a supportive figure. Here we have conducted an extensive analysis of the honeybee egfr locus and show that this gene is never methylated. We discuss several lines of evidence casting serious doubts on the amegfr methylation result in the 2011 paper and consider possible origins of the author’s statement. In a broader context, we discuss the implication of our findings for contrasting context-dependent regulation of EGFR in three insect species, Apis mellifera, D. melanogaster and the carpenter ant, Camponotus floridanus, and argue that more adequate methylation data scrutiny measures are needed to avoid unwarranted conclusions.
Collapse
|