1
|
Multi-organ single-cell transcriptomics of immune cells uncovered organ-specific gene expression and functions. Sci Data 2024; 11:316. [PMID: 38538617 PMCID: PMC10973478 DOI: 10.1038/s41597-024-03152-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 03/18/2024] [Indexed: 04/01/2024] Open
Abstract
Despite the wealth of publicly available single-cell datasets, our understanding of distinct resident immune cells and their unique features in diverse human organs remains limited. To address this, we compiled a meta-analysis dataset of 114,275 CD45+ immune cells sourced from 14 organs in healthy donors. While the transcriptome of immune cells remains relatively consistent across organs, our analysis has unveiled organ-specific gene expression differences (GTPX3 in kidney, DNTT and ACVR2B in thymus). These alterations are linked to different transcriptional factor activities and pathways including metabolism. TNF-α signaling through the NFkB pathway was found in several organs and immune compartments. The presence of distinct expression profiles for NFkB family genes and their target genes, including cytokines, underscores their pivotal role in cell positioning. Taken together, immune cells serve a dual role: safeguarding the organs and dynamically adjusting to the intricacies of the host organ environment, thereby actively contributing to its functionality and overall homeostasis.
Collapse
|
2
|
Machine learning and multi-omics data in chronic lymphocytic leukemia: the future of precision medicine? Front Genet 2024; 14:1304661. [PMID: 38283149 PMCID: PMC10811210 DOI: 10.3389/fgene.2023.1304661] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 12/27/2023] [Indexed: 01/30/2024] Open
Abstract
Chronic lymphocytic leukemia is a complex and heterogeneous hematological malignancy. The advance of high-throughput multi-omics technologies has significantly influenced chronic lymphocytic leukemia research and paved the way for precision medicine approaches. In this review, we explore the role of machine learning in the analysis of multi-omics data in this hematological malignancy. We discuss recent literature on different machine learning models applied to single omic studies in chronic lymphocytic leukemia, with a special focus on the potential contributions to precision medicine. Finally, we highlight the recently published machine learning applications in multi-omics data in this area of research as well as their potential and limitations.
Collapse
|
3
|
Convert-Pheno: A software toolkit for the interconversion of standard data models for phenotypic data. J Biomed Inform 2024; 149:104558. [PMID: 38035971 DOI: 10.1016/j.jbi.2023.104558] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 11/24/2023] [Accepted: 11/27/2023] [Indexed: 12/02/2023]
Abstract
Efficient sharing and integration of phenotypic data is crucial for advancing biomedical research and enhancing patient outcomes in precision medicine and public health. To achieve this, the health data community has developed standards to promote the harmonization of variable names and values. However, the use of diverse standards across different research centers can hinder progress. Here we present Convert-Pheno, an open-source software toolkit that enables the interconversion of common data models for phenotypic data such as Beacon v2 Models, CDISC-ODM, OMOP-CDM, Phenopackets v2, and REDCap. Along with the software, we have created a detailed documentation that includes information on deployment and installation.
Collapse
|
4
|
Development of a novel streamlined workflow (AACRE) and database (inCREDBle) for genomic analysis of carbapenem-resistant Enterobacterales. Microb Genom 2023; 9. [PMID: 38010338 DOI: 10.1099/mgen.0.001132] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2023] Open
Abstract
In response to the threat of increasing antimicrobial resistance, we must increase the amount of available high-quality genomic data gathered on antibiotic-resistant bacteria. To this end, we developed an integrated pipeline for high-throughput long-read sequencing, assembly, annotation and analysis of bacterial isolates and used it to generate a large genomic data set of carbapenemase-producing Enterobacterales (CPE) isolates collected in Spain. The set of 461 isolates were sequenced with a combination of both Illumina and Oxford Nanopore Technologies (ONT) DNA sequencing technologies in order to provide genomic context for chromosomal loci and, most importantly, structural resolution of plasmids, important determinants for transmission of antimicrobial resistance. We developed an informatics pipeline called Assembly and Annotation of Carbapenem-Resistant Enterobacteriaceae (AACRE) for the full assembly and annotation of the bacterial genomes and their complement of plasmids. To explore the resulting genomic data set, we developed a new database called inCREDBle that not only stores the genomic data, but provides unique ways to filter and compare data, enabling comparative genomic analyses at the level of chromosomes, plasmids and individual genes. We identified a new sequence type, ST5000, and discovered a genomic locus unique to ST15 that may be linked to its increased spread in the population. In addition to our major objective of generating a large regional data set, we took the opportunity to compare the effects of sample quality and sequencing methods, including R9 versus R10 nanopore chemistry, on genome assembly and annotation quality. We conclude that converting short-read and hybrid microbial sequencing and assembly workflows to the latest nanopore chemistry will further reduce processing time and cost, truly enabling the routine monitoring of resistance transmission patterns at the resolution of complete chromosomes and plasmids.
Collapse
|
5
|
Dissecting the Origin of Heterogeneity in Uterine and Ovarian Carcinosarcomas. CANCER RESEARCH COMMUNICATIONS 2023; 3:830-841. [PMID: 37377900 PMCID: PMC10171113 DOI: 10.1158/2767-9764.crc-22-0520] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 02/28/2023] [Accepted: 04/18/2023] [Indexed: 06/29/2023]
Abstract
Gynecologic carcinosarcomas (CS) are biphasic neoplasms composed of carcinomatous (C) and sarcomatous (S) malignant components. Because of their rarity and histologic complexity, genetic and functional studies on CS are scarce and the mechanisms of initiation and development remain largely unknown. Whole-genome analysis of the C and S components reveals shared genomic alterations, thus emphasizing the clonal evolution of CS. Reconstructions of the evolutionary history of each tumor further reveal that C and S samples are composed of both ancestral cell populations and component-specific subclones, supporting a common origin followed by distinct evolutionary trajectories. However, while we do not find any recurrent genomic features associated with phenotypic divergence, transcriptomic and methylome analyses identify a common mechanism across the cohort, the epithelial-to-mesenchymal transition (EMT), suggesting a role for nongenetic factors in inflicting changes to cellular fate. Altogether, these data accredit the hypothesis that CS tumors are driven by both clonal evolution and transcriptomic reprogramming, essential for susceptibility to transdifferentiation upon encountering environmental cues, thus linking CS heterogeneity to genetic, transcriptomic, and epigenetic influences. Significance We have provided a detailed characterization of the genomic landscape of CS and identified EMT as a common mechanism associated with phenotypic divergence, linking CS heterogeneity to genetic, transcriptomic, and epigenetic influences.
Collapse
|
6
|
|
7
|
Author Correction: Genomic basis for RNA alterations in cancer. Nature 2023; 614:E37. [PMID: 36697831 PMCID: PMC9931574 DOI: 10.1038/s41586-022-05596-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
|
8
|
Genomic evidence for recurrent genetic admixture during the domestication of Mediterranean olive trees (Olea europaea L.). BMC Biol 2020; 18:148. [PMID: 33100219 PMCID: PMC7586694 DOI: 10.1186/s12915-020-00881-6] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2020] [Accepted: 09/27/2020] [Indexed: 12/26/2022] Open
Abstract
BACKGROUND Olive tree (Olea europaea L. subsp. europaea, Oleaceae) has been the most emblematic perennial crop for Mediterranean countries since its domestication around 6000 years ago in the Levant. Two taxonomic varieties are currently recognized: cultivated (var. europaea) and wild (var. sylvestris) trees. However, it remains unclear whether olive cultivars derive from a single initial domestication event followed by secondary diversification, or whether cultivated lineages are the result of more than a single, independent primary domestication event. To shed light into the recent evolution and domestication of the olive tree, here we analyze a group of newly sequenced and available genomes using a phylogenomics and population genomics framework. RESULTS We improved the assembly and annotation of the reference genome, newly sequenced the genomes of twelve individuals: ten var. europaea, one var. sylvestris, and one outgroup taxon (subsp. cuspidata)-and assembled a dataset comprising whole genome data from 46 var. europaea and 10 var. sylvestris. Phylogenomic and population structure analyses support a continuous process of olive tree domestication, involving a major domestication event, followed by recurrent independent genetic admixture events with wild populations across the Mediterranean Basin. Cultivated olives exhibit only slightly lower levels of genetic diversity than wild forms, which can be partially explained by the occurrence of a mild population bottleneck 3000-14,000 years ago during the primary domestication period, followed by recurrent introgression from wild populations. Genes associated with stress response and developmental processes were positively selected in cultivars, but we did not find evidence that genes involved in fruit size or oil content were under positive selection. This suggests that complex selective processes other than directional selection of a few genes are in place. CONCLUSIONS Altogether, our results suggest that a primary domestication area in the eastern Mediterranean basin was followed by numerous secondary events across most countries of southern Europe and northern Africa, often involving genetic admixture with genetically rich wild populations, particularly from the western Mediterranean Basin.
Collapse
|
9
|
Framework for quality assessment of whole genome cancer sequences. Nat Commun 2020; 11:5040. [PMID: 33028839 PMCID: PMC7541455 DOI: 10.1038/s41467-020-18688-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2020] [Accepted: 07/22/2020] [Indexed: 11/25/2022] Open
Abstract
Bringing together cancer genomes from different projects increases power and allows the investigation of pan-cancer, molecular mechanisms. However, working with whole genomes sequenced over several years in different sequencing centres requires a framework to compare the quality of these sequences. We used the Pan-Cancer Analysis of Whole Genomes cohort as a test case to construct such a framework. This cohort contains whole cancer genomes of 2832 donors from 18 sequencing centres. We developed a non-redundant set of five quality control (QC) measurements to establish a star rating system. These QC measures reflect known differences in sequencing protocol and provide a guide to downstream analyses and allow for exclusion of samples of poor quality. We have found that this is an effective framework of quality measures. The implementation of the framework is available at: https://dockstore.org/containers/quay.io/jwerner_dkfz/pancanqc:1.2.2. Working with cancer genomes from multiple projects can increase investigative power, but quality of sequences can vary. Here, the authors present a framework for comparing whole genome sequencing quality to help researchers guide downstream analyses and exclude poor quality samples.
Collapse
|
10
|
Abstract
Cancer is driven by genetic change, and the advent of massively parallel sequencing has enabled systematic documentation of this variation at the whole-genome scale1-3. Here we report the integrative analysis of 2,658 whole-cancer genomes and their matching normal tissues across 38 tumour types from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). We describe the generation of the PCAWG resource, facilitated by international data sharing using compute clouds. On average, cancer genomes contained 4-5 driver mutations when combining coding and non-coding genomic elements; however, in around 5% of cases no drivers were identified, suggesting that cancer driver discovery is not yet complete. Chromothripsis, in which many clustered structural variants arise in a single catastrophic event, is frequently an early event in tumour evolution; in acral melanoma, for example, these events precede most somatic point mutations and affect several cancer-associated genes simultaneously. Cancers with abnormal telomere maintenance often originate from tissues with low replicative activity and show several mechanisms of preventing telomere attrition to critical levels. Common and rare germline variants affect patterns of somatic mutation, including point mutations, structural variants and somatic retrotransposition. A collection of papers from the PCAWG Consortium describes non-coding mutations that drive cancer beyond those in the TERT promoter4; identifies new signatures of mutational processes that cause base substitutions, small insertions and deletions and structural variation5,6; analyses timings and patterns of tumour evolution7; describes the diverse transcriptional consequences of somatic mutation on splicing, expression levels, fusion genes and promoter activity8,9; and evaluates a range of more-specialized features of cancer genomes8,10-18.
Collapse
|
11
|
gemBS: high throughput processing for DNA methylation data from bisulfite sequencing. Bioinformatics 2019; 35:737-742. [PMID: 30137223 DOI: 10.1093/bioinformatics/bty690] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2018] [Revised: 07/18/2018] [Accepted: 08/20/2018] [Indexed: 02/07/2023] Open
Abstract
MOTIVATION DNA methylation is essential for normal embryogenesis and development in mammals and can be captured at single base pair resolution by whole genome bisulfite sequencing (WGBS). Current available analysis tools are becoming rapidly outdated as they lack sensible functionality and efficiency to handle large amounts of data now commonly created. RESULTS We developed gemBS, a fast high-throughput bioinformatics pipeline specifically designed for large scale BS-Seq analysis that combines a high performance BS-mapper (GEM3) and a variant caller specifically for BS-Seq data (BScall). gemBS provides genotype information and methylation estimates for all genomic cytosines in different contexts (CpG and non-CpG) and a set of quality reports for comprehensive and reproducible analysis. gemBS is highly modular and can be easily automated, while producing robust and accurate results. AVAILABILITY AND IMPLEMENTATION gemBS is released under the GNU GPLv3+ license. Source code and documentation are freely available from www.statgen.cat/gemBS. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
12
|
High degree of polyclonality hinders somatic mutation calling in lung brush samples of COPD cases and controls. Sci Rep 2019; 9:20158. [PMID: 31882973 PMCID: PMC6934450 DOI: 10.1038/s41598-019-56618-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2019] [Accepted: 12/04/2019] [Indexed: 11/16/2022] Open
Abstract
Chronic obstructive pulmonary disease (COPD) is induced by cigarette smoking and characterized by inflammation of airway tissue. Since smokers with COPD have a higher risk of developing lung cancer than those without, we hypothesized that they carry more mutations in affected tissue. We called somatic mutations in airway brush samples from medium-coverage whole genome sequencing data from healthy never and ex-smokers (n = 8), as well as from ex-smokers with variable degrees of COPD (n = 4). Owing to the limited concordance of resulting calls between the applied tools we built a consensus, a strategy that was validated with high accuracy for cancer data. However, consensus calls showed little promise of representing true positives due to low mappability of corresponding sequence reads and high overlap with positions harbouring known genetic polymorphisms. A targeted re-sequencing approach suggested that only few mutations would survive stringent verification testing and that our data did not allow the inference of any difference in the mutational load of bronchial brush samples between former smoking COPD cases and controls. High polyclonality in airway brush samples renders medium-depth sequencing insufficient to provide the resolution to detect somatic mutations. Deep sequencing data of airway biopsies are needed to tackle the question.
Collapse
|
13
|
Labelled regulatory elements are pervasive features of the macrophage genome and are dynamically utilized by classical and alternative polarization signals. Nucleic Acids Res 2019; 47:2778-2792. [PMID: 30799488 PMCID: PMC6451134 DOI: 10.1093/nar/gkz118] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2019] [Accepted: 02/14/2019] [Indexed: 01/09/2023] Open
Abstract
The concept of tissue-specific gene expression posits that lineage-determining transcription factors (LDTFs) determine the open chromatin profile of a cell via collaborative binding, providing molecular beacons to signal-dependent transcription factors (SDTFs). However, the guiding principles of LDTF binding, chromatin accessibility and enhancer activity have not yet been systematically evaluated. We sought to study these features of the macrophage genome by the combination of experimental (ChIP-seq, ATAC-seq and GRO-seq) and computational approaches. We show that Random Forest and Support Vector Regression machine learning methods can accurately predict chromatin accessibility using the binding patterns of the LDTF PU.1 and four other key TFs of macrophages (IRF8, JUNB, CEBPA and RUNX1). Any of these TFs alone were not sufficient to predict open chromatin, indicating that TF binding is widespread at closed or weakly opened chromatin regions. Analysis of the PU.1 cistrome revealed that two-thirds of PU.1 binding occurs at low accessible chromatin. We termed these sites labelled regulatory elements (LREs), which may represent a dormant state of a future enhancer and contribute to macrophage cellular plasticity. Collectively, our work demonstrates the existence of LREs occupied by various key TFs, regulating specific gene expression programs triggered by divergent macrophage polarizing stimuli.
Collapse
|
14
|
Recurrent somatic mutations reveal new insights into consequences of mutagenic processes in cancer. PLoS Comput Biol 2019; 15:e1007496. [PMID: 31765368 PMCID: PMC6901237 DOI: 10.1371/journal.pcbi.1007496] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2019] [Revised: 12/09/2019] [Accepted: 10/22/2019] [Indexed: 12/12/2022] Open
Abstract
The sheer size of the human genome makes it improbable that identical somatic mutations at the exact same position are observed in multiple tumours solely by chance. The scarcity of cancer driver mutations also precludes positive selection as the sole explanation. Therefore, recurrent mutations may be highly informative of characteristics of mutational processes. To explore the potential, we use recurrence as a starting point to cluster >2,500 whole genomes of a pan-cancer cohort. We describe each genome with 13 recurrence-based and 29 general mutational features. Using principal component analysis we reduce the dimensionality and create independent features. We apply hierarchical clustering to the first 18 principal components followed by k-means clustering. We show that the resulting 16 clusters capture clinically relevant cancer phenotypes. High levels of recurrent substitutions separate the clusters that we link to UV-light exposure and deregulated activity of POLE from the one representing defective mismatch repair, which shows high levels of recurrent insertions/deletions. Recurrence of both mutation types characterizes cancer genomes with somatic hypermutation of immunoglobulin genes and the cluster of genomes exposed to gastric acid. Low levels of recurrence are observed for the cluster where tobacco-smoke exposure induces mutagenesis and the one linked to increased activity of cytidine deaminases. Notably, the majority of substitutions are recurrent in a single tumour type, while recurrent insertions/deletions point to shared processes between tumour types. Recurrence also reveals susceptible sequence motifs, including TT[C>A]TTT and AAC[T>G]T for the POLE and 'gastric-acid exposure' clusters, respectively. Moreover, we refine knowledge of mutagenesis, including increased C/G deletion levels in general for lung tumours and specifically in midsize homopolymer sequence contexts for microsatellite instable tumours. Our findings are an important step towards the development of a generic cancer diagnostic test for clinical practice based on whole-genome sequencing that could replace multiple diagnostics currently in use.
Collapse
|
15
|
Partially methylated domains are hypervariable in breast cancer and fuel widespread CpG island hypermethylation. Nat Commun 2019; 10:1749. [PMID: 30988298 PMCID: PMC6465362 DOI: 10.1038/s41467-019-09828-0] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2018] [Accepted: 03/27/2019] [Indexed: 12/21/2022] Open
Abstract
Global loss of DNA methylation and CpG island (CGI) hypermethylation are key epigenomic aberrations in cancer. Global loss manifests itself in partially methylated domains (PMDs) which extend up to megabases. However, the distribution of PMDs within and between tumor types, and their effects on key functional genomic elements including CGIs are poorly defined. We comprehensively show that loss of methylation in PMDs occurs in a large fraction of the genome and represents the prime source of DNA methylation variation. PMDs are hypervariable in methylation level, size and distribution, and display elevated mutation rates. They impose intermediate DNA methylation levels incognizant of functional genomic elements including CGIs, underpinning a CGI methylator phenotype (CIMP). Repression effects on tumor suppressor genes are negligible as they are generally excluded from PMDs. The genomic distribution of PMDs reports tissue-of-origin and may represent tissue-specific silent regions which tolerate instability at the epigenetic, transcriptomic and genetic level.
Collapse
|
16
|
Single cell RNA-seq identifies the origins of heterogeneity in efficient cell transdifferentiation and reprogramming. eLife 2019; 8:41627. [PMID: 30860479 PMCID: PMC6435319 DOI: 10.7554/elife.41627] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2018] [Accepted: 03/11/2019] [Indexed: 12/31/2022] Open
Abstract
Forced transcription factor expression can transdifferentiate somatic cells into other specialised cell types or reprogram them into induced pluripotent stem cells (iPSCs) with variable efficiency. To better understand the heterogeneity of these processes, we used single-cell RNA sequencing to follow the transdifferentation of murine pre-B cells into macrophages as well as their reprogramming into iPSCs. Even in these highly efficient systems, there was substantial variation in the speed and path of fate conversion. We predicted and validated that these differences are inversely coupled and arise in the starting cell population, with Mychigh large pre-BII cells transdifferentiating slowly but reprogramming efficiently and Myclow small pre-BII cells transdifferentiating rapidly but failing to reprogram. Strikingly, differences in Myc activity predict the efficiency of reprogramming across a wide range of somatic cell types. These results illustrate how single cell expression and computational analyses can identify the origins of heterogeneity in cell fate conversion processes.
Collapse
|
17
|
Systems Genetics of Hepatic Metabolome Reveals Octopamine as a Target for Non-Alcoholic Fatty Liver Disease Treatment. Sci Rep 2019; 9:3656. [PMID: 30842494 PMCID: PMC6403227 DOI: 10.1038/s41598-019-40153-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2018] [Accepted: 01/17/2019] [Indexed: 12/14/2022] Open
Abstract
Non-alcoholic fatty liver disease (NAFLD) is often associated with obesity and type 2 diabetes. To disentangle etiological relationships between these conditions and identify genetically-determined metabolites involved in NAFLD processes, we mapped 1H nuclear magnetic resonance (NMR) metabolomic and disease-related phenotypes in a mouse F2 cross derived from strains showing resistance (BALB/c) and increased susceptibility (129S6) to these diseases. Quantitative trait locus (QTL) analysis based on single nucleotide polymorphism (SNP) genotypes identified diet responsive QTLs in F2 mice fed control or high fat diet (HFD). In HFD fed F2 mice we mapped on chromosome 18 a QTL regulating liver micro- and macrovesicular steatosis and inflammation, independently from glucose intolerance and adiposity, which was linked to chromosome 4. Linkage analysis of liver metabolomic profiling data identified a QTL for octopamine, which co-localised with the QTL for liver histopathology in the cross. Functional relationship between these two QTLs was validated in vivo in mice chronically treated with octopamine, which exhibited reduction in liver histopathology and metabolic benefits, underlining its role as a mechanistic biomarker of fatty liver with potential therapeutic applications.
Collapse
|
18
|
Transcription Factors Drive Tet2-Mediated Enhancer Demethylation to Reprogram Cell Fate. Cell Stem Cell 2018; 23:905-906. [PMID: 30526885 PMCID: PMC6292969 DOI: 10.1016/j.stem.2018.11.001] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
19
|
Transcription Factors Drive Tet2-Mediated Enhancer Demethylation to Reprogram Cell Fate. Cell Stem Cell 2018; 23:727-741.e9. [DOI: 10.1016/j.stem.2018.08.016] [Citation(s) in RCA: 75] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2017] [Revised: 07/07/2018] [Accepted: 08/23/2018] [Indexed: 10/28/2022]
|
20
|
Genomic and Phenotypic Variation in Morphogenetic Networks of Two Candida albicans Isolates Subtends Their Different Pathogenic Potential. Front Immunol 2018; 8:1997. [PMID: 29403478 PMCID: PMC5780349 DOI: 10.3389/fimmu.2017.01997] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2017] [Accepted: 12/22/2017] [Indexed: 01/29/2023] Open
Abstract
The transition from commensalism to pathogenicity of Candida albicans reflects both the host inability to mount specific immune responses and the microorganism’s dimorphic switch efficiency. In this study, we used whole genome sequencing and microarray analysis to investigate the genomic determinants of the phenotypic changes observed in two C. albicans clinical isolates (YL1 and YQ2). In vitro experiments employing epithelial, microglial, and peripheral blood mononuclear cells were thus used to evaluate C. albicans isolates interaction with first line host defenses, measuring adhesion, susceptibility to phagocytosis, and induction of secretory responses. Moreover, a murine model of peritoneal infection was used to compare the in vivo pathogenic potential of the two isolates. Genome sequence and gene expression analysis of C. albicans YL1 and YQ2 showed significant changes in cellular pathways involved in environmental stress response, adhesion, filamentous growth, invasiveness, and dimorphic transition. This was in accordance with the observed marked phenotypic differences in biofilm production, dimorphic switch efficiency, cell adhesion, invasion, and survival to phagocyte-mediated host defenses. The mutations in key regulators of the hyphal growth pathway in the more virulent strain corresponded to an overall greater number of budding yeast cells released. Compared to YQ2, YL1 consistently showed enhanced pathogenic potential, since in vitro, it was less susceptible to ingestion by phagocytic cells and more efficient in invading epithelial cells, while in vivo YL1 was more effective than YQ2 in recruiting inflammatory cells, eliciting IL-1β response and eluding phagocytic cells. Overall, these results indicate an unexpected isolate-specific variation in pathways important for host invasion and colonization, showing how the genetic background of C. albicans may greatly affect its behavior both in vitro and in vivo. Based on this approach, we propose that the co-occurrence of changes in sequence and expression in genes and pathways driving dimorphic transition and pathogenicity reflects a selective balance between traits favoring dissemination of the pathogen and traits involved in host defense evasion. This study highlights the importance of investigating strain-level, rather than species level, differences, when determining fungal–host interactions and defining commensal or pathogen behavior.
Collapse
|
21
|
Morphometric, Behavioral, and Genomic Evidence for a New Orangutan Species. Curr Biol 2017; 27:3487-3498.e10. [PMID: 29103940 DOI: 10.1016/j.cub.2017.09.047] [Citation(s) in RCA: 133] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2017] [Revised: 07/17/2017] [Accepted: 09/20/2017] [Indexed: 12/30/2022]
Abstract
Six extant species of non-human great apes are currently recognized: Sumatran and Bornean orangutans, eastern and western gorillas, and chimpanzees and bonobos [1]. However, large gaps remain in our knowledge of fine-scale variation in hominoid morphology, behavior, and genetics, and aspects of great ape taxonomy remain in flux. This is particularly true for orangutans (genus: Pongo), the only Asian great apes and phylogenetically our most distant relatives among extant hominids [1]. Designation of Bornean and Sumatran orangutans, P. pygmaeus (Linnaeus 1760) and P. abelii (Lesson 1827), as distinct species occurred in 2001 [1, 2]. Here, we show that an isolated population from Batang Toru, at the southernmost range limit of extant Sumatran orangutans south of Lake Toba, is distinct from other northern Sumatran and Bornean populations. By comparing cranio-mandibular and dental characters of an orangutan killed in a human-animal conflict to those of 33 adult male orangutans of a similar developmental stage, we found consistent differences between the Batang Toru individual and other extant Ponginae. Our analyses of 37 orangutan genomes provided a second line of evidence. Model-based approaches revealed that the deepest split in the evolutionary history of extant orangutans occurred ∼3.38 mya between the Batang Toru population and those to the north of Lake Toba, whereas both currently recognized species separated much later, about 674 kya. Our combined analyses support a new classification of orangutans into three extant species. The new species, Pongo tapanuliensis, encompasses the Batang Toru population, of which fewer than 800 individuals survive. VIDEO ABSTRACT.
Collapse
|
22
|
Increased DNA methylation variability in type 1 diabetes across three immune effector cell types. Nat Commun 2016; 7:13555. [PMID: 27898055 PMCID: PMC5141286 DOI: 10.1038/ncomms13555] [Citation(s) in RCA: 119] [Impact Index Per Article: 14.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2016] [Accepted: 10/04/2016] [Indexed: 02/06/2023] Open
Abstract
The incidence of type 1 diabetes (T1D) has substantially increased over the past decade, suggesting a role for non-genetic factors such as epigenetic mechanisms in disease development. Here we present an epigenome-wide association study across 406,365 CpGs in 52 monozygotic twin pairs discordant for T1D in three immune effector cell types. We observe a substantial enrichment of differentially variable CpG positions (DVPs) in T1D twins when compared with their healthy co-twins and when compared with healthy, unrelated individuals. These T1D-associated DVPs are found to be temporally stable and enriched at gene regulatory elements. Integration with cell type-specific gene regulatory circuits highlight pathways involved in immune cell metabolism and the cell cycle, including mTOR signalling. Evidence from cord blood of newborns who progress to overt T1D suggests that the DVPs likely emerge after birth. Our findings, based on 772 methylomes, implicate epigenetic changes that could contribute to disease pathogenesis in T1D. The incidence of type 1 diabetes is increasing, potentially implicating non-genetic factors. Here the authors conduct an epigenome-wide association study in disease-discordant twins and find increased DNA methylation variability at genes associated with immune cell metabolism and the cell cycle.
Collapse
|
23
|
Decoding the DNA Methylome of Mantle Cell Lymphoma in the Light of the Entire B Cell Lineage. Cancer Cell 2016; 30:806-821. [PMID: 27846393 PMCID: PMC5805090 DOI: 10.1016/j.ccell.2016.09.014] [Citation(s) in RCA: 90] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/22/2016] [Revised: 07/18/2016] [Accepted: 09/19/2016] [Indexed: 01/17/2023]
Abstract
We analyzed the in silico purified DNA methylation signatures of 82 mantle cell lymphomas (MCL) in comparison with cell subpopulations spanning the entire B cell lineage. We identified two MCL subgroups, respectively carrying epigenetic imprints of germinal-center-inexperienced and germinal-center-experienced B cells, and we found that DNA methylation profiles during lymphomagenesis are largely influenced by the methylation dynamics in normal B cells. An integrative epigenomic approach revealed 10,504 differentially methylated regions in regulatory elements marked by H3K27ac in MCL primary cases, including a distant enhancer showing de novo looping to the MCL oncogene SOX11. Finally, we observed that the magnitude of DNA methylation changes per case is highly variable and serves as an independent prognostic factor for MCL outcome.
Collapse
|
24
|
Abstract
Background The Mediterranean olive tree (Olea europaea subsp. europaea) was one of the first trees to be domesticated and is currently of major agricultural importance in the Mediterranean region as the source of olive oil. The molecular bases underlying the phenotypic differences among domesticated cultivars, or between domesticated olive trees and their wild relatives, remain poorly understood. Both wild and cultivated olive trees have 46 chromosomes (2n). Findings A total of 543 Gb of raw DNA sequence from whole genome shotgun sequencing, and a fosmid library containing 155,000 clones from a 1,000+ year-old olive tree (cv. Farga) were generated by Illumina sequencing using different combinations of mate-pair and pair-end libraries. Assembly gave a final genome with a scaffold N50 of 443 kb, and a total length of 1.31 Gb, which represents 95 % of the estimated genome length (1.38 Gb). In addition, the associated fungus Aureobasidiumpullulans was partially sequenced. Genome annotation, assisted by RNA sequencing from leaf, root, and fruit tissues at various stages, resulted in 56,349 unique protein coding genes, suggesting recent genomic expansion. Genome completeness, as estimated using the CEGMA pipeline, reached 98.79 %. Conclusions The assembled draft genome of O. europaea will provide a valuable resource for the study of the evolution and domestication processes of this important tree, and allow determination of the genetic bases of key phenotypic traits. Moreover, it will enhance breeding programs and the formation of new varieties.
Collapse
|
25
|
A comprehensive assessment of somatic mutation detection in cancer using whole-genome sequencing. Nat Commun 2015; 6:10001. [PMID: 26647970 PMCID: PMC4682041 DOI: 10.1038/ncomms10001] [Citation(s) in RCA: 205] [Impact Index Per Article: 22.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2015] [Accepted: 10/23/2015] [Indexed: 12/13/2022] Open
Abstract
As whole-genome sequencing for cancer genome analysis becomes a clinical tool, a full understanding of the variables affecting sequencing analysis output is required. Here using tumour-normal sample pairs from two different types of cancer, chronic lymphocytic leukaemia and medulloblastoma, we conduct a benchmarking exercise within the context of the International Cancer Genome Consortium. We compare sequencing methods, analysis pipelines and validation methods. We show that using PCR-free methods and increasing sequencing depth to ∼ 100 × shows benefits, as long as the tumour:control coverage ratio remains balanced. We observe widely varying mutation call rates and low concordance among analysis pipelines, reflecting the artefact-prone nature of the raw data and lack of standards for dealing with the artefacts. However, we show that, using the benchmark mutation set we have created, many issues are in fact easy to remedy and have an immediate positive impact on mutation detection accuracy.
Collapse
|
26
|
New technologies for DNA analysis--a review of the READNA Project. N Biotechnol 2015; 33:311-30. [PMID: 26514324 DOI: 10.1016/j.nbt.2015.10.003] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2015] [Accepted: 10/17/2015] [Indexed: 01/09/2023]
Abstract
The REvolutionary Approaches and Devices for Nucleic Acid analysis (READNA) project received funding from the European Commission for 41/2 years. The objectives of the project revolved around technological developments in nucleic acid analysis. The project partners have discovered, created and developed a huge body of insights into nucleic acid analysis, ranging from improvements and implementation of current technologies to the most promising sequencing technologies that constitute a 3(rd) and 4(th) generation of sequencing methods with nanopores and in situ sequencing, respectively.
Collapse
|
27
|
Genomic characterization of mutant laboratory mouse strains by exome sequencing and annotation lift-over. BMC Genomics 2015; 16:351. [PMID: 25943197 PMCID: PMC4422528 DOI: 10.1186/s12864-015-1548-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2015] [Accepted: 04/17/2015] [Indexed: 11/16/2022] Open
Abstract
Background Exome sequencing has become a popular method to evaluate undirected mutagenesis experiments in mice. However, the most suitable mouse strain for the biological model may be relatively distant from the standard mouse reference genome. For pinpointing causative variants, a matching reference with gene annotations is essential, but not always readily available. Results We present an approach that allows to use murine Ensembl annotations on alternative mouse strain assemblies. We resolved ENU-induced mutation screening for 8 phenotypic mutant lines generated on C3HeB/FeJ background aligning the sequences against the closely related, but not annotated reference of C3H/HeJ. Variants occurring in all strains were filtered out as specific for the C3HeB/FeJ strain but unrelated to mutagenesis. Variants occurring exclusively in all individuals of one mutant line and matching the inheritance model were selected as mutagenesis-related. These variants were annotated with gene and exon names lifted over from the standard murine reference mm9 to C3H/HeJ using megablast. For each mutant line, we could restrict the results to exonic variants in between 1 and 23 genes. Conclusions The presented method of exonic annotation lift-over proved to be a valuable tool in the search for mutagenesis-derived coding genomic variants and the assessment of genotype-phenotype relationships.
Collapse
|
28
|
Involvement of a citrus meiotic recombination TTC-repeat motif in the formation of gross deletions generated by ionizing radiation and MULE activation. BMC Genomics 2015; 16:69. [PMID: 25758634 PMCID: PMC4334395 DOI: 10.1186/s12864-015-1280-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2014] [Accepted: 01/26/2015] [Indexed: 02/07/2023] Open
Abstract
Background Transposable-element mediated chromosomal rearrangements require the involvement of two transposons and two double-strand breaks (DSB) located in close proximity. In radiobiology, DSB proximity is also a major factor contributing to rearrangements. However, the whole issue of DSB proximity remains virtually unexplored. Results Based on DNA sequencing analysis we show that the genomes of 2 derived mutations, Arrufatina (sport) and Nero (irradiation), share a similar 2 Mb deletion of chromosome 3. A 7 kb Mutator-like element found in Clemenules was present in Arrufatina in inverted orientation flanking the 5′ end of the deletion. The Arrufatina Mule displayed “dissimilar” 9-bp target site duplications separated by 2 Mb. Fine-scale single nucleotide variant analyses of the deleted fragments identified a TTC-repeat sequence motif located in the center of the deletion responsible of a meiotic crossover detected in the citrus reference genome. Conclusions Taken together, this information is compatible with the proposal that in both mutants, the TTC-repeat motif formed a triplex DNA structure generating a loop that brought in close proximity the originally distinct reactive ends. In Arrufatina, the loop brought the Mule ends nearby the 2 distinct insertion target sites and the inverted insertion of the transposable element between these target sites provoked the release of the in-between fragment. This proposal requires the involvement of a unique transposon and sheds light on the unresolved question of how two distinct sites become located in close proximity. These observations confer a crucial role to the TTC-repeats in fundamental plant processes as meiotic recombination and chromosomal rearrangements. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1280-3) contains supplementary material, which is available to authorized users.
Collapse
|
29
|
Whole-epigenome analysis in multiple myeloma reveals DNA hypermethylation of B cell-specific enhancers. Genome Res 2015; 25:478-87. [PMID: 25644835 PMCID: PMC4381520 DOI: 10.1101/gr.180240.114] [Citation(s) in RCA: 99] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2014] [Accepted: 01/22/2015] [Indexed: 12/20/2022]
Abstract
While analyzing the DNA methylome of multiple myeloma (MM), a plasma cell neoplasm, by whole-genome bisulfite sequencing and high-density arrays, we observed a highly heterogeneous pattern globally characterized by regional DNA hypermethylation embedded in extensive hypomethylation. In contrast to the widely reported DNA hypermethylation of promoter-associated CpG islands (CGIs) in cancer, hypermethylated sites in MM, as opposed to normal plasma cells, were located outside CpG islands and were unexpectedly associated with intronic enhancer regions defined in normal B cells and plasma cells. Both RNA-seq and in vitro reporter assays indicated that enhancer hypermethylation is globally associated with down-regulation of its host genes. ChIP-seq and DNase-seq further revealed that DNA hypermethylation in these regions is related to enhancer decommissioning. Hypermethylated enhancer regions overlapped with binding sites of B cell-specific transcription factors (TFs) and the degree of enhancer methylation inversely correlated with expression levels of these TFs in MM. Furthermore, hypermethylated regions in MM were methylated in stem cells and gradually became demethylated during normal B-cell differentiation, suggesting that MM cells either reacquire epigenetic features of undifferentiated cells or maintain an epigenetic signature of a putative myeloma stem cell progenitor. Overall, we have identified DNA hypermethylation of developmentally regulated enhancers as a new type of epigenetic modification associated with the pathogenesis of MM.
Collapse
|
30
|
Abstract
Cactophilic Drosophila species provide a valuable model to study gene–environment interactions and ecological adaptation. Drosophila buzzatii and Drosophila mojavensis are two cactophilic species that belong to the repleta group, but have very different geographical distributions and primary host plants. To investigate the genomic basis of ecological adaptation, we sequenced the genome and developmental transcriptome of D. buzzatii and compared its gene content with that of D. mojavensis and two other noncactophilic Drosophila species in the same subgenus. The newly sequenced D. buzzatii genome (161.5 Mb) comprises 826 scaffolds (>3 kb) and contains 13,657 annotated protein-coding genes. Using RNA sequencing data of five life-stages we found expression of 15,026 genes, 80% protein-coding genes, and 20% noncoding RNA genes. In total, we detected 1,294 genes putatively under positive selection. Interestingly, among genes under positive selection in the D. mojavensis lineage, there is an excess of genes involved in metabolism of heterocyclic compounds that are abundant in Stenocereus cacti and toxic to nonresident Drosophila species. We found 117 orphan genes in the shared D. buzzatii–D. mojavensis lineage. In addition, gene duplication analysis identified lineage-specific expanded families with functional annotations associated with proteolysis, zinc ion binding, chitin binding, sensory perception, ethanol tolerance, immunity, physiology, and reproduction. In summary, we identified genetic signatures of adaptation in the shared D. buzzatii–D. mojavensis lineage, and in the two separate D. buzzatii and D. mojavensis lineages. Many of the novel lineage-specific genomic features are promising candidates for explaining the adaptation of these species to their distinct ecological niches.
Collapse
|
31
|
Variation in genomic landscape of clear cell renal cell carcinoma across Europe. Nat Commun 2014; 5:5135. [PMID: 25351205 DOI: 10.1038/ncomms6135] [Citation(s) in RCA: 132] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2014] [Accepted: 09/03/2014] [Indexed: 12/31/2022] Open
Abstract
The incidence of renal cell carcinoma (RCC) is increasing worldwide, and its prevalence is particularly high in some parts of Central Europe. Here we undertake whole-genome and transcriptome sequencing of clear cell RCC (ccRCC), the most common form of the disease, in patients from four different European countries with contrasting disease incidence to explore the underlying genomic architecture of RCC. Our findings support previous reports on frequent aberrations in the epigenetic machinery and PI3K/mTOR signalling, and uncover novel pathways and genes affected by recurrent mutations and abnormal transcriptome patterns including focal adhesion, components of extracellular matrix (ECM) and genes encoding FAT cadherins. Furthermore, a large majority of patients from Romania have an unexpected high frequency of A:T>T:A transversions, consistent with exposure to aristolochic acid (AA). These results show that the processes underlying ccRCC tumorigenesis may vary in different populations and suggest that AA may be an important ccRCC carcinogen in Romania, a finding with major public health implications.
Collapse
|
32
|
Abstract
Epithelial-to-mesenchymal transition (EMT) is a plastic process in which fully differentiated epithelial cells are converted into poorly differentiated, migratory and invasive mesenchymal cells, and it has been related to the metastasis potential of tumors. This is a reversible process and cells can also eventually undergo mesenchymal-to-epithelial transition. The existence of a dynamic EMT process suggests the involvement of epigenetic shifts in the phenotype. Herein, we obtained the DNA methylomes at single-base resolution of Madin-Darby canine kidney cells undergoing EMT and translated the identified differentially methylated regions to human breast cancer cells undergoing a gain of migratory and invasive capabilities associated with the EMT phenotype. We noticed dynamic and reversible changes of DNA methylation, both on promoter sequences and gene-bodies in association with transcription regulation of EMT-related genes. Most importantly, the identified DNA methylation markers of EMT were present in primary mammary tumors in association with the epithelial or the mesenchymal phenotype of the studied breast cancer samples.
Collapse
|
33
|
RD-Connect: an integrated platform connecting databases, registries, biobanks and clinical bioinformatics for rare disease research. J Gen Intern Med 2014; 29 Suppl 3:S780-7. [PMID: 25029978 PMCID: PMC4124112 DOI: 10.1007/s11606-014-2908-8] [Citation(s) in RCA: 104] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Research into rare diseases is typically fragmented by data type and disease. Individual efforts often have poor interoperability and do not systematically connect data across clinical phenotype, genomic data, biomaterial availability, and research/trial data sets. Such data must be linked at both an individual-patient and whole-cohort level to enable researchers to gain a complete view of their disease and patient population of interest. Data access and authorization procedures are required to allow researchers in multiple institutions to securely compare results and gain new insights. Funded by the European Union's Seventh Framework Programme under the International Rare Diseases Research Consortium (IRDiRC), RD-Connect is a global infrastructure project initiated in November 2012 that links genomic data with registries, biobanks, and clinical bioinformatics tools to produce a central research resource for rare diseases.
Collapse
|
34
|
A revisit of high collision energy effects on collision-induced dissociation spectra using matrix-assisted laser desorption/ionization tandem time-of-flight mass spectrometry (MALDI-LIFT-TOF/TOF): application to the sequencing of RNA/DNA chimeras. RAPID COMMUNICATIONS IN MASS SPECTROMETRY : RCM 2014; 28:1433-1443. [PMID: 24861592 DOI: 10.1002/rcm.6913] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/17/2013] [Revised: 04/01/2014] [Accepted: 04/01/2014] [Indexed: 06/03/2023]
Abstract
RATIONALE High-energy collision-induced dissociation (CID) spectra of isomeric RNA/DNA chimeras using matrix-assisted laser desorption/ionization time-of-flight LIFT mass spectrometry (MALDI-LIFT-TOF/TOF) can potentially be applied for an exhaustive fragment characterization in a nucleic acid sequencing scheme. These chimeras contain deoxynucleotides and at the 3'-end a ribonucleotide with a 3'-phosphate group. METHODS Deprotonated RNA/DNA chimeras of 4-, 5-, 7- and 10-mers are analyzed by CID. This enhances consecutive dissociations from both the precursor and prompt product anions generated by MALDI and metastable fragmentations prior to entering the LIFT cell. RESULTS Gas-phase fragmentations of 4- and 5-mers produced many fragment ions, from base release prior to consecutive cleavage of the nucleotide phosphate bond linkage phosphate. The unusual a4(-) product ion is a specific and diagnostic dissociation of the 4-mer if the ribonucleotide contains cytosine. As the size of RNA/DNA chimeras increase, several abundant product ions are generated mainly from zwitterionic forms (deprotonated phosphate ester and protonated base sites): [(M-H)-BiH](-), [ai-BiH](-), wj(-), [wj, (ai-BiH)](-) (if Bi ≠ T) as internal product ion, and more rarely [wj-BiH](-). The absence of the majority of the [ai-BiH](-) series although the wj (-) series suggested that the higher critical energy processes with a loose transition state are favored yielding the wj(-) series. A large number of abundant fragment ions are detected which enable each isomer to be sequenced. CONCLUSIONS This sequencing method is high-throughput, accurate and could be used to sequence isomers of up to 10-mers and also oligonucleotides of unknown sequence. However, RNA/DNA chimeras without thymine must be sufficiently concentrated to reach desorption of deprotonated molecular species to be selected in LIFT to produce all fragment ions within measurable abundances.
Collapse
|
35
|
Structural studies of the sBBI/trypsin non-covalent complex using covalent modification and mass spectrometry. RAPID COMMUNICATIONS IN MASS SPECTROMETRY : RCM 2014; 28:413-429. [PMID: 24497279 DOI: 10.1002/rcm.6797] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2013] [Revised: 11/21/2013] [Accepted: 12/01/2013] [Indexed: 06/03/2023]
Abstract
RATIONALE The study of protein recognition sites is crucial for understanding the mechanisms of protein interaction. Mass spectrometry can be a method of choice for the investigation of the contact surface within the protein non-covalent complexes. METHODS Probing the reactivity of essential amino acid residues of soybean Bowman-Birk inhibitor (sBBI) within the non-covalent sBBI/bovine trypsin complex was performed using covalent labeling by the BS3 cross-linker and charge tag with a quaternary ammonium group in combination with matrix-assisted laser desorption/ionization (MALDI) mass spectrometry (MS) and tandem mass spectrometry (MS/MS) analysis. RESULTS Significant modulation of the reactivity of essential K16 and S17 residues in the sBBI molecule upon binding to trypsin was established. The studies of sBBI proteolytic peptides with the same structure but carrying different labels using metastable dissociation in LIFT mode demonstrated that fragmentation pathways were oriented by used modification (BS3 cross-linker or charge tag). CONCLUSIONS The effectiveness of the mass spectrometric approach including covalent modification for exploring protein-protein interaction sites has been demonstrated. The alteration of the reactivity of functionally important amino acid residues in the sBBI molecule is most likely related to changes in their microenvironment. It has been suggested that in the presence of charge tags fragmentation in LIFT mode proceeds through the formation of salt bridges between quaternary ammonium groups and acidic residues due to the occurrence of zwitterions (including basic and acidic residues). Despite the presence of one or several charge tags, fragmentation takes place yielding modulated bi /yj ion series depending on the positions of the tags.
Collapse
|
36
|
Abstract
Deciphering the information encoded in the human genome is key for the further understanding of human biology, physiology and evolution. With the draft sequence of the human genome completed, elucidation of the epigenetic information layer of the human genome becomes accessible. Epigenetic mechanisms are mediated by either chemical modifications of the DNA itself or by modifications of proteins that are closely associated with DNA. Defects of the epigenetic regulation involved in processes such as imprinting, X chromosome inactivation, transcriptional control of genes, as well as mutations affecting DNA methylation enzymes, contribute fundamentally to the etiology of many human diseases. Headed by the Human Epigenome Consortium, the Human Epigenome Project is a joint effort by an international collaboration that aims to identify, catalog and interpret genome-wide DNA methylation patterns of all human genes in all major tissues. Methylation variable positions are thought to reflect gene activity, tissue type and disease state, and are useful epigenetic markers revealing the dynamic state of the genome. Like single nucleotide polymorphisms, methylation variable positions will greatly advance our ability to elucidate and diagnose the molecular basis of human diseases.
Collapse
|
37
|
ERBB2 in cat mammary neoplasias disclosed a positive correlation between RNA and protein low expression levels: a model for erbB-2 negative human breast cancer. PLoS One 2013; 8:e83673. [PMID: 24386251 PMCID: PMC3873372 DOI: 10.1371/journal.pone.0083673] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2013] [Accepted: 11/06/2013] [Indexed: 02/01/2023] Open
Abstract
Human ERBB2 is a proto-oncogene that codes for the erbB-2 epithelial growth factor receptor. In human breast cancer (HBC), erbB-2 protein overexpression has been repeatedly correlated with poor prognosis. In more recent works, underexpression of this gene has been described in HBC. Moreover, it is also recognised that oncogenes that are commonly amplified or deleted encompass point mutations, and some of these are associated with HBC. In cat mammary lesions (CMLs), the overexpression of ERBB2 (27%-59.6%) has also been described, mostly at the protein level and although cat mammary neoplasias are considered to be a natural model of HBC, molecular information is still scarce. In the present work, a cat ERBB2 fragment, comprising exons 10 to 15 (ERBB2_10-15) was achieved for the first time. Allelic variants and genomic haplotype analyses were also performed, and differences between normal and CML populations were observed. Three amino acid changes, corresponding to 3 non-synonymous genomic sequence variants that were only detected in CMLs, were proposed to damage the 3D structure of the protein. We analysed the cat ERBB2 gene at the DNA (copy number determination), mRNA (expression levels assessment) and protein levels (in extra- and intra protein domains) in CML samples and correlated the last two evaluations with clinicopathological features. We found a positive correlation between the expression levels of the ERBB2 RNA and erbB-2 protein, corresponding to the intracellular region. Additionally, we detected a positive correlation between higher mRNA expression and better clinical outcome. Our results suggest that the ERBB2 gene is post-transcriptionally regulated and that proteins with truncations and single point mutations are present in cat mammary neoplastic lesions. We would like to emphasise that the recurrent occurrence of low erbB-2 expression levels in cat mammary tumours, suggests the cat mammary neoplasias as a valuable model for erbB-2 negative HBC.
Collapse
|
38
|
Reproducibility of high-throughput mRNA and small RNA sequencing across laboratories. Nat Biotechnol 2013; 31:1015-22. [DOI: 10.1038/nbt.2702] [Citation(s) in RCA: 193] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2013] [Accepted: 08/21/2013] [Indexed: 02/07/2023]
|
39
|
Transcriptome and genome sequencing uncovers functional variation in humans. Nature 2013; 501:506-11. [PMID: 24037378 PMCID: PMC3918453 DOI: 10.1038/nature12531] [Citation(s) in RCA: 1350] [Impact Index Per Article: 122.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2013] [Accepted: 08/05/2013] [Indexed: 02/06/2023]
Abstract
Genome sequencing projects are discovering millions of genetic variants in humans, and interpretation of their functional effects is essential for understanding the genetic basis of variation in human traits. Here we report sequencing and deep analysis of mRNA and miRNA from lymphoblastoid cell lines of 462 individuals from the 1000 Genomes Project – the first uniformly processed RNA-seq data from multiple human populations with high-quality genome sequences. We discovered extremely widespread genetic variation affecting regulation of the majority of genes, with transcript structure and expression level variation being equally common but genetically largely independent. Our characterization of causal regulatory variation sheds light on cellular mechanisms of regulatory and loss-of-function variation, and allowed us to infer putative causal variants for dozens of disease-associated loci. Altogether, this study provides a deep understanding of the cellular mechanisms of transcriptome variation and of the landscape of functional variants in the human genome.
Collapse
|
40
|
Whole-genome bisulfite sequencing of two distinct interconvertible DNA methylomes of mouse embryonic stem cells. Cell Stem Cell 2013; 13:360-9. [PMID: 23850244 DOI: 10.1016/j.stem.2013.06.002] [Citation(s) in RCA: 336] [Impact Index Per Article: 30.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2013] [Revised: 06/03/2013] [Accepted: 06/07/2013] [Indexed: 12/22/2022]
Abstract
The use of two kinase inhibitors (2i) enables derivation of mouse embryonic stem cells (ESCs) in the pluripotent ground state. Using whole-genome bisulfite sequencing (WGBS), we show that male 2i ESCs are globally hypomethylated compared to conventional ESCs maintained in serum. In serum, female ESCs are hypomethyated similarly to male ESCs in 2i, and DNA methylation is further reduced in 2i. Regions with elevated DNA methylation in 2i strongly correlate with the presence of H3K9me3 on endogenous retroviruses (ERVs) and imprinted loci. The methylome of male ESCs in serum parallels postimplantation blastocyst cells, while 2i stalls ESCs in a hypomethylated, ICM-like state. WGBS analysis during adaptation of 2i ESCs to serum suggests that deposition of DNA methylation is largely random, while loss of DNA methylation during reversion to 2i occurs passively, initiating at TET1 binding sites. Together, our analysis provides insight into DNA methylation dynamics in cultured ESCs paralleling early developmental processes.
Collapse
|
41
|
The genome sequencing of an albino Western lowland gorilla reveals inbreeding in the wild. BMC Genomics 2013; 14:363. [PMID: 23721540 PMCID: PMC3673836 DOI: 10.1186/1471-2164-14-363] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2013] [Accepted: 05/23/2013] [Indexed: 11/28/2022] Open
Abstract
Background The only known albino gorilla, named Snowflake, was a male wild born individual from Equatorial Guinea who lived at the Barcelona Zoo for almost 40 years. He was diagnosed with non-syndromic oculocutaneous albinism, i.e. white hair, light eyes, pink skin, photophobia and reduced visual acuity. Despite previous efforts to explain the genetic cause, this is still unknown. Here, we study the genetic cause of his albinism and making use of whole genome sequencing data we find a higher inbreeding coefficient compared to other gorillas. Results We successfully identified the causal genetic variant for Snowflake’s albinism, a non-synonymous single nucleotide variant located in a transmembrane region of SLC45A2. This transporter is known to be involved in oculocutaneous albinism type 4 (OCA4) in humans. We provide experimental evidence that shows that this amino acid replacement alters the membrane spanning capability of this transmembrane region. Finally, we provide a comprehensive study of genome-wide patterns of autozygogosity revealing that Snowflake’s parents were related, being this the first report of inbreeding in a wild born Western lowland gorilla. Conclusions In this study we demonstrate how the use of whole genome sequencing can be extended to link genotype and phenotype in non-model organisms and it can be a powerful tool in conservation genetics (e.g., inbreeding and genetic diversity) with the expected decrease in sequencing cost.
Collapse
|
42
|
High-specificity single-tube multiplex genotyping using Ribo-PAP PCR, tag primers, alkali cleavage of RNA/DNA chimeras and MALDI-TOF MS. Hum Mutat 2012; 34:266-73. [PMID: 23132774 DOI: 10.1002/humu.22227] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2012] [Accepted: 09/18/2012] [Indexed: 11/09/2022]
Abstract
Here, we describe a high-throughput, single-tube, allele-specific ribonucleotide analog pyrophosphorolysis-activated polymerization (ribo-PAP) PCR multiplex genotyping and resequencing method. An RNA/DNA chimeric PCR product is generated using genomic DNA as starting template, a panel of allele-selective 5'-tagged primers, a reverse primer, one nucleotide in the ribo-form (90-100%), the other nucleotides in the deoxy-form, a DNA polymerase capable of incorporating ribonucleotides, a suitable buffer and thermal cycling. The RNA/DNA chimeric PCR products are fragmented by treatment with alkali and analyzed by mass spectrometry. All allele-selective primers have a 5' repetitive motif where each repeat unit has a unique, distinct mass upon reverse copying and alkali fragmentation. The mass of the complement repeat fragment or flag identifies the primer or primers that were recruited in the ribo-PAP PCR. The method readily identifies homozygous and heterozygous positions in simplex or duplex ribo-PAP PCR. Many different tags can be analyzed simultaneously. The assay can genotype several SNPs in a single tube. It thus constitutes the simplest genotyping protocol with multiplex analysis. This novel genotyping and resequencing protocol was applied to different genomic loci: NOS1 and H19 in 30 individuals in simplex ribo-PAP PCR and at two SLCO1B1 loci in 95 individuals in duplex ribo-PAP PCR.
Collapse
|
43
|
Ribo-polymerase chain reaction--a facile method for the preparation of chimeric RNA/DNA applied to DNA sequencing. Hum Mutat 2012; 33:1010-5. [PMID: 22354560 DOI: 10.1002/humu.22061] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2011] [Accepted: 02/14/2012] [Indexed: 11/06/2022]
Abstract
We describe ribo-polymerase chain reaction (PCR), a method for the preparation of chimeric RNA/DNA. The RNA/DNA chimeric nucleic acids are generated directly from genomic DNA starting templates with two locus-specific primers, three nucleotides in their deoxy form and the fourth in its ribo form, a DNA polymerase capable of incorporating ribo bases, a suitable buffer, and thermal cycling. We have applied ribo-PCR to resequence DNA by directly fragmenting the RNA/DNA chimeras with alkali and analyzing the fragments by mass spectrometry (MS). Mass fingerprint is used to identify deviations from the reference sequence. This method readily detects homozygous sequence deviations as well as heterozygous positions directly from genomic DNA samples. With the high-throughput capability of MS, this facile method is well suited for screening DNA sequences of limited regions of the genome in a large number of individuals. It can also be used to sequence multiple distant genomic loci in a single reaction. This novel ribo-PCR resequencing protocol was applied to different genomic loci involving nitric oxide synthase 1 (NOS1) and H19 in 30 individuals and SLCO1B1 in 95 individuals.
Collapse
|
44
|
Identification of IL7RA Risk Alleles for Rapid Progression During HIV-1 Infection: A Comprehensive Study in the GRIV Cohort. Curr HIV Res 2012; 10:143-50. [DOI: 10.2174/157016212799937209] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2011] [Revised: 11/22/2011] [Accepted: 12/23/2011] [Indexed: 11/22/2022]
|
45
|
Tuning of natural killer cell reactivity by NKp46 and Helios calibrates T cell responses. Science 2012; 335:344-8. [PMID: 22267813 DOI: 10.1126/science.1215621] [Citation(s) in RCA: 154] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Natural killer (NK) cells are lymphocytes involved in antimicrobial and antitumoral immune responses. Using N-ethyl-N-nitrosourea mutagenesis in mice, we identified a mutant with increased resistance to viral infections because of the presence of hyperresponsive NK cells. Whole-genome sequencing and functional analysis revealed a loss-of-function mutation in the Ncr1 gene encoding the activating receptor NKp46. The down-regulation of NK cell activity by NKp46 was associated with the silencing of the Helios transcription factor in NK cells. NKp46 was critical for the subsequent development of antiviral and antibacterial T cell responses, which suggests that the regulation of NK cell function by NKp46 allows for the optimal development of adaptive immune responses. NKp46 blockade enhanced NK cell reactivity in vivo, which could enable the design of immunostimulation strategies in humans.
Collapse
|
46
|
TWIST1 Gene: First Insights in Felis catus. Curr Genomics 2011; 11:212-20. [PMID: 21037858 PMCID: PMC2878985 DOI: 10.2174/138920210791110933] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2009] [Revised: 02/22/2010] [Accepted: 02/24/2010] [Indexed: 11/22/2022] Open
Abstract
TWIST1 is thought to be a novel oncogene. Understanding the molecular mechanisms regulating the TWIST1 gene expression profiles in tumor cells may give new insights regarding prognostic factors and novel therapeutic targets in veterinary oncology. In the present study we partially isolated the TWIST1 gene in Felis catus and performed comparative studies. Several primer combinations were used based on the alignments of homologous DNA sequences. After PCR amplification, three bands were obtained, purified and sequenced. Several bioinformatic tools were utilized to carry out the comparative studies. Higher similarity was found between the isolated TWIST1 gene in Felis catus and Homo sapiens (86%) than between Homo sapiens and Rattus norvegicus or Mus musculus (75%). Partial amino acid sequence showed no change in the four species analyzed. This confirmed that coding sequences presented high similarity (~96%) between man and cat. These results give the first insights regarding the TWIST1 gene in cat but further studies are required in order to establish, or not, its role in tumor formation and progression in veterinary oncology.
Collapse
|
47
|
A genome-wide association study of upper aerodigestive tract cancers conducted within the INHANCE consortium. PLoS Genet 2011; 7:e1001333. [PMID: 21437268 PMCID: PMC3060072 DOI: 10.1371/journal.pgen.1001333] [Citation(s) in RCA: 136] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2010] [Accepted: 02/11/2011] [Indexed: 11/18/2022] Open
Abstract
Genome-wide association studies (GWAS) have been successful in identifying common genetic variation involved in susceptibility to etiologically complex disease. We conducted a GWAS to identify common genetic variation involved in susceptibility to upper aero-digestive tract (UADT) cancers. Genome-wide genotyping was carried out using the Illumina HumanHap300 beadchips in 2,091 UADT cancer cases and 3,513 controls from two large European multi-centre UADT cancer studies, as well as 4,821 generic controls. The 19 top-ranked variants were investigated further in an additional 6,514 UADT cancer cases and 7,892 controls of European descent from an additional 13 UADT cancer studies participating in the INHANCE consortium. Five common variants presented evidence for significant association in the combined analysis (p ≤ 5 × 10⁻⁷). Two novel variants were identified, a 4q21 variant (rs1494961, p = 1×10⁻⁸) located near DNA repair related genes HEL308 and FAM175A (or Abraxas) and a 12q24 variant (rs4767364, p =2 × 10⁻⁸) located in an extended linkage disequilibrium region that contains multiple genes including the aldehyde dehydrogenase 2 (ALDH2) gene. Three remaining variants are located in the ADH gene cluster and were identified previously in a candidate gene study involving some of these samples. The association between these three variants and UADT cancers was independently replicated in 5,092 UADT cancer cases and 6,794 controls non-overlapping samples presented here (rs1573496-ADH7, p = 5 × 10⁻⁸); rs1229984-ADH1B, p = 7 × 10⁻⁹; and rs698-ADH1C, p = 0.02). These results implicate two variants at 4q21 and 12q24 and further highlight three ADH variants in UADT cancer susceptibility.
Collapse
|
48
|
Abstract
BACKGROUND Susceptibility to asthma is influenced by genes and environment; implicated genes may indicate pathways for therapeutic intervention. Genetic risk factors may be useful in identifying subtypes of asthma and determining whether intermediate phenotypes, such as elevation of the total serum IgE level, are causally linked to disease. METHODS We carried out a genomewide association study by genotyping 10,365 persons with physician-diagnosed asthma and 16,110 unaffected persons, all of whom were matched for ancestry. We used random-effects pooled analysis to test for association in the overall study population and in subgroups of subjects with childhood-onset asthma (defined as asthma developing before 16 years of age), later-onset asthma, severe asthma, and occupational asthma. RESULTS We observed associations of genomewide significance between asthma and the following single-nucleotide polymorphisms: rs3771166 on chromosome 2, implicating IL1RL1/IL18R1 (P=3×10(−9)); rs9273349 on chromosome 6, implicating HLA-DQ (P=7×10(−14)); rs1342326 on chromosome 9, flanking IL33 (P=9×10(−10)); rs744910 on chromosome 15 in SMAD3 (P=4×10(−9)); and rs2284033 on chromosome 22 in IL2RB (P=1.1×10(−8)). Association with the ORMDL3/GSDMB locus on chromosome 17q21 was specific to childhood-onset disease (rs2305480, P=6×10(−23)). Only HLA-DR showed a significant genomewide association with the total serum IgE concentration, and loci strongly associated with IgE levels were not associated with asthma. CONCLUSIONS Asthma is genetically heterogeneous. A few common alleles are associated with disease risk at all ages. Implicated genes suggest a role for communication of epithelial damage to the adaptive immune system and activation of airway inflammation. Variants at the ORMDL3/GSDMB locus are associated only with childhood-onset disease. Elevation of total serum IgE levels has a minor role in the development of asthma. (Funded by the European Commission and others.)
Collapse
|
49
|
2nd generation DNA sequencing meets functional genomics. Aging (Albany NY) 2010; 2:541. [PMID: 20864776 PMCID: PMC2984602 DOI: 10.18632/aging.100199] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2010] [Accepted: 09/11/2010] [Indexed: 05/29/2023]
|
50
|
DNA methylation profiling in doxorubicin treated primary locally advanced breast tumours identifies novel genes associated with survival and treatment response. Mol Cancer 2010; 9:68. [PMID: 20338046 PMCID: PMC2861056 DOI: 10.1186/1476-4598-9-68] [Citation(s) in RCA: 95] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2009] [Accepted: 03/25/2010] [Indexed: 12/24/2022] Open
Abstract
Background Breast cancer is the most frequent cancer in women and consists of a heterogeneous collection of diseases with distinct histopathological, genetic and epigenetic characteristics. In this study, we aimed to identify DNA methylation based biomarkers to distinguish patients with locally advanced breast cancer who may benefit from neoadjuvant doxorubicin treatment. Results We investigated quantitatively the methylation patterns in the promoter regions of 14 genes (ABCB1, ATM, BRCA1, CDH3, CDKN2A, CXCR4, ESR1, FBXW7, FOXC1, GSTP1, IGF2, HMLH1, PPP2R2B, and PTEN) in 75 well-described pre-treatment samples from locally advanced breast cancer and correlated the results to the available clinical and molecular parameters. Six normal breast tissues were used as controls and 163 unselected breast cancer cases were used to validate associations with histopathological and clinical parameters. Aberrant methylation was detected in 9 out of the 14 genes including the discovery of methylation at the FOXC1 promoter. Absence of methylation at the ABCB1 promoter correlated with progressive disease during doxorubicin treatment. Most importantly, the DNA methylation status at the promoters of GSTP1, FOXC1 and ABCB1 correlated with survival, whereby the combination of methylated genes improved the subdivision with respect to the survival of the patients. In multivariate analysis GSTP1 and FOXC1 methylation status proved to be independent prognostic markers associated with survival. Conclusions Quantitative DNA methylation profiling is a powerful tool to identify molecular changes associated with specific phenotypes. Methylation at the ABCB1 or GSTP1 promoter improved overall survival probably due to prolonged availability and activity of the drug in the cell while FOXC1 methylation might be a protective factor against tumour invasiveness. FOXC1 proved to be general prognostic factor, while ABCB1 and GSTP1 might be predictive factors for the response to and efficacy of doxorubicin treatment. Pharmacoepigenetic effects such as the reported associations in this study provide molecular explanations for differential responses to chemotherapy and it might prove valuable to take the methylation status of selected genes into account for patient management and treatment decisions.
Collapse
|