1
|
|
2
|
Author Correction: Genomic basis for RNA alterations in cancer. Nature 2023; 614:E37. [PMID: 36697831 PMCID: PMC9931574 DOI: 10.1038/s41586-022-05596-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
|
3
|
Organ-specific metabolic pathways distinguish prediabetes, type 2 diabetes, and normal tissues. CELL REPORTS MEDICINE 2022; 3:100763. [PMID: 36198307 PMCID: PMC9589007 DOI: 10.1016/j.xcrm.2022.100763] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/15/2021] [Revised: 07/02/2022] [Accepted: 09/13/2022] [Indexed: 11/28/2022]
Abstract
Environmental and genetic factors cause defects in pancreatic islets driving type 2 diabetes (T2D) together with the progression of multi-tissue insulin resistance. Mass spectrometry proteomics on samples from five key metabolic tissues of a cross-sectional cohort of 43 multi-organ donors provides deep coverage of their proteomes. Enrichment analysis of Gene Ontology terms provides a tissue-specific map of altered biological processes across healthy, prediabetes (PD), and T2D subjects. We find widespread alterations in several relevant biological pathways, including increase in hemostasis in pancreatic islets of PD, increase in the complement cascade in liver and pancreatic islets of PD, and elevation in cholesterol biosynthesis in liver of T2D. Our findings point to inflammatory, immune, and vascular alterations in pancreatic islets in PD that are hypotheses to be tested for potential contributions to hormonal perturbations such as impaired insulin and increased glucagon production. This multi-tissue proteomic map suggests tissue-specific metabolic dysregulations in T2D.
Collapse
|
4
|
Interpretable machine learning identifies paediatric Systemic Lupus Erythematosus subtypes based on gene expression data. Sci Rep 2022; 12:7433. [PMID: 35523803 PMCID: PMC9076598 DOI: 10.1038/s41598-022-10853-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2021] [Accepted: 04/13/2022] [Indexed: 11/25/2022] Open
Abstract
Transcriptomic analyses are commonly used to identify differentially expressed genes between patients and controls, or within individuals across disease courses. These methods, whilst effective, cannot encompass the combinatorial effects of genes driving disease. We applied rule-based machine learning (RBML) models and rule networks (RN) to an existing paediatric Systemic Lupus Erythematosus (SLE) blood expression dataset, with the goal of developing gene networks to separate low and high disease activity (DA1 and DA3). The resultant model had an 81% accuracy to distinguish between DA1 and DA3, with unsupervised hierarchical clustering revealing additional subgroups indicative of the immune axis involved or state of disease flare. These subgroups correlated with clinical variables, suggesting that the gene sets identified may further the understanding of gene networks that act in concert to drive disease progression. This included roles for genes (i) induced by interferons (IFI35 and OTOF), (ii) key to SLE cell types (KLRB1 encoding CD161), or (iii) with roles in autophagy and NF-κB pathway responses (CKAP4). As demonstrated here, RBML approaches have the potential to reveal novel gene patterns from within a heterogeneous disease, facilitating patient clinical and therapeutic stratification.
Collapse
|
5
|
Machine Learning-Based Analysis of Glioma Grades Reveals Co-Enrichment. Cancers (Basel) 2022; 14:1014. [PMID: 35205761 PMCID: PMC8870250 DOI: 10.3390/cancers14041014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Revised: 02/09/2022] [Accepted: 02/14/2022] [Indexed: 02/04/2023] Open
Abstract
Gliomas develop and grow in the brain and central nervous system. Examining glioma grading processes is valuable for improving therapeutic challenges. One of the most extensive repositories storing transcriptomics data for gliomas is The Cancer Genome Atlas (TCGA). However, such big cohorts should be processed with caution and evaluated thoroughly as they can contain batch and other effects. Furthermore, biological mechanisms of cancer contain interactions among biomarkers. Thus, we applied an interpretable machine learning approach to discover such relationships. This type of transparent learning provides not only good predictability, but also reveals co-predictive mechanisms among features. In this study, we corrected the strong and confounded batch effect in the TCGA glioma data. We further used the corrected datasets to perform comprehensive machine learning analysis applied on single-sample gene set enrichment scores using collections from the Molecular Signature Database. Furthermore, using rule-based classifiers, we displayed networks of co-enrichment related to glioma grades. Moreover, we validated our results using the external glioma cohorts. We believe that utilizing corrected glioma cohorts from TCGA may improve the application and validation of any future studies. Finally, the co-enrichment and survival analysis provided detailed explanations for glioma progression and consequently, it should support the targeted treatment.
Collapse
|
6
|
MetaFetcheR: An R Package for Complete Mapping of Small-Compound Data. Metabolites 2021; 11:metabo11110743. [PMID: 34822401 PMCID: PMC8620779 DOI: 10.3390/metabo11110743] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 10/22/2021] [Accepted: 10/27/2021] [Indexed: 11/20/2022] Open
Abstract
Small-compound databases contain a large amount of information for metabolites and metabolic pathways. However, the plethora of such databases and the redundancy of their information lead to major issues with analysis and standardization. A lack of preventive establishment of means of data access at the infant stages of a project might lead to mislabelled compounds, reduced statistical power, and large delays in delivery of results. We developed MetaFetcheR, an open-source R package that links metabolite data from several small-compound databases, resolves inconsistencies, and covers a variety of use-cases of data fetching. We showed that the performance of MetaFetcheR was superior to existing approaches and databases by benchmarking the performance of the algorithm in three independent case studies based on two published datasets.
Collapse
|
7
|
The Thioesterase ACOT1 as a Regulator of Lipid Metabolism in Type 2 Diabetes Detected in a Multi-Omics Study of Human Liver. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2021; 25:652-659. [PMID: 34520261 PMCID: PMC8812507 DOI: 10.1089/omi.2021.0093] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
Type 2 diabetes (T2D) is characterized by pathophysiological alterations in lipid metabolism. One strategy to understand the molecular mechanisms behind these abnormalities is to identify cis-regulatory elements (CREs) located in chromatin-accessible regions of the genome that regulate key genes. In this study we integrated assay for transposase-accessible chromatin followed by sequencing (ATAC-seq) data, widely used to decode chromatin accessibility, with multi-omics data and publicly available CRE databases to identify candidate CREs associated with T2D for further experimental validations. We performed high-sensitive ATAC-seq in nine human liver samples from normal and T2D donors, and identified a set of differentially accessible regions (DARs). We identified seven DARs including a candidate enhancer for the ACOT1 gene that regulates the balance of acyl-CoA and free fatty acids (FFAs) in the cytoplasm. The relevance of ACOT1 regulation in T2D was supported by the analysis of transcriptomics and proteomics data in liver tissue. Long-chain acyl-CoA thioesterases (ACOTs) are a group of enzymes that hydrolyze acyl-CoA esters to FFAs and coenzyme A. ACOTs have been associated with regulation of triglyceride levels, fatty acid oxidation, mitochondrial function, and insulin signaling, linking their regulation to the pathogenesis of T2D. Our strategy integrating chromatin accessibility with DNA binding and other types of omics provides novel insights on the role of genetic regulation in T2D and is extendable to other complex multifactorial diseases.
Collapse
|
8
|
Mapping chromatin accessibility and active regulatory elements reveals pathological mechanisms in human gliomas. Nat Commun 2021; 12:3621. [PMID: 34131149 PMCID: PMC8206121 DOI: 10.1038/s41467-021-23922-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Accepted: 05/19/2021] [Indexed: 12/15/2022] Open
Abstract
Chromatin structure and accessibility, and combinatorial binding of transcription factors to regulatory elements in genomic DNA control transcription. Genetic variations in genes encoding histones, epigenetics-related enzymes or modifiers affect chromatin structure/dynamics and result in alterations in gene expression contributing to cancer development or progression. Gliomas are brain tumors frequently associated with epigenetics-related gene deregulation. We perform whole-genome mapping of chromatin accessibility, histone modifications, DNA methylation patterns and transcriptome analysis simultaneously in multiple tumor samples to unravel epigenetic dysfunctions driving gliomagenesis. Based on the results of the integrative analysis of the acquired profiles, we create an atlas of active enhancers and promoters in benign and malignant gliomas. We explore these elements and intersect with Hi-C data to uncover molecular mechanisms instructing gene expression in gliomas.
Collapse
|
9
|
Multifaceted regulation of hepatic lipid metabolism by YY1. Life Sci Alliance 2021; 4:4/7/e202000928. [PMID: 34099540 PMCID: PMC8200296 DOI: 10.26508/lsa.202000928] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Revised: 05/25/2021] [Accepted: 05/25/2021] [Indexed: 12/20/2022] Open
Abstract
This study shows that YY1 regulates hepatic lipid metabolism by directly or indirectly regulating the expression of several key upstream transcription factors and their coactivators. Recent studies suggested that dysregulated YY1 plays a pivotal role in many liver diseases. To obtain a detailed view of genes and pathways regulated by YY1 in the liver, we carried out RNA sequencing in HepG2 cells after YY1 knockdown. A rigid set of 2,081 differentially expressed genes was identified by comparing the YY1-knockdown samples (n = 8) with the control samples (n = 14). YY1 knockdown significantly decreased the expression of several key transcription factors and their coactivators in lipid metabolism. This is illustrated by YY1 regulating PPARA expression through binding to its promoter and enhancer regions. Our study further suggest that down-regulation of the key transcription factors together with YY1 knockdown significantly decreased the cooperation between YY1 and these transcription factors at various regulatory regions, which are important in regulating the expression of genes in hepatic lipid metabolism. This was supported by the finding that the expression of SCD and ELOVL6, encoding key enzymes in lipogenesis, were regulated by the cooperation between YY1 and PPARA/RXRA complex over their promoters.
Collapse
|
10
|
Abstract
BACKGROUND Machine learning involves strategies and algorithms that may assist bioinformatics analyses in terms of data mining and knowledge discovery. In several applications, viz. in Life Sciences, it is often more important to understand how a prediction was obtained rather than knowing what prediction was made. To this end so-called interpretable machine learning has been recently advocated. In this study, we implemented an interpretable machine learning package based on the rough set theory. An important aim of our work was provision of statistical properties of the models and their components. RESULTS We present the R.ROSETTA package, which is an R wrapper of ROSETTA framework. The original ROSETTA functions have been improved and adapted to the R programming environment. The package allows for building and analyzing non-linear interpretable machine learning models. R.ROSETTA gathers combinatorial statistics via rule-based modelling for accessible and transparent results, well-suited for adoption within the greater scientific community. The package also provides statistics and visualization tools that facilitate minimization of analysis bias and noise. The R.ROSETTA package is freely available at https://github.com/komorowskilab/R.ROSETTA . To illustrate the usage of the package, we applied it to a transcriptome dataset from an autism case-control study. Our tool provided hypotheses for potential co-predictive mechanisms among features that discerned phenotype classes. These co-predictors represented neurodevelopmental and autism-related genes. CONCLUSIONS R.ROSETTA provides new insights for interpretable machine learning analyses and knowledge-based systems. We demonstrated that our package facilitated detection of dependencies for autism-related genes. Although the sample application of R.ROSETTA illustrates transcriptome data analysis, the package can be used to analyze any data organized in decision tables.
Collapse
|
11
|
A non-coding cancer mutation disrupting an HNF4α binding motif affects an enhancer regulating genes associated to the progression of liver cancer. Exp Oncol 2021; 43:2-6. [PMID: 33785712 DOI: 10.32471/exp-oncology.2312-8852.vol-43-no-1.15925] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
BACKGROUND Somatic mutations in coding regions of the genome may result in non-functional proteins that can lead to cancer or other diseases, however cancer mutations in the non-coding regions have rarely been studied and the interpretation of their effects is difficult. Non-coding mutations might act by breaking or creating transcription factor binding motifs in promoters, enhancers or silencers resulting in altered expression of target gene(s). A high number of mutations have been reported in coding and non-coding regions in cells of liver cancer. Hepatocyte nuclear factor 4α is a transcription factor that regulates the expression of several genes in liver cells, while the motifs it binds are frequently mutated in promoters and enhancers in liver cancer. AIM The aim of the study is to evaluate the genetic effects of a non-coding somatic mutation frequently observed in liver cancer. MATERIALS AND METHODS We evaluated experimentally the effects of a somatic mutation frequently reported in liver cancer as a motif-breaker for the binding of hepatocyte nuclear factor 4α. The effects of the mutation on protein binding and enhancer activity were studied in HepG2 cells via electrophoresis mobility shift assay and dual luciferase reporter assays. We also studied genome-wide promoter-enhancer interactions performing targeted chromosome conformation capture in liver tissue to identify putative target genes whose expression could be altered by the mutation. RESULTS We found that the mutation leads to reduced protein binding and a decrease in enhancer activity. The enhancer harboring the mutation interacts with the promoters of ANAPC13, MAP6D1 and MUC13, which have been implicated in liver cancer. CONCLUSIONS The study highlights the importance of non-coding somatic mutations, vastly understudied, but likely to contribute to cancer development and progression.
Collapse
|
12
|
Nucleolar rDNA folds into condensed foci with a specific combination of epigenetic marks. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2021; 105:1534-1548. [PMID: 33314374 DOI: 10.1111/tpj.15130] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Accepted: 11/30/2020] [Indexed: 05/15/2023]
Abstract
Arabidopsis thaliana 45S ribosomal genes (rDNA) are located in tandem arrays called nucleolus organizing regions on the termini of chromosomes 2 and 4 (NOR2 and NOR4) and encode rRNA, a crucial structural element of the ribosome. The current model of rDNA organization suggests that inactive rRNA genes accumulate in the condensed chromocenters in the nucleus and at the nucleolar periphery, while the nucleolus delineates active genes. We challenge the perspective that all intranucleolar rDNA is active by showing that a subset of nucleolar rDNA assembles into condensed foci marked by H3.1 and H3.3 histones that also contain the repressive H3K9me2 histone mark. By using plant lines containing a low number of rDNA copies, we further found that the condensed foci relate to the folding of rDNA, which appears to be a common mechanism of rDNA regulation inside the nucleolus. The H3K9me2 histone mark found in condensed foci represents a typical modification of bulk inactive rDNA, as we show by genome-wide approaches, similar to the H2A.W histone variant. The euchromatin histone marks H3K27me3 and H3K4me3, in contrast, do not colocalize with nucleolar foci and their overall levels in the nucleolus are very low. We further demonstrate that the rDNA promoter is an important regulatory region of the rDNA, where the distribution of histone variants and histone modifications are modulated in response to rDNA activity.
Collapse
|
13
|
Interpretable Machine Learning Reveals Dissimilarities Between Subtypes of Autism Spectrum Disorder. Front Genet 2021; 12:618277. [PMID: 33719335 PMCID: PMC7946989 DOI: 10.3389/fgene.2021.618277] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Accepted: 01/12/2021] [Indexed: 01/16/2023] Open
Abstract
Autism spectrum disorder (ASD) is a heterogeneous neuropsychiatric disorder with a complex genetic background. Analysis of altered molecular processes in ASD patients requires linear and nonlinear methods that provide interpretable solutions. Interpretable machine learning provides legible models that allow explaining biological mechanisms and support analysis of clinical subgroups. In this work, we investigated several case-control studies of gene expression measurements of ASD individuals. We constructed a rule-based learning model from three independent datasets that we further visualized as a nonlinear gene-gene co-predictive network. To find dissimilarities between ASD subtypes, we scrutinized a topological structure of the network and estimated a centrality distance. Our analysis revealed that autism is the most severe subtype of ASD, while pervasive developmental disorder-not otherwise specified and Asperger syndrome are closely related and milder ASD subtypes. Furthermore, we analyzed the most important ASD-related features that were described in terms of gene co-predictors. Among others, we found a strong co-predictive mechanism between EMC4 and TMEM30A, which may suggest a co-regulation between these genes. The present study demonstrates the potential of applying interpretable machine learning in bioinformatics analyses. Although the proposed methodology was designed for transcriptomics data, it can be applied to other omics disciplines.
Collapse
|
14
|
Single nucleus transcriptomics data integration recapitulates the major cell types in human liver. Hepatol Res 2021; 51:233-238. [PMID: 33119937 DOI: 10.1111/hepr.13585] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Revised: 10/02/2020] [Accepted: 10/20/2020] [Indexed: 12/13/2022]
Abstract
AIM The aim of this study was to explore the benefits of data integration from different platforms for single nucleus transcriptomics profiling to characterize cell populations in human liver. METHODS We generated single-nucleus RNA sequencing data from Chromium 10X Genomics and Drop-seq for a human liver sample. We utilized state of the art bioinformatics tools to undertake a rigorous quality control and to integrate the data into a common space summarizing the gene expression variation from the respective platforms, while accounting for known and unknown confounding factors. RESULTS Analysis of single nuclei transcriptomes from both 10X and Drop-seq allowed identification of the major liver cell types, while the integrated set obtained enough statistical power to separate a small population of inactive hepatic stellate cells that was not characterized in either of the platforms. CONCLUSIONS Integration of droplet-based single nucleus transcriptomics data enabled identification of a small cluster of inactive hepatic stellate cells that highlights the potential of our approach. We suggest single-nucleus RNA sequencing integrative approaches could be utilized to design larger and cost-effective studies.
Collapse
|
15
|
Decreased melanocyte counts in the upper hair follicle in frontal fibrosing alopecia compared to lichen planopilaris: a retrospective histopathologic study. J Eur Acad Dermatol Venereol 2021; 35:e343-e345. [PMID: 33332678 DOI: 10.1111/jdv.17093] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
|
16
|
A Multi-Omics Approach to Liver Diseases: Integration of Single Nuclei Transcriptomics with Proteomics and HiCap Bulk Data in Human Liver. OMICS : A JOURNAL OF INTEGRATIVE BIOLOGY 2020; 24:180-194. [PMID: 32181701 PMCID: PMC7185313 DOI: 10.1089/omi.2019.0215] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The liver is the largest solid organ and a primary metabolic hub. In recent years, intact cell nuclei were used to perform single-nuclei RNA-seq (snRNA-seq) for tissues difficult to dissociate and for flash-frozen archived tissue samples to discover unknown and rare cell subpopulations. In this study, we performed snRNA-seq of a liver sample to identify subpopulations of cells based on nuclear transcriptomics. In 4282 single nuclei, we detected, on average, 1377 active genes and we identified seven major cell types. We integrated data from 94,286 distal interactions (p < 0.05) for 7682 promoters from a targeted chromosome conformation capture technique (HiCap) and mass spectrometry proteomics for the same liver sample. We observed a reasonable correlation between proteomics and in silico bulk snRNA-seq (r = 0.47) using tissue-independent gene-specific protein abundancy estimation factors. We specifically looked at genes of medical importance. The DPYD gene is involved in the pharmacogenetics of fluoropyrimidine toxicity and some of its variants are analyzed for clinical purposes. We identified a new putative polymorphic regulatory element, which may contribute to variation in toxicity. Hepatocellular carcinoma (HCC) is the most common type of primary liver cancer and we investigated all known risk genes. We identified a complex regulatory landscape for the SLC2A2 gene with 16 candidate enhancers. Three of them harbor somatic motif breaking and other mutations in HCC in the Pan Cancer Analysis of Whole Genomes dataset and are candidates to contribute to malignancy. Our results highlight the potential of a multi-omics approach in the study of human diseases.
Collapse
|
17
|
Abstract
Cancer is driven by genetic change, and the advent of massively parallel sequencing has enabled systematic documentation of this variation at the whole-genome scale1-3. Here we report the integrative analysis of 2,658 whole-cancer genomes and their matching normal tissues across 38 tumour types from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). We describe the generation of the PCAWG resource, facilitated by international data sharing using compute clouds. On average, cancer genomes contained 4-5 driver mutations when combining coding and non-coding genomic elements; however, in around 5% of cases no drivers were identified, suggesting that cancer driver discovery is not yet complete. Chromothripsis, in which many clustered structural variants arise in a single catastrophic event, is frequently an early event in tumour evolution; in acral melanoma, for example, these events precede most somatic point mutations and affect several cancer-associated genes simultaneously. Cancers with abnormal telomere maintenance often originate from tissues with low replicative activity and show several mechanisms of preventing telomere attrition to critical levels. Common and rare germline variants affect patterns of somatic mutation, including point mutations, structural variants and somatic retrotransposition. A collection of papers from the PCAWG Consortium describes non-coding mutations that drive cancer beyond those in the TERT promoter4; identifies new signatures of mutational processes that cause base substitutions, small insertions and deletions and structural variation5,6; analyses timings and patterns of tumour evolution7; describes the diverse transcriptional consequences of somatic mutation on splicing, expression levels, fusion genes and promoter activity8,9; and evaluates a range of more-specialized features of cancer genomes8,10-18.
Collapse
|
18
|
Abstract
The discovery of drivers of cancer has traditionally focused on protein-coding genes1-4. Here we present analyses of driver point mutations and structural variants in non-coding regions across 2,658 genomes from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium5 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). For point mutations, we developed a statistically rigorous strategy for combining significance levels from multiple methods of driver discovery that overcomes the limitations of individual methods. For structural variants, we present two methods of driver discovery, and identify regions that are significantly affected by recurrent breakpoints and recurrent somatic juxtapositions. Our analyses confirm previously reported drivers6,7, raise doubts about others and identify novel candidates, including point mutations in the 5' region of TP53, in the 3' untranslated regions of NFKBIZ and TOB1, focal deletions in BRD4 and rearrangements in the loci of AKR1C genes. We show that although point mutations and structural variants that drive cancer are less frequent in non-coding genes and regulatory sequences than in protein-coding genes, additional examples of these drivers will be found as more cancer genomes become available.
Collapse
|
19
|
Intra- and inter-individual metabolic profiling highlights carnitine and lysophosphatidylcholine pathways as key molecular defects in type 2 diabetes. Sci Rep 2019; 9:9653. [PMID: 31273253 PMCID: PMC6609645 DOI: 10.1038/s41598-019-45906-5] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2019] [Accepted: 06/07/2019] [Indexed: 01/22/2023] Open
Abstract
Type 2 diabetes (T2D) mellitus is a complex metabolic disease commonly caused by insulin resistance in several tissues. We performed a matched two-dimensional metabolic screening in tissue samples from 43 multi-organ donors. The intra-individual analysis was assessed across five key metabolic tissues (serum, visceral adipose tissue, liver, pancreatic islets and skeletal muscle), and the inter-individual across three different groups reflecting T2D progression. We identified 92 metabolites differing significantly between non-diabetes and T2D subjects. In diabetes cases, carnitines were significantly higher in liver, while lysophosphatidylcholines were significantly lower in muscle and serum. We tracked the primary tissue of origin for multiple metabolites whose alterations were reflected in serum. An investigation of three major stages spanning from controls, to pre-diabetes and to overt T2D indicated that a subset of lysophosphatidylcholines was significantly lower in the muscle of pre-diabetes subjects. Moreover, glycodeoxycholic acid was significantly higher in liver of pre-diabetes subjects while additional increase in T2D was insignificant. We confirmed many previously reported findings and substantially expanded on them with altered markers for early and overt T2D. Overall, the analysis of this unique dataset can increase the understanding of the metabolic interplay between organs in the development of T2D.
Collapse
|
20
|
Is there a pathogenetic link between frontal fibrosing alopecia, androgenetic alopecia and fibrosing alopecia in a pattern distribution? J Eur Acad Dermatol Venereol 2018; 32:e218-e220. [DOI: 10.1111/jdv.14748] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
|
21
|
Maps of context-dependent putative regulatory regions and genomic signal interactions. Nucleic Acids Res 2016; 44:9110-9120. [PMID: 27625394 PMCID: PMC5100580 DOI: 10.1093/nar/gkw800] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2015] [Accepted: 08/31/2016] [Indexed: 12/24/2022] Open
Abstract
Gene transcription is regulated mainly by transcription factors (TFs). ENCODE and Roadmap Epigenomics provide global binding profiles of TFs, which can be used to identify regulatory regions. To this end we implemented a method to systematically construct cell-type and species-specific maps of regulatory regions and TF-TF interactions. We illustrated the approach by developing maps for five human cell-lines and two other species. We detected ∼144k putative regulatory regions among the human cell-lines, with the majority of them being ∼300 bp. We found ∼20k putative regulatory elements in the ENCODE heterochromatic domains suggesting a large regulatory potential in the regions presumed transcriptionally silent. Among the most significant TF interactions identified in the heterochromatic regions were CTCF and the cohesin complex, which is in agreement with previous reports. Finally, we investigated the enrichment of the obtained putative regulatory regions in the 3D chromatin domains. More than 90% of the regions were discovered in the 3D contacting domains. We found a significant enrichment of GWAS SNPs in the putative regulatory regions. These significant enrichments provide evidence that the regulatory regions play a crucial role in the genomic structural stability. Additionally, we generated maps of putative regulatory regions for prostate and colorectal cancer human cell-lines.
Collapse
|
22
|
A Significant Regulatory Mutation Burden at a High-Affinity Position of the CTCF Motif in Gastrointestinal Cancers. Hum Mutat 2016; 37:904-13. [PMID: 27174533 DOI: 10.1002/humu.23014] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2016] [Accepted: 05/03/2016] [Indexed: 12/22/2022]
Abstract
Somatic mutations drive cancer and there are established ways to study those in coding sequences. It has been shown that some regulatory mutations are over-represented in cancer. We develop a new strategy to find putative regulatory mutations based on experimentally established motifs for transcription factors (TFs). In total, we find 1,552 candidate regulatory mutations predicted to significantly reduce binding affinity of many TFs in hepatocellular carcinoma and affecting binding of CTCF also in esophagus, gastric, and pancreatic cancers. Near mutated motifs, there is a significant enrichment of (1) genes mutated in cancer, (2) tumor-suppressor genes, (3) genes in KEGG cancer pathways, and (4) sets of genes previously associated to cancer. Experimental and functional validations support the findings. The strategy can be applied to identify regulatory mutations in any cell type with established TF motifs and will aid identifications of genes contributing to cancer.
Collapse
|
23
|
Discovering Networks of Interdependent Features in High-Dimensional Problems. STUDIES IN BIG DATA 2016. [DOI: 10.1007/978-3-319-26989-4_12] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
|
24
|
Metabolic syndrome and Mediterranean dietary pattern in a sample of young, male, Greek navy recruits. Nutr Metab Cardiovasc Dis 2009; 19:e7-e8. [PMID: 19477625 DOI: 10.1016/j.numecd.2009.03.008] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/02/2009] [Revised: 03/08/2009] [Accepted: 03/09/2009] [Indexed: 11/21/2022]
|