1
|
Transcriptional signals of transformation in human cancer. Genome Med 2024; 16:8. [PMID: 38195504 PMCID: PMC10775554 DOI: 10.1186/s13073-023-01279-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Accepted: 12/18/2023] [Indexed: 01/11/2024] Open
Abstract
BACKGROUND As normal cells transform into cancers, their cell state changes, which may drive cancer cells into a stem-like or more primordial, foetal, or embryonic cell state. The transcriptomic profile of this final state may encode information about cancer's origin and how cancers relate to their normal cell counterparts. METHODS Here, we used single-cell atlases to study cancer transformation in transcriptional terms. We utilised bulk transcriptomes across a wide spectrum of adult and childhood cancers, using a previously established method to interrogate their relationship to normal cell states. We extend and validate these findings using single-cell cancer transcriptomes and organ-specific atlases of colorectal and liver cancer. RESULTS Our bulk transcriptomic data reveals that adult cancers rarely return to an embryonic state, but that a foetal state is a near-universal feature of childhood cancers. This finding was confirmed with single-cell cancer transcriptomes. CONCLUSIONS Our findings provide a nuanced picture of transformation in human cancer, indicating cancer-specific rather than universal patterns of transformation pervade adult epithelial cancers.
Collapse
|
2
|
Yolk sac cell atlas reveals multiorgan functions during human early development. Science 2023; 381:eadd7564. [PMID: 37590359 PMCID: PMC7614978 DOI: 10.1126/science.add7564] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 07/03/2023] [Indexed: 08/19/2023]
Abstract
The extraembryonic yolk sac (YS) ensures delivery of nutritional support and oxygen to the developing embryo but remains ill-defined in humans. We therefore assembled a comprehensive multiomic reference of the human YS from 3 to 8 postconception weeks by integrating single-cell protein and gene expression data. Beyond its recognized role as a site of hematopoiesis, we highlight roles in metabolism, coagulation, vascular development, and hematopoietic regulation. We reconstructed the emergence and decline of YS hematopoietic stem and progenitor cells from hemogenic endothelium and revealed a YS-specific accelerated route to macrophage production that seeds developing organs. The multiorgan functions of the YS are superseded as intraembryonic organs develop, effecting a multifaceted relay of vital functions as pregnancy proceeds.
Collapse
|
3
|
Author Correction: Mapping the temporal and spatial dynamics of the human endometrium in vivo and in vitro. Nat Genet 2023; 55:165. [PMID: 36539619 PMCID: PMC9839447 DOI: 10.1038/s41588-022-01287-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
4
|
Myeloid cell interferon secretion restricts Zika flavivirus infection of developing and malignant human neural progenitor cells. Neuron 2022; 110:3936-3951.e10. [PMID: 36174572 PMCID: PMC7615581 DOI: 10.1016/j.neuron.2022.09.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2022] [Revised: 08/10/2022] [Accepted: 09/01/2022] [Indexed: 02/02/2023]
Abstract
Zika virus (ZIKV) can infect human developing brain (HDB) progenitors resulting in epidemic microcephaly, whereas analogous cellular tropism offers treatment potential for the adult brain cancer, glioblastoma (GBM). We compared productive ZIKV infection in HDB and GBM primary tissue explants that both contain SOX2+ neural progenitors. Strikingly, although the HDB proved uniformly vulnerable to ZIKV infection, GBM was more refractory, and this correlated with an innate immune expression signature. Indeed, GBM-derived CD11b+ microglia/macrophages were necessary and sufficient to protect progenitors against ZIKV infection in a non-cell autonomous manner. Using SOX2+ GBM cell lines, we found that CD11b+-conditioned medium containing type 1 interferon beta (IFNβ) promoted progenitor resistance to ZIKV, whereas inhibition of JAK1/2 signaling restored productive infection. Additionally, CD11b+ conditioned medium, and IFNβ treatment rendered HDB progenitor lines and explants refractory to ZIKV. These findings provide insight into neuroprotection for HDB progenitors as well as enhanced GBM oncolytic therapies.
Collapse
|
5
|
Abstract
Gonadal development is a complex process that involves sex determination followed by divergent maturation into either testes or ovaries1. Historically, limited tissue accessibility, a lack of reliable in vitro models and critical differences between humans and mice have hampered our knowledge of human gonadogenesis, despite its importance in gonadal conditions and infertility. Here, we generated a comprehensive map of first- and second-trimester human gonads using a combination of single-cell and spatial transcriptomics, chromatin accessibility assays and fluorescent microscopy. We extracted human-specific regulatory programmes that control the development of germline and somatic cell lineages by profiling equivalent developmental stages in mice. In both species, we define the somatic cell states present at the time of sex specification, including the bipotent early supporting population that, in males, upregulates the testis-determining factor SRY and sPAX8s, a gonadal lineage located at the gonadal-mesonephric interface. In females, we resolve the cellular and molecular events that give rise to the first and second waves of granulosa cells that compartmentalize the developing ovary to modulate germ cell differentiation. In males, we identify human SIGLEC15+ and TREM2+ fetal testicular macrophages, which signal to somatic cells outside and inside the developing testis cords, respectively. This study provides a comprehensive spatiotemporal map of human and mouse gonadal differentiation, which can guide in vitro gonadogenesis.
Collapse
|
6
|
Correction: MiR-277/4989 regulate transcriptional landscape during juvenile to adult transition in the parasitic helminth Schistosoma mansoni. PLoS Negl Trop Dis 2022; 16:e0010521. [PMID: 35666710 PMCID: PMC9170109 DOI: 10.1371/journal.pntd.0010521] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
|
7
|
Abstract
The cellular landscape of the human intestinal tract is dynamic throughout life, developing in utero and changing in response to functional requirements and environmental exposures. Here, to comprehensively map cell lineages, we use single-cell RNA sequencing and antigen receptor analysis of almost half a million cells from up to 5 anatomical regions in the developing and up to 11 distinct anatomical regions in the healthy paediatric and adult human gut. This reveals the existence of transcriptionally distinct BEST4 epithelial cells throughout the human intestinal tract. Furthermore, we implicate IgG sensing as a function of intestinal tuft cells. We describe neural cell populations in the developing enteric nervous system, and predict cell-type-specific expression of genes associated with Hirschsprung's disease. Finally, using a systems approach, we identify key cell players that drive the formation of secondary lymphoid tissue in early human development. We show that these programs are adopted in inflammatory bowel disease to recruit and retain immune cells at the site of inflammation. This catalogue of intestinal cells will provide new insights into cellular programs in development, homeostasis and disease.
Collapse
|
8
|
Flexible comparison of batch correction methods for single-cell RNA-seq using BatchBench. Nucleic Acids Res 2021; 49:e42. [PMID: 33524142 PMCID: PMC8053088 DOI: 10.1093/nar/gkab004] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Revised: 12/11/2020] [Accepted: 01/29/2021] [Indexed: 01/02/2023] Open
Abstract
As the cost of single-cell RNA-seq experiments has decreased, an increasing number of datasets are now available. Combining newly generated and publicly accessible datasets is challenging due to non-biological signals, commonly known as batch effects. Although there are several computational methods available that can remove batch effects, evaluating which method performs best is not straightforward. Here, we present BatchBench (https://github.com/cellgeni/batchbench), a modular and flexible pipeline for comparing batch correction methods for single-cell RNA-seq data. We apply BatchBench to eight methods, highlighting their methodological differences and assess their performance and computational requirements through a compendium of well-studied datasets. This systematic comparison guides users in the choice of batch correction tool, and the pipeline makes it easy to evaluate other datasets.
Collapse
|
9
|
Abstract
Analysis of human blood immune cells provides insights into the coordinated response to viral infections such as severe acute respiratory syndrome coronavirus 2, which causes coronavirus disease 2019 (COVID-19). We performed single-cell transcriptome, surface proteome and T and B lymphocyte antigen receptor analyses of over 780,000 peripheral blood mononuclear cells from a cross-sectional cohort of 130 patients with varying severities of COVID-19. We identified expansion of nonclassical monocytes expressing complement transcripts (CD16+C1QA/B/C+) that sequester platelets and were predicted to replenish the alveolar macrophage pool in COVID-19. Early, uncommitted CD34+ hematopoietic stem/progenitor cells were primed toward megakaryopoiesis, accompanied by expanded megakaryocyte-committed progenitors and increased platelet activation. Clonally expanded CD8+ T cells and an increased ratio of CD8+ effector T cells to effector memory T cells characterized severe disease, while circulating follicular helper T cells accompanied mild disease. We observed a relative loss of IgA2 in symptomatic disease despite an overall expansion of plasmablasts and plasma cells. Our study highlights the coordinated immune response that contributes to COVID-19 pathogenesis and reveals discrete cellular components that can be targeted for therapy.
Collapse
|
10
|
Mapping the temporal and spatial dynamics of the human endometrium in vivo and in vitro. Nat Genet 2021; 53:1698-1711. [PMID: 34857954 PMCID: PMC8648563 DOI: 10.1038/s41588-021-00972-2] [Citation(s) in RCA: 173] [Impact Index Per Article: 57.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2021] [Accepted: 10/18/2021] [Indexed: 12/24/2022]
Abstract
The endometrium, the mucosal lining of the uterus, undergoes dynamic changes throughout the menstrual cycle in response to ovarian hormones. We have generated dense single-cell and spatial reference maps of the human uterus and three-dimensional endometrial organoid cultures. We dissect the signaling pathways that determine cell fate of the epithelial lineages in the lumenal and glandular microenvironments. Our benchmark of the endometrial organoids reveals the pathways and cell states regulating differentiation of the secretory and ciliated lineages both in vivo and in vitro. In vitro downregulation of WNT or NOTCH pathways increases the differentiation efficiency along the secretory and ciliated lineages, respectively. We utilize our cellular maps to deconvolute bulk data from endometrial cancers and endometriotic lesions, illuminating the cell types dominating in each of these disorders. These mechanistic insights provide a platform for future development of treatments for common conditions including endometriosis and endometrial carcinoma.
Collapse
|
11
|
Visualization and analysis of RNA-Seq assembly graphs. Nucleic Acids Res 2019; 47:7262-7275. [PMID: 31305886 PMCID: PMC6698738 DOI: 10.1093/nar/gkz599] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Revised: 05/31/2019] [Accepted: 07/10/2019] [Indexed: 01/20/2023] Open
Abstract
RNA-Seq is a powerful transcriptome profiling technology enabling transcript discovery and quantification. Whilst most commonly used for gene-level quantification, the data can be used for the analysis of transcript isoforms. However, when the underlying transcript assemblies are complex, current visualization approaches can be limiting, with splicing events a challenge to interpret. Here, we report on the development of a graph-based visualization method as a complementary approach to understanding transcript diversity from short-read RNA-Seq data. Following the mapping of reads to a reference genome, a read-to-read comparison is performed on all reads mapping to a given gene, producing a weighted similarity matrix between reads. This is used to produce an RNA assembly graph, where nodes represent reads and edges similarity scores between them. The resulting graphs are visualized in 3D space to better appreciate their sometimes large and complex topology, with other information being overlaid on to nodes, e.g. transcript models. Here we demonstrate the utility of this approach, including the unusual structure of these graphs and how they can be used to identify issues in assembly, repetitive sequences within transcripts and splice variants. We believe this approach has the potential to significantly improve our understanding of transcript complexity.
Collapse
|
12
|
Mirnovo: genome-free prediction of microRNAs from small RNA sequencing data and single-cells using decision forests. Nucleic Acids Res 2017; 45:e177. [PMID: 29036314 PMCID: PMC5716205 DOI: 10.1093/nar/gkx836] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2017] [Accepted: 09/20/2017] [Indexed: 12/21/2022] Open
Abstract
The discovery of microRNAs (miRNAs) remains an important problem, particularly given the growth of high-throughput sequencing, cell sorting and single cell biology. While a large number of miRNAs have already been annotated, there may well be large numbers of miRNAs that are expressed in very particular cell types and remain elusive. Sequencing allows us to quickly and accurately identify the expression of known miRNAs from small RNA-Seq data. The biogenesis of miRNAs leads to very specific characteristics observed in their sequences. In brief, miRNAs usually have a well-defined 5′ end and a more flexible 3′ end with the possibility of 3′ tailing events, such as uridylation. Previous approaches to the prediction of novel miRNAs usually involve the analysis of structural features of miRNA precursor hairpin sequences obtained from genome sequence. We surmised that it may be possible to identify miRNAs by using these biogenesis features observed directly from sequenced reads, solely or in addition to structural analysis from genome data. To this end, we have developed mirnovo, a machine learning based algorithm, which is able to identify known and novel miRNAs in animals and plants directly from small RNA-Seq data, with or without a reference genome. This method performs comparably to existing tools, however is simpler to use with reduced run time. Its performance and accuracy has been tested on multiple datasets, including species with poorly assembled genomes, RNaseIII (Drosha and/or Dicer) deficient samples and single cells (at both embryonic and adult stage).
Collapse
|
13
|
Many si/shRNAs can kill cancer cells by targeting multiple survival genes through an off-target mechanism. eLife 2017; 6. [PMID: 29063830 PMCID: PMC5655136 DOI: 10.7554/elife.29702] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2017] [Accepted: 09/13/2017] [Indexed: 12/27/2022] Open
Abstract
Over 80% of multiple-tested siRNAs and shRNAs targeting CD95 or CD95 ligand (CD95L) induce a form of cell death characterized by simultaneous activation of multiple cell death pathways preferentially killing transformed and cancer stem cells. We now show these si/shRNAs kill cancer cells through canonical RNAi by targeting the 3’UTR of critical survival genes in a unique form of off-target effect we call DISE (death induced by survival gene elimination). Drosha and Dicer-deficient cells, devoid of most miRNAs, are hypersensitive to DISE, suggesting cellular miRNAs protect cells from this form of cell death. By testing 4666 shRNAs derived from the CD95 and CD95L mRNA sequences and an unrelated control gene, Venus, we have identified many toxic sequences - most of them located in the open reading frame of CD95L. We propose that specific toxic RNAi-active sequences present in the genome can kill cancer cells. Cells store their genetic code within molecules of DNA. Some of this information will be copied into chemically similar molecules called RNAs, from which the sequence of letters in the genetic code can be translated to build proteins. However, these messenger RNAs are not the only RNA molecules that cells can make. MicroRNAs are other short pieces of RNA that closely match sequences in parts of certain messenger RNAs. The messenger RNAs targeted by microRNAs are broken down inside the cell, which reduces how much protein can be produced from them. Since its discovery, scientists have exploited this process – called RNA interference (or RNAi for short) – and designed microRNA-like small interfering RNAs (siRNAs) to target particular messenger RNAs and decrease the levels of the corresponding proteins in countless experiments. Two proteins that have been studied in RNAi experiments are CD95 and its interaction partner CD95L. Both of these proteins are important in human cancer cells, and targeting them via RNAi killed cancer cells in an unknown mechanism that the cancer cells were unable to resist. RNAi experiments are designed to be specific, but sometimes they can accidently target other non-target messenger RNAs. Putzbach, Gao, Patel et al. have now analyzed all of the siRNAs that can be made from the messenger RNAs for CD95 and CD95L to mediate RNAi in cancer cells. This revealed that several messenger RNAs, other than those for CD95 and CD95L, were unintentionally being targeted, including many that code for proteins that cells need to survive. Further examination of the messenger RNA for CD95 and CD95L showed that they contain short sequences that are similar to those in the messenger RNAs of the genes that encode these survival proteins. Putzbach et al. were able to study and then predict which siRNA sequences would be toxic to cancer cells. These findings indicate that an RNAi off-target effect may actually be used to kill cancer cells. Future studies will determine whether this effect could be exploited to shrink tumors in animal models of cancer. If successful, this in turn could lead to new treatments for cancer patients.
Collapse
|
14
|
Large-scale analysis of microRNA expression, epi-transcriptomic features and biogenesis. Nucleic Acids Res 2017; 45:1079-1090. [PMID: 28180281 PMCID: PMC5388392 DOI: 10.1093/nar/gkw1031] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2016] [Revised: 10/17/2016] [Accepted: 10/18/2016] [Indexed: 01/09/2023] Open
Abstract
MicroRNAs are important genetic regulators in both animals and plants. They have a range of functions spanning development, differentiation, growth, metabolism and disease. The advent of next-generation sequencing technologies has made it a relatively straightforward task to detect these molecules and their relative expression via sequencing. There are a large number of published studies with deposited datasets. However, there are currently few resources that capitalize on these data to better understand the features, distribution and biogenesis of miRNAs. Herein, we focus on Human and Mouse for which the majority of data are available. We reanalyse sequencing data from 461 samples into a coordinated catalog of microRNA expression. We use this to perform large-scale analyses of miRNA function and biogenesis. These analyses include global expression comparison, co-expression of miRNA clusters and the prediction of miRNA strand-specificity and underlying constraints. Additionally, we report for the first time a global analysis of miRNA epi-transcriptomic modifications and assess their prevalence across tissues, samples and families. Finally, we report a list of potentially mis-annotated miRNAs in miRBase based on their aggregated modification profiles. The results have been collated into a comprehensive online repository of miRNA expression and features such as modifications and RNA editing events, which is available at: http://wwwdev.ebi.ac.uk/enright-dev/miratlas. We believe these findings will further contribute to our understanding of miRNA function in animals and benefit the miRNA community in general.
Collapse
|
15
|
MiR-277/4989 regulate transcriptional landscape during juvenile to adult transition in the parasitic helminth Schistosoma mansoni. PLoS Negl Trop Dis 2017; 11:e0005559. [PMID: 28542189 PMCID: PMC5459504 DOI: 10.1371/journal.pntd.0005559] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2016] [Revised: 06/05/2017] [Accepted: 04/05/2017] [Indexed: 01/06/2023] Open
Abstract
Schistosomes are parasitic helminths that cause schistosomiasis, a disease affecting circa 200 million people, primarily in underprivileged regions of the world. Schistosoma mansoni is the most experimentally tractable schistosome species due to its ease of propagation in the laboratory and the high quality of its genome assembly and annotation. Although there is growing interest in microRNAs (miRNAs) in trematodes, little is known about the role these molecules play in the context of developmental processes. We use the completely unaware "miRNA-blind" bioinformatics tool Sylamer to analyse the 3'-UTRs of transcripts differentially expressed between the juvenile and adult stages. We show that the miR-277/4989 family target sequence is the only one significantly enriched in the transition from juvenile to adult worms. Further, we describe a novel miRNA, sma-miR-4989 showing that its proximal genomic location to sma-miR-277 suggests that they form a miRNA cluster, and we propose hairpin folds for both miRNAs compatible with the miRNA pathway. In addition, we found that expression of sma-miR-277/4989 miRNAs are up-regulated in adults while their predicted targets are characterised by significant down-regulation in paired adult worms but remain largely undisturbed in immature "virgin" females. Finally, we show that sma-miR-4989 is expressed in tegumental cells located proximal to the oesophagus gland and also distributed throughout the male worms' body. Our results indicate that sma-miR-277/4989 might play a dominant role in post-transcriptional regulation during development of juvenile worms and suggest an important role in the sexual development of female schistosomes.
Collapse
|
16
|
Transposon-driven transcription is a conserved feature of vertebrate spermatogenesis and transcript evolution. EMBO Rep 2017; 18:1231-1247. [PMID: 28500258 PMCID: PMC5494522 DOI: 10.15252/embr.201744059] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2017] [Revised: 03/29/2017] [Accepted: 04/11/2017] [Indexed: 01/09/2023] Open
Abstract
Spermatogenesis is associated with major and unique changes to chromosomes and chromatin. Here, we sought to understand the impact of these changes on spermatogenic transcriptomes. We show that long terminal repeats (LTRs) of specific mouse endogenous retroviruses (ERVs) drive the expression of many long non‐coding transcripts (lncRNA). This process occurs post‐mitotically predominantly in spermatocytes and round spermatids. We demonstrate that this transposon‐driven lncRNA expression is a conserved feature of vertebrate spermatogenesis. We propose that transposon promoters are a mechanism by which the genome can explore novel transcriptional substrates, increasing evolutionary plasticity and allowing for the genesis of novel coding and non‐coding genes. Accordingly, we show that a small fraction of these novel ERV‐driven transcripts encode short open reading frames that produce detectable peptides. Finally, we find that distinct ERV elements from the same subfamilies act as differentially activated promoters in a tissue‐specific context. In summary, we demonstrate that LTRs can act as tissue‐specific promoters and contribute to post‐mitotic spermatogenic transcriptome diversity.
Collapse
|
17
|
Exploring regulatory networks of miR-96 in the developing inner ear. Sci Rep 2016; 6:23363. [PMID: 26988146 PMCID: PMC4796898 DOI: 10.1038/srep23363] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2015] [Accepted: 02/24/2016] [Indexed: 12/16/2022] Open
Abstract
Mutations in the microRNA Mir96 cause deafness in mice and humans. In the diminuendo mouse, which carries a single base pair change in the seed region of miR-96, the sensory hair cells crucial for hearing fail to develop fully and retain immature characteristics, suggesting that miR-96 is important for coordinating hair cell maturation. Our previous transcriptional analyses show that many genes are misregulated in the diminuendo inner ear and we report here further misregulated genes. We have chosen three complementary approaches to explore potential networks controlled by miR-96 using these transcriptional data. Firstly, we used regulatory interactions manually curated from the literature to construct a regulatory network incorporating our transcriptional data. Secondly, we built a protein-protein interaction network using the InnateDB database. Thirdly, gene set enrichment analysis was used to identify gene sets in which the misregulated genes are enriched. We have identified several candidates for mediating some of the expression changes caused by the diminuendo mutation, including Fos, Myc, Trp53 and Nr3c1, and confirmed our prediction that Fos is downregulated in diminuendo homozygotes. Understanding the pathways regulated by miR-96 could lead to potential therapeutic targets for treating hearing loss due to perturbation of any component of the network.
Collapse
|
18
|
Depletion of HPV16 early genes induces autophagy and senescence in a cervical carcinogenesis model, regardless of viral physical state. J Pathol 2013; 231:354-66. [DOI: 10.1002/path.4244] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2013] [Revised: 06/25/2013] [Accepted: 07/27/2013] [Indexed: 01/07/2023]
|
19
|
Kraken: a set of tools for quality control and analysis of high-throughput sequence data. Methods 2013; 63:41-9. [PMID: 23816787 PMCID: PMC3991327 DOI: 10.1016/j.ymeth.2013.06.027] [Citation(s) in RCA: 274] [Impact Index Per Article: 24.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2013] [Revised: 06/21/2013] [Accepted: 06/22/2013] [Indexed: 12/18/2022] Open
Abstract
New sequencing technologies pose significant challenges in terms of data complexity and magnitude. It is essential that efficient software is developed with performance that scales with this growth in sequence information. Here we present a comprehensive and integrated set of tools for the analysis of data from large scale sequencing experiments. It supports adapter detection and removal, demultiplexing of barcodes, paired-end data, a range of read architectures and the efficient removal of sequence redundancy. Sequences can be trimmed and filtered based on length, quality and complexity. Quality control plots track sequence length, composition and summary statistics with respect to genomic annotation. Several use cases have been integrated into a single streamlined pipeline, including both mRNA and small RNA sequencing experiments. This pipeline interfaces with existing tools for genomic mapping and differential expression analysis.
Collapse
|
20
|
LIN28 Expression in malignant germ cell tumors downregulates let-7 and increases oncogene levels. Cancer Res 2013; 73:4872-84. [PMID: 23774216 DOI: 10.1158/0008-5472.can-12-2085] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Despite their clinicopathologic heterogeneity, malignant germ cell tumors (GCT) share molecular abnormalities that are likely to be functionally important. In this study, we investigated the potential significance of downregulation of the let-7 family of tumor suppressor microRNAs in malignant GCTs. Microarray results from pediatric and adult samples (n = 45) showed that LIN28, the negative regulator of let-7 biogenesis, was abundant in malignant GCTs, regardless of patient age, tumor site, or histologic subtype. Indeed, a strong negative correlation existed between LIN28 and let-7 levels in specimens with matched datasets. Low let-7 levels were biologically significant, as the sequence complementary to the 2 to 7 nt common let-7 seed "GAGGUA" was enriched in the 3' untranslated regions of mRNAs upregulated in pediatric and adult malignant GCTs, compared with normal gonads (a mixture of germ cells and somatic cells). We identified 27 mRNA targets of let-7 that were upregulated in malignant GCT cells, confirming significant negative correlations with let-7 levels. Among 16 mRNAs examined in a largely independent set of specimens by quantitative reverse transcription PCR, we defined negative-associations with let-7e levels for six oncogenes, including MYCN, AURKB, CCNF, RRM2, MKI67, and C12orf5 (when including normal control tissues). Importantly, LIN28 depletion in malignant GCT cells restored let-7 levels and repressed all of these oncogenic let-7 mRNA targets, with LIN28 levels correlating with cell proliferation and MYCN levels. Conversely, ectopic expression of let-7e was sufficient to reduce proliferation and downregulate MYCN, AURKB, and LIN28, the latter via a double-negative feedback loop. We conclude that the LIN28/let-7 pathway has a critical pathobiologic role in malignant GCTs and therefore offers a promising target for therapeutic intervention.
Collapse
|
21
|
Extent, causes, and consequences of small RNA expression variation in human adipose tissue. PLoS Genet 2012; 8:e1002704. [PMID: 22589741 PMCID: PMC3349731 DOI: 10.1371/journal.pgen.1002704] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2012] [Accepted: 03/27/2012] [Indexed: 12/12/2022] Open
Abstract
Small RNAs are functional molecules that modulate mRNA transcripts and have been implicated in the aetiology of several common diseases. However, little is known about the extent of their variability within the human population. Here, we characterise the extent, causes, and effects of naturally occurring variation in expression and sequence of small RNAs from adipose tissue in relation to genotype, gene expression, and metabolic traits in the MuTHER reference cohort. We profiled the expression of 15 to 30 base pair RNA molecules in subcutaneous adipose tissue from 131 individuals using high-throughput sequencing, and quantified levels of 591 microRNAs and small nucleolar RNAs. We identified three genetic variants and three RNA editing events. Highly expressed small RNAs are more conserved within mammals than average, as are those with highly variable expression. We identified 14 genetic loci significantly associated with nearby small RNA expression levels, seven of which also regulate an mRNA transcript level in the same region. In addition, these loci are enriched for variants significant in genome-wide association studies for body mass index. Contrary to expectation, we found no evidence for negative correlation between expression level of a microRNA and its target mRNAs. Trunk fat mass, body mass index, and fasting insulin were associated with more than twenty small RNA expression levels each, while fasting glucose had no significant associations. This study highlights the similar genetic complexity and shared genetic control of small RNA and mRNA transcripts, and gives a quantitative picture of small RNA expression variation in the human population. Genetic information is transmitted to the cell only through RNA molecules. A special class of RNAs is comprised of the small (up to 30 nucleotide) ones, known to be potent regulators of various cellular processes. At the same time, they have not been as widely studied as messenger RNAs—we do not know how much variation in their sequence and expression level occurs naturally in human populations or how this variability influences other traits. We measured small RNA levels and genetic variability in fat tissue from 131 individuals by high-throughput sequencing. We could associate the expression levels with genetic background of the individuals, as well as changes in metabolic traits. Surprisingly, we found no large scale influence of small RNA variation on mRNA levels, their main regulatory target. Overall, our study is the first to give a quantitative picture of the naturally occurring variation in these important regulatory molecules in human fat tissue.
Collapse
|
22
|
Abstract
BACKGROUND Delineating the role of microRNAs (miRNAs) in the posttranscriptional gene regulation offers new insights into how the heart adapts to pathological stress. We developed a knockout of miR-22 in mice and investigated its function in the heart. METHODS AND RESULTS Here, we show that miR-22-deficient mice are impaired in inotropic and lusitropic response to acute stress by dobutamine. Furthermore, the absence of miR-22 sensitized mice to cardiac decompensation and left ventricular dilation after long-term stimulation by pressure overload. Calcium transient analysis revealed reduced sarcoplasmic reticulum Ca(2+) load in association with repressed sarcoplasmic reticulum Ca(2+) ATPase activity in mutant myocytes. Genetic ablation of miR-22 also led to a decrease in cardiac expression levels for Serca2a and muscle-restricted genes encoding proteins in the vicinity of the cardiac Z disk/titin cytoskeleton. These phenotypes were attributed in part to inappropriate repression of serum response factor activity in stressed hearts. Global analysis revealed increased expression of the transcriptional/translational repressor purine-rich element binding protein B, a highly conserved miR-22 target implicated in the negative control of muscle expression. CONCLUSION These data indicate that miR-22 functions as an integrator of Ca(2+) homeostasis and myofibrillar protein content during stress in the heart and shed light on the mechanisms that enhance propensity toward heart failure.
Collapse
|
23
|
Abstract
MCL is a general purpose cluster algorithm for both weighted and unweighted networks. The algorithm utilises network topology as well as edge weights, is highly scalable and has been applied in a wide variety of bioinformatic methods. In this chapter, we give protocols and case studies for clustering of networks derived from, respectively, protein sequence similarities and gene expression profile correlations.
Collapse
|
24
|
The two most common histological subtypes of malignant germ cell tumour are distinguished by global microRNA profiles, associated with differential transcription factor expression. Mol Cancer 2010; 9:290. [PMID: 21059207 PMCID: PMC2993676 DOI: 10.1186/1476-4598-9-290] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2010] [Accepted: 11/08/2010] [Indexed: 11/29/2022] Open
Abstract
Background We hypothesised that differences in microRNA expression profiles contribute to the contrasting natural history and clinical outcome of the two most common types of malignant germ cell tumour (GCT), yolk sac tumours (YSTs) and germinomas. Results By direct comparison, using microarray data for paediatric GCT samples and published qRT-PCR data for adult samples, we identified microRNAs significantly up-regulated in YSTs (n = 29 paediatric, 26 adult, 11 overlapping) or germinomas (n = 37 paediatric). By Taqman qRT-PCR we confirmed differential expression of 15 of 16 selected microRNAs and further validated six of these (miR-302b, miR-375, miR-200b, miR-200c, miR-122, miR-205) in an independent sample set. Interestingly, the miR-302 cluster, which is over-expressed in all malignant GCTs, showed further over-expression in YSTs versus germinomas, representing six of the top eight microRNAs over-expressed in paediatric YSTs and seven of the top 11 in adult YSTs. To explain this observation, we used mRNA expression profiles of paediatric and adult malignant GCTs to identify 10 transcription factors (TFs) consistently over-expressed in YSTs versus germinomas, followed by linear regression to confirm associations between TF and miR-302 cluster expression levels. Using the sequence motif analysis environment iMotifs, we identified predicted binding sites for four of the 10 TFs (GATA6, GATA3, TCF7L2 and MAF) in the miR-302 cluster promoter region. Finally, we showed that miR-302 family over-expression in YST is likely to be functionally significant, as mRNAs down-regulated in YSTs were enriched for 3' untranslated region sequences complementary to the common seed of miR-302a~miR-302d. Such mRNAs included mediators of key cancer-associated processes, including tumour suppressor genes, apoptosis regulators and TFs. Conclusions Differential microRNA expression is likely to contribute to the relatively aggressive behaviour of YSTs and may enable future improvements in clinical diagnosis and/or treatment.
Collapse
|
25
|
Malignant germ cell tumors display common microRNA profiles resulting in global changes in expression of messenger RNA targets. Cancer Res 2010; 70:2911-23. [PMID: 20332240 DOI: 10.1158/0008-5472.can-09-3301] [Citation(s) in RCA: 199] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Despite their extensive clinical and pathologic heterogeneity, all malignant germ cell tumors (GCT) are thought to originate from primordial germ cells. However, no common biological abnormalities have been identified to date. We profiled 615 microRNAs (miRNA) in pediatric malignant GCTs, controls, and GCT cell lines (48 samples in total) and re-analyzed available miRNA expression data in adult gonadal malignant GCTs. We applied the bioinformatic algorithm Sylamer to identify miRNAs that are of biological importance by inducing global shifts in mRNA levels. The most significant differentially expressed miRNAs in malignant GCTs were all from the miR-371-373 and miR-302 clusters (adjusted P < 0.00005), which were overexpressed regardless of histologic subtype [yolk sac tumor (YST)/seminoma/embryonal carcinoma (EC)], site (gonadal/extragonadal), or patient age (pediatric/adult). Sylamer revealed that the hexamer GCACTT, complementary to the 2- to 7-nucleotide miRNA seed AAGUGC shared by six members of the miR-371-373 and miR-302 clusters, was the only sequence significantly enriched in the 3'-untranslated region of mRNAs downregulated in pediatric malignant GCTs (as a group), YSTs and ECs, and in adult YSTs (all versus nonmalignant tissue controls; P < 0.05). For the pediatric samples, downregulated genes containing the 3'-untranslated region GCACTT showed significant overrepresentation of Gene Ontology terms related to cancer-associated processes, whereas for downregulated genes lacking GCACTT, Gene Ontology terms generally represented metabolic processes only, with few genes per term (adjusted P < 0.05). We conclude that the miR-371-373 and miR-302 clusters are universally overexpressed in malignant GCTs and coordinately downregulate mRNAs involved in biologically significant pathways.
Collapse
|
26
|
An ENU-induced mutation of miR-96 associated with progressive hearing loss in mice. Nat Genet 2009; 41:614-8. [PMID: 19363478 PMCID: PMC2705913 DOI: 10.1038/ng.369] [Citation(s) in RCA: 240] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2008] [Accepted: 02/13/2009] [Indexed: 12/25/2022]
Abstract
Progressive hearing loss is common in the human population, but little is known about the molecular basis. We report a new ENU-induced mouse mutant, diminuendo, with a single base change in the seed region of Mirn96. Heterozygotes show progressive loss of hearing and hair cell anomalies, while homozygotes have no cochlear responses. Most microRNAs are believed to downregulate target genes by binding to specific sites on their mRNAs, so mutation of the seed should lead to target gene upregulation. Microarray analysis revealed 96 transcripts with significantly altered expression in homozygotes; notably, Slc26a5, oncomodulin, Gfi1, Ptprq and Pitpnm1 were downregulated. Hypergeometric p-value analysis showed hundreds of genes were upregulated in mutants. Different genes, with target sites complementary to the mutant seed, were downregulated. This is the first microRNA found associated with deafness, and diminuendo represents a model for understanding and potentially moderating progressive hair cell degeneration in hearing loss more generally.
Collapse
|
27
|
Detecting microRNA binding and siRNA off-target effects from expression data. Nat Methods 2008; 5:1023-5. [PMID: 18978784 PMCID: PMC2635553 DOI: 10.1038/nmeth.1267] [Citation(s) in RCA: 224] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2008] [Accepted: 10/07/2008] [Indexed: 12/23/2022]
Abstract
Sylamer is a method for detecting microRNA target and small interfering RNA off-target signals in 3' untranslated regions from a ranked gene list, sorted from upregulated to downregulated, after a microRNA perturbation or RNA interference experiment. The output is a landscape plot that tracks occurrence biases using hypergeometric P-values for all words across the gene ranking. We demonstrated the utility, speed and accuracy of this approach on several datasets.
Collapse
|
28
|
Abstract
miRBase is the central online repository for microRNA (miRNA) nomenclature, sequence data, annotation and target prediction. The current release (10.0) contains 5071 miRNA loci from 58 species, expressing 5922 distinct mature miRNA sequences: a growth of over 2000 sequences in the past 2 years. miRBase provides a range of data to facilitate studies of miRNA genomics: all miRNAs are mapped to their genomic coordinates. Clusters of miRNA sequences in the genome are highlighted, and can be defined and retrieved with any inter-miRNA distance. The overlap of miRNA sequences with annotated transcripts, both protein- and non-coding, are described. Finally, graphical views of the locations of a wide range of genomic features in model organisms allow for the first time the prediction of the likely boundaries of many miRNA primary transcripts. miRBase is available at http://microrna.sanger.ac.uk/.
Collapse
|
29
|
Abstract
MicroRNAs are a class of small RNAs that are increasingly being recognized as important regulators of gene expression. Although hundreds of microRNAs are present in the mammalian genome, genetic studies addressing their physiological roles are at an early stage. We have shown that mice deficient for bic/microRNA-155 are immunodeficient and display increased lung airway remodeling. We demonstrate a requirement of bic/microRNA-155 for the function of B and T lymphocytes and dendritic cells. Transcriptome analysis of bic/microRNA-155-deficient CD4+ T cells identified a wide spectrum of microRNA-155-regulated genes, including cytokines, chemokines, and transcription factors. Our work suggests that bic/microRNA-155 plays a key role in the homeostasis and function of the immune system.
Collapse
|
30
|
Abstract
The role of developmental instability (DI), as measured by fluctuating asymmetry (FA), in evolutionary biology has been the focus of a wealth of research for more than half a century. In spite of this long period and many published papers, our current state of knowledge reviewed here only allows us to conclude that patterns are heterogeneous and that very little is known about the underlying causes of this heterogeneity. In addition, the statistical properties of FA as a measure of DI are only poorly grasped because of a general lack of understanding of the underlying mechanisms that drive DI. If we want to avoid that this area of research becomes abandoned, more efforts should be made to understand the observed heterogeneity, and attempts should be made to develop a unifying statistical protocol. More specifically, and perhaps most importantly, it is argued here that more attention should be paid to the usefulness of FA as a measure of DI since many factors might blur this relationship. Furthermore, the genetic architecture, associations with fitness and the importance of compensatory growth should be investigated under a variety of stress situations. In addition, more focus should be directed to the underlying mechanisms of DI as well as how these processes map to the observable phenotype. These insights could yield more efficient statistical models and a unified approach to the analysis of patterns in FA and DI. The study of both DI and canalization is indispensable to obtain better insights in their possible common origin, especially because both have been suggested to play a role in both micro- and macro-evolutionary processes.
Collapse
|
31
|
Abstract
A survey of RNA editing of miRNAs from ten human tissues indicates that RNA editing increases the diversity of miRNAs and their targets. Background MicroRNAs (miRNAs) are short RNAs of around 22 nucleotides that regulate gene expression. The primary transcripts of miRNAs contain double-stranded RNA and are therefore potential substrates for adenosine to inosine (A-to-I) RNA editing. Results We have conducted a survey of RNA editing of miRNAs from ten human tissues by sequence comparison of PCR products derived from matched genomic DNA and total cDNA from the same individual. Six out of 99 (6%) miRNA transcripts from which data were obtained were subject to A-to-I editing in at least one tissue. Four out of seven edited adenosines were in the mature miRNA and were predicted to change the target sites in 3' untranslated regions. For a further six miRNAs, we identified A-to-I editing of transcripts derived from the opposite strand of the genome to the annotated miRNA. These miRNAs may have been annotated to the wrong genomic strand. Conclusion Our results indicate that RNA editing increases the diversity of miRNAs and their targets, and hence may modulate miRNA function.
Collapse
|
32
|
Abstract
The miRBase database aims to provide integrated interfaces to comprehensive microRNA sequence data, annotation and predicted gene targets. miRBase takes over functionality from the microRNA Registry and fulfils three main roles: the miRBase Registry acts as an independent arbiter of microRNA gene nomenclature, assigning names prior to publication of novel miRNA sequences. miRBase Sequences is the primary online repository for miRNA sequence data and annotation. miRBase Targets is a comprehensive new database of predicted miRNA target genes. miRBase is available at .
Collapse
|
33
|
miRBase: microRNA sequences, targets and gene nomenclature. Nucleic Acids Res 2006. [PMID: 16381832 DOI: 10.1093/nar/gk112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/14/2023] Open
Abstract
The miRBase database aims to provide integrated interfaces to comprehensive microRNA sequence data, annotation and predicted gene targets. miRBase takes over functionality from the microRNA Registry and fulfils three main roles: the miRBase Registry acts as an independent arbiter of microRNA gene nomenclature, assigning names prior to publication of novel miRNA sequences. miRBase Sequences is the primary online repository for miRNA sequence data and annotation. miRBase Targets is a comprehensive new database of predicted miRNA target genes. miRBase is available at http://microrna.sanger.ac.uk/.
Collapse
|
34
|
RNA editing of human microRNAs. Genome Biol 2006. [PMID: 16594986 DOI: 10.1186/gb-2006-7-4-r17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/17/2023] Open
Abstract
BACKGROUND MicroRNAs (miRNAs) are short RNAs of around 22 nucleotides that regulate gene expression. The primary transcripts of miRNAs contain double-stranded RNA and are therefore potential substrates for adenosine to inosine (A-to-I) RNA editing. RESULTS We have conducted a survey of RNA editing of miRNAs from ten human tissues by sequence comparison of PCR products derived from matched genomic DNA and total cDNA from the same individual. Six out of 99 (6%) miRNA transcripts from which data were obtained were subject to A-to-I editing in at least one tissue. Four out of seven edited adenosines were in the mature miRNA and were predicted to change the target sites in 3' untranslated regions. For a further six miRNAs, we identified A-to-I editing of transcripts derived from the opposite strand of the genome to the annotated miRNA. These miRNAs may have been annotated to the wrong genomic strand. CONCLUSION Our results indicate that RNA editing increases the diversity of miRNAs and their targets, and hence may modulate miRNA function.
Collapse
|
35
|
Abstract
BACKGROUND MicroRNAs (miRNAs) are short RNAs of around 22 nucleotides that regulate gene expression. The primary transcripts of miRNAs contain double-stranded RNA and are therefore potential substrates for adenosine to inosine (A-to-I) RNA editing. RESULTS We have conducted a survey of RNA editing of miRNAs from ten human tissues by sequence comparison of PCR products derived from matched genomic DNA and total cDNA from the same individual. Six out of 99 (6%) miRNA transcripts from which data were obtained were subject to A-to-I editing in at least one tissue. Four out of seven edited adenosines were in the mature miRNA and were predicted to change the target sites in 3' untranslated regions. For a further six miRNAs, we identified A-to-I editing of transcripts derived from the opposite strand of the genome to the annotated miRNA. These miRNAs may have been annotated to the wrong genomic strand. CONCLUSION Our results indicate that RNA editing increases the diversity of miRNAs and their targets, and hence may modulate miRNA function.
Collapse
|
36
|
Abstract
Accurately and reliably identifying the actual number of clusters present with a dataset of gene expression profiles, when no additional information on cluster structure is available, is a problem addressed by few algorithms. GeneMCL transforms microarray analysis data into a graph consisting of nodes connected by edges, where the nodes represent genes, and the edges represent the similarity in expression of those genes, as given by a proximity measurement. This measurement is taken to be the Pearson correlation coefficient combined with a local non-linear rescaling step. The resulting graph is input to the Markov Cluster (MCL) algorithm, which is an elegant, deterministic, non-specific and scalable method, which models stochastic flow through the graph. The algorithm is inherently affected by any cluster structure present, and rapidly decomposes a graph into cohesive clusters. The potential of the GeneMCL algorithm is demonstrated with a 5,730 gene subset (IGS) of the Van't Veer breast cancer database, for which the clusterings are shown to reflect underlying biological mechanisms.
Collapse
|