1
|
Developmental origins shape the paediatric cancer genome. Nat Rev Cancer 2024:10.1038/s41568-024-00684-9. [PMID: 38698126 DOI: 10.1038/s41568-024-00684-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/18/2024] [Indexed: 05/05/2024]
Abstract
In the past two decades, technological advances have brought unprecedented insights into the paediatric cancer genome revealing characteristics distinct from those of adult cancer. Originating from developing tissues, paediatric cancers generally have low mutation burden and are driven by variants that disrupt the transcriptional activity, chromatin state, non-coding cis-regulatory regions and other biological functions. Within each tumour, there are multiple populations of cells with varying states, and the lineages of some can be tracked to their fetal origins. Genome-wide genetic screening has identified vulnerabilities associated with both the cell of origin and transcription deregulation in paediatric cancer, which have become a valuable resource for designing new therapeutic approaches including those for small molecules, immunotherapy and targeted protein degradation. In this Review, we present recent findings on these facets of paediatric cancer from a pan-cancer perspective and provide an outlook on future investigations.
Collapse
|
2
|
HYENA detects oncogenes activated by distal enhancers in cancer. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.01.09.523321. [PMID: 38076958 PMCID: PMC10705271 DOI: 10.1101/2023.01.09.523321] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Somatic structural variations (SVs) in cancer can shuffle DNA content in the genome, relocate regulatory elements, and alter genome organization. Enhancer hijacking occurs when SVs relocate distal enhancers to activate proto-oncogenes. However, most enhancer hijacking studies have only focused on protein-coding genes. Here, we develop a computational algorithm "HYENA" to identify candidate oncogenes (both protein-coding and non-coding) activated by enhancer hijacking based on tumor whole-genome and transcriptome sequencing data. HYENA detects genes whose elevated expression is associated with somatic SVs by using a rank-based regression model. We systematically analyze 1,146 tumors across 25 types of adult tumors and identify a total of 108 candidate oncogenes including many non-coding genes. A long non-coding RNA TOB1-AS1 is activated by various types of SVs in 10% of pancreatic cancers through altered 3-dimensional genome structure. We find that high expression of TOB1-AS1 can promote cell invasion and metastasis. Our study highlights the contribution of genetic alterations in non-coding regions to tumorigenesis and tumor progression.
Collapse
|
3
|
The prolactin receptor: A cross-species comparison of gene structure, transcriptional regulation, tissue-specificity, and genetic variation. J Neuroendocrinol 2024:e13385. [PMID: 38586906 DOI: 10.1111/jne.13385] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 02/25/2024] [Accepted: 03/13/2024] [Indexed: 04/09/2024]
Abstract
The conserved and multifaceted functions of prolactin (PRL) are coordinated through varied distribution and expression of its cell-surface receptor (PRLR) across a range of tissues and physiological states. The resultant heterogeneous expression of PRLR mRNA and protein across different organs and cell types supports a wide range of PRL-regulated processes including reproduction, lactation, development, and homeostasis. Genetic variation within the PRLR gene also accounts for several phenotypes impacting agricultural production and human pathology. The goal of this review is to highlight the many elements that control differential expression of the PRLR across tissues, and the various phenotypes that exist across species due to variation in the PRLR gene.
Collapse
|
4
|
Noncoding mutations drive persistence of a founder preleukemic clone which initiates late relapse in T-ALL. Blood 2024; 143:933-937. [PMID: 38194681 DOI: 10.1182/blood.2023021906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 11/14/2023] [Accepted: 12/01/2023] [Indexed: 01/11/2024] Open
Abstract
ABSTRACT T-ALL relapse usually occurs early but can occur much later, which has been suggested to represent a de novo leukemia. However, we conclusively demonstrate late relapse can evolve from a pre-leukemic subclone harbouring a non-coding mutation that evades initial chemotherapy.
Collapse
|
5
|
Genetic and epigenetic features of bilateral Wilms tumor predisposition in patients from the Children's Oncology Group AREN18B5-Q. Nat Commun 2023; 14:8006. [PMID: 38110397 PMCID: PMC10728430 DOI: 10.1038/s41467-023-43730-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Accepted: 11/17/2023] [Indexed: 12/20/2023] Open
Abstract
Developing synchronous bilateral Wilms tumor suggests an underlying (epi)genetic predisposition. Here, we evaluate this predisposition in 68 patients using whole exome or genome sequencing (n = 85 tumors from 61 patients with matched germline blood DNA), RNA-seq (n = 99 tumors), and DNA methylation analysis (n = 61 peripheral blood, n = 29 non-diseased kidney, n = 99 tumors). We determine the predominant events for bilateral Wilms tumor predisposition: 1)pre-zygotic germline genetic variants readily detectable in blood DNA [WT1 (14.8%), NYNRIN (6.6%), TRIM28 (5%), and BRCA-related genes (5%)] or 2)post-zygotic epigenetic hypermethylation at 11p15.5 H19/ICR1 that may require analysis of multiple tissue types for diagnosis. Of 99 total tumor specimens, 16 (16.1%) have 11p15.5 normal retention of imprinting, 25 (25.2%) have 11p15.5 copy neutral loss of heterozygosity, and 58 (58.6%) have 11p15.5 H19/ICR1 epigenetic hypermethylation (loss of imprinting). Here, we ascertain the epigenetic and genetic modes of bilateral Wilms tumor predisposition.
Collapse
|
6
|
Chromatin accessibility landscape of relapsed pediatric B-lineage acute lymphoblastic leukemia. Nat Commun 2023; 14:6792. [PMID: 37880218 PMCID: PMC10600232 DOI: 10.1038/s41467-023-42565-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 10/13/2023] [Indexed: 10/27/2023] Open
Abstract
For around half of the pediatric B-lineage acute lymphoblastic leukemia (B-ALL) patients, the molecular mechanism of relapse remains unclear. To fill this gap in knowledge, here we characterize the chromatin accessibility landscape in pediatric relapsed B-ALL. We observe rewired accessible chromatin regions (ACRs) associated with transcription dysregulation in leukemia cells as compared with normal B-cell progenitors. We show that over a quarter of the ACRs in B-ALL are in quiescent regions with high heterogeneity among B-ALLs. We identify subtype-specific and allele-imbalanced chromatin accessibility by integrating multi-omics data. By characterizing the differential ACRs between diagnosis and relapse in B-ALL, we identify alterations in chromatin accessibility during drug treatment. Further analysis of ACRs associated with relapse free survival leads to the identification of a subgroup of B-ALL which show early relapse. These data provide an advanced and integrative portrait of the importance of chromatin accessibility alterations in tumorigenesis and drug responses.
Collapse
|
7
|
Transcriptome sequencing identifies novel EVX fusions involved in transcriptional activation of HOX family genes in pediatric immature T-cell acute lymphoblastic leukemia: two cases reports and a literature review. Int J Hematol 2023; 118:508-513. [PMID: 37243888 DOI: 10.1007/s12185-023-03619-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Revised: 05/09/2023] [Accepted: 05/15/2023] [Indexed: 05/29/2023]
Abstract
Driver genomic alterations in pediatric immature T-cell acute lymphoblastic leukemia are not fully known. We report two cases of novel EVX fusions involved in the transcriptional activation of HOX family genes, ETV6::EVX2 and MSI2::EVX1/HOXA13, which activate HOXD and HOXA cluster genes transcription through enhancer hijacking. HOXA and HOXD were the only key transcription factors activated in these cases, which indicates their important roles in leukemogenesis. Our findings elucidate potential drivers for development of T-cell lymphoblastic leukemia, and are valuable for diagnosis and risk stratification of pediatric T-ALL in the era of precision medicine.
Collapse
|
8
|
Allele-specific RNA N 6-methyladenosine modifications reveal functional genetic variants in human tissues. Genome Res 2023; 33:1369-1380. [PMID: 37714712 PMCID: PMC10547253 DOI: 10.1101/gr.277704.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2023] [Accepted: 06/13/2023] [Indexed: 09/17/2023]
Abstract
An intricate network of cis- and trans-elements acts on RNA N 6-methyladenosine (m6A), which in turn may affect gene expression and, ultimately, human health. A complete understanding of this network requires new approaches to accurately measure the subtle m6A differences arising from genetic variants, many of which have been associated with common diseases. To address this gap, we developed a method to accurately and sensitively detect transcriptome-wide allele-specific m6A (ASm6A) from MeRIP-seq data and applied it to uncover 12,056 high-confidence ASm6A modifications from 25 human tissues. We also identified 1184 putative functional variants for ASm6A regulation, a subset of which we experimentally validated. Importantly, we found that many of these ASm6A-associated genetic variants were enriched for common disease-associated and complex trait-associated risk loci, and verified that two disease risk variants can change m6A modification status. Together, this work provides a tool to detangle the dynamic network of RNA modifications at the allelic level and highlights the interplay of m6A and genetics in human health and disease.
Collapse
|
9
|
Etiology of oncogenic fusions in 5,190 childhood cancers and its clinical and therapeutic implication. Nat Commun 2023; 14:1739. [PMID: 37019972 PMCID: PMC10076316 DOI: 10.1038/s41467-023-37438-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Accepted: 03/16/2023] [Indexed: 04/07/2023] Open
Abstract
Oncogenic fusions formed through chromosomal rearrangements are hallmarks of childhood cancer that define cancer subtype, predict outcome, persist through treatment, and can be ideal therapeutic targets. However, mechanistic understanding of the etiology of oncogenic fusions remains elusive. Here we report a comprehensive detection of 272 oncogenic fusion gene pairs by using tumor transcriptome sequencing data from 5190 childhood cancer patients. We identify diverse factors, including translation frame, protein domain, splicing, and gene length, that shape the formation of oncogenic fusions. Our mathematical modeling reveals a strong link between differential selection pressure and clinical outcome in CBFB-MYH11. We discover 4 oncogenic fusions, including RUNX1-RUNX1T1, TCF3-PBX1, CBFA2T3-GLIS2, and KMT2A-AFDN, with promoter-hijacking-like features that may offer alternative strategies for therapeutic targeting. We uncover extensive alternative splicing in oncogenic fusions including KMT2A-MLLT3, KMT2A-MLLT10, C11orf95-RELA, NUP98-NSD1, KMT2A-AFDN and ETV6-RUNX1. We discover neo splice sites in 18 oncogenic fusion gene pairs and demonstrate that such splice sites confer therapeutic vulnerability for etiology-based genome editing. Our study reveals general principles on the etiology of oncogenic fusions in childhood cancer and suggests profound clinical implications including etiology-based risk stratification and genome-editing-based therapeutics.
Collapse
|
10
|
Genomic, transcriptomic, and metabolomic analysis of Oldenlandia corymbosa reveals the biosynthesis and mode of action of anti-cancer metabolites. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2023. [PMID: 36807520 DOI: 10.1111/jipb.13469] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Accepted: 02/18/2023] [Indexed: 06/18/2023]
Abstract
Plants accumulate a vast array of secondary metabolites, which constitute a natural resource for pharmaceuticals. Oldenlandia corymbosa belongs to the Rubiaceae family, and has been used in traditional medicine to treat different diseases, including cancer. However, the active metabolites of the plant, their biosynthetic pathway and mode of action in cancer are unknown. To fill these gaps, we exposed this plant to eight different stress conditions and combined different omics data capturing gene expression, metabolic profiles, and anti-cancer activity. Our results show that O. corymbosa extracts are active against breast cancer cell lines and that ursolic acid is responsible for this activity. Moreover, we assembled a high-quality genome and uncovered two genes involved in the biosynthesis of ursolic acid. Finally, we also revealed that ursolic acid causes mitotic catastrophe in cancer cells and identified three high-confidence protein binding targets by Cellular Thermal Shift Assay (CETSA) and reverse docking. Altogether, these results constitute a valuable resource to further characterize the biosynthesis of active metabolites in the Oldenlandia group, while the mode of action of ursolic acid will allow us to further develop this valuable compound.
Collapse
|
11
|
Revana: a comprehensive tool for regulatory variant analysis and visualization of cancer genomes. Bioinformatics 2022; 39:6964378. [PMID: 36576005 PMCID: PMC9822537 DOI: 10.1093/bioinformatics/btac831] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Revised: 12/19/2022] [Indexed: 12/29/2022] Open
Abstract
MOTIVATION As non-coding driver mutations move more into the focus of cancer research, a comprehensive and easy-to-use software solution for regulatory variant analysis and data visualization is highly relevant. The interpretation of regulatory variants in large tumor genome cohorts requires specialized analysis and visualization of multiple layers of data, including for example breakpoints of structural variants, enhancer elements and additional available gene locus annotation, in the context of changes in gene expression. RESULTS We introduce a user-friendly tool, Revana (REgulatory Variant ANAlysis), that can aggregate and visually represent regulatory variants from cancer genomes in a gene-centric manner. It requires whole-genome and RNA sequencing data of a cohort of tumor samples and creates interactive HTML reports summarizing the most important regulatory events. AVAILABILITY AND IMPLEMENTATION Revana is implemented in R and JavaScript. It is available for download as an R package under <https://github.com/KiTZ-Heidelberg/revana>. Sample results can be viewed under <https://github.com/KiTZ-Heidelberg/revana-demo-report> and a short walkthrough is available under <https://github.com/KiTZ-Heidelberg/revana-demo-data>. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
12
|
Functional analysis of structural variants in single cells using Strand-seq. Nat Biotechnol 2022:10.1038/s41587-022-01551-4. [PMID: 36424487 DOI: 10.1038/s41587-022-01551-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Accepted: 10/07/2022] [Indexed: 11/27/2022]
Abstract
Somatic structural variants (SVs) are widespread in cancer, but their impact on disease evolution is understudied due to a lack of methods to directly characterize their functional consequences. We present a computational method, scNOVA, which uses Strand-seq to perform haplotype-aware integration of SV discovery and molecular phenotyping in single cells by using nucleosome occupancy to infer gene expression as a readout. Application to leukemias and cell lines identifies local effects of copy-balanced rearrangements on gene deregulation, and consequences of SVs on aberrant signaling pathways in subclones. We discovered distinct SV subclones with dysregulated Wnt signaling in a chronic lymphocytic leukemia patient. We further uncovered the consequences of subclonal chromothripsis in T cell acute lymphoblastic leukemia, which revealed c-Myb activation, enrichment of a primitive cell state and informed successful targeting of the subclone in cell culture, using a Notch inhibitor. By directly linking SVs to their functional effects, scNOVA enables systematic single-cell multiomic studies of structural variation in heterogeneous cell populations.
Collapse
|
13
|
Interpretation of the role of germline and somatic non-coding mutations in cancer: expression and chromatin conformation informed analysis. Clin Epigenetics 2022; 14:120. [PMID: 36171609 PMCID: PMC9520844 DOI: 10.1186/s13148-022-01342-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 09/21/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND There has been extensive scrutiny of cancer driving mutations within the exome (especially amino acid altering mutations) as these are more likely to have a clear impact on protein functions, and thus on cell biology. However, this has come at the neglect of systematic identification of regulatory (non-coding) variants, which have recently been identified as putative somatic drivers and key germline risk factors for cancer development. Comprehensive understanding of non-coding mutations requires understanding their role in the disruption of regulatory elements, which then disrupt key biological functions such as gene expression. MAIN BODY We describe how advancements in sequencing technologies have led to the identification of a large number of non-coding mutations with uncharacterized biological significance. We summarize the strategies that have been developed to interpret and prioritize the biological mechanisms impacted by non-coding mutations, focusing on recent annotation of cancer non-coding variants utilizing chromatin states, eQTLs, and chromatin conformation data. CONCLUSION We believe that a better understanding of how to apply different regulatory data types into the study of non-coding mutations will enhance the discovery of novel mechanisms driving cancer.
Collapse
|
14
|
Abstract
Structural variations (SVs) affect more of the cancer genome than any other type of somatic genetic alteration but difficulties in detecting and interpreting them have limited our understanding. Clinical cancer sequencing also increasingly aims to detect SVs, leading to a widespread necessity to interpret their biological and clinical relevance. Recently, analyses of large whole-genome sequencing data sets revealed features that impact rates of SVs across the genome in different cancers. A striking feature has been the extent to which, in both their generation and their influence on the selective fitness of cancer cells, SVs are more specific to individual cancer types than other genetic alterations such as single-nucleotide variants. This Perspective discusses how the folding of the 3D genome, and differences in its folding across cell types, affect observed SV rates in different cancer types as well as how SVs can impact cancer cell fitness.
Collapse
|
15
|
Single base-pair resolution analysis of DNA binding motif with MoMotif reveals an oncogenic function of CTCF zinc-finger 1 mutation. Nucleic Acids Res 2022; 50:8441-8458. [PMID: 35947648 PMCID: PMC9410893 DOI: 10.1093/nar/gkac658] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Accepted: 07/21/2022] [Indexed: 12/24/2022] Open
Abstract
Defining the impact of missense mutations on the recognition of DNA motifs is highly dependent on bioinformatic tools that define DNA binding elements. However, classical motif analysis tools remain limited in their capacity to identify subtle changes in complex binding motifs between distinct conditions. To overcome this limitation, we developed a new tool, MoMotif, that facilitates a sensitive identification, at the single base-pair resolution, of complex, or subtle, alterations to core binding motifs, discerned from ChIP-seq data. We employed MoMotif to define the previously uncharacterized recognition motif of CTCF zinc-finger 1 (ZF1), and to further define the impact of CTCF ZF1 mutation on its association with chromatin. Mutations of CTCF ZF1 are exclusive to breast cancer and are associated with metastasis and therapeutic resistance, but the underlying mechanisms are unclear. Using MoMotif, we identified an extension of the CTCF core binding motif, necessitating a functional ZF1 to bind appropriately. Using a combination of ChIP-Seq and RNA-Seq, we discover that the inability to bind this extended motif drives an altered transcriptional program associated with the oncogenic phenotypes observed clinically. Our study demonstrates that MoMotif is a powerful new tool for comparative ChIP-seq analysis and characterising DNA-protein contacts.
Collapse
|
16
|
Abstract
The translational challenges in the field of precision oncology are in part related to the biological complexity and diversity of this disease. Technological advances in genomics have facilitated large sequencing efforts and discoveries that have further supported this notion. In this review, we reflect on the impact of these discoveries on our understanding of several concepts: cancer initiation, cancer prevention, early detection, adjuvant therapy and minimal residual disease monitoring, cancer drug resistance, and cancer evolution in metastasis. We discuss key areas of focus for improving cancer outcomes, from biological insights to clinical application, and suggest where the development of these technologies will lead us. Finally, we discuss practical challenges to the wider adoption of molecular profiling in the clinic and the need for robust translational infrastructure.
Collapse
|
17
|
Integrated Genomic Analysis Identifies UBTF Tandem Duplications as a Recurrent Lesion in Pediatric Acute Myeloid Leukemia. Blood Cancer Discov 2022; 3:194-207. [PMID: 35176137 PMCID: PMC9780084 DOI: 10.1158/2643-3230.bcd-21-0160] [Citation(s) in RCA: 35] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Revised: 08/27/2021] [Accepted: 01/24/2022] [Indexed: 01/21/2023] Open
Abstract
The genetics of relapsed pediatric acute myeloid leukemia (AML) has yet to be comprehensively defined. Here, we present the spectrum of genomic alterations in 136 relapsed pediatric AMLs. We identified recurrent exon 13 tandem duplications (TD) in upstream binding transcription factor (UBTF) in 9% of relapsed AML cases. UBTF-TD AMLs commonly have normal karyotype or trisomy 8 with cooccurring WT1 mutations or FLT3-ITD but not other known oncogenic fusions. These UBTF-TD events are stable during disease progression and are present in the founding clone. In addition, we observed that UBTF-TD AMLs account for approximately 4% of all de novo pediatric AMLs, are less common in adults, and are associated with poor outcomes and MRD positivity. Expression of UBTF-TD in primary hematopoietic cells is sufficient to enhance serial clonogenic activity and to drive a similar transcriptional program to UBTF-TD AMLs. Collectively, these clinical, genomic, and functional data establish UBTF-TD as a new recurrent mutation in AML. SIGNIFICANCE We defined the spectrum of mutations in relapsed pediatric AML and identified UBTF-TDs as a new recurrent genetic alteration. These duplications are more common in children and define a group of AMLs with intermediate-risk cytogenetic abnormalities, FLT3-ITD and WT1 alterations, and are associated with poor outcomes. See related commentary by Hasserjian and Nardi, p. 173. This article is highlighted in the In This Issue feature, p. 171.
Collapse
|
18
|
Abstract
Distilling biologically meaningful information from cancer genome sequencing data requires comprehensive identification of somatic alterations using rigorous computational methods. As the amount and complexity of sequencing data have increased, so has the number of tools for analysing them. Here, we describe the main steps involved in the bioinformatic analysis of cancer genomes, review key algorithmic developments and highlight popular tools and emerging technologies. These tools include those that identify point mutations, copy number alterations, structural variations and mutational signatures in cancer genomes. We also discuss issues in experimental design, the strengths and limitations of sequencing modalities and methodological challenges for the future.
Collapse
|
19
|
Allele-specific expression reveals genes with recurrent cis-regulatory alterations in high-risk neuroblastoma. Genome Biol 2022; 23:71. [PMID: 35246212 PMCID: PMC8896304 DOI: 10.1186/s13059-022-02640-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 02/23/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Neuroblastoma is a pediatric malignancy with a high frequency of metastatic disease at initial diagnosis. Neuroblastoma tumors have few recurrent protein-coding mutations but contain extensive somatic copy number alterations (SCNAs) suggesting that mutations that alter gene dosage are important drivers of tumorigenesis. Here, we analyze allele-specific expression in 96 high-risk neuroblastoma tumors to discover genes impacted by cis-acting mutations that alter dosage. RESULTS We identify 1043 genes with recurrent, neuroblastoma-specific allele-specific expression. While most of these genes lie within common SCNA regions, many of them exhibit allele-specific expression in copy neutral samples and these samples are enriched for mutations that are predicted to cause nonsense-mediated decay. Thus, both SCNA and non-SCNA mutations frequently alter gene expression in neuroblastoma. We focus on genes with neuroblastoma-specific allele-specific expression in the absence of SCNAs and find 26 such genes that have reduced expression in stage 4 disease. At least two of these genes have evidence for tumor suppressor activity including the transcription factor TFAP2B and the protein tyrosine phosphatase PTPRH. CONCLUSIONS In summary, our allele-specific expression analysis discovers genes that are recurrently dysregulated by both large SCNAs and other cis-acting mutations in high-risk neuroblastoma.
Collapse
|
20
|
Leveraging Allele-Specific Expression for Therapeutic Response Gene Discovery in Glioblastoma. Cancer Res 2021; 82:377-390. [PMID: 34903607 DOI: 10.1158/0008-5472.can-21-0810] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Revised: 09/13/2021] [Accepted: 12/02/2021] [Indexed: 11/16/2022]
Abstract
Glioblastoma is the most prevalent primary malignant brain tumor in adults and is characterized by poor prognosis and universal tumor recurrence. Effective glioblastoma treatments are lacking, in part due to somatic mutations and epigenetic reprogramming that alter gene expression and confer drug resistance. To investigate recurrently dysregulated genes in glioblastoma we interrogated allele-specific expression (ASE), the difference in expression between two alleles of a gene, in glioblastoma stem cells (GSC) derived from 43 patients. A total of 118 genes were found with recurrent ASE preferentially in GSCs compared to normal tissues. These genes were enriched for apoptotic regulators, including schlafen family member 11 (SLFN11). Loss of SLFN11 gene expression was associated with aberrant promoter methylation and conferred resistance to chemotherapy and PARP inhibition. Conversely, low SLFN11 expression rendered GSCs susceptible to the oncolytic flavivirus Zika. This discovery effort based upon ASE revealed novel points of vulnerability in GSCs, suggesting a potential alternative treatment strategy for chemotherapy resistant glioblastoma.
Collapse
|
21
|
Redefining the biological basis of lineage-ambiguous leukemia through genomics: BCL11B deregulation in acute leukemias of ambiguous lineage. Best Pract Res Clin Haematol 2021; 34:101329. [PMID: 34865701 DOI: 10.1016/j.beha.2021.101329] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Acute leukemias of ambiguous lineage (ALAL), including mixed phenotype acute leukemia (MPAL) and related entities such as early T-cell precursor acute leukemia (ETP-ALL), remain diagnostic and clinical challenges due to limited understanding of pathogenesis, reliance of immunophenotyping to classify disease, and the lack of a rational approach to guide selection of appropriate therapy. Recent studies utilizing genomic sequencing and complementary approaches have provided key insights that are changing the way in which such leukemias are classified, and potentially, treated. Several recurrent genomic alterations define leukemias that straddle immunophenotypic entities, such as ZNF384-rearranged childhood B-ALL and B/myeloid MPAL, and BCL11B-rearranged T/myeloid MPAL, ETP-ALL and AML. In contrast, some cases of MPAL represent canonical ALL/AML entities exhibiting lineage aberrancy. For many cases of ALAL, experimental approaches indicate lineage aberrancy arises from acquisition of a founding genetic alteration into a hematopoietic stem or progenitor cell. Determination of optimal therapeutic approach requires genomic characterization of uniformly treated ALAL patients in prospective studies, but several approaches, including kinase inhibitors and BH3 mimetics may be efficacious in subsets of ALAL.
Collapse
|
22
|
Integrative Genomic Analysis of Pediatric Myeloid-Related Acute Leukemias Identifies Novel Subtypes and Prognostic Indicators. Blood Cancer Discov 2021; 2:586-599. [PMID: 34778799 PMCID: PMC8580615 DOI: 10.1158/2643-3230.bcd-21-0049] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Revised: 07/04/2021] [Accepted: 09/01/2021] [Indexed: 12/17/2022] Open
Abstract
Integrating somatic mutation analysis and gene expression profiling distinguishes pediatric AML subtypes with differential prognoses and clinical risks. Genomic characterization of pediatric patients with acute myeloid leukemia (AML) has led to the discovery of somatic mutations with prognostic implications. Although gene-expression profiling can differentiate subsets of pediatric AML, its clinical utility in risk stratification remains limited. Here, we evaluate gene expression, pathogenic somatic mutations, and outcome in a cohort of 435 pediatric patients with a spectrum of pediatric myeloid-related acute leukemias for biological subtype discovery. This analysis revealed 63 patients with varying immunophenotypes that span a T-lineage and myeloid continuum designated as acute myeloid/T-lymphoblastic leukemia (AMTL). Within AMTL, two patient subgroups distinguished by FLT3-ITD and PRC2 mutations have different outcomes, demonstrating the impact of mutational composition on survival. Across the cohort, variability in outcomes of patients within isomutational subsets is influenced by transcriptional identity and the presence of a stem cell–like gene-expression signature. Integration of gene expression and somatic mutations leads to improved risk stratification.
Collapse
|
23
|
Enhancer Hijacking Drives Oncogenic BCL11B Expression in Lineage-Ambiguous Stem Cell Leukemia. Cancer Discov 2021; 11:2846-2867. [PMID: 34103329 PMCID: PMC8563395 DOI: 10.1158/2159-8290.cd-21-0145] [Citation(s) in RCA: 73] [Impact Index Per Article: 24.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Revised: 04/27/2021] [Accepted: 06/01/2021] [Indexed: 11/16/2022]
Abstract
Lineage-ambiguous leukemias are high-risk malignancies of poorly understood genetic basis. Here, we describe a distinct subgroup of acute leukemia with expression of myeloid, T lymphoid, and stem cell markers driven by aberrant allele-specific deregulation of BCL11B, a master transcription factor responsible for thymic T-lineage commitment and specification. Mechanistically, this deregulation was driven by chromosomal rearrangements that juxtapose BCL11B to superenhancers active in hematopoietic progenitors, or focal amplifications that generate a superenhancer from a noncoding element distal to BCL11B. Chromatin conformation analyses demonstrated long-range interactions of rearranged enhancers with the expressed BCL11B allele and association of BCL11B with activated hematopoietic progenitor cell cis-regulatory elements, suggesting BCL11B is aberrantly co-opted into a gene regulatory network that drives transformation by maintaining a progenitor state. These data support a role for ectopic BCL11B expression in primitive hematopoietic cells mediated by enhancer hijacking as an oncogenic driver of human lineage-ambiguous leukemia. SIGNIFICANCE: Lineage-ambiguous leukemias pose significant diagnostic and therapeutic challenges due to a poorly understood molecular and cellular basis. We identify oncogenic deregulation of BCL11B driven by diverse structural alterations, including de novo superenhancer generation, as the driving feature of a subset of lineage-ambiguous leukemias that transcend current diagnostic boundaries.This article is highlighted in the In This Issue feature, p. 2659.
Collapse
|
24
|
Mutational patterns and clonal evolution from diagnosis to relapse in pediatric acute lymphoblastic leukemia. Sci Rep 2021; 11:15988. [PMID: 34362951 PMCID: PMC8346595 DOI: 10.1038/s41598-021-95109-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2021] [Accepted: 07/20/2021] [Indexed: 02/07/2023] Open
Abstract
The mechanisms driving clonal heterogeneity and evolution in relapsed pediatric acute lymphoblastic leukemia (ALL) are not fully understood. We performed whole genome sequencing of samples collected at diagnosis, relapse(s) and remission from 29 Nordic patients. Somatic point mutations and large-scale structural variants were called using individually matched remission samples as controls, and allelic expression of the mutations was assessed in ALL cells using RNA-sequencing. We observed an increased burden of somatic mutations at relapse, compared to diagnosis, and at second relapse compared to first relapse. In addition to 29 known ALL driver genes, of which nine genes carried recurrent protein-coding mutations in our sample set, we identified putative non-protein coding mutations in regulatory regions of seven additional genes that have not previously been described in ALL. Cluster analysis of hundreds of somatic mutations per sample revealed three distinct evolutionary trajectories during ALL progression from diagnosis to relapse. The evolutionary trajectories provide insight into the mutational mechanisms leading relapse in ALL and could offer biomarkers for improved risk prediction in individual patients.
Collapse
|
25
|
Non-Coding Variants in Cancer: Mechanistic Insights and Clinical Potential for Personalized Medicine. Noncoding RNA 2021; 7:47. [PMID: 34449663 PMCID: PMC8395730 DOI: 10.3390/ncrna7030047] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Revised: 07/26/2021] [Accepted: 08/01/2021] [Indexed: 12/11/2022] Open
Abstract
The cancer genome is characterized by extensive variability, in the form of Single Nucleotide Polymorphisms (SNPs) or structural variations such as Copy Number Alterations (CNAs) across wider genomic areas. At the molecular level, most SNPs and/or CNAs reside in non-coding sequences, ultimately affecting the regulation of oncogenes and/or tumor-suppressors in a cancer-specific manner. Notably, inherited non-coding variants can predispose for cancer decades prior to disease onset. Furthermore, accumulation of additional non-coding driver mutations during progression of the disease, gives rise to genomic instability, acting as the driving force of neoplastic development and malignant evolution. Therefore, detection and characterization of such mutations can improve risk assessment for healthy carriers and expand the diagnostic and therapeutic toolbox for the patient. This review focuses on functional variants that reside in transcribed or not transcribed non-coding regions of the cancer genome and presents a collection of appropriate state-of-the-art methodologies to study them.
Collapse
|
26
|
Abstract
Tumour formation involves random mutagenic events and positive evolutionary selection acting on a subset of such events, referred to as driver mutations. A decade of careful surveying of tumour DNA using exome-based analyses has revealed a multitude of protein-coding somatic driver mutations, some of which are clinically actionable. Today, a transition towards whole-genome analysis is well under way, technically enabling the discovery of potential driver mutations occurring outside protein-coding sequences. Mutations are abundant in this vast non-coding space, which is more than 50 times larger than the coding exome, but reliable identification of selection signals in non-coding DNA remains a challenge. In this Review, we discuss recent findings in the field, where the emerging landscape is one in which non-coding driver mutations appear to be relatively infrequent. Nevertheless, we highlight several notable discoveries. We consider possible reasons for the relative absence of non-coding driver events, as well as the difficulties associated with detecting signals of positive selection in non-coding DNA.
Collapse
|
27
|
Genomes for Kids: The scope of pathogenic mutations in pediatric cancer revealed by comprehensive DNA and RNA sequencing. Cancer Discov 2021; 11:3008-3027. [PMID: 34301788 DOI: 10.1158/2159-8290.cd-20-1631] [Citation(s) in RCA: 78] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2020] [Revised: 03/21/2021] [Accepted: 06/23/2021] [Indexed: 11/16/2022]
Abstract
Genomic studies of pediatric cancer have primarily focused on specific tumor types or high-risk disease. Here, we used a three-platform sequencing approach, including whole genome (WGS), exome, and RNA sequencing, to examine tumor and germline genomes from 309 prospectively identified children with newly diagnosed (85%) or relapsed/refractory (15%) cancers, unselected for tumor type. Eighty-six percent of patients harbored diagnostic (53%), prognostic (57%), therapeutically-relevant (25%), and/or cancer predisposing (18%) variants. Inclusion of WGS enabled detection of activating gene fusions and enhancer hijacks (36% and 8% of tumors, respectively), small intragenic deletions (15% of tumors) and mutational signatures revealing of pathogenic variant effects. Evaluation of paired tumor-normal data revealed relevance to tumor development for 55% of pathogenic germline variants. This study demonstrates the power of a three-platform approach that incorporates WGS to interrogate and interpret the full range of genomic variants across newly diagnosed as well as relapsed/refractory pediatric cancers.
Collapse
|
28
|
Regulatory SNPs: Altered Transcription Factor Binding Sites Implicated in Complex Traits and Diseases. Int J Mol Sci 2021; 22:6454. [PMID: 34208629 PMCID: PMC8235176 DOI: 10.3390/ijms22126454] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Revised: 06/15/2021] [Accepted: 06/15/2021] [Indexed: 12/19/2022] Open
Abstract
The vast majority of the genetic variants (mainly SNPs) associated with various human traits and diseases map to a noncoding part of the genome and are enriched in its regulatory compartment, suggesting that many causal variants may affect gene expression. The leading mechanism of action of these SNPs consists in the alterations in the transcription factor binding via creation or disruption of transcription factor binding sites (TFBSs) or some change in the affinity of these regulatory proteins to their cognate sites. In this review, we first focus on the history of the discovery of regulatory SNPs (rSNPs) and systematized description of the existing methodical approaches to their study. Then, we brief the recent comprehensive examples of rSNPs studied from the discovery of the changes in the TFBS sequence as a result of a nucleotide substitution to identification of its effect on the target gene expression and, eventually, to phenotype. We also describe state-of-the-art genome-wide approaches to identification of regulatory variants, including both making molecular sense of genome-wide association studies (GWAS) and the alternative approaches the primary goal of which is to determine the functionality of genetic variants. Among these approaches, special attention is paid to expression quantitative trait loci (eQTLs) analysis and the search for allele-specific events in RNA-seq (ASE events) as well as in ChIP-seq, DNase-seq, and ATAC-seq (ASB events) data.
Collapse
|
29
|
SVExpress: identifying gene features altered recurrently in expression with nearby structural variant breakpoints. BMC Bioinformatics 2021; 22:135. [PMID: 33743584 PMCID: PMC7981925 DOI: 10.1186/s12859-021-04072-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Accepted: 03/15/2021] [Indexed: 12/29/2022] Open
Abstract
Background Combined whole-genome sequencing (WGS) and RNA sequencing of cancers offer the opportunity to identify genes with altered expression due to genomic rearrangements. Somatic structural variants (SVs), as identified by WGS, can involve altered gene cis-regulation, gene fusions, copy number alterations, or gene disruption. The absence of computational tools to streamline integrative analysis steps may represent a barrier in identifying genes recurrently altered by genomic rearrangement. Results Here, we introduce SVExpress, a set of tools for carrying out integrative analysis of SV and gene expression data. SVExpress enables systematic cataloging of genes that consistently show increased or decreased expression in conjunction with the presence of nearby SV breakpoints. SVExpress can evaluate breakpoints in proximity to genes for potential enhancer translocation events or disruption of topologically associated domains, two mechanisms by which SVs may deregulate genes. The output from any commonly used SV calling algorithm may be easily adapted for use with SVExpress. SVExpress can readily analyze genomic datasets involving hundreds of cancer sample profiles. Here, we used SVExpress to analyze SV and expression data across 327 cancer cell lines with combined SV and expression data in the Cancer Cell Line Encyclopedia (CCLE). In the CCLE dataset, hundreds of genes showed altered gene expression in relation to nearby SV breakpoints. Altered genes involved TAD disruption, enhancer hijacking, and gene fusions. When comparing the top set of SV-altered genes from cancer cell lines with the top SV-altered genes previously reported for human tumors from The Cancer Genome Atlas and the Pan-Cancer Analysis of Whole Genomes datasets, a significant number of genes overlapped in the same direction for both cell lines and tumors, while some genes were significant for cell lines but not for human tumors and vice versa. Conclusion Our SVExpress tools allow computational biologists with a working knowledge of R to integrate gene expression with SV breakpoint data to identify recurrently altered genes. SVExpress is freely available for academic or commercial use at https://github.com/chadcreighton/SVExpress. SVExpress is implemented as a set of Excel macros and R code. All source code (R and Visual Basic for Applications) is available. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-021-04072-0.
Collapse
|
30
|
The acquisition of molecular drivers in pediatric therapy-related myeloid neoplasms. Nat Commun 2021; 12:985. [PMID: 33579957 PMCID: PMC7880998 DOI: 10.1038/s41467-021-21255-8] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Accepted: 01/15/2021] [Indexed: 12/21/2022] Open
Abstract
Pediatric therapy-related myeloid neoplasms (tMN) occur in children after exposure to cytotoxic therapy and have a dismal prognosis. The somatic and germline genomic alterations that drive these myeloid neoplasms in children and how they arise have yet to be comprehensively described. We use whole exome, whole genome, and/or RNA sequencing to characterize the genomic profile of 84 pediatric tMN cases (tMDS: n = 28, tAML: n = 56). Our data show that Ras/MAPK pathway mutations, alterations in RUNX1 or TP53, and KMT2A rearrangements are frequent somatic drivers, and we identify cases with aberrant MECOM expression secondary to enhancer hijacking. Unlike adults with tMN, we find no evidence of pre-existing minor tMN clones (including those with TP53 mutations), but rather the majority of cases are unrelated clones arising as a consequence of cytotoxic therapy. These studies also uncover rare cases of lineage switch disease rather than true secondary neoplasms. Paediatric therapy-related myeloid neoplasms (tMN) have a dismal prognosis and have not been comprehensively profiled. Here the authors characterise the molecular landscape of 84 paediatric tMN patients, and find that, unlike adult tMNs, these do not emerge from pre-existing clones and that MECOM dysregulation is frequent.
Collapse
|
31
|
A pediatric brain tumor atlas of genes deregulated by somatic genomic rearrangement. Nat Commun 2021; 12:937. [PMID: 33568653 PMCID: PMC7876141 DOI: 10.1038/s41467-021-21081-y] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2020] [Accepted: 01/13/2021] [Indexed: 02/08/2023] Open
Abstract
The global impact of somatic structural variants (SSVs) on gene expression in pediatric brain tumors has not been thoroughly characterised. Here, using whole-genome and RNA sequencing from 854 tumors of more than 30 different types from the Children's Brain Tumor Tissue Consortium, we report the altered expression of hundreds of genes in association with the presence of nearby SSV breakpoints. SSV-mediated expression changes involve gene fusions, altered cis-regulation, or gene disruption. SSVs considerably extend the numbers of patients with tumors somatically altered for critical pathways, including receptor tyrosine kinases (KRAS, MET, EGFR, NF1), Rb pathway (CDK4), TERT, MYC family (MYC, MYCN, MYB), and HIPPO (NF2). Compared to initial tumors, progressive or recurrent tumors involve a distinct set of SSV-gene associations. High overall SSV burden associates with TP53 mutations, histone H3.3 gene H3F3C mutations, and the transcription of DNA damage response genes. Compared to adult cancers, pediatric brain tumors would involve a different set of genes with SSV-altered cis-regulation. Our comprehensive and pan-histology genomic analyses reveal SSVs to play a major role in shaping the transcriptome of pediatric brain tumors.
Collapse
|
32
|
Cancer regulatory variation. Curr Opin Genet Dev 2021; 66:41-49. [DOI: 10.1016/j.gde.2020.11.010] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 11/17/2020] [Accepted: 11/26/2020] [Indexed: 12/20/2022]
|
33
|
Exploration of Coding and Non-coding Variants in Cancer Using GenomePaint. Cancer Cell 2021; 39:83-95.e4. [PMID: 33434514 PMCID: PMC7884056 DOI: 10.1016/j.ccell.2020.12.011] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Revised: 10/13/2020] [Accepted: 12/10/2020] [Indexed: 12/14/2022]
Abstract
GenomePaint (https://genomepaint.stjude.cloud/) is an interactive visualization platform for whole-genome, whole-exome, transcriptome, and epigenomic data of tumor samples. Its design captures the inter-relatedness between DNA variations and RNA expression, supporting in-depth exploration of both individual cancer genomes and full cohorts. Regulatory non-coding variants can be inspected and analyzed along with coding variants, and their functional impact further explored by examining 3D genome data from cancer cell lines. Further, GenomePaint correlates mutation and expression patterns with patient outcomes, and supports custom data upload. We used GenomePaint to unveil aberrant splicing that disrupts the RING domain of CREBBP, discover cis activation of the MYC oncogene by duplication of the NOTCH1-MYC enhancer in B-lineage acute lymphoblastic leukemia, and explore the inter- and intra-tumor heterogeneity at EGFR in adult glioblastomas. These examples demonstrate that deep multi-omics exploration of individual cancer genomes enabled by GenomePaint can lead to biological insights for follow-up validation.
Collapse
|
34
|
St. Jude Cloud: A Pediatric Cancer Genomic Data-Sharing Ecosystem. Cancer Discov 2021; 11:1082-1099. [PMID: 33408242 DOI: 10.1158/2159-8290.cd-20-1230] [Citation(s) in RCA: 87] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Revised: 11/17/2020] [Accepted: 12/14/2020] [Indexed: 02/07/2023]
Abstract
Effective data sharing is key to accelerating research to improve diagnostic precision, treatment efficacy, and long-term survival in pediatric cancer and other childhood catastrophic diseases. We present St. Jude Cloud (https://www.stjude.cloud), a cloud-based data-sharing ecosystem for accessing, analyzing, and visualizing genomic data from >10,000 pediatric patients with cancer and long-term survivors, and >800 pediatric sickle cell patients. Harmonized genomic data totaling 1.25 petabytes are freely available, including 12,104 whole genomes, 7,697 whole exomes, and 2,202 transcriptomes. The resource is expanding rapidly, with regular data uploads from St. Jude's prospective clinical genomics programs. Three interconnected apps within the ecosystem-Genomics Platform, Pediatric Cancer Knowledgebase, and Visualization Community-enable simultaneously performing advanced data analysis in the cloud and enhancing the Pediatric Cancer knowledgebase. We demonstrate the value of the ecosystem through use cases that classify 135 pediatric cancer subtypes by gene expression profiling and map mutational signatures across 35 pediatric cancer subtypes. SIGNIFICANCE: To advance research and treatment of pediatric cancer, we developed St. Jude Cloud, a data-sharing ecosystem for accessing >1.2 petabytes of raw genomic data from >10,000 pediatric patients and survivors, innovative analysis workflows, integrative multiomics visualizations, and a knowledgebase of published data contributed by the global pediatric cancer community.This article is highlighted in the In This Issue feature, p. 995.
Collapse
|
35
|
Emerging molecular subtypes and therapeutic targets in B-cell precursor acute lymphoblastic leukemia. Front Med 2021; 15:347-371. [PMID: 33400146 DOI: 10.1007/s11684-020-0821-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2019] [Accepted: 09/04/2020] [Indexed: 12/13/2022]
Abstract
B-cell precursor acute lymphoblastic leukemia (BCP-ALL) is characterized by genetic alterations with high heterogeneity. Precise subtypes with distinct genomic and/or gene expression patterns have been recently revealed using high-throughput sequencing technology. Most of these profiles are associated with recurrent non-overlapping rearrangements or hotspot point mutations that are analogous to the established subtypes, such as DUX4 rearrangements, MEF2D rearrangements, ZNF384/ZNF362 rearrangements, NUTM1 rearrangements, BCL2/MYC and/or BCL6 rearrangements, ETV6-RUNX1-like gene expression, PAX5alt (diverse PAX5 alterations, including rearrangements, intragenic amplifications, or mutations), and hotspot mutations PAX5 (p.Pro80Arg) with biallelic PAX5 alterations, IKZF1 (p.Asn159Tyr), and ZEB2 (p.His1038Arg). These molecular subtypes could be classified by gene expression patterns with RNA-seq technology. Refined molecular classification greatly improved the treatment strategy. Multiagent therapy regimens, including target inhibitors (e.g., imatinib), immunomodulators, monoclonal antibodies, and chimeric antigen receptor T-cell (CAR-T) therapy, are transforming the clinical practice from chemotherapy drugs to personalized medicine in the field of risk-directed disease management. We provide an update on our knowledge of emerging molecular subtypes and therapeutic targets in BCP-ALL.
Collapse
|
36
|
Aberrant GATA2 Activation in Pediatric B-Cell Acute Lymphoblastic Leukemia. Front Pediatr 2021; 9:795529. [PMID: 35087776 PMCID: PMC8787225 DOI: 10.3389/fped.2021.795529] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Accepted: 12/02/2021] [Indexed: 11/16/2022] Open
Abstract
GATA2 is a transcription factor that is critical for the generation and survival of hematopoietic stem cells (HSCs). It also plays an important role in the regulation of myeloid differentiation. Accordingly, GATA2 expression is restricted to HSCs and hematopoietic progenitors as well as early erythroid cells and megakaryocytic cells. Here we identified aberrant GATA2 expression in B-cell acute lymphoblastic leukemia (B-ALL) by analyzing transcriptome sequencing data obtained from St. Jude Cloud. Differentially expressed genes upon GATA2 activation showed significantly myeloid-like transcription signature. Further analysis identified several tumor-associated genes as targets of GATA2 activation including BAG3 and EPOR. In addition, the correlation between KMT2A-USP2 fusion and GATA2 activation not only indicates a potential trans-activating mechanism of GATA2 but also suggests that GATA2 is a target of KMT2A-USP2. Furthermore, by integrating whole-genome and transcriptome sequencing data, we showed that GATA2 is also cis activated. A somatic focal deletion located in the GATA2 neighborhood that disrupts the boundaries of topologically associating domains was identified in one B-ALL patient with GATA2 activation. These evidences support the hypothesis that GATA2 could be involved in leukemogenesis of B-ALL and can be transcriptionally activated through multiple mechanisms. The findings of aberrant activation of GATA2 and its molecular function extend our understanding of transcriptional factor dysregulation in B-ALL.
Collapse
|
37
|
The Genome in a Three-Dimensional Context: Deciphering the Contribution of Noncoding Mutations at Enhancers to Blood Cancer. Front Immunol 2020; 11:592087. [PMID: 33117405 PMCID: PMC7575776 DOI: 10.3389/fimmu.2020.592087] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2020] [Accepted: 09/21/2020] [Indexed: 11/13/2022] Open
Abstract
Associations between blood cancer and genetic predisposition, including both inherited variants and acquired mutations and epimutations, have been well characterized. However, the majority of these variants affect noncoding regions, making their mechanisms difficult to hypothesize and hindering the translation of these insights into patient benefits. Fueled by unprecedented progress in next-generation sequencing and computational integrative analysis, studies have started applying combinations of epigenetic, genome architecture, and functional assays to bridge the gap between noncoding variants and blood cancer. These complementary tools have not only allowed us to understand the potential malignant role of these variants but also to differentiate key variants, cell-types, and conditions from misleading ones. Here, we briefly review recent studies that have provided fundamental insights into our understanding of how noncoding mutations at enhancers predispose and promote blood malignancies in the context of spatial genome architecture.
Collapse
|
38
|
Pan-neuroblastoma analysis reveals age- and signature-associated driver alterations. Nat Commun 2020; 11:5183. [PMID: 33056981 PMCID: PMC7560655 DOI: 10.1038/s41467-020-18987-4] [Citation(s) in RCA: 75] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Accepted: 08/27/2020] [Indexed: 02/06/2023] Open
Abstract
Neuroblastoma is a pediatric malignancy with heterogeneous clinical outcomes. To better understand neuroblastoma pathogenesis, here we analyze whole-genome, whole-exome and/or transcriptome data from 702 neuroblastoma samples. Forty percent of samples harbor at least one recurrent driver gene alteration and most aberrations, including MYCN, ATRX, and TERT alterations, differ in frequency by age. MYCN alterations occur at median 2.3 years of age, TERT at 3.8 years, and ATRX at 5.6 years. COSMIC mutational signature 18, previously associated with reactive oxygen species, is the most common cause of driver point mutations in neuroblastoma, including most ALK and Ras-activating variants. Signature 18 appears early and is continuous throughout disease evolution. Signature 18 is enriched in neuroblastomas with MYCN amplification, 17q gain, and increased expression of mitochondrial ribosome and electron transport-associated genes. Recurrent FGFR1 variants in six patients, and ALK N-terminal structural alterations in five samples, identify additional patients potentially amenable to precision therapy.
Collapse
|