1
|
|
2
|
Author Correction: Genomic basis for RNA alterations in cancer. Nature 2023; 614:E37. [PMID: 36697831 PMCID: PMC9931574 DOI: 10.1038/s41586-022-05596-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
|
3
|
Abstract
Cancer is driven by genetic change, and the advent of massively parallel sequencing has enabled systematic documentation of this variation at the whole-genome scale1-3. Here we report the integrative analysis of 2,658 whole-cancer genomes and their matching normal tissues across 38 tumour types from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). We describe the generation of the PCAWG resource, facilitated by international data sharing using compute clouds. On average, cancer genomes contained 4-5 driver mutations when combining coding and non-coding genomic elements; however, in around 5% of cases no drivers were identified, suggesting that cancer driver discovery is not yet complete. Chromothripsis, in which many clustered structural variants arise in a single catastrophic event, is frequently an early event in tumour evolution; in acral melanoma, for example, these events precede most somatic point mutations and affect several cancer-associated genes simultaneously. Cancers with abnormal telomere maintenance often originate from tissues with low replicative activity and show several mechanisms of preventing telomere attrition to critical levels. Common and rare germline variants affect patterns of somatic mutation, including point mutations, structural variants and somatic retrotransposition. A collection of papers from the PCAWG Consortium describes non-coding mutations that drive cancer beyond those in the TERT promoter4; identifies new signatures of mutational processes that cause base substitutions, small insertions and deletions and structural variation5,6; analyses timings and patterns of tumour evolution7; describes the diverse transcriptional consequences of somatic mutation on splicing, expression levels, fusion genes and promoter activity8,9; and evaluates a range of more-specialized features of cancer genomes8,10-18.
Collapse
|
4
|
From trainees to trainers to instructors: Sustainably building a national capacity in bioinformatics training. PLoS Comput Biol 2019; 15:e1006923. [PMID: 31246949 PMCID: PMC6597034 DOI: 10.1371/journal.pcbi.1006923] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Demand for training life scientists in bioinformatics skills led to the development of a train-the-trainer collaboration between the European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI) and 2 Australian organisations, Bioplatforms Australia and Commonwealth Scientific and Industrial Research Organisation (CSIRO) in 2012. The goal of the collaboration was to establish a group of trained instructors who could develop and deliver short bioinformatics courses nationally. A train-the-trainer course introduces instructors to aspects of andragogy and evidence-based learning principles to help them better design, develop, and deliver high-quality training. Since then, both the number of trainers in the network and the course portfolio have grown. Best practises have been developed and shared between the Australian cohort and EMBL-EBI to address common challenges in bioinformatics training. The Australian trainer cohort undertook a train-the-trainer instructor course, again with EMBL-EBI, and subsequently successfully delivered train-the-trainer courses to interested bioinformatics trainers within Australia. We conclude that a train-the-trainer approach can help build national capacity and maintain a critical mass of trained instructors.
Collapse
|
5
|
Abstract
The Bioinformatics Training Platform (BTP) has been developed to provide access to the computational infrastructure required to deliver sophisticated hands-on bioinformatics training courses. The BTP is a cloud-based solution that is in active use for delivering next-generation sequencing training to Australian researchers at geographically dispersed locations. The BTP was built to provide an easy, accessible, consistent and cost-effective approach to delivering workshops at host universities and organizations with a high demand for bioinformatics training but lacking the dedicated bioinformatics training suites required. To support broad uptake of the BTP, the platform has been made compatible with multiple cloud infrastructures. The BTP is an open-source and open-access resource. To date, 20 training workshops have been delivered to over 700 trainees at over 10 venues across Australia using the BTP.
Collapse
|
6
|
Towards an open, collaborative, reusable framework for sharing hands-on bioinformatics training workshops. Brief Bioinform 2017; 18:348-355. [PMID: 26984618 PMCID: PMC5444239 DOI: 10.1093/bib/bbw013] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2015] [Indexed: 11/13/2022] Open
Abstract
There is a clear demand for hands-on bioinformatics training. The development of bioinformatics workshop content is both time-consuming and expensive. Therefore, enabling trainers to develop bioinformatics workshops in a way that facilitates reuse is becoming increasingly important. The most widespread practice for sharing workshop content is through making PDF, PowerPoint and Word documents available online. While this effort is to be commended, such content is usually not so easy to reuse or repurpose and does not capture all the information required for a third party to rerun a workshop. We present an open, collaborative framework for developing and maintaining, reusable and shareable hands-on training workshop content.
Collapse
|
7
|
Whole-genome landscapes of major melanoma subtypes. Nature 2017; 545:175-180. [PMID: 28467829 DOI: 10.1038/nature22071] [Citation(s) in RCA: 879] [Impact Index Per Article: 125.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2015] [Accepted: 03/15/2017] [Indexed: 12/16/2022]
Abstract
Melanoma of the skin is a common cancer only in Europeans, whereas it arises in internal body surfaces (mucosal sites) and on the hands and feet (acral sites) in people throughout the world. Here we report analysis of whole-genome sequences from cutaneous, acral and mucosal subtypes of melanoma. The heavily mutated landscape of coding and non-coding mutations in cutaneous melanoma resolved novel signatures of mutagenesis attributable to ultraviolet radiation. However, acral and mucosal melanomas were dominated by structural changes and mutation signatures of unknown aetiology, not previously identified in melanoma. The number of genes affected by recurrent mutations disrupting non-coding sequences was similar to that affected by recurrent mutations to coding sequences. Significantly mutated genes included BRAF, CDKN2A, NRAS and TP53 in cutaneous melanoma, BRAF, NRAS and NF1 in acral melanoma and SF3B1 in mucosal melanoma. Mutations affecting the TERT promoter were the most frequent of all; however, neither they nor ATRX mutations, which correlate with alternative telomere lengthening, were associated with greater telomere length. Most melanomas had potentially actionable mutations, most in components of the mitogen-activated protein kinase and phosphoinositol kinase pathways. The whole-genome mutation landscape of melanoma reveals diverse carcinogenic processes across its subtypes, some unrelated to sun exposure, and extends potential involvement of the non-coding genome in its pathogenesis.
Collapse
|
8
|
Tumour procurement, DNA extraction, coverage analysis and optimisation of mutation-detection algorithms for human melanoma genomes. Pathology 2016; 47:683-93. [PMID: 26517638 DOI: 10.1097/pat.0000000000000324] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Whole genome sequencing (WGS) of cancer patients' tumours offers the most comprehensive method of identifying both novel and known clinically-actionable genomic targets. However, the practicalities of performing WGS on clinical samples are poorly defined.This study was designed to test sample preparation, sequencing specifications and bioinformatic algorithms for their effect on accuracy and cost-efficiency in a large WGS analysis of human melanoma samples.WGS was performed on melanoma cell lines (n = 15) and melanoma fresh frozen tumours (n = 222). The appropriate level of coverage and the optimal mutation detection algorithm for the project pipeline were determined.An incremental increase in sequencing coverage from 36X to 132X in melanoma tissue samples and 30X to 103X for cell lines only resulted in a small increase (1-2%) in the number of mutations detected, and the quality scores of the additional mutations indicated a low probability that the mutations were real. The results suggest that 60X coverage for melanoma tissue and 40X for melanoma cell lines empower the detection of 98-99% of informative single nucleotide variants (SNVs), a sensitivity level at which clinical decision making or landscape research projects can be carried out with a high degree of confidence in the results. Likewise the bioinformatic mutation analysis methodology strongly influenced the number and quality of SNVs detected. Detecting mutations in the blood genomes separate to the tumour genomes generated 41% more SNVs than if the blood and melanoma tissue genomes were analysed simultaneously. Therefore, simultaneous analysis should be employed on matched melanoma tissue and blood genomes to reduce errors in mutation detection.This study provided valuable insights into the accuracy of SNV with WGS at various coverage levels in human clinical cancer specimens. Additionally, we investigated the accuracy of the publicly available mutation detection algorithms to detect cancer specific SNVs which will aid researchers and clinicians in study design and implementation of WGS for the identification of somatic mutations in other cancers.
Collapse
|
9
|
The cardiac-restricted protein ADP-ribosylhydrolase-like 1 is essential for heart chamber outgrowth and acts on muscle actin filament assembly. Dev Biol 2016; 416:373-88. [PMID: 27217161 PMCID: PMC4990356 DOI: 10.1016/j.ydbio.2016.05.006] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2015] [Revised: 04/29/2016] [Accepted: 05/06/2016] [Indexed: 12/22/2022]
Abstract
Adprhl1, a member of the ADP-ribosylhydrolase protein family, is expressed exclusively in the developing heart of all vertebrates. In the amphibian Xenopus laevis, distribution of its mRNA is biased towards actively growing chamber myocardium. Morpholino oligonucleotide-mediated knockdown of all Adprhl1 variants inhibits striated myofibril assembly and prevents outgrowth of the ventricle. The resulting ventricles retain normal electrical conduction and express markers of chamber muscle differentiation but are functionally inert. Using a cardiac-specific Gal4 binary expression system, we show that the abundance of Adprhl1 protein in tadpole hearts is tightly controlled through a negative regulatory mechanism targeting the 5′-coding sequence of Xenopus adprhl1. Over-expression of full length (40 kDa) Adprhl1 variants modified to escape such repression, also disrupts cardiac myofibrillogenesis. Disarrayed myofibrils persist that show extensive branching, with sarcomere division occurring at the actin-Z-disc boundary. Ultimately, Adprhl1-positive cells contain thin actin threads, connected to numerous circular branch points. Recombinant Adprhl1 can localize to stripes adjacent to the Z-disc, suggesting a direct role for Adprhl1 in modifying Z-disc and actin dynamics as heart chambers grow. Modelling the structure of Adprhl1 suggests this cardiac-specific protein is a pseudoenzyme, lacking key residues necessary for ADP-ribosylhydrolase catalytic activity. Adprhl1 is expressed exclusively in the heart of all vertebrates. Morpholino knockdown of Adprhl1 prevents outgrowth of the ventricle. Elevated 40 kDa Adprhl1 produces disarrayed myofibrils that show extensive branching. The 5′-coding sequence of Xenopus adprhl1 influences the synthesis of Adprhl1 protein. Two Adprhl1 proteins, 40+23 kDa exist in Xenopus embryos and are conserved in mouse.
Collapse
|
10
|
Next-generation sequencing: a challenge to meet the increasing demand for training workshops in Australia. Brief Bioinform 2013; 14:563-74. [PMID: 23543352 PMCID: PMC3771231 DOI: 10.1093/bib/bbt022] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The widespread adoption of high-throughput next-generation sequencing (NGS) technology among the Australian life science research community is highlighting an urgent need to up-skill biologists in tools required for handling and analysing their NGS data. There is currently a shortage of cutting-edge bioinformatics training courses in Australia as a consequence of a scarcity of skilled trainers with time and funding to develop and deliver training courses. To address this, a consortium of Australian research organizations, including Bioplatforms Australia, the Commonwealth Scientific and Industrial Research Organisation and the Australian Bioinformatics Network, have been collaborating with EMBL-EBI training team. A group of Australian bioinformaticians attended the train-the-trainer workshop to improve training skills in developing and delivering bioinformatics workshop curriculum. A 2-day NGS workshop was jointly developed to provide hands-on knowledge and understanding of typical NGS data analysis workflows. The road show–style workshop was successfully delivered at five geographically distant venues in Australia using the newly established Australian NeCTAR Research Cloud. We highlight the challenges we had to overcome at different stages from design to delivery, including the establishment of an Australian bioinformatics training network and the computing infrastructure and resource development. A virtual machine image, workshop materials and scripts for configuring a machine with workshop contents have all been made available under a Creative Commons Attribution 3.0 Unported License. This means participants continue to have convenient access to an environment they had become familiar and bioinformatics trainers are able to access and reuse these resources.
Collapse
|
11
|
Bread matters: a national initiative to profile the genetic diversity of Australian wheat. PLANT BIOTECHNOLOGY JOURNAL 2012; 10:703-8. [PMID: 22681313 DOI: 10.1111/j.1467-7652.2012.00717.x] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
The large and complex genome of wheat makes genetic and genomic analysis in this important species both expensive and resource intensive. The application of next-generation sequencing technologies is particularly resource intensive, with at least 17 Gbp of sequence data required to obtain minimal (1×) coverage of the genome. A similar volume of data would represent almost 40× coverage of the rice genome. Progress can be made through the establishment of consortia to produce shared genomic resources. Australian wheat genome researchers, working with Bioplatforms Australia, have collaborated in a national initiative to establish a genetic diversity dataset representing Australian wheat germplasm based on whole genome next-generation sequencing data. Here, we describe the establishment and validation of this resource which can provide a model for broader international initiatives for the analysis of large and complex genomes.
Collapse
|
12
|
Abstract
To make full use of research data, the bioscience community needs to adopt technologies and reward mechanisms that support interoperability and promote the growth of an open 'data commoning' culture. Here we describe the prerequisites for data commoning and present an established and growing ecosystem of solutions using the shared 'Investigation-Study-Assay' framework to support that vision.
Collapse
|
13
|
Constitutively active signal transducer and activator of transcription 5 can replace the requirement for growth hormone in adipogenesis of 3T3-F442A preadipocytes. Mol Endocrinol 2003; 17:2494-508. [PMID: 12970402 DOI: 10.1210/me.2003-0139] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Although it is the best characterized in vitro model of GH action, the mechanisms used by GH to induce differentiation of murine 3T3-F442A preadipocytes remain unclear. Here we have examined the role of three transcriptional regulators in adipogenesis. These regulators are either rapidly induced in response to GH [Stra13, signal transducer and activator of transcription (Stat)3] or of central importance to GH signaling (Stat5). Retroviral transfection of 3T3-F442A preadipocytes was used to increase expression of Stra13, Stat3, and Stat5a. Only Stat5a transfection increased the expression of adipogenic markers peroxisome proliferator-activated receptor gamma, CCAAT enhancer binding protein (C/EBP)alpha, and adipose protein 2/fatty acid-binding protein in response to GH, as determined by quantitative RT-PCR. Transfection with constitutively active Stat3 and Stat5a revealed that constitutively active Stat5a but not Stat3 was able to replace the GH requirement for adipogenesis. Constitutively active Stat5a but not Stat3 was able to increase the formation of lipid droplets and expression of alpha-glycerol phosphate dehydrogenase toward levels seen in mature adipocytes. Constitutively active Stat5a was also able to increase the expression of transcripts for C/EBPalpha to similar levels as GH, and of C/EBPbeta, peroxisome proliferator-activated receptor gamma, and adipose protein 2/fatty acid-binding protein transcripts to a lesser extent. An in vivo role for GH in murine adipogenesis is supported by significantly decreased epididymal fat depot size in young GH receptor-deleted mice, before manifestation of the lipolytic actions of GH. We conclude that Stat5 is a critical factor in GH-induced, and potentially prolactin-induced, murine adipogenesis.
Collapse
|
14
|
Physiology of normal growth hormone receptor function. J Pediatr Endocrinol Metab 2002; 15 Suppl 5:1443-6. [PMID: 12511005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/28/2023]
|
15
|
Abstract
A substantial number of GH regulated genes have been reported in mature hepatocytes, but genes involved in GH-initiated cell differentiation have not yet been identified. Here we have studied a well-characterised model of GH-dependent differentiation, adipogenesis of 3T3-F442A preadipocytes, to identify genes rapidly induced by GH. Using the suppression subtractive hybridisation technique, we have identified eight genes induced within 60 min of GH treatment, and verified these by northern analysis. Six were identifiable as Stat 2, Stat 3, thrombospondin-1, oncostatin M receptor beta chain, a DEAD box RNA helicase, and muscleblind, a developmental transcription factor. Bioinformatic approaches assigned one of the two remaining unknown genes as a novel 436 residue serine/threonine kinase. As each of the identified genes have important developmental roles, they may be important in initiating GH-induced adipogenesis.
Collapse
|
16
|
Abstract
Growth hormone (GH) is a pleiotropic cytokine that acts upon its target cells to regulate their growth, differentiation and metabolism. GH is thought to act by altering gene expression in target cells, but few GH-regulated genes are known. In this study, we used cDNA array analysis to identify genes rapidly induced in the liver of GH-deficient dwarf rats following a single systemic injection of GH. Eight genes were found to upregulate their mRNA expression within 1-3 hours of GH administration, results which were confirmed by northern analysis. The identity of these genes suggests GH may influence a diversity of cellular processes. A role for GH in regulating cytokine and growth factor signalling is suggested by upregulation of mRNAs encoding three signal transducers: a subunit of the receptor for IL-6-type cytokines (gp130), STAT3 (signal transducer and activator of transcription) and p38MAPK (mitogen activated protein kinase). Two genes involved in DNA repair and cell cycle control, APEN (apurinic endonuclease) and GADD45 (growth arrest and DNA damage 45) were upregulated. Other induced genes include those encoding a lactate transporter (MCT-1), an extracellular matrix remodelling enzyme, MTI-MMP (membrane type 1 matrix metalloproteinase) and an acute phase protein (fibrinogen beta). In summary, this work is the first to apply cDNA arrays to the study of peptide hormone action in vivo and has identified 8 novel GH target genes.
Collapse
|
17
|
Abstract
1. The growth hormone (GH) receptor was the first of the class 1 cytokine receptors to be cloned. It shares a number of structural characteristics with other family members and common signalling mechanisms based on common usage of the Janus kinase 2 (JAK2). 2. Growth hormone receptor activation is initiated by GH-induced homodimerization of receptor molecules. This has enabled the creation of specific hormone antagonists that block receptor dimerization. 3. The details of the transcription factors used by the activated receptor are being revealed as a result of promoter analyses and electrophoretic mobility gelshift analysis. 4. Growth hormone receptors are widespread and their discovery in certain tissues has led to the assignment of new physiological roles for GH. Some of these involve local or paracrine roles for GH, as befits its cytokine status. 5. Four examples of such novel roles are discussed. These are: (i) the brain GH axis; (ii) GH and the vitamin B12 axis; (iii) GH in early pre-implantation development; and (iv) GH in development of the tooth. 6. We propose that the view that GH acts through the intermediacy of insulin-like growth factor-1 is simplistic; rather, GH acts to induce an array of growth factors and their receptors and the composition of this array varies with tissue type and, probably, stage of development.
Collapse
|
18
|
Ternary complex factors Elk-1 and Sap-1a mediate growth hormone-induced transcription of egr-1 (early growth response factor-1) in 3T3-F442A preadipocytes. Mol Endocrinol 1999; 13:619-31. [PMID: 10194767 DOI: 10.1210/mend.13.4.0266] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
In our search for transcription factors induced by GH, we have analyzed immediate early gene activation in a model of GH-dependent differentiation. Here we describe the activation of early growth response factor-1 (egr-1) in GH-stimulated 3T3-F442A preadipocytes and the transcription factors responsible for its transactivation. Binding activity of egr-1 in electrophoretic mobility shift assay (EMSA) increased transiently 1 h after GH stimulation, accompanied by a concomitant increase in egr-1 mRNA. egr-1 induction appeared not to be related to proliferation since it was amplified in quiescent preadipocytes at a time when cells were refractive to GH-stimulated DNA synthesis. Truncations of the proximal 1 kb of the egr-1 promoter revealed that a 374-bp region (-624 to -250) contributes about 80% of GH inducibility in 3T3-F442A cells and approximately 90% inducibility in CHO-K1 cells. This region contains three juxtaposed SRE (serum response element)/Ets site pairs known to be important for egr-1 activity in response to exogenous stimuli. Site-specific mutations of individual SRE and Ets sites within this region each reduced GH inducibility of the promoter. Use of these site-specific mutations in EMSA showed that disruption of either Ets or SRE sites abrogated ternary complex formation at the composite sites. DNA binding of ternary complexes, but not binary complexes, in EMSA was rapidly and transiently increased by GH. EMSA supershifts indicated these ternary complexes contained serum response factor (SRF) and the Ets factors Elk-1 and Sap-1a. Coexpression of Sap-1a and Elk-1 resulted in a marked increase in GH induction of egr-1 promoter activity, although transfection with expression vectors for either Ets factor alone did not significantly enhance the GH response. We conclude that GH stimulates transcription of egr-1 primarily through activation of these Ets factors at multiple sites on the promoter and that stabilization of ternary complexes with SRF at these sites maximizes this response.
Collapse
|