51
|
Abstract
Hepatitis B virus (HBV) integration in hepatocellular carcinoma (HCC) is a poorly understood event. In a recent Cancer Cell paper, Lau et al. (2014) describe a HBV-human fusion transcript (HBx-LINE1) that functions as a lncRNA, influences the epithelial-mesenchymal transition, and correlates with reduced patient survival and tumor formation in mice.
Collapse
Affiliation(s)
- Mathias Heikenwalder
- Institute of Virology, Technische Universität München/Helmholtz Zentrum München, 81675 Munich, Germany.
| | - Ulrike Protzer
- Institute of Virology, Technische Universität München/Helmholtz Zentrum München, 81675 Munich, Germany.
| |
Collapse
|
52
|
Katz JP, Pipas JM. SummonChimera infers integrated viral genomes with nucleotide precision from NGS data. BMC Bioinformatics 2014; 15:348. [PMID: 25331652 PMCID: PMC4210586 DOI: 10.1186/s12859-014-0348-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2014] [Accepted: 10/09/2014] [Indexed: 01/03/2023] Open
Abstract
Background Viral integration into a host genome is defined by two chimeric junctions that join viral and host DNA. Recently, computational tools have been developed that utilize NGS data to detect chimeric junctions. These methods identify individual viral-host junctions but do not associate chimeric pairs as an integration event. Without knowing the chimeric boundaries of an integration, its genetic content cannot be determined. Results Summonchimera is a Perl program that associates chimera pairs to infer the complete viral genomic integration event to the nucleotide level within single or paired-end NGS data. SummonChimera integration prediction was verified on a set of single-end IonTorrent reads from a purified Salmonella bacterium with an integrated bacteriophage. Furthermore, SummonChimera predicted integrations from experimentally verified Hepatitis B Virus chimeras within a paired-end Whole Genome Sequencing hepatocellular carcinoma tumor database. Conclusions SummonChimera identified all experimentally verified chimeras detected by current computational methods. Further, SummonChimera integration inference precisely predicted bacteriophage integration. The application of SummonChimera to cancer NGS accurately identifies deletion of host and viral sequence during integration. The precise nucleotide determination of an integration allows prediction of viral and cellular gene transcription patterns. Electronic supplementary material The online version of this article (doi:10.1186/s12859-014-0348-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Joshua P Katz
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA, 15260, USA.
| | - James M Pipas
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA, 15260, USA.
| |
Collapse
|
53
|
Ho T, Tzanetakis IE. Development of a virus detection and discovery pipeline using next generation sequencing. Virology 2014; 471-473:54-60. [PMID: 25461531 DOI: 10.1016/j.virol.2014.09.019] [Citation(s) in RCA: 105] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2014] [Revised: 08/28/2014] [Accepted: 09/22/2014] [Indexed: 12/13/2022]
Abstract
Next generation sequencing (NGS) has revolutionized virus discovery. Notwithstanding, a vertical pipeline, from sample preparation to data analysis, has not been available to the plant virology community. We developed a degenerate oligonucleotide primed RT-PCR method with multiple barcodes for NGS, and constructed VirFind, a bioinformatics tool specifically for virus detection and discovery able to: (i) map and filter out host reads, (ii) deliver files of virus reads with taxonomic information and corresponding Blastn and Blastx reports, and (iii) perform conserved domain search for reads of unknown origin. The pipeline was used to process more than 30 samples resulting in the detection of all viruses known to infect the processed samples, the extension of the genomic sequences of others, and the discovery of several novel viruses. VirFind was tested by four external users with datasets from plants or insects, demonstrating its potential as a universal virus detection and discovery tool.
Collapse
Affiliation(s)
- Thien Ho
- Department of Plant Pathology, Division of Agriculture, University of Arkansas System, Fayetteville, AR, USA.
| | - Ioannis E Tzanetakis
- Department of Plant Pathology, Division of Agriculture, University of Arkansas System, Fayetteville, AR, USA.
| |
Collapse
|
54
|
Han L, Vickers KC, Samuels DC, Guo Y. Alternative applications for distinct RNA sequencing strategies. Brief Bioinform 2014; 16:629-39. [PMID: 25246237 DOI: 10.1093/bib/bbu032] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2014] [Accepted: 08/19/2014] [Indexed: 12/30/2022] Open
Abstract
Recent advances in RNA library preparation methods, platform accessibility and cost efficiency have allowed high-throughput RNA sequencing (RNAseq) to replace conventional hybridization microarray platforms as the method of choice for mRNA profiling and transcriptome analyses. RNAseq is a powerful technique to profile both long and short RNA expression, and the depth of information gained from distinct RNAseq methods is striking and facilitates discovery. In addition to expression analysis, distinct RNAseq approaches also allow investigators the ability to assess transcriptional elongation, DNA variance and exogenous RNA content. Here we review the current state of the art in transcriptome sequencing and address epigenetic regulation, quantification of transcription activation, RNAseq output and a diverse set of applications for RNAseq data. We detail how RNAseq can be used to identify allele-specific expression, single-nucleotide polymorphisms and somatic mutations and discuss the benefits and limitations of using RNAseq to monitor DNA characteristics. Moreover, we highlight the power of combining RNA- and DNAseq methods for genomic analysis. In summary, RNAseq provides the opportunity to gain greater insight into transcriptional regulation and output than simply miRNA and mRNA profiling.
Collapse
|
55
|
Lau CC, Sun T, Ching AKK, He M, Li JW, Wong AM, Co NN, Chan AWH, Li PS, Lung RWM, Tong JHM, Lai PBS, Chan HLY, To KF, Chan TF, Wong N. Viral-human chimeric transcript predisposes risk to liver cancer development and progression. Cancer Cell 2014; 25:335-49. [PMID: 24582836 DOI: 10.1016/j.ccr.2014.01.030] [Citation(s) in RCA: 224] [Impact Index Per Article: 20.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/06/2013] [Revised: 12/07/2013] [Accepted: 01/31/2014] [Indexed: 02/06/2023]
Abstract
The mutagenic effect of hepatitis B (HBV) integration in predisposing risk to hepatocellular carcinoma (HCC) remains elusive. In this study, we performed transcriptome sequencing of HBV-positive HCC cell lines and showed transcription of viral-human gene fusions from the site of genome integrations. We discovered tumor-promoting properties of a chimeric HBx-LINE1 that, intriguingly, functions as a hybrid RNA. HBx-LINE1 can be detected in 23.3% of HBV-associated HCC tumors and correlates with poorer patient survival. HBx-LINE1 transgenic mice showed heightened susceptibility to diethylnitrosamine-induced tumor formation. We further show that HBx-LINE1 expression affects β-catenin transactivity, which underlines a role in activating Wnt signaling. Thus, this study identifies a viral-human chimeric fusion transcript that functions like a long noncoding RNA to promote HCC.
Collapse
Affiliation(s)
- Chi-Chiu Lau
- Department of Anatomical and Cellular Pathology, The Chinese University of Hong Kong, Shatin, Hong Kong, China
| | - Tingting Sun
- Department of Anatomical and Cellular Pathology, The Chinese University of Hong Kong, Shatin, Hong Kong, China
| | - Arthur K K Ching
- Department of Anatomical and Cellular Pathology, The Chinese University of Hong Kong, Shatin, Hong Kong, China
| | - Mian He
- Department of Anatomical and Cellular Pathology, The Chinese University of Hong Kong, Shatin, Hong Kong, China
| | - Jing-Woei Li
- Department of Anatomical and Cellular Pathology, The Chinese University of Hong Kong, Shatin, Hong Kong, China; School of Life Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong, China
| | - Alissa M Wong
- Department of Anatomical and Cellular Pathology, The Chinese University of Hong Kong, Shatin, Hong Kong, China
| | - Ngai Na Co
- Department of Anatomical and Cellular Pathology, The Chinese University of Hong Kong, Shatin, Hong Kong, China
| | - Anthony W H Chan
- Department of Anatomical and Cellular Pathology, The Chinese University of Hong Kong, Shatin, Hong Kong, China
| | - Pik-Shan Li
- Transgenic Facility, School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong, China
| | - Raymond W M Lung
- Department of Anatomical and Cellular Pathology, The Chinese University of Hong Kong, Shatin, Hong Kong, China
| | - Joanna H M Tong
- Department of Anatomical and Cellular Pathology, The Chinese University of Hong Kong, Shatin, Hong Kong, China
| | - Paul B S Lai
- Department of Surgery, The Chinese University of Hong Kong, Shatin, Hong Kong, China
| | - Henry L Y Chan
- Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Shatin, Hong Kong, China
| | - Ka-Fai To
- Department of Anatomical and Cellular Pathology, The Chinese University of Hong Kong, Shatin, Hong Kong, China; State Key Laboratory in Oncology in South China, The Chinese University of Hong Kong, Shatin, Hong Kong, China; State Key Laboratory of Digestive Disease, The Chinese University of Hong Kong, Shatin, Hong Kong, China
| | - Ting-Fung Chan
- School of Life Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong, China.
| | - Nathalie Wong
- Department of Anatomical and Cellular Pathology, The Chinese University of Hong Kong, Shatin, Hong Kong, China; State Key Laboratory in Oncology in South China, The Chinese University of Hong Kong, Shatin, Hong Kong, China; State Key Laboratory of Digestive Disease, The Chinese University of Hong Kong, Shatin, Hong Kong, China.
| |
Collapse
|
56
|
Sensitive detection of viral transcripts in human tumor transcriptomes. PLoS Comput Biol 2013; 9:e1003228. [PMID: 24098097 PMCID: PMC3789765 DOI: 10.1371/journal.pcbi.1003228] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2012] [Accepted: 06/04/2013] [Indexed: 02/07/2023] Open
Abstract
In excess of % of human cancer incidents have a viral cofactor. Epidemiological studies of idiopathic human cancers indicate that additional tumor viruses remain to be discovered. Recent advances in sequencing technology have enabled systematic screenings of human tumor transcriptomes for viral transcripts. However, technical problems such as low abundances of viral transcripts in large volumes of sequencing data, viral sequence divergence, and homology between viral and human factors significantly confound identification of tumor viruses. We have developed a novel computational approach for detecting viral transcripts in human cancers that takes the aforementioned confounding factors into account and is applicable to a wide variety of viruses and tumors. We apply the approach to conducting the first systematic search for viruses in neuroblastoma, the most common cancer in infancy. The diverse clinical progression of this disease as well as related epidemiological and virological findings are highly suggestive of a pathogenic cofactor. However, a viral etiology of neuroblastoma is currently contested. We mapped transcriptomes of neuroblastoma as well as positive and negative controls to the human and all known viral genomes in order to detect both known and unknown viruses. Analysis of controls, comparisons with related methods, and statistical estimates demonstrate the high sensitivity of our approach. Detailed investigation of putative viral transcripts within neuroblastoma samples did not provide evidence for the existence of any known human viruses. Likewise, de-novo assembly and analysis of chimeric transcripts did not result in expression signatures associated with novel human pathogens. While confounding factors such as sample dilution or viral clearance in progressed tumors may mask viral cofactors in the data, in principle, this is rendered less likely by the high sensitivity of our approach and the number of biological replicates analyzed. Therefore, our results suggest that frequent viral cofactors of metastatic neuroblastoma are unlikely. Many human cancers are caused by infections with tumor viruses and identification of these pathogens is considered a critical contribution to cancer prevention. Deep sequencing enables us to systematically investigate viral nucleotide signatures in order to either verify or exclude the existence of viruses in idiopathic human cancers. We have developed Virana, a novel computational approach for identifying tumor viruses in human cancers that is applicable to a wide variety of tumors and viruses. Virana firstly addresses several important biological confounding factors that may hinder successful detection of these pathogens. We applied our approach in the first systematic search for cancer-causing viruses in metastatic neuroblastoma, the most common form of cancer in infancy. Although the heterogeneous clinical progression of this disease as well as epidemiological and virological findings are suggestive of a pathogenic cofactor, the viral etiology of neuroblastoma is currently contested. We conducted an analysis of experimental controls, comparisons with related approaches, as well as statistical analyses in order to validate our method. In spite of the high sensitivity of our approach, analyses of neuroblastoma transcriptomes did not provide evidence for the existence of any known or unknown human viruses. Our results therefore suggest that frequent viral cofactors of metastatic neuroblastoma are unlikely.
Collapse
|
57
|
Samuels DC, Han L, Li J, Quanghu S, Clark TA, Shyr Y, Guo Y. Finding the lost treasures in exome sequencing data. Trends Genet 2013; 29:593-9. [PMID: 23972387 DOI: 10.1016/j.tig.2013.07.006] [Citation(s) in RCA: 103] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2013] [Revised: 07/05/2013] [Accepted: 07/24/2013] [Indexed: 01/04/2023]
Abstract
Exome sequencing is one of the most cost-efficient sequencing approaches for conducting genome research on coding regions. However, significant portions of the reads obtained in exome sequencing come from outside of the designed target regions. These additional reads are generally ignored, potentially wasting an important source of genomic data. There are three major types of unintentionally sequenced read that can be found in exome sequencing data: reads in introns and intergenic regions, reads in the mitochondrial genome, and reads originating in viral genomes. All of these can be used for reliable data mining, extending the utility of exome sequencing. Large-scale exome sequencing data repositories, such as The Cancer Genome Atlas (TCGA), the 1000 Genomes Project, National Heart, Lung, and Blood Institute (NHLBI) Exome Sequencing Project, and The Sequence Reads Archive, provide researchers with excellent secondary data-mining opportunities to study genomic data beyond the intended target regions.
Collapse
Affiliation(s)
- David C Samuels
- Center for Human Genetics Research, Vanderbilt University, Nashville, TN, 37232, USA
| | | | | | | | | | | | | |
Collapse
|
58
|
Wang Q, Jia P, Zhao Z. VirusFinder: software for efficient and accurate detection of viruses and their integration sites in host genomes through next generation sequencing data. PLoS One 2013; 8:e64465. [PMID: 23717618 PMCID: PMC3663743 DOI: 10.1371/journal.pone.0064465] [Citation(s) in RCA: 105] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2013] [Accepted: 04/08/2013] [Indexed: 11/23/2022] Open
Abstract
Next generation sequencing (NGS) technologies allow us to explore virus interactions with host genomes that lead to carcinogenesis or other diseases; however, this effort is largely hindered by the dearth of efficient computational tools. Here, we present a new tool, VirusFinder, for the identification of viruses and their integration sites in host genomes using NGS data, including whole transcriptome sequencing (RNA-Seq), whole genome sequencing (WGS), and targeted sequencing data. VirusFinder’s unique features include the characterization of insertion loci of virus of arbitrary type in the host genome and high accuracy and computational efficiency as a result of its well-designed pipeline. The source code as well as additional data of VirusFinder is publicly available at http://bioinfo.mc.vanderbilt.edu/VirusFinder/.
Collapse
Affiliation(s)
- Qingguo Wang
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
| | - Peilin Jia
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Center for Quantitative Sciences, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
| | - Zhongming Zhao
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Department of Psychiatry, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Department of Cancer Biology, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Center for Quantitative Sciences, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- * E-mail:
| |
Collapse
|