1
|
Identification and confirmation via in situ hybridization of Merkel cell polyomavirus in rare cases of posttransplant cutaneous T-cell lymphoma. J Cutan Pathol 2023; 50:835-844. [PMID: 37394808 DOI: 10.1111/cup.14486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 05/25/2023] [Accepted: 06/06/2023] [Indexed: 07/04/2023]
Abstract
BACKGROUND Viral infection is an oncogenic factor in many hematolymphoid malignancies. We sought to determine the diagnostic yield of aligning off-target reads incidentally obtained during targeted hematolymphoid next-generation sequencing to a large database of viral genomes to screen for viral sequences within tumor specimens. METHODS Alignment of off-target reads to viral genomes was performed using magicBLAST. Localization of Merkel cell polyomavirus (MCPyV) RNA was confirmed by RNAScope in situ hybridization. Integration analysis was performed using Virus-Clip. RESULTS Four cases of post-cardiac-transplant folliculotropic mycosis fungoides (fMF) and one case of peripheral T-cell lymphoma (PTCL) were positive in off-target reads for MCPyV DNA. Two of the four cases of posttransplant fMF and the case of PTCL showed localization of MCPyV RNA to malignant lymphocytes, whereas the remaining two cases of posttransplant fMF showed MCPyV RNA in keratinocytes. CONCLUSIONS Our findings raise the question of whether MCPyV may play a role in rare cases of T-lymphoproliferative disorders, particularly in the skin and in the heavily immunosuppressed posttransplant setting.
Collapse
|
2
|
Human papillomavirus integration transforms chromatin to drive oncogenesis. Genome Biol 2023; 24:142. [PMID: 37365652 DOI: 10.1186/s13059-023-02926-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Accepted: 04/07/2023] [Indexed: 06/28/2023] Open
Abstract
BACKGROUND Human papillomavirus (HPV) drives almost all cervical cancers and up to 70% of head and neck cancers. Frequent integration into the host genome occurs predominantly in tumorigenic types of HPV. We hypothesize that changes in chromatin state at the location of integration can result in changes in gene expression that contribute to the tumorigenicity of HPV. RESULTS We find that viral integration events often occur along with changes in chromatin state and expression of genes near the integration site. We investigate whether introduction of new transcription factor binding sites due to HPV integration could invoke these changes. Some regions within the HPV genome, particularly the position of a conserved CTCF binding site, show enriched chromatin accessibility signal. ChIP-seq reveals that the conserved CTCF binding site within the HPV genome binds CTCF in 4 HPV+ cancer cell lines. Significant changes in CTCF binding pattern and increases in chromatin accessibility occur exclusively within 100 kbp of HPV integration sites. The chromatin changes co-occur with out-sized changes in transcription and alternative splicing of local genes. Analysis of The Cancer Genome Atlas (TCGA) HPV+ tumors indicates that HPV integration upregulates genes which have significantly higher essentiality scores compared to randomly selected upregulated genes from the same tumors. CONCLUSIONS Our results suggest that introduction of a new CTCF binding site due to HPV integration reorganizes chromatin state and upregulates genes essential for tumor viability in some HPV+ tumors. These findings emphasize a newly recognized role of HPV integration in oncogenesis.
Collapse
|
3
|
VIS Atlas: A Database of Virus Integration Sites in Human Genome from NGS Data to Explore Integration Patterns. GENOMICS, PROTEOMICS & BIOINFORMATICS 2023; 21:300-310. [PMID: 36804047 PMCID: PMC10626058 DOI: 10.1016/j.gpb.2023.02.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 01/08/2023] [Accepted: 02/10/2023] [Indexed: 02/17/2023]
Abstract
Integration of oncogenic DNA viruses into the human genome is a key step in most virus-induced carcinogenesis. Here, we constructed a virus integration site (VIS) Atlas database, an extensive collection of integration breakpoints for three most prevalent oncoviruses, human papillomavirus, hepatitis B virus, and Epstein-Barr virus based on the next-generation sequencing (NGS) data, literature, and experimental data. There are 63,179 breakpoints and 47,411 junctional sequences with full annotations deposited in the VIS Atlas database, comprising 47 virus genotypes and 17 disease types. The VIS Atlas database provides (1) a genome browser for NGS breakpoint quality check, visualization of VISs, and the local genomic context; (2) a novel platform to discover integration patterns; and (3) a statistics interface for a comprehensive investigation of genotype-specific integration features. Data collected in the VIS Atlas aid to provide insights into virus pathogenic mechanisms and the development of novel antitumor drugs. The VIS Atlas database is available at https://www.vis-atlas.tech/.
Collapse
|
4
|
Abstract
Currently, both pathogenic and commensal viruses are continuously being discovered and acknowledged as ubiquitous components of microbial communities. The advancements of systems microbiological approaches have changed the face of virome research. Here, we focus on viral metagenomic approach to study virus community and their interactions with other microbial members as well as their hosts. This review also summarizes challenges, limitations, and benefits of the current virome approaches. Potentially, the studies of virome can be further applied in various biological and clinical fields.
Collapse
|
5
|
FastViFi: Fast and accurate detection of (Hybrid) Viral DNA and RNA. NAR Genom Bioinform 2022; 4:lqac032. [PMID: 35493723 PMCID: PMC9041341 DOI: 10.1093/nargab/lqac032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Revised: 03/04/2022] [Accepted: 03/06/2022] [Indexed: 11/13/2022] Open
Abstract
DNA viruses are important infectious agents known to mediate a large number of human diseases, including cancer. Viral integration into the host genome and the formation of hybrid transcripts are also associated with increased pathogenicity. The high variability of viral genomes, however requires the use of sensitive ensemble hidden Markov models that add to the computational complexity, often requiring > 40 CPU-hours per sample. Here, we describe FastViFi, a fast 2-stage filtering method that reduces the computational burden. On simulated and cancer genomic data, FastViFi improved the running time by 2 orders of magnitude with comparable accuracy on challenging data sets. Recently published methods have focused on identification of location of viral integration into the human host genome using local assembly, but do not extend to RNA. To identify human viral hybrid transcripts, we additionally developed ensemble Hidden Markov Models for the Epstein Barr virus (EBV) to add to the models for Hepatitis B (HBV), Hepatitis C (HCV) viruses and the Human Papillomavirus (HPV), and used FastViFi to query RNA-seq data from Gastric cancer (EBV) and liver cancer (HBV/HCV). FastViFi ran in <10 minutes per sample and identified multiple hybrids that fuse viral and human genes suggesting new mechanisms for oncoviral pathogenicity. FastViFi is available at https://github.com/sara-javadzadeh/FastViFi.
Collapse
|
6
|
Comprehensive characterization of viral integrations and genomic aberrations in HBV-infected intrahepatic cholangiocarcinomas. Hepatology 2022; 75:997-1011. [PMID: 34478159 DOI: 10.1002/hep.32135] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Revised: 08/08/2021] [Accepted: 08/21/2021] [Indexed: 12/15/2022]
Abstract
BACKGROUND AND AIMS Despite the epidemiological association between intrahepatic cholangiocarcinoma (iCCA) and HBV infection, little is known about the relevant oncogenic effects. We sought to identify the landscape and mechanism of HBV integration, along with the genomic architecture of HBV-infected iCCA (HBV-iCCA) tumors. APPROACH AND RESULTS We profiled a cohort of 108 HBV-iCCAs using whole-genome sequencing, deep sequencing, and RNA sequencing, together with preconstructed data sets of HBV-infected HCC (HBV-HCC; n = 167) and combined hepatocellular cholangiocarcinoma (HBV-cHCC/CCA; n = 59), and conventional (n = 154) and fluke-related iCCAs (n = 16). Platforms based on primary iCCA cell lines to evaluate the functional effects of chimeric transcripts were also used. We found that HBV had inserted at multiple sites in the iCCA genomes in 45 (41.7%) of the tumors. Recurrent viral integration breakpoints were found at nine different sites. The most common insertional hotspot (7 tumors) was in the TERT (telomerase reverse transcriptase) promoter, where insertions and mutations (11 tumors) were mutually exclusive, and were accompanied by promoter hyperactivity. Recurrent HBV integration events (5 tumors) were also detected in FAT2 (FAT atypical cadherin 2), and were associated with enrichment of epithelial-mesenchymal transition-related genes. A distinctive intergenic insertion (chr9p21.3), between DMRTA1 (DMRT like family A1) and LINC01239 (long intergenic non-protein coding RNA 1239), had oncogenic effects through activation of the mammalian target of rapamycin (mTOR)/4EBP/S6K pathway. Regarding the mutational profiles of primary liver cancers, the overall landscape of HBV-iCCA was closer to that of nonviral conventional iCCA, than to HBV-HCC and HBV-cHCC/CCA. CONCLUSIONS Our findings provide insight into the behavior of iCCAs driven by various pathogenic mechanisms involving HBV integration events and associated genomic aberrations. This knowledge should be of use in managing HBV carriers.
Collapse
|
7
|
Isling: a tool for detecting integration of wild-type viruses and clinical vectors. J Mol Biol 2021; 434:167408. [PMID: 34929203 DOI: 10.1016/j.jmb.2021.167408] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 12/09/2021] [Accepted: 12/13/2021] [Indexed: 10/19/2022]
Abstract
Detecting viral and vector integration events is a key step when investigating interactions between viral and host genomes. This is relevant in several fields, including virology, cancer research and gene therapy. For example, investigating integrations of wild-type viruses such as human papillomavirus and hepatitis B virus has proven to be crucial for understanding the role of these integrations in cancer. Furthermore, identifying the extent of vector integration is vital for determining the potential for genotoxicity in gene therapies. To address these questions, we developed isling, the first tool specifically designed for identifying viral integrations in both wild-type and vector from next-generation sequencing data. Isling addresses complexities in integration behaviour including integration of fragmented genomes and integration junctions with ambiguous locations in a host or vector genome, and can also flag possible vector recombinations. We show that isling is up to 1.6-fold faster and up to 170% more accurate than other viral integration tools, and performs well on both simulated and real datasets. Isling is therefore an efficient and application-agnostic tool that will enable a broad range of investigations into viral and vector integration. These include comparisons between integrations of wild-type viruses and gene therapy vectors, as well as assessing the genotoxicity of vectors and understanding the role of viruses in cancer.
Collapse
|
8
|
Unraveling the functional role of DNA demethylation at specific promoters by targeted steric blockage of DNA methyltransferase with CRISPR/dCas9. Nat Commun 2021; 12:5711. [PMID: 34588447 PMCID: PMC8481236 DOI: 10.1038/s41467-021-25991-9] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2020] [Accepted: 09/07/2021] [Indexed: 01/10/2023] Open
Abstract
Despite four decades of research to support the association between DNA methylation and gene expression, the causality of this relationship remains unresolved. Here, we reaffirm that experimental confounds preclude resolution of this question with existing strategies, including recently developed CRISPR/dCas9 and TET-based epigenetic editors. Instead, we demonstrate a highly effective method using only nuclease-dead Cas9 and guide RNA to physically block DNA methylation at specific targets in the absence of a confounding flexibly-tethered enzyme, thereby enabling the examination of the role of DNA demethylation per se in living cells, with no evidence of off-target activity. Using this method, we probe a small number of inducible promoters and find the effect of DNA demethylation to be small, while demethylation of CpG-rich FMR1 produces larger changes in gene expression. This method could be used to reveal the extent and nature of the contribution of DNA methylation to gene regulation.
Collapse
|
9
|
Exogene: A performant workflow for detecting viral integrations from paired-end next-generation sequencing data. PLoS One 2021; 16:e0250915. [PMID: 34550971 PMCID: PMC8457494 DOI: 10.1371/journal.pone.0250915] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Accepted: 07/08/2021] [Indexed: 01/14/2023] Open
Abstract
The integration of viruses into the human genome is known to be associated with tumorigenesis in many cancers, but the accurate detection of integration breakpoints from short read sequencing data is made difficult by human-viral homologies, viral genome heterogeneity, coverage limitations, and other factors. To address this, we present Exogene, a sensitive and efficient workflow for detecting viral integrations from paired-end next generation sequencing data. Exogene’s read filtering and breakpoint detection strategies yield integration coordinates that are highly concordant with long read validation. We demonstrate this concordance across 6 TCGA Hepatocellular carcinoma (HCC) tumor samples, identifying integrations of hepatitis B virus that are also supported by long reads. Additionally, we applied Exogene to targeted capture data from 426 previously studied HCC samples, achieving 98.9% concordance with existing methods and identifying 238 high-confidence integrations that were not previously reported. Exogene is applicable to multiple types of paired-end sequence data, including genome, exome, RNA-Seq and targeted capture.
Collapse
|
10
|
Causes and Consequences of HPV Integration in Head and Neck Squamous Cell Carcinomas: State of the Art. Cancers (Basel) 2021; 13:cancers13164089. [PMID: 34439243 PMCID: PMC8394665 DOI: 10.3390/cancers13164089] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2021] [Revised: 08/10/2021] [Accepted: 08/11/2021] [Indexed: 12/29/2022] Open
Abstract
A constantly increasing incidence in high-risk Human Papillomaviruses (HPV)s driven head and neck squamous cell carcinomas (HNSCC)s, especially of oropharyngeal origin, is being observed. During persistent infections, viral DNA integration into the host genome may occur. Studies are examining if the physical status of the virus (episomal vs. integration) affects carcinogenesis and eventually has further-reaching consequences on disease progression and outcome. Here, we review the literature of the most recent five years focusing on the impact of HPV integration in HNSCCs, covering aspects of detection techniques used (from PCR up to NGS approaches), integration loci identified, and associations with genomic and clinical data. The consequences of HPV integration in the human genome, including the methylation status and deregulation of genes involved in cell signaling pathways, immune evasion, and response to therapy, are also summarized.
Collapse
|
11
|
The effect of stress on the transcriptomes of circulating immune cells in patients with Gulf War Illness. Life Sci 2021; 281:119719. [PMID: 34144055 DOI: 10.1016/j.lfs.2021.119719] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Revised: 06/07/2021] [Accepted: 06/08/2021] [Indexed: 11/29/2022]
Abstract
AIMS In an effort to gain further insight into the underlying mechanisms tied to disease onset and progression of Gulf War Illness (GWI), our team evaluated GWI patient response to stress utilizing RNA-Seq. MAIN METHODS The protocol included blood collection before exercise challenge (baseline), at maximal exertion, and after exercise challenge (recovery - four hours post-exercise challenge). Peripheral blood mononuclear cell (PBMC) transcriptomics data were analyzed to understand why GWI patients process stressors differently from their healthy counterparts. KEY FINDINGS Our findings validate previously identified dysregulation of immune and inflammatory pathways among GWI patients as well as highlight novel immune and inflammatory markers of disease activity. These results provide a foundation for future research efforts in understanding GWI pathophysiology and creating targeted treatments. SIGNIFICANCE Gulf War Illness is a complex, chronic, and debilitating multi-system illness impacting 25%-30% of the U.S. troops deployed to the 1990-1991 Gulf War. The condition is characterized by medically unexplained fatigue and affects multiple organ systems. Because the underlying mechanisms are largely unknown, patients receive symptom-based treatment, rather than targeting fundamental biological processes. To the best of our knowledge, this is the first study that applies RNA-Seq to analyze the effect of GWI, and the response to stressors in GWI, on the transcriptomic changes in circulating immune cells.
Collapse
|
12
|
AAV9-mediated FIG4 delivery prolongs life span in Charcot-Marie-Tooth disease type 4J mouse model. J Clin Invest 2021; 131:137159. [PMID: 33878035 PMCID: PMC8159684 DOI: 10.1172/jci137159] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2020] [Accepted: 04/15/2021] [Indexed: 12/23/2022] Open
Abstract
Charcot-Marie-Tooth disease type 4J (CMT4J) is caused by recessive, loss-of-function mutations in FIG4, encoding a phosphoinositol(3,5)P2-phosphatase. CMT4J patients have both neuron loss and demyelination in the peripheral nervous system, with vacuolization indicative of endosome/lysosome trafficking defects. Although the disease is highly variable, the onset is often in childhood and FIG4 mutations can dramatically shorten life span. There is currently no treatment for CMT4J. Here, we present the results of preclinical studies testing a gene-therapy approach to restoring FIG4 expression. A mouse model of CMT4J, the Fig4-pale tremor (plt) allele, was dosed with a single-stranded adeno-associated virus serotype 9 (AAV9) to deliver a codon-optimized human FIG4 sequence. Untreated, Fig4plt/plt mice have a median survival of approximately 5 weeks. When treated with the AAV9-FIG4 vector at P1 or P4, mice survived at least 1 year, with largely normal gross motor performance and little sign of neuropathy by neurophysiological or histopathological evaluation. When mice were treated at P7 or P11, life span was still significantly prolonged and peripheral nerve function was improved, but rescue was less complete. No unanticipated adverse effects were observed. Therefore, AAV9-mediated delivery of FIG4 is a well-tolerated and efficacious strategy in a mouse model of CMT4J.
Collapse
|
13
|
SurVirus: a repeat-aware virus integration caller. Nucleic Acids Res 2021; 49:e33. [PMID: 33444454 PMCID: PMC8034624 DOI: 10.1093/nar/gkaa1237] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2020] [Revised: 12/01/2020] [Accepted: 01/12/2021] [Indexed: 01/01/2023] Open
Abstract
A significant portion of human cancers are due to viruses integrating into human genomes. Therefore, accurately predicting virus integrations can help uncover the mechanisms that lead to many devastating diseases. Virus integrations can be called by analysing second generation high-throughput sequencing datasets. Unfortunately, existing methods fail to report a significant portion of integrations, while predicting a large number of false positives. We observe that the inaccuracy is caused by incorrect alignment of reads in repetitive regions. False alignments create false positives, while missing alignments create false negatives. This paper proposes SurVirus, an improved virus integration caller that corrects the alignment of reads which are crucial for the discovery of integrations. We use publicly available datasets to show that existing methods predict hundreds of thousands of false positives; SurVirus, on the other hand, is significantly more precise while it also detects many novel integrations previously missed by other tools, most of which are in repetitive regions. We validate a subset of these novel integrations, and find that the majority are correct. Using SurVirus, we find that HPV and HBV integrations are enriched in LINE and Satellite regions which had been overlooked, as well as discover recurrent HBV and HPV breakpoints in human genome-virus fusion transcripts.
Collapse
|
14
|
VIRUSBreakend: Viral Integration Recognition Using Single Breakends. Bioinformatics 2021; 37:3115-3119. [PMID: 33973999 PMCID: PMC8504616 DOI: 10.1093/bioinformatics/btab343] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2020] [Revised: 03/25/2021] [Accepted: 05/03/2021] [Indexed: 12/17/2022] Open
Abstract
Motivation Integration of viruses into infected host cell DNA can cause DNA damage and disrupt genes. Recent cost reductions and growth of whole genome sequencing has produced a wealth of data in which viral presence and integration detection is possible. While key research and clinically relevant insights can be uncovered, existing software has not achieved widespread adoption, limited in part due to high computational costs, the inability to detect a wide range of viruses, as well as precision and sensitivity. Results Here, we describe VIRUSBreakend, a high-speed tool that identifies viral DNA presence and genomic integration. It utilizes single breakends, breakpoints in which only one side can be unambiguously placed, in a novel virus-centric variant calling and assembly approach to identify viral integrations with high sensitivity and a near-zero false discovery rate. VIRUSBreakend detects viral integrations anywhere in the host genome including regions such as centromeres and telomeres unable to be called by existing tools. Applying VIRUSBreakend to a large metastatic cancer cohort, we demonstrate that it can reliably detect clinically relevant viral presence and integration including HPV, HBV, MCPyV, EBV and HHV-8. Availability and implementation VIRUSBreakend is part of the Genomic Rearrangement IDentification Software Suite (GRIDSS). It is available under a GPLv3 license from https://github.com/PapenfussLab/VIRUSBreakend. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
|
15
|
Presence of complete murine viral genome sequences in patient-derived xenografts. Nat Commun 2021; 12:2031. [PMID: 33795676 PMCID: PMC8017013 DOI: 10.1038/s41467-021-22200-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2019] [Accepted: 03/01/2021] [Indexed: 12/13/2022] Open
Abstract
Patient-derived xenografts are crucial for drug development but their use is challenged by issues such as murine viral infection. We evaluate the scope of viral infection and its impact on patient-derived xenografts by taking an unbiased data-driven approach to analyze unmapped RNA-Seq reads from 184 experiments. We find and experimentally validate the extensive presence of murine viral sequence reads covering entire viral genomes in patient-derived xenografts. The existence of viral sequences inside tumor cells is further confirmed by single cell sequencing data. Extensive chimeric reads containing both viral and human sequences are also observed. Furthermore, we find significantly changed expression levels of many cancer-, immune-, and drug metabolism-related genes in samples with high virus load. Our analyses indicate a need to carefully evaluate the impact of viral infection on patient-derived xenografts for drug development. They also point to a need for attention to quality control of patient-derived xenograft experiments.
Collapse
|
16
|
HIVID2: an accurate tool to detect virus integrations in the host genome. Bioinformatics 2021; 37:1821-1827. [PMID: 33453108 DOI: 10.1093/bioinformatics/btab031] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Revised: 12/27/2020] [Accepted: 01/12/2021] [Indexed: 12/11/2022] Open
Abstract
MOTIVATION Virus integration in the host genome is frequently reported to be closely associated with many human diseases, and the detection of virus integration is a critically challenging task. However, most existing tools show limited specificity and sensitivity. Therefore, the objective of this study is to develop a method for accurate detection of virus integration into host genomes. RESULTS Herein, we report a novel method termed HIVID2 that is a significant upgrade of HIVID. HIVID2 performs a paired-end combination (PE-combination) for potentially integrated reads. The resulting sequences are then remapped onto the reference genomes, and both split and discordant chimeric reads are used to identify accurate integration breakpoints with high confidence. HIVID2 represents a great improvement in specificity and sensitivity, and predicts breakpoints closer to the real integrations, compared with existing methods. The advantage of our method was demonstrated using both simulated and real data sets. HIVID2 uncovered novel integration breakpoints in well-known cervical cancer-related genes, including FHIT and LRP1B, which was verified using protein expression data. In addition, HIVID2 allows the user to decide whether to automatically perform advanced analysis using the identified virus integrations. By analyzing the simulated data and real data tests, we demonstrated that HIVID2 is not only more accurate than HIVID but also better than other existing programs with respect to both sensitivity and specificity. We believe that HIVID2 will help in enhancing future research associated with virus integration. AVAILABILITY HIVID2 can be accessed at https://github.com/zengxi-hada/HIVID2/. CONTACT Xi Zeng (zengxi@mail.hzau.edu.cn), Linghao Zhao (michael_yifan@126.com). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
17
|
Hepatitis B Virus-Telomerase Reverse Transcriptase Promoter Integration Harnesses Host ELF4, Resulting in Telomerase Reverse Transcriptase Gene Transcription in Hepatocellular Carcinoma. Hepatology 2021; 73:23-40. [PMID: 32170761 PMCID: PMC7898544 DOI: 10.1002/hep.31231] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/02/2019] [Revised: 01/17/2020] [Accepted: 02/27/2020] [Indexed: 12/27/2022]
Abstract
BACKGROUND AND AIMS Hepatitis B virus (HBV) integrations are common in hepatocellular carcinoma (HCC). In particular, alterations of the telomerase reverse transcriptase (TERT) gene by HBV integrations are frequent; however, the molecular mechanism and functional consequence underlying TERT HBV integration are unclear. APPROACH AND RESULTS We adopted a targeted sequencing strategy to survey HBV integrations in human HBV-associated HCCs (n = 95). HBV integration at the TERT promoter was frequent (35.8%, n = 34/95) in HCC tumors and was associated with increased TERT mRNA expression and more aggressive tumor behavior. To investigate the functional importance of various integrated HBV components, we employed different luciferase reporter constructs and found that HBV enhancer I (EnhI) was the key viral component leading to TERT activation on integration at the TERT promoter. In addition, the orientation of the HBV integration at the TERT promoter further modulated the degree of TERT transcription activation in HCC cell lines and patients' HCCs. Furthermore, we performed array-based small interfering RNA library functional screening to interrogate the potential major transcription factors that physically interacted with HBV and investigated the cis-activation of host TERT gene transcription on viral integration. We identified a molecular mechanism of TERT activation through the E74 like ETS transcription factor 4 (ELF4), which normally could drive HBV gene transcription. ELF4 bound to the chimeric HBV EnhI at the TERT promoter, resulting in telomerase activation. Stable knockdown of ELF4 significantly reduced the TERT expression and sphere-forming ability in HCC cells. CONCLUSIONS Our results reveal a cis-activating mechanism harnessing host ELF4 and HBV integrated at the TERT promoter and uncover how TERT HBV-integrated HCCs may achieve TERT activation in hepatocarcinogenesis.
Collapse
|
18
|
Comprehensive comparative analysis of methods and software for identifying viral integrations. Brief Bioinform 2020; 20:2088-2097. [PMID: 30102374 DOI: 10.1093/bib/bby070] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2018] [Revised: 07/02/2018] [Accepted: 07/12/2018] [Indexed: 12/13/2022] Open
Abstract
Many viruses are capable of integrating in the human genome, particularly viruses involved in tumorigenesis. Viral integrations can be considered genetic markers for discovering virus-caused cancers and inferring cancer cell development. Next-generation sequencing (NGS) technologies have been widely used to screen for viral integrations in cancer genomes, and a number of bioinformatics tools have been developed to detect viral integrations using NGS data. However, there has been no systematic comparison of the methods or software. In this study, we performed a comprehensive comparative analysis of the designs, performance, functionality and limitations among the existing methods and software for detecting viral integrations. We further compared the sensitivity, precision and runtime of integration detection of four representative tools. Our analyses showed that each of the existing software had its own merits; however, none of them were sufficient for parallel or accurate virome-wide detection. After carefully evaluating the limitations shared by the existing methods, we proposed strategies and directions for developing virome-wide integration detection.
Collapse
|
19
|
A First NGS Investigation Suggests No Association Between Viruses and Canine Cancers. Front Vet Sci 2020; 7:365. [PMID: 32766289 PMCID: PMC7380080 DOI: 10.3389/fvets.2020.00365] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Accepted: 05/26/2020] [Indexed: 12/16/2022] Open
Abstract
Approximately 10–15% of worldwide human cancers are attributable to viral infection. When operating as carcinogenic elements, viruses may act with various mechanisms, but the most important is represented by viral integration into the host genome, causing chromosome instability, genomic mutations, and aberrations. In canine species, few reports have described an association between viral integration and canine cancers, but more comprehensive studies are needed. The advancement of next-generation sequencing and the cost reduction have resulted in a progressive increasing of sequencing data in veterinary oncology offering an opportunity to study virome in canine cancers. In this study, we have performed viral detection and integration analyses using VirusFinder2 software tool on available whole-genome and whole-exome sequencing data of different canine cancers. Several viral sequences were detected in lymphomas, hemangiosarcomas, melanomas, and osteosarcomas, but no reliable integration sites were identified. Even if with some limitations such as the depth and type of sequencing, a restricted number of available nonhuman genomes software, and a limited knowledge on endogenous retroviruses in the canine genome, results are compelling. However, further experiments are needed, and similarly to feline species, dedicated analysis tools for the identification of viral integration sites in canine cancers are required.
Collapse
|
20
|
VSeq-Toolkit: Comprehensive Computational Analysis of Viral Vectors in Gene Therapy. MOLECULAR THERAPY-METHODS & CLINICAL DEVELOPMENT 2020; 17:752-757. [PMID: 32346552 PMCID: PMC7177155 DOI: 10.1016/j.omtm.2020.03.024] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Accepted: 03/25/2020] [Indexed: 11/17/2022]
Abstract
Viral vector characterization and analysis are important components for the development of safe gene therapeutic products, elucidating the potential genotoxic and immunogenic effects of vectors and establishing their safety profiles. Here, we present VSeq-Toolkit, which offers varying analysis modes for viral gene therapy data. The first mode determines the undesirable known contaminants and their frequency in viral preparations or other sequencing data. The second mode is designed for the analysis of intra-vector fusion breakpoints and the third mode for unraveling the viral-host fusion events distribution. Analysis modes of our toolkit can be executed independently or together and allow the analysis of multiple viral vectors concurrently. It has been designed and evaluated for the analysis of short read high-throughput sequencing data, including whole-genome or targeted sequencing. VSeq-Toolkit is developed in Perl and Bash programming languages and is available at https://github.com/CompMeth/VSeq-Toolkit.
Collapse
|
21
|
Microbiome signatures in prostate cancer. Carcinogenesis 2020; 40:749-764. [PMID: 30794288 DOI: 10.1093/carcin/bgz008] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2018] [Revised: 11/21/2018] [Accepted: 02/01/2019] [Indexed: 12/20/2022] Open
Abstract
We have established a microbiome signature for prostate cancer using an array-based metagenomic and capture-sequencing approach. A diverse microbiome signature (viral, bacterial, fungal and parasitic) was observed in the prostate cancer samples compared with benign prostate hyperplasia controls. Hierarchical clustering analysis identified three distinct prostate cancer-specific microbiome signatures. The three signatures correlated with different grades, stages and scores of the cancer. Thus, microbiome signature analysis potentially provides clinical diagnosis and outcome predictions. The array data were validated by PCR and targeted next-generation sequencing (NGS). Specific NGS data suggested that certain viral genomic sequences were inserted into the host somatic chromosomes of the prostate cancer samples. A randomly selected group of these was validated by direct PCR and sequencing. In addition, PCR validation of Helicobacter showed that Helicobacter cagA sequences integrated within specific chromosomes of prostate tumor cells. The viral and Helicobacter integrations are predicted to affect the expression of several cellular genes associated with oncogenic processes.
Collapse
|
22
|
Abstract
Adeno-associated virus (AAV) vectors have shown promising results in preclinical models, but the genomic consequences of transduction with AAV vectors encoding CRISPR-Cas nucleases is still being examined. In this study, we observe high levels of AAV integration (up to 47%) into Cas9-induced double-strand breaks (DSBs) in therapeutically relevant genes in cultured murine neurons, mouse brain, muscle and cochlea. Genome-wide AAV mapping in mouse brain shows no overall increase of AAV integration except at the CRISPR/Cas9 target site. To allow detailed characterization of integration events we engineer a miniature AAV encoding a 465 bp lambda bacteriophage DNA (AAV-λ465), enabling sequencing of the entire integrated vector genome. The integration profile of AAV-465λ in cultured cells display both full-length and fragmented AAV genomes at Cas9 on-target sites. Our data indicate that AAV integration should be recognized as a common outcome for applications that utilize AAV for genome editing.
Collapse
|
23
|
Abstract
Liver cancers are highly heterogeneous with poor prognosis and drug response. A better understanding between genetic alterations and drug responses would facilitate precision treatment for liver cancers. To characterize the landscape of pharmacogenomic interactions in liver cancers, we developed a protocol to establish human liver cancer cell models at a success rate of around 50% and generated the Liver Cancer Model Repository (LIMORE) with 81 cell models. LIMORE represented genomic and transcriptomic heterogeneity of primary cancers. Interrogation of the pharmacogenomic landscape of LIMORE discovered unexplored gene-drug associations, including synthetic lethalities to prevalent alterations in liver cancers. Moreover, predictive biomarker candidates were suggested for the selection of sorafenib-responding patients. LIMORE provides a rich resource facilitating drug discovery in liver cancers.
Collapse
MESH Headings
- Animals
- Antineoplastic Agents/pharmacology
- Asian People/genetics
- Biomarkers, Tumor/genetics
- Carcinoma, Hepatocellular/drug therapy
- Carcinoma, Hepatocellular/ethnology
- Carcinoma, Hepatocellular/genetics
- Carcinoma, Hepatocellular/pathology
- Cell Line, Tumor
- Clinical Decision-Making
- Databases, Genetic
- Drug Resistance, Neoplasm/genetics
- Female
- Genetic Heterogeneity
- Genetic Predisposition to Disease
- High-Throughput Nucleotide Sequencing
- Humans
- Liver Neoplasms/drug therapy
- Liver Neoplasms/ethnology
- Liver Neoplasms/genetics
- Liver Neoplasms/pathology
- Male
- Mice, Inbred BALB C
- Mice, Inbred NOD
- Mice, Nude
- Mice, SCID
- Patient Selection
- Pharmacogenomic Testing
- Pharmacogenomic Variants
- Phenotype
- Precision Medicine
- Protein Kinase Inhibitors/pharmacology
- Sorafenib/pharmacology
- Xenograft Model Antitumor Assays
Collapse
|
24
|
A virome-wide clonal integration analysis platform for discovering cancer viral etiology. Genome Res 2019; 29:819-830. [PMID: 30872350 PMCID: PMC6499315 DOI: 10.1101/gr.242529.118] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Accepted: 03/11/2019] [Indexed: 12/31/2022]
Abstract
Oncoviral infection is responsible for 12%–15% of cancer in humans. Convergent evidence from epidemiology, pathology, and oncology suggests that new viral etiologies for cancers remain to be discovered. Oncoviral profiles can be obtained from cancer genome sequencing data; however, widespread viral sequence contamination and noncausal viruses complicate the process of identifying genuine oncoviruses. Here, we propose a novel strategy to address these challenges by performing virome-wide screening of early-stage clonal viral integrations. To implement this strategy, we developed VIcaller, a novel platform for identifying viral integrations that are derived from any characterized viruses and shared by a large proportion of tumor cells using whole-genome sequencing (WGS) data. The sensitivity and precision were confirmed with simulated and benchmark cancer data sets. By applying this platform to cancer WGS data sets with proven or speculated viral etiology, we newly identified or confirmed clonal integrations of hepatitis B virus (HBV), human papillomavirus (HPV), Epstein-Barr virus (EBV), and BK Virus (BKV), suggesting the involvement of these viruses in early stages of tumorigenesis in affected tumors, such as HBV in TERT and KMT2B (also known as MLL4) gene loci in liver cancer, HPV and BKV in bladder cancer, and EBV in non-Hodgkin's lymphoma. We also showed the capacity of VIcaller to identify integrations from some uncharacterized viruses. This is the first study to systematically investigate the strategy and method of virome-wide screening of clonal integrations to identify oncoviruses. Searching clonal viral integrations with our platform has the capacity to identify virus-caused cancers and discover cancer viral etiologies.
Collapse
|
25
|
Abstract
Background Since tumor often has a high level of intra-tumor heterogeneity, multiple tumor samples from the same patient at different locations or different time points are often sequenced to study tumor intra-heterogeneity or tumor evolution. In virus-related tumors such as human papillomavirus- and Hepatitis B Virus-related tumors, virus genome integrations can be critical driving events. It is thus important to investigate the integration sites of the virus genomes. Currently, a few algorithms for detecting virus integration sites based on high-throughput sequencing have been developed, but their insufficient performance in their sensitivity, specificity and computational complexity hinders their applications in multiple related tumor sequencing. Results We develop VirTect for detecting virus integration sites simultaneously from multiple related-sample data. This algorithm is mainly based on the joint analysis of short reads spanning breakpoints of integration sites from multiple samples. To achieve high specificity and breakpoint accuracy, a local precise sandwich alignment algorithm is used. Simulation and real data analyses show that, compared with other algorithms, VirTect is significantly more sensitive and has a similar or lower false discovery rate. Conclusions VirTect can provide more accurate breakpoint position and is computationally much more efficient in terms both memory requirement and computational time. Electronic supplementary material The online version of this article (10.1186/s12920-018-0461-8) contains supplementary material, which is available to authorized users.
Collapse
|
26
|
VIpower: Simulation-based tool for estimating power of viral integration detection via high-throughput sequencing. Genomics 2019; 112:207-211. [PMID: 30710609 DOI: 10.1016/j.ygeno.2019.01.015] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2018] [Revised: 12/31/2018] [Accepted: 01/22/2019] [Indexed: 12/12/2022]
Abstract
Viral sequence integrations in the human genome have been implicated in various human diseases. Viral integrations remain among the most challenging-to-detect structural changes of the human genome. No studies have systematically analyzed how molecular and bioinformatics factors affect the power (sensitivity) to detect viral integrations using high-throughput sequencing (HTS). We selected a wide-range of molecular and bioinformatics factors covering genome sequence characteristics, HTS features, and viral integration detection. We designed a fast simulation-based framework to model the process of detecting variable viral integration events in the human genome. We then examined the associations of selected factors with viral integration detection power. We identified six factors that significantly affected viral integration detection power (P < 2 × 10-16). The strongest factors associated with detection power included proportion of sample cells with clonal viral integrations (Pearson's ρ = 0.64), sequencing depth (ρ = 0.37), length of viral integration (ρ = 0.37), paired-end read insert size (ρ = 0.23), user-defined threshold (number of supporting reads) to claim successful identification of integrations (ρ = -0.19), and read length (when sequence volume was fixed) (ρ = -0.09). As the first tool of its kind, VIpower incorporates all these factors, which can be manipulated in concert with each other to optimize the detection power. This tool may be used to estimate viral integration detection power for various combinations of sequencing or analytic parameters. It may also be used to estimate the parameters required to achieve a specific power when designing new sequencing experiments.
Collapse
|
27
|
BS-virus-finder: virus integration calling using bisulfite sequencing data. Gigascience 2018; 7:1-7. [PMID: 29267855 PMCID: PMC5788064 DOI: 10.1093/gigascience/gix123] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2017] [Accepted: 11/30/2017] [Indexed: 01/10/2023] Open
Abstract
Background DNA methylation plays a key role in the regulation of gene expression and carcinogenesis. Bisulfite sequencing studies mainly focus on calling single nucleotide polymorphism, different methylation region, and find allele-specific DNA methylation. Until now, only a few software tools have focused on virus integration using bisulfite sequencing data. Findings We have developed a new and easy-to-use software tool, named BS-virus-finder (BSVF, RRID:SCR_015727), to detect viral integration breakpoints in whole human genomes. The tool is hosted at https://github.com/BGI-SZ/BSVF. Conclusions BS-virus-finder demonstrates high sensitivity and specificity. It is useful in epigenetic studies and to reveal the relationship between viral integration and DNA methylation. BS-virus-finder is the first software tool to detect virus integration loci by using bisulfite sequencing data.
Collapse
|
28
|
The precision prevention and therapy of HPV-related cervical cancer: new concepts and clinical implications. Cancer Med 2018; 7:5217-5236. [PMID: 30589505 PMCID: PMC6198240 DOI: 10.1002/cam4.1501] [Citation(s) in RCA: 147] [Impact Index Per Article: 24.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2017] [Revised: 02/14/2018] [Accepted: 03/21/2018] [Indexed: 12/14/2022] Open
Abstract
Cervical cancer is the third most common cancer in women worldwide, with concepts and knowledge about its prevention and treatment evolving rapidly. Human papillomavirus (HPV) has been identified as a major factor that leads to cervical cancer, although HPV infection alone cannot cause the disease. In fact, HPV-driven cancer is a small probability event because most infections are transient and could be cleared spontaneously by host immune system. With persistent HPV infection, decades are required for progression to cervical cancer. Therefore, this long time window provides golden opportunity for clinical intervention, and the fundament here is to elucidate the carcinogenic pattern and applicable targets during HPV-host interaction. In this review, we discuss the key factors that contribute to the persistence of HPV and cervical carcinogenesis, emerging new concepts and technologies for cancer interventions, and more urgently, how these concepts and technologies might lead to clinical precision medicine which could provide prediction, prevention, and early treatment for patients.
Collapse
|
29
|
Relative Abundance of Integrant-Derived Viral RNAs in Infected Tissues Harvested from Chronic Hepatitis B Virus Carriers. J Virol 2018; 92:e02221-17. [PMID: 29491161 PMCID: PMC5923063 DOI: 10.1128/jvi.02221-17] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2017] [Accepted: 02/17/2018] [Indexed: 02/07/2023] Open
Abstract
Five matching sets of nonmalignant liver tissues and hepatocellular carcinoma (HCC) samples from individuals chronically infected with hepatitis B virus (HBV) were examined. The HBV genomic sequences were determined by using overlapping PCR amplicons covering the entire viral genome. Four pairs of tissues were infected with HBV genotype C, while one pair was infected with HBV genotype B. HBV replication markers were found in all tissues. In the majority of HCC samples, the levels of pregenomic/precore RNA (pgRNA) and covalently closed circular DNA (cccDNA) were lower than those in liver tissue counterparts. Regardless of the presence of HBV replication markers, (i) integrant-derived HBV RNAs (id-RNAs) were found in all tissues by reverse transcription-PCR (RT-PCR) analysis and were considerably abundant or predominant in 6/10 tissue samples (2 liver and 4 HCC samples), (ii) RNAs that were polyadenylated using the cryptic HBV polyadenylation signal and therefore could be produced by HBV replication or derived from integrated HBV DNA were found in 5/10 samples (3 liver and 2 HCC samples) and were considerably abundant species in 3/10 tissues (2 livers and 1 HCC), and (iii) cccDNA-transcribed RNAs polyadenylated near position 1931 were not abundant in 7/10 tissues (2 liver and 5 HCC samples) and were predominant in only two liver samples. Subsequent RNA sequencing analysis of selected liver/HCC samples also showed relative abundance of id-RNAs in most of the examined tissues. Our findings suggesting that id-RNAs could represent a significant source of HBV envelope proteins, which is independent of viral replication, are discussed in the context of the possible contribution of id-RNAs to the HBV life cycle.IMPORTANCE The relative abundance of integrant-derived HBV RNAs (id-RNAs) in chronically infected tissues suggest that id-RNAs coding for the envelope proteins may facilitate the production of a considerable fraction of surface antigens (HBsAg) in infected cells bearing HBV integrants. If the same cells support HBV replication, then a significant fraction of assembled HBV virions could bear id-RNA-derived HBsAg as a major component of their envelopes. Therefore, the infectivity of these HBV virions and their ability to facilitate virus cell-to-cell spread could be determined mainly by the properties of id-RNA-derived envelope proteins and not by the properties of replication-derived HBsAg. These interpretations suggest that id-RNAs may play a role in the maintenance of chronic HBV infection and therefore contribute to the HBV life cycle. Furthermore, the production of HBsAg from id-RNAs independently of viral replication may explain at least in part why treatment with interferon or nucleos(t)ides in most cases fails to achieve a loss of serum HBsAg.
Collapse
|
30
|
ViFi: accurate detection of viral integration and mRNA fusion reveals indiscriminate and unregulated transcription in proximal genomic regions in cervical cancer. Nucleic Acids Res 2018; 46:3309-3325. [PMID: 29579309 PMCID: PMC6283451 DOI: 10.1093/nar/gky180] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2017] [Revised: 02/12/2018] [Accepted: 03/05/2018] [Indexed: 12/20/2022] Open
Abstract
The integration of viral sequences into the host genome is an important driver of tumorigenesis in many viral mediated cancers, notably cervical cancer and hepatocellular carcinoma. We present ViFi, a computational method that combines phylogenetic methods with reference-based read mapping to detect viral integrations. In contrast with read-based reference mapping approaches, ViFi is faster, and shows high precision and sensitivity on both simulated and biological data, even when the integrated virus is a novel strain or highly mutated. We applied ViFi to matched genomic and mRNA data from 68 cervical cancer samples from TCGA and found high concordance between the two. Surprisingly, viral integration resulted in a dramatic transcriptional upregulation in all proximal elements, including LINEs and LTRs that are not normally transcribed. This upregulation is highly correlated with the presence of a viral gene fused with a downstream human element. Moreover, genomic rearrangements suggest the formation of apparent circular extrachromosomal (ecDNA) human-viral structures. Our results suggest the presence of apparent small circular fusion viral/human ecDNA, which correlates with indiscriminate and unregulated expression of proximal genomic elements, potentially contributing to the pathogenesis of HPV-associated cervical cancers. ViFi is available at https://github.com/namphuon/ViFi.
Collapse
|
31
|
Abstract
Humans and other mammals are colonized by microbial agents across the kingdom which can represent a unique microbiome pattern. Dysbiosis of the microbiome has been associated with pathology including cancer. We have identified a microbiome signature unique to ovarian cancers, one of the most lethal malignancies of the female reproductive system, primarily because of its asymptomatic nature during the early stages in development. We screened ovarian cancer samples along with matched, and non-matched control samples using our pan-pathogen array (PathoChip), combined with capture-next generation sequencing. The results show a distinct group of viral, bacterial, fungal and parasitic signatures of high significance in ovarian cases. Further analysis shows specific viral integration sites within the host genome of tumor samples, which may contribute to the carcinogenic process. The ovarian cancer microbiome signature provides insights for the development of targeted therapeutics against ovarian cancers.
Collapse
|
32
|
Utility of high-throughput DNA sequencing in the study of the human papillomaviruses. Virus Genes 2017; 54:17-24. [PMID: 29282656 DOI: 10.1007/s11262-017-1530-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2017] [Accepted: 12/19/2017] [Indexed: 11/28/2022]
Abstract
The Papillomaviridae family is probably the most diverse group of viruses that affect vertebrates. The study of the relationship between infection by certain types of human papillomavirus (HPV) and the development of neoplastic epithelial lesions is of particular interest because of the high prevalence of HPV-related carcinomas in populations of developing countries. To understand the mechanisms of infection and their association with different clinical manifestations, molecular tools play an important role in the description of new types of HPV, the characterization of effector properties of the viral factors, the specific diagnosis and monitoring of HPV types, and the alteration patterns at genetic level in the host. Technological advances in the field of DNA sequencing have led to the development of different next-generation sequencing systems, allowing obtaining a large amount of data and broadening the applications to study viral diseases. In this review, we summarize the main approaches and their perspectives where the use of massively parallel sequencing has been proved as a useful tool in the research of the HPV infection.
Collapse
|
33
|
ChimericSeq: An open-source, user-friendly interface for analyzing NGS data to identify and characterize viral-host chimeric sequences. PLoS One 2017; 12:e0182843. [PMID: 28829778 PMCID: PMC5567911 DOI: 10.1371/journal.pone.0182843] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2017] [Accepted: 07/25/2017] [Indexed: 11/18/2022] Open
Abstract
Identification of viral integration sites has been important in understanding the pathogenesis and progression of diseases associated with particular viral infections. The advent of next-generation sequencing (NGS) has enabled researchers to understand the impact that viral integration has on the host, such as tumorigenesis. Current computational methods to analyze NGS data of virus-host junction sites have been limited in terms of their accessibility to a broad user base. In this study, we developed a software application (named ChimericSeq), that is the first program of its kind to offer a graphical user interface, compatibility with both Windows and Mac operating systems, and optimized for effectively identifying and annotating virus-host chimeric reads within NGS data. In addition, ChimericSeq’s pipeline implements custom filtering to remove artifacts and detect reads with quantitative analytical reporting to provide functional significance to discovered integration sites. The improved accessibility of ChimericSeq through a GUI interface in both Windows and Mac has potential to expand NGS analytical support to a broader spectrum of the scientific community.
Collapse
|
34
|
Microbial Signatures Associated with Oropharyngeal and Oral Squamous Cell Carcinomas. Sci Rep 2017; 7:4036. [PMID: 28642609 PMCID: PMC5481414 DOI: 10.1038/s41598-017-03466-6] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2017] [Accepted: 04/26/2017] [Indexed: 12/18/2022] Open
Abstract
The microbiome is fundamentally one of the most unique organs in the human body. Dysbiosis can result in critical inflammatory responses and result in pathogenesis contributing to neoplastic events. We used a pan-pathogen array technology (PathoChip) coupled with next-generation sequencing to establish microbial signatures unique to human oral and oropharyngeal squamous cell carcinomas (OCSCC/OPSCC). Signatures for DNA and RNA viruses including oncogenic viruses, gram positive and negative bacteria, fungi and parasites were detected. Cluster and topological analyses identified 2 distinct groups of microbial signatures related to OCSCCs/OPSCCs. Results were validated by probe capture next generation sequencing; the data from which also provided a comprehensive map of integration sites and chromosomal hotspots for micro-organism genomic insertions. Identification of these microbial signatures and their integration sites may provide biomarkers for OCSCC/OPSCC diagnosis and prognosis as well as novel avenues for study of their potential role in OCSCCs/OPSCCs.
Collapse
|
35
|
GENE-IS: Time-Efficient and Accurate Analysis of Viral Integration Events in Large-Scale Gene Therapy Data. MOLECULAR THERAPY-NUCLEIC ACIDS 2016; 6:133-139. [PMID: 28325279 PMCID: PMC5363413 DOI: 10.1016/j.omtn.2016.12.001] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/29/2016] [Revised: 11/24/2016] [Accepted: 12/01/2016] [Indexed: 12/22/2022]
Abstract
Integration site profiling and clonality analysis of viral vector distribution in gene therapy is a key factor to monitor the fate of gene-corrected cells, assess the risk of malignant transformation, and establish vector biosafety. We developed the Genome Integration Site Analysis Pipeline (GENE-IS) for highly time-efficient and accurate detection of next-generation sequencing (NGS)-based viral vector integration sites (ISs) in gene therapy data. It is the first available tool with dual analysis mode that allows IS analysis both in data generated by PCR-based methods, such as linear amplification method PCR (LAM-PCR), and by rapidly evolving targeted sequencing (e.g., Agilent SureSelect) technologies. GENE-IS makes use of trimming strategies, customized reference genome, and soft-clipped information with sequential filtering steps to provide annotated IS with clonality information. It is a scalable, robust, precise, and reliable tool for large-scale pre-clinical and clinical data analysis that provides users complete flexibility and control over analysis with a broad range of configurable parameters. GENE-IS is available at https://github.com/G100DKFZ/gene-is.
Collapse
|
36
|
Abstract
The pathogenesis of hepatocellular carcinoma (HCC) is a multistep process involving the progressive accumulation of molecular alterations pinpointing different molecular and cellular events. The next-generation sequencing technology is facilitating the global and systematic evaluation of molecular landscapes in HCC. There is emerging evidence supporting the importance of cancer metabolism and tumor microenvironment in providing a favorable and supportive niche to expedite HCC development. Moreover, recent studies have identified distinct surface markers of cancer stem cell (CSC) in HCC, and they also put forward the profound involvement of altered signaling pathways and epigenetic modifications in CSCs, in addition to the concomitant drug resistance and metastasis. Taken together, multiple key genetic and non-genetic factors, as well as liver CSCs, result in the development and progression of HCC.
Collapse
|
37
|
Hepatocellular carcinoma cell lines retain the genomic and transcriptomic landscapes of primary human cancers. Sci Rep 2016; 6:27411. [PMID: 27273737 PMCID: PMC4895220 DOI: 10.1038/srep27411] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2016] [Accepted: 05/18/2016] [Indexed: 01/03/2023] Open
Abstract
Hepatocellular carcinoma (HCC) cell lines are useful in vitro models for the study of primary HCCs. Because cell lines acquire additional mutations in culture, it is important to understand to what extent HCC cell lines retain the genetic landscapes of primary HCCs. Most HCC cell lines were established during the last century, precluding comparison between cell lines and primary cancers. In this study, 9 Chinese HCC cell lines with matched patient-derived cells at low passages (PDCs) were established in the defined culture condition. Whole genome analyses of 4 HCC cell lines showed that genomic mutation landscapes, including mutations, copy number alterations (CNAs) and HBV integrations, were highly stable during cell line establishment. Importantly, genetic alterations in cancer drivers and druggable genes were reserved in cell lines. HCC cell lines also retained gene expression patterns of primary HCCs during in vitro culture. Finally, sequential analysis of HCC cell lines and PDCs at different passages revealed their comparable and stable genomic and transcriptomic levels if maintained within proper passages. These results show that HCC cell lines largely retain the genomic and transcriptomic landscapes of primary HCCs, thus laying the rationale for testing HCC cell lines as preclinical models in precision medicine.
Collapse
|