Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Borozan I, Wilson S, Blanchette P, Laflamme P, Watt SN, Krzyzanowski PM, Sircoulomb F, Rottapel R, Branton PE, Ferretti V. CaPSID: a bioinformatics platform for computational pathogen sequence identification in human genomes and transcriptomes. BMC Bioinformatics 2012;13:206. [PMID: 22901030 PMCID: PMC3464663 DOI: 10.1186/1471-2105-13-206] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2012] [Accepted: 07/18/2012] [Indexed: 01/05/2023] Open

For:	Borozan I, Wilson S, Blanchette P, Laflamme P, Watt SN, Krzyzanowski PM, Sircoulomb F, Rottapel R, Branton PE, Ferretti V. CaPSID: a bioinformatics platform for computational pathogen sequence identification in human genomes and transcriptomes. BMC Bioinformatics 2012;13:206. [PMID: 22901030 PMCID: PMC3464663 DOI: 10.1186/1471-2105-13-206] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2012] [Accepted: 07/18/2012] [Indexed: 01/05/2023] Open

Number

Cited by Other Article(s)

Zhao Y, Huang F, Wang W, Gao R, Fan L, Wang A, Gao SH. Application of high-throughput sequencing technologies and analytical tools for pathogen detection in urban water systems: Progress and future perspectives. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023;900:165867. [PMID: 37516185 DOI: 10.1016/j.scitotenv.2023.165867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2023] [Revised: 07/25/2023] [Accepted: 07/26/2023] [Indexed: 07/31/2023]

Goubet AG. Could the tumor-associated microbiota be the new multi-faceted player in the tumor microenvironment? Front Oncol 2023;13:1185163. [PMID: 37287916 PMCID: PMC10242102 DOI: 10.3389/fonc.2023.1185163] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 05/02/2023] [Indexed: 06/09/2023] Open

Turan H, Vitale SG, Kahramanoglu I, Della Corte L, Giampaolino P, Azemi A, Durmus S, Sal V, Tokgozoglu N, Bese T, Arvas M, Demirkiran F, Gelisgen R, Ilvan S, Uzun H. Diagnostic and prognostic role of TFF3, Romo-1, NF-кB and SFRP4 as biomarkers for endometrial and ovarian cancers: a prospective observational translational study. Arch Gynecol Obstet 2022;306:2105-2114. [PMID: 35461390 PMCID: PMC9633503 DOI: 10.1007/s00404-022-06563-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 04/01/2022] [Indexed: 12/24/2022]

Yu D, Wang T, Liang D, Mei Y, Zou W, Guo S. The Landscape of Microbial Composition and Associated Factors in Pancreatic Ductal Adenocarcinoma Using RNA-Seq Data. Front Oncol 2021;11:651350. [PMID: 34136388 PMCID: PMC8202409 DOI: 10.3389/fonc.2021.651350] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2021] [Accepted: 03/30/2021] [Indexed: 01/14/2023] Open

Rodriguez RM, Khadka VS, Menor M, Hernandez BY, Deng Y. Tissue-associated microbial detection in cancer using human sequencing data. BMC Bioinformatics 2020;21:523. [PMID: 33272199 PMCID: PMC7713026 DOI: 10.1186/s12859-020-03831-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Accepted: 10/21/2020] [Indexed: 12/19/2022] Open

Chen X, Kost J, Li D. Comprehensive comparative analysis of methods and software for identifying viral integrations. Brief Bioinform 2020;20:2088-2097. [PMID: 30102374 DOI: 10.1093/bib/bby070] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2018] [Revised: 07/02/2018] [Accepted: 07/12/2018] [Indexed: 12/13/2022] Open

Robitaille A, Brancaccio RN, Dutta S, Rollison DE, Leja M, Fischer N, Grundhoff A, Gheit T, Tommasino M, Olivier M. PVAmpliconFinder: a workflow for the identification of human papillomaviruses from high-throughput amplicon sequencing. BMC Bioinformatics 2020;21:233. [PMID: 32513098 PMCID: PMC7282039 DOI: 10.1186/s12859-020-03573-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2019] [Accepted: 05/28/2020] [Indexed: 01/06/2023] Open

Rodriguez RM, Hernandez BY, Menor M, Deng Y, Khadka VS. The landscape of bacterial presence in tumor and adjacent normal tissue across 9 major cancer types using TCGA exome sequencing. Comput Struct Biotechnol J 2020;18:631-641. [PMID: 32257046 PMCID: PMC7109368 DOI: 10.1016/j.csbj.2020.03.003] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Revised: 03/02/2020] [Accepted: 03/06/2020] [Indexed: 12/26/2022] Open

Zapatka M, Borozan I, Brewer DS, Iskar M, Grundhoff A, Alawi M, Desai N, Sültmann H, Moch H, Cooper CS, Eils R, Ferretti V, Lichter P. The landscape of viral associations in human cancers. Nat Genet 2020;52:320-330. [PMID: 32025001 PMCID: PMC8076016 DOI: 10.1038/s41588-019-0558-9] [Citation(s) in RCA: 220] [Impact Index Per Article: 55.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Accepted: 11/22/2019] [Indexed: 12/30/2022]

Affiliation(s)

Marc Zapatka Division of Molecular Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany
Ivan Borozan Informatics and Bio-computing Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada
Daniel S Brewer Norwich Medical School, University of East Anglia, Norwich, UK Earlham Institute, Norwich, UK
Murat Iskar Division of Molecular Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany
Adam Grundhoff Heinrich-Pette-Institute, Leibniz Institute for Experimental Virology, Hamburg, Germany German Center for Infection Research (DZIF), Partner Site Hamburg-Borstel-Lübeck-Riems, Hamburg, Germany
Malik Alawi Heinrich-Pette-Institute, Leibniz Institute for Experimental Virology, Hamburg, Germany Bioinformatics Core, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
Nikita Desai Bioinformatics Group, Department of Computer Science, University College London, London, UK Biomedical Data Science Laboratory, Francis Crick Institute, London, UK
Holger Sültmann National Center for Tumor Diseases (NCT) Heidelberg, Heidelberg, Germany Division of Cancer Genome Research, German Cancer Research Center (DKFZ), Heidelberg, Germany German Cancer Consortium (DKTK), Heidelberg, Germany
Holger Moch Department of Pathology and Molecular Pathology, University and University Hospital Zürich, Zurich, Switzerland
Colin S Cooper Norwich Medical School, University of East Anglia, Norwich, UK Earlham Institute, Norwich, UK Institute of Cancer Research, London, UK University of East Anglia, Norwich, UK
Roland Eils Division of Theoretical Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Germany Department of Bioinformatics and Functional Genomics, Institute of Pharmacy and Molecular Biotechnology, Heidelberg University and BioQuant Center, Heidelberg, Germany Center for Digital Health, Berlin Institute of Health and Charité Universitätsmedizin Berlin, Berlin, Germany
Vincent Ferretti Ontario Institute for Cancer Research, MaRS Centre, Toronto, Ontario, Canada Department of Biochemistry and Molecular Medicine, University of Montreal, Montreal, Québec, Canada
Peter Lichter Division of Molecular Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany. German Cancer Consortium (DKTK), Heidelberg, Germany.

Collapse

Sangiovanni M, Granata I, Thind AS, Guarracino MR. From trash to treasure: detecting unexpected contamination in unmapped NGS data. BMC Bioinformatics 2019;20:168. [PMID: 30999839 PMCID: PMC6472186 DOI: 10.1186/s12859-019-2684-x] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open

Abstract

Background

Next Generation Sequencing (NGS) experiments produce millions of short sequences that, mapped to a reference genome, provide biological insights at genomic, transcriptomic and epigenomic level. Typically the amount of reads that correctly maps to the reference genome ranges between 70% and 90%, leaving in some cases a consistent fraction of unmapped sequences. This ’misalignment’ can be ascribed to low quality bases or sequence differences between the sample reads and the reference genome. Investigating the source of the unmapped reads is definitely important to better assess the quality of the whole experiment and to check for possible downstream or upstream ’contamination’ from exogenous nucleic acids.

Results

Here we propose DecontaMiner, a tool to unravel the presence of contaminating sequences among the unmapped reads. It uses a subtraction approach to identify bacteria, fungi and viruses genome contamination. DecontaMiner generates several output files to track all the processed reads, and to provide a complete report of their characteristics. The good quality matches on microorganism genomes are counted and compared among samples. DecontaMiner builds an offline HTML page containing summary statistics and plots. The latter are obtained using the state-of-the-art D3 javascript libraries. DecontaMiner has been mainly used to detect contamination in human RNA-Seq data. The software is freely available at http://www-labgtp.na.icar.cnr.it/decontaminer.

Conclusions

DecontaMiner is a tool designed and developed to investigate the presence of contaminating sequences in unmapped NGS data. It can suggest the presence of contaminating organisms in sequenced samples, that might derive either from laboratory contamination or from their biological source, and in both cases can be considered as worthy of further investigation and experimental validation. The novelty of DecontaMiner is mainly represented by its easy integration with the standard procedures of NGS data analysis, while providing a complete, reliable, and automatic pipeline.

Electronic supplementary material

The online version of this article (10.1186/s12859-019-2684-x) contains supplementary material, which is available to authorized users.

Collapse

Chen X, Kost J, Sulovari A, Wong N, Liang WS, Cao J, Li D. A virome-wide clonal integration analysis platform for discovering cancer viral etiology. Genome Res 2019;29:819-830. [PMID: 30872350 PMCID: PMC6499315 DOI: 10.1101/gr.242529.118] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Accepted: 03/11/2019] [Indexed: 12/31/2022]

Nooij S, Schmitz D, Vennema H, Kroneman A, Koopmans MPG. Overview of Virus Metagenomic Classification Methods and Their Biological Applications. Front Microbiol 2018;9:749. [PMID: 29740407 PMCID: PMC5924777 DOI: 10.3389/fmicb.2018.00749] [Citation(s) in RCA: 74] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2017] [Accepted: 04/03/2018] [Indexed: 12/20/2022] Open

Tang KW, Larsson E. Tumour virology in the era of high-throughput genomics. Philos Trans R Soc Lond B Biol Sci 2018;372:rstb.2016.0265. [PMID: 28893932 PMCID: PMC5597732 DOI: 10.1098/rstb.2016.0265] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/09/2017] [Indexed: 12/12/2022] Open

Analysis of Epstein-Barr Virus Genomes and Expression Profiles in Gastric Adenocarcinoma. J Virol 2018;92:JVI.01239-17. [PMID: 29093097 DOI: 10.1128/jvi.01239-17] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2017] [Accepted: 10/05/2017] [Indexed: 01/10/2023] Open

Abstract

Epstein-Barr virus (EBV) is a causative agent of a variety of lymphomas, nasopharyngeal carcinoma (NPC), and ∼9% of gastric carcinomas (GCs). An important question is whether particular EBV variants are more oncogenic than others, but conclusions are currently hampered by the lack of sequenced EBV genomes. Here, we contribute to this question by mining whole-genome sequences of 201 GCs to identify 13 EBV-positive GCs and by assembling 13 new EBV genome sequences, almost doubling the number of available GC-derived EBV genome sequences and providing the first non-Asian EBV genome sequences from GC. Whole-genome sequence comparisons of all EBV isolates sequenced to date (85 from tumors and 57 from healthy individuals) showed that most GC and NPC EBV isolates were closely related although American Caucasian GC samples were more distant, suggesting a geographical component. However, EBV GC isolates were found to contain some consistent changes in protein sequences regardless of geographical origin. In addition, transcriptome data available for eight of the EBV-positive GCs were analyzed to determine which EBV genes are expressed in GC. In addition to the expected latency proteins (EBNA1, LMP1, and LMP2A), specific subsets of lytic genes were consistently expressed that did not reflect a typical lytic or abortive lytic infection, suggesting a novel mechanism of EBV gene regulation in the context of GC. These results are consistent with a model in which a combination of specific latent and lytic EBV proteins promotes tumorigenesis.IMPORTANCE Epstein-Barr virus (EBV) is a widespread virus that causes cancer, including gastric carcinoma (GC), in a small subset of individuals. An important question is whether particular EBV variants are more cancer associated than others, but more EBV sequences are required to address this question. Here, we have generated 13 new EBV genome sequences from GC, almost doubling the number of EBV sequences from GC isolates and providing the first EBV sequences from non-Asian GC. We further identify sequence changes in some EBV proteins common to GC isolates. In addition, gene expression analysis of eight of the EBV-positive GCs showed consistent expression of both the expected latency proteins and a subset of lytic proteins that was not consistent with typical lytic or abortive lytic expression. These results suggest that novel mechanisms activate expression of some EBV lytic proteins and that their expression may contribute to oncogenesis.

Collapse

Cantalupo PG, Katz JP, Pipas JM. Viral sequences in human cancer. Virology 2017;513:208-216. [PMID: 29107929 DOI: 10.1016/j.virol.2017.10.017] [Citation(s) in RCA: 83] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2017] [Revised: 10/10/2017] [Accepted: 10/19/2017] [Indexed: 01/14/2023]

Brhelova E, Antonova M, Pardy F, Kocmanova I, Mayer J, Racil Z, Lengerova M. Investigation of next-generation sequencing data of Klebsiella pneumoniae using web-based tools. J Med Microbiol 2017;66:1673-1683. [PMID: 29068275 DOI: 10.1099/jmm.0.000624] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open

Doggett NA, Mukundan H, Lefkowitz EJ, Slezak TR, Chain PS, Morse S, Anderson K, Hodge DR, Pillai S. Culture-Independent Diagnostics for Health Security. Health Secur 2017;14:122-42. [PMID: 27314653 DOI: 10.1089/hs.2015.0074] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open

VirusSeeker, a computational pipeline for virus discovery and virome composition analysis. Virology 2017;503:21-30. [PMID: 28110145 DOI: 10.1016/j.virol.2017.01.005] [Citation(s) in RCA: 87] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2016] [Revised: 01/07/2017] [Accepted: 01/10/2017] [Indexed: 01/21/2023]

Bullman S, Meyerson M, Kostic AD. Emerging Concepts and Technologies for the Discovery of Microorganisms Involved in Human Disease. ANNUAL REVIEW OF PATHOLOGY-MECHANISMS OF DISEASE 2016;12:217-244. [PMID: 27959634 DOI: 10.1146/annurev-pathol-012615-044305] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Karapiperis C, Kempf SJ, Quintens R, Azimzadeh O, Vidal VL, Pazzaglia S, Bazyka D, Mastroberardino PG, Scouras ZG, Tapio S, Benotmane MA, Ouzounis CA. Brain Radiation Information Data Exchange (BRIDE): integration of experimental data from low-dose ionising radiation research for pathway discovery. BMC Bioinformatics 2016;17:212. [PMID: 27170263 PMCID: PMC4865096 DOI: 10.1186/s12859-016-1068-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2015] [Accepted: 04/21/2016] [Indexed: 11/10/2022] Open

Affiliation(s)

Christos Karapiperis Department of Genetics, Development & Molecular Biology, School of Biology, Aristotle University of Thessalonica, 54124, Thessalonica, Greece
Stefan J Kempf Institute of Radiation Biology, Helmholtz Zentrum München, German Research Center for Environmental Health GmbH, 85764, Neuherberg, Germany Present address: Department of Biochemistry and Molecular Biology, University of Southern Denmark, Campusvej 55, 5230, Odense M, Denmark
Roel Quintens Radiobiology Unit, Belgian Nuclear Research Centre (SCK•CEN), B-2400, Mol, Belgium
Omid Azimzadeh Institute of Radiation Biology, Helmholtz Zentrum München, German Research Center for Environmental Health GmbH, 85764, Neuherberg, Germany
Victoria Linares Vidal School of Medicine, IISPV, "Rovira i Virgili" University, Sant Llorens 21, 43201, Reus, Spain
Simonetta Pazzaglia Laboratory of Radiation Biology & Biomedicine, Agenzia Nazionale per le Nuove Tecnologie, l'Energia e lo Sviluppo Economico Sostenibile (ENEA) Centro Ricerche Casaccia, 00123, Rome, Italy
Dimitry Bazyka National Research Center for Radiation Medicine of the National Academy of Medical Sciences of Ukraine, Melnykov str. 53, Kyiv, 04050, Ukraine
Pier G Mastroberardino Erasmus Medical Center, 3015GE, Rotterdam, The Netherlands
Zacharias G Scouras Department of Genetics, Development & Molecular Biology, School of Biology, Aristotle University of Thessalonica, 54124, Thessalonica, Greece
Soile Tapio Institute of Radiation Biology, Helmholtz Zentrum München, German Research Center for Environmental Health GmbH, 85764, Neuherberg, Germany.
Mohammed Abderrafi Benotmane Radiobiology Unit, Belgian Nuclear Research Centre (SCK•CEN), B-2400, Mol, Belgium.
Christos A Ouzounis Department of Genetics, Development & Molecular Biology, School of Biology, Aristotle University of Thessalonica, 54124, Thessalonica, Greece. Biological Process & Computation Laboratory (BCPL), Chemical Process & Energy Resources Institute (CPERI), Centre for Research & Technology Hellas (CERTH), Thessalonica, 57001, Greece.

Collapse

Friis-Nielsen J, Kjartansdóttir KR, Mollerup S, Asplund M, Mourier T, Jensen RH, Hansen TA, Rey-Iglesia A, Richter SR, Nielsen IB, Alquezar-Planas DE, Olsen PVS, Vinner L, Fridholm H, Nielsen LP, Willerslev E, Sicheritz-Pontén T, Lund O, Hansen AJ, Izarzugaza JMG, Brunak S. Identification of Known and Novel Recurrent Viral Sequences in Data from Multiple Patients and Multiple Cancers. Viruses 2016;8:E53. [PMID: 26907326 PMCID: PMC4776208 DOI: 10.3390/v8020053] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2015] [Revised: 01/29/2016] [Accepted: 02/05/2016] [Indexed: 12/17/2022] Open

Affiliation(s)

Jens Friis-Nielsen Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, DK-2800 Kgs. Lyngby, Denmark.
Kristín Rós Kjartansdóttir Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark.
Sarah Mollerup Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark.
Maria Asplund Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark.
Tobias Mourier Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark.
Randi Holm Jensen Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark.
Thomas Arn Hansen Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark.
Alba Rey-Iglesia Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark.
Stine Raith Richter Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark.
Ida Broman Nielsen Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark.
David E Alquezar-Planas Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark.
Pernille V S Olsen Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark.
Lasse Vinner Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark.
Helena Fridholm Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark.
Lars Peter Nielsen Department of Autoimmunology and Biomarkers, Statens Serum Institut, DK-2300 Copenhagen S, Denmark.
Eske Willerslev Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark.
Thomas Sicheritz-Pontén Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, DK-2800 Kgs. Lyngby, Denmark.
Ole Lund Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, DK-2800 Kgs. Lyngby, Denmark.
Anders Johannes Hansen Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark.
Jose M G Izarzugaza Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, DK-2800 Kgs. Lyngby, Denmark.
Søren Brunak Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, DK-2800 Kgs. Lyngby, Denmark. NNF Center for Protein Research, University of Copenhagen, Blegdamsvej 3B, DK-2200 Copenhagen, Denmark.

Collapse

Reisman S, Hatzopoulos T, Läufer K, Thiruvathukal GK, Putonti C. A Polyglot Approach to Bioinformatics Data Integration: A Phylogenetic Analysis of HIV-1. Evol Bioinform Online 2016;12:23-7. [PMID: 26819543 PMCID: PMC4718148 DOI: 10.4137/ebo.s32757] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2015] [Revised: 10/18/2015] [Accepted: 10/25/2015] [Indexed: 02/04/2023] Open

Possible Human Papillomavirus 38 Contamination of Endometrial Cancer RNA Sequencing Samples in The Cancer Genome Atlas Database. J Virol 2015;89:8967-73. [PMID: 26085148 DOI: 10.1128/jvi.00822-15] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2015] [Accepted: 06/09/2015] [Indexed: 12/17/2022] Open

Abstract

UNLABELLED

Viruses are causally associated with a number of human malignancies. In this study, we sought to identify new virus-cancer associations by searching RNA sequencing data sets from >2,000 patients, encompassing 21 cancers from The Cancer Genome Atlas (TCGA), for the presence of viral sequences. In agreement with previous studies, we found human papillomavirus 16 (HPV16) and HPV18 in oropharyngeal cancer and hepatitis B and C viruses in liver cancer. Unexpectedly, however, we found HPV38, a cutaneous form of HPV associated with skin cancer, in 32 of 168 samples from endometrial cancer. In 12 of the HPV38-positive (HPV38(+)) samples, we observed at least one paired read that mapped to both human and HPV38 genomes, indicative of viral integration into the host DNA, something not previously demonstrated for HPV38. The expression levels of HPV38 transcripts were relatively low, and all 32 HPV38(+) samples belonged to the same experimental batch of 40 samples, whereas none of the other 128 endometrial carcinoma samples were HPV38(+), raising doubts about the significance of the HPV38 association. Moreover, the HPV38(+) samples contained the same 10 novel single nucleotide variations (SNVs), leading us to hypothesize that one patient was infected with this new isolate of HPV38, which was integrated into his/her genome and may have cross-contaminated other TCGA samples within batch 228. Based on our analysis, we propose guidelines to examine the batch effect, virus expression level, and SNVs as part of next-generation sequencing (NGS) data analysis for evaluating the significance of viral/pathogen sequences in clinical samples.

IMPORTANCE

High-throughput RNA sequencing (RNA-Seq), followed by computational analysis, has vastly accelerated the identification of viral and other pathogenic sequences in clinical samples, but cross-contamination during the processing of the samples remain a major problem that can lead to erroneous conclusions. We found HPV38 sequences specifically present in RNA-Seq samples from endometrial cancer patients from TCGA, a virus not previously associated with this type of cancer. However, multiple lines of evidence suggest possible cross-contamination in these samples, which were processed together in the same batch. Despite this potential cross-contamination, our data indicate that we have detected a new isolate of HPV38 that appears to be integrated into the human genome. We also provide general guidelines for computational detection and interpretation of pathogen-disease associations.

Collapse

HeLa nucleic acid contamination in the cancer genome atlas leads to the misidentification of human papillomavirus 18. J Virol 2015;89:4051-7. [PMID: 25631090 DOI: 10.1128/jvi.03365-14] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open

Wang Q, Jia P, Zhao Z. VERSE: a novel approach to detect virus integration in host genomes through reference genome customization. Genome Med 2015;7:2. [PMID: 25699093 PMCID: PMC4333248 DOI: 10.1186/s13073-015-0126-6] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2014] [Accepted: 01/05/2015] [Indexed: 12/28/2022] Open

Calistri A, Palu G. Editorial Commentary: Unbiased Next-Generation Sequencing and New Pathogen Discovery: Undeniable Advantages and Still-Existing Drawbacks. Clin Infect Dis 2015;60:889-91. [DOI: 10.1093/cid/ciu913] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Naccache SN, Federman S, Veeraraghavan N, Zaharia M, Lee D, Samayoa E, Bouquet J, Greninger AL, Luk KC, Enge B, Wadford DA, Messenger SL, Genrich GL, Pellegrino K, Grard G, Leroy E, Schneider BS, Fair JN, Martínez MA, Isa P, Crump JA, DeRisi JL, Sittler T, Hackett J, Miller S, Chiu CY. A cloud-compatible bioinformatics pipeline for ultrarapid pathogen identification from next-generation sequencing of clinical samples. Genome Res 2014;24:1180-92. [PMID: 24899342 PMCID: PMC4079973 DOI: 10.1101/gr.171934.113] [Citation(s) in RCA: 311] [Impact Index Per Article: 31.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Affiliation(s)

Samia N Naccache Department of Laboratory Medicine, UCSF, San Francisco, California 94107, USA; UCSF-Abbott Viral Diagnostics and Discovery Center, San Francisco, California 94107, USA
Scot Federman Department of Laboratory Medicine, UCSF, San Francisco, California 94107, USA; UCSF-Abbott Viral Diagnostics and Discovery Center, San Francisco, California 94107, USA
Narayanan Veeraraghavan Department of Laboratory Medicine, UCSF, San Francisco, California 94107, USA; UCSF-Abbott Viral Diagnostics and Discovery Center, San Francisco, California 94107, USA
Matei Zaharia Department of Computer Science, University of California, Berkeley, California 94720, USA
Deanna Lee Department of Laboratory Medicine, UCSF, San Francisco, California 94107, USA; UCSF-Abbott Viral Diagnostics and Discovery Center, San Francisco, California 94107, USA
Erik Samayoa Department of Laboratory Medicine, UCSF, San Francisco, California 94107, USA; UCSF-Abbott Viral Diagnostics and Discovery Center, San Francisco, California 94107, USA
Jerome Bouquet Department of Laboratory Medicine, UCSF, San Francisco, California 94107, USA; UCSF-Abbott Viral Diagnostics and Discovery Center, San Francisco, California 94107, USA
Alexander L Greninger Department of Biochemistry, UCSF, San Francisco, California 94107, USA
Ka-Cheung Luk Abbott Diagnostics, Abbott Park, Illinois 60064, USA
Barryett Enge Viral and Rickettsial Disease Laboratory, California Department of Public Health, Richmond, California 94804, USA
Debra A Wadford Viral and Rickettsial Disease Laboratory, California Department of Public Health, Richmond, California 94804, USA
Sharon L Messenger Viral and Rickettsial Disease Laboratory, California Department of Public Health, Richmond, California 94804, USA
Gillian L Genrich Department of Laboratory Medicine, UCSF, San Francisco, California 94107, USA
Kristen Pellegrino Department of Family and Community Medicine, UCSF, San Francisco, California 94143, USA
Gilda Grard Viral Emergent Diseases Unit, Centre International de Recherches Médicales de Franceville, Franceville, BP 769, Gabon
Eric Leroy Viral Emergent Diseases Unit, Centre International de Recherches Médicales de Franceville, Franceville, BP 769, Gabon
Bradley S Schneider Metabiota, Inc., San Francisco, California 94104, USA
Joseph N Fair Metabiota, Inc., San Francisco, California 94104, USA
Miguel A Martínez Departamento de Genética del Desarrollo y Fisiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, 62260, Mexico
Pavel Isa Departamento de Genética del Desarrollo y Fisiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, 62260, Mexico
John A Crump Division of Infectious Diseases and International Health and the Duke Global Health Institute, Duke University Medical Center, Durham, North Carolina 27708, USA; Kilimanjaro Christian Medical Centre, Moshi, Kilimanjaro, 7393, Tanzania; Centre for International Health, University of Otago, Dunedin, 9054, New Zealand
Joseph L DeRisi Department of Biochemistry, UCSF, San Francisco, California 94107, USA
Taylor Sittler Department of Laboratory Medicine, UCSF, San Francisco, California 94107, USA
John Hackett Abbott Diagnostics, Abbott Park, Illinois 60064, USA
Steve Miller Department of Laboratory Medicine, UCSF, San Francisco, California 94107, USA; UCSF-Abbott Viral Diagnostics and Discovery Center, San Francisco, California 94107, USA
Charles Y Chiu Department of Laboratory Medicine, UCSF, San Francisco, California 94107, USA; UCSF-Abbott Viral Diagnostics and Discovery Center, San Francisco, California 94107, USA; Department of Medicine, Division of Infectious Diseases, UCSF, San Francisco, California 94143, USA

Collapse

Caboche S, Audebert C, Hot D. High-Throughput Sequencing, a VersatileWeapon to Support Genome-Based Diagnosis in Infectious Diseases: Applications to Clinical Bacteriology. Pathogens 2014;3:258-79. [PMID: 25437800 PMCID: PMC4243446 DOI: 10.3390/pathogens3020258] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2013] [Revised: 02/28/2014] [Accepted: 03/20/2014] [Indexed: 12/19/2022] Open

Borozan I, Watt SN, Ferretti V. Evaluation of alignment algorithms for discovery and identification of pathogens using RNA-Seq. PLoS One 2013;8:e76935. [PMID: 24204709 PMCID: PMC3813700 DOI: 10.1371/journal.pone.0076935] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2013] [Accepted: 09/04/2013] [Indexed: 01/02/2023] Open

Sensitive detection of viral transcripts in human tumor transcriptomes. PLoS Comput Biol 2013;9:e1003228. [PMID: 24098097 PMCID: PMC3789765 DOI: 10.1371/journal.pcbi.1003228] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2012] [Accepted: 06/04/2013] [Indexed: 02/07/2023] Open

Abstract

In excess of % of human cancer incidents have a viral cofactor. Epidemiological studies of idiopathic human cancers indicate that additional tumor viruses remain to be discovered. Recent advances in sequencing technology have enabled systematic screenings of human tumor transcriptomes for viral transcripts. However, technical problems such as low abundances of viral transcripts in large volumes of sequencing data, viral sequence divergence, and homology between viral and human factors significantly confound identification of tumor viruses. We have developed a novel computational approach for detecting viral transcripts in human cancers that takes the aforementioned confounding factors into account and is applicable to a wide variety of viruses and tumors. We apply the approach to conducting the first systematic search for viruses in neuroblastoma, the most common cancer in infancy. The diverse clinical progression of this disease as well as related epidemiological and virological findings are highly suggestive of a pathogenic cofactor. However, a viral etiology of neuroblastoma is currently contested. We mapped transcriptomes of neuroblastoma as well as positive and negative controls to the human and all known viral genomes in order to detect both known and unknown viruses. Analysis of controls, comparisons with related methods, and statistical estimates demonstrate the high sensitivity of our approach. Detailed investigation of putative viral transcripts within neuroblastoma samples did not provide evidence for the existence of any known human viruses. Likewise, de-novo assembly and analysis of chimeric transcripts did not result in expression signatures associated with novel human pathogens. While confounding factors such as sample dilution or viral clearance in progressed tumors may mask viral cofactors in the data, in principle, this is rendered less likely by the high sensitivity of our approach and the number of biological replicates analyzed. Therefore, our results suggest that frequent viral cofactors of metastatic neuroblastoma are unlikely.

Many human cancers are caused by infections with tumor viruses and identification of these pathogens is considered a critical contribution to cancer prevention. Deep sequencing enables us to systematically investigate viral nucleotide signatures in order to either verify or exclude the existence of viruses in idiopathic human cancers. We have developed Virana, a novel computational approach for identifying tumor viruses in human cancers that is applicable to a wide variety of tumors and viruses. Virana firstly addresses several important biological confounding factors that may hinder successful detection of these pathogens. We applied our approach in the first systematic search for cancer-causing viruses in metastatic neuroblastoma, the most common form of cancer in infancy. Although the heterogeneous clinical progression of this disease as well as epidemiological and virological findings are suggestive of a pathogenic cofactor, the viral etiology of neuroblastoma is currently contested. We conducted an analysis of experimental controls, comparisons with related approaches, as well as statistical analyses in order to validate our method. In spite of the high sensitivity of our approach, analyses of neuroblastoma transcriptomes did not provide evidence for the existence of any known or unknown human viruses. Our results therefore suggest that frequent viral cofactors of metastatic neuroblastoma are unlikely.

Collapse

Viral pathogen discovery. Curr Opin Microbiol 2013;16:468-78. [PMID: 23725672 PMCID: PMC5964995 DOI: 10.1016/j.mib.2013.05.001] [Citation(s) in RCA: 146] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2013] [Revised: 04/29/2013] [Accepted: 05/01/2013] [Indexed: 12/16/2022]

Naeem R, Rashid M, Pain A. READSCAN: a fast and scalable pathogen discovery program with accurate genome relative abundance estimation. ACTA ACUST UNITED AC 2012. [PMID: 23193222 PMCID: PMC3562070 DOI: 10.1093/bioinformatics/bts684] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]