1
|
Patel Y, Zhu C, Yamaguchi TN, Bugh YZ, Tian M, Holmes A, Fitz-Gibbon ST, Boutros PC. NFTest: automated testing of Nextflow pipelines. Bioinformatics 2024; 40:btae081. [PMID: 38341660 PMCID: PMC10881102 DOI: 10.1093/bioinformatics/btae081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 01/18/2024] [Accepted: 02/08/2024] [Indexed: 02/12/2024] Open
Abstract
MOTIVATION The ongoing expansion in the volume of biomedical data has contributed to a growing complexity in the tools and technologies used in research with an increased reliance on complex workflows written in orchestration languages such as Nextflow to integrate algorithms into processing pipelines. The growing use of workflows involving various tools and algorithms has led to increased scrutiny of software development practices to avoid errors in individual tools and in the connections between them. RESULTS To facilitate test-driven development of Nextflow pipelines, we created NFTest, a framework for automated pipeline testing and validation with customizability options for Nextflow features. It is open-source, easy to initialize and use, and customizable to allow for testing of complex workflows with test success configurable through a broad range of assertions. NFTest simplifies the testing burden on developers by automating tests once defined and providing a flexible interface for running tests to validate workflows. This reduces the barrier to rigorous biomedical workflow testing and paves the way toward reducing computational errors in biomedicine. AVAILABILITY AND IMPLEMENTATION NFTest is an open-source Python framework under the GPLv2 license and is freely available at https://github.com/uclahs-cds/tool-NFTest. The call-sSNV Nextflow pipeline is available at: https://github.com/uclahs-cds/pipeline-call-sSNV.
Collapse
Affiliation(s)
- Yash Patel
- Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, CA 90095, United States
- Institute for Precision Health, University of California, Los Angeles, Los Angeles, CA 90095, United States
| | - Chenghao Zhu
- Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, CA 90095, United States
- Institute for Precision Health, University of California, Los Angeles, Los Angeles, CA 90095, United States
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA 90095, United States
| | - Takafumi N Yamaguchi
- Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, CA 90095, United States
- Institute for Precision Health, University of California, Los Angeles, Los Angeles, CA 90095, United States
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA 90095, United States
| | - Yuan Zhe Bugh
- Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, CA 90095, United States
| | - Mao Tian
- Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, CA 90095, United States
- Institute for Precision Health, University of California, Los Angeles, Los Angeles, CA 90095, United States
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA 90095, United States
| | - Aaron Holmes
- Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, CA 90095, United States
- Institute for Precision Health, University of California, Los Angeles, Los Angeles, CA 90095, United States
| | - Sorel T Fitz-Gibbon
- Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, CA 90095, United States
- Institute for Precision Health, University of California, Los Angeles, Los Angeles, CA 90095, United States
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA 90095, United States
| | - Paul C Boutros
- Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, CA 90095, United States
- Institute for Precision Health, University of California, Los Angeles, Los Angeles, CA 90095, United States
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA 90095, United States
- Department of Urology, University of California, Los Angeles, Los Angeles, CA 90095, United States
- Broad Stem Cell Research Center, University of California, Los Angeles, Los Angeles, CA 90095, United States
| |
Collapse
|
2
|
Lim WC, Marques Da Costa ME, Godefroy K, Jacquet E, Gragert L, Rondof W, Marchais A, Nhiri N, Dalfovo D, Viard M, Labaied N, Khan AM, Dessen P, Romanel A, Pasqualini C, Schleiermacher G, Carrington M, Zitvogel L, Scoazec JY, Geoerger B, Salmon J. Divergent HLA variations and heterogeneous expression but recurrent HLA loss-of- heterozygosity and common HLA-B and TAP transcriptional silencing across advanced pediatric solid cancers. Front Immunol 2024; 14:1265469. [PMID: 38318504 PMCID: PMC10839790 DOI: 10.3389/fimmu.2023.1265469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2023] [Accepted: 11/06/2023] [Indexed: 02/07/2024] Open
Abstract
The human leukocyte antigen (HLA) system is a major factor controlling cancer immunosurveillance and response to immunotherapy, yet its status in pediatric cancers remains fragmentary. We determined high-confidence HLA genotypes in 576 children, adolescents and young adults with recurrent/refractory solid tumors from the MOSCATO-01 and MAPPYACTS trials, using normal and tumor whole exome and RNA sequencing data and benchmarked algorithms. There was no evidence for narrowed HLA allelic diversity but discordant homozygosity and allele frequencies across tumor types and subtypes, such as in embryonal and alveolar rhabdomyosarcoma, neuroblastoma MYCN and 11q subtypes, and high-grade glioma, and several alleles may represent protective or susceptibility factors to specific pediatric solid cancers. There was a paucity of somatic mutations in HLA and antigen processing and presentation (APP) genes in most tumors, except in cases with mismatch repair deficiency or genetic instability. The prevalence of loss-of-heterozygosity (LOH) ranged from 5.9 to 7.7% in HLA class I and 8.0 to 16.7% in HLA class II genes, but was widely increased in osteosarcoma and glioblastoma (~15-25%), and for DRB1-DQA1-DQB1 in Ewing sarcoma (~23-28%) and low-grade glioma (~33-50%). HLA class I and HLA-DR antigen expression was assessed in 194 tumors and 44 patient-derived xenografts (PDXs) by immunochemistry, and class I and APP transcript levels quantified in PDXs by RT-qPCR. We confirmed that HLA class I antigen expression is heterogeneous in advanced pediatric solid tumors, with class I loss commonly associated with the transcriptional downregulation of HLA-B and transporter associated with antigen processing (TAP) genes, whereas class II antigen expression is scarce on tumor cells and occurs on immune infiltrating cells. Patients with tumors expressing sufficient HLA class I and TAP levels such as some glioma, osteosarcoma, Ewing sarcoma and non-rhabdomyosarcoma soft-tissue sarcoma cases may more likely benefit from T cell-based approaches, whereas strategies to upregulate HLA expression, to expand the immunopeptidome, and to target TAP-independent epitopes or possibly LOH might provide novel therapeutic opportunities in others. The consequences of HLA class II expression by immune cells remain to be established. Immunogenetic profiling should be implemented in routine to inform immunotherapy trials for precision medicine of pediatric cancers.
Collapse
Affiliation(s)
- Wan Ching Lim
- INSERM U1015, Gustave Roussy Cancer Campus, Université Paris-Saclay, Villejuif, France
- Bioinformatics Platform, AMMICA, INSERM US23/CNRS UMS3655, Gustave Roussy Cancer Campus, Université Paris-Saclay, Villejuif, France
- School of Data Sciences, Perdana University, Kuala Lumpur, Malaysia
| | | | - Karine Godefroy
- Department of Pathology and Laboratory Medicine, Translational Research Laboratory and Biobank, AMMICA, INSERM US23/CNRS UMS3655, Gustave Roussy Cancer Campus, Université Paris-Saclay, Villejuif, France
| | - Eric Jacquet
- Institut de Chimie des Substances Naturelles, CNRS UPR2301, Université Paris-Saclay, Gif-sur-Yvette, France
| | - Loren Gragert
- Department of Pathology and Laboratory Medicine, Tulane University School of Medicine, New Orleans, LA, United States
| | - Windy Rondof
- INSERM U1015, Gustave Roussy Cancer Campus, Université Paris-Saclay, Villejuif, France
- Bioinformatics Platform, AMMICA, INSERM US23/CNRS UMS3655, Gustave Roussy Cancer Campus, Université Paris-Saclay, Villejuif, France
| | - Antonin Marchais
- INSERM U1015, Gustave Roussy Cancer Campus, Université Paris-Saclay, Villejuif, France
- Bioinformatics Platform, AMMICA, INSERM US23/CNRS UMS3655, Gustave Roussy Cancer Campus, Université Paris-Saclay, Villejuif, France
| | - Naima Nhiri
- Institut de Chimie des Substances Naturelles, CNRS UPR2301, Université Paris-Saclay, Gif-sur-Yvette, France
| | - Davide Dalfovo
- Department of Cellular, Computational and Integrative Biology (CIBIO), University of Trento, Trento, Italy
| | - Mathias Viard
- Frederick National Laboratory for Cancer Research, National Cancer Institute, Frederick, MD, United States
- Laboratory of Integrative Cancer Immunology, Center for Cancer Research, National Cancer Institute, Bethesda, MD, United States
| | - Nizar Labaied
- Department of Pathology and Laboratory Medicine, Translational Research Laboratory and Biobank, AMMICA, INSERM US23/CNRS UMS3655, Gustave Roussy Cancer Campus, Université Paris-Saclay, Villejuif, France
| | - Asif M. Khan
- School of Data Sciences, Perdana University, Kuala Lumpur, Malaysia
| | - Philippe Dessen
- Bioinformatics Platform, AMMICA, INSERM US23/CNRS UMS3655, Gustave Roussy Cancer Campus, Université Paris-Saclay, Villejuif, France
| | - Alessandro Romanel
- Department of Cellular, Computational and Integrative Biology (CIBIO), University of Trento, Trento, Italy
| | - Claudia Pasqualini
- INSERM U1015, Gustave Roussy Cancer Campus, Université Paris-Saclay, Villejuif, France
- Department of Pediatric and Adolescent Oncology, Gustave Roussy Cancer Campus, Université Paris-Saclay, Villejuif, France
| | - Gudrun Schleiermacher
- INSERM U830, Recherche Translationnelle en Oncologie Pédiatrique (RTOP), and SIREDO Oncology Center (Care, Innovation and Research for Children and AYA with Cancer), PSL Research University, Institut Curie, Paris, France
| | - Mary Carrington
- Frederick National Laboratory for Cancer Research, National Cancer Institute, Frederick, MD, United States
- Laboratory of Integrative Cancer Immunology, Center for Cancer Research, National Cancer Institute, Bethesda, MD, United States
- Ragon Institute of Massachusetts General Hospital, MIT and Harvard University, Cambridge, MA, United States
| | - Laurence Zitvogel
- INSERM U1015, Gustave Roussy Cancer Campus, Université Paris-Saclay, Villejuif, France
| | - Jean-Yves Scoazec
- Department of Pathology and Laboratory Medicine, Translational Research Laboratory and Biobank, AMMICA, INSERM US23/CNRS UMS3655, Gustave Roussy Cancer Campus, Université Paris-Saclay, Villejuif, France
| | - Birgit Geoerger
- INSERM U1015, Gustave Roussy Cancer Campus, Université Paris-Saclay, Villejuif, France
- Department of Pediatric and Adolescent Oncology, Gustave Roussy Cancer Campus, Université Paris-Saclay, Villejuif, France
| | - Jerome Salmon
- INSERM U1015, Gustave Roussy Cancer Campus, Université Paris-Saclay, Villejuif, France
| |
Collapse
|
3
|
Díaz de Ståhl T, Shamikh A, Mayrhofer M, Juhos S, Basmaci E, Prochazka G, Garcia M, Somarajan PR, Zielinska-Chomej K, Illies C, Øra I, Siesjö P, Sandström PE, Stenman J, Sabel M, Gustavsson B, Kogner P, Pfeifer S, Ljungman G, Sandgren J, Nistér M. The Swedish childhood tumor biobank: systematic collection and molecular characterization of all pediatric CNS and other solid tumors in Sweden. J Transl Med 2023; 21:342. [PMID: 37221626 DOI: 10.1186/s12967-023-04178-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 05/02/2023] [Indexed: 05/25/2023] Open
Abstract
The Swedish Childhood Tumor Biobank (BTB) is a nonprofit national infrastructure for collecting tissue samples and genomic data from pediatric patients diagnosed with central nervous system (CNS) and other solid tumors. The BTB is built on a multidisciplinary network established to provide the scientific community with standardized biospecimens and genomic data, thereby improving knowledge of the biology, treatment and outcome of childhood tumors. As of 2022, over 1100 fresh-frozen tumor samples are available for researchers. We present the workflow of the BTB from sample collection and processing to the generation of genomic data and services offered. To determine the research and clinical utility of the data, we performed bioinformatics analyses on next-generation sequencing (NGS) data obtained from a subset of 82 brain tumors and patient blood-derived DNA combined with methylation profiling to enhance the diagnostic accuracy and identified germline and somatic alterations with potential biological or clinical significance. The BTB procedures for collection, processing, sequencing, and bioinformatics deliver high-quality data. We observed that the findings could impact patient management by confirming or clarifying the diagnosis in 79 of the 82 tumors and detecting known or likely driver mutations in 68 of 79 patients. In addition to revealing known mutations in a broad spectrum of genes implicated in pediatric cancer, we discovered numerous alterations that may represent novel driver events and specific tumor entities. In summary, these examples reveal the power of NGS to identify a wide number of actionable gene alterations. Making the power of NGS available in healthcare is a challenging task requiring the integration of the work of clinical specialists and cancer biologists; this approach requires a dedicated infrastructure, as exemplified here by the BTB.
Collapse
Affiliation(s)
- Teresita Díaz de Ståhl
- Department of Oncology-Pathology, Karolinska Institutet, Stockholm, Sweden.
- Clinical Pathology and Cancer Diagnostics, Karolinska University Hospital, Stockholm, Sweden.
| | - Alia Shamikh
- Department of Oncology-Pathology, Karolinska Institutet, Stockholm, Sweden
- Clinical Pathology and Cancer Diagnostics, Karolinska University Hospital, Stockholm, Sweden
| | - Markus Mayrhofer
- Department of Cell and Molecular Biology, National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Szilvester Juhos
- Department of Oncology-Pathology, Karolinska Institutet, Stockholm, Sweden
| | - Elisa Basmaci
- Department of Oncology-Pathology, Karolinska Institutet, Stockholm, Sweden
| | - Gabriela Prochazka
- Department of Oncology-Pathology, Karolinska Institutet, Stockholm, Sweden
| | - Maxime Garcia
- Department of Oncology-Pathology, Karolinska Institutet, Stockholm, Sweden
| | | | | | - Christopher Illies
- Department of Oncology-Pathology, Karolinska Institutet, Stockholm, Sweden
- Clinical Pathology and Cancer Diagnostics, Karolinska University Hospital, Stockholm, Sweden
| | - Ingrid Øra
- Department of Paediatric Haematology Oncology and Immunology, Skåne University Hospital Lund, Lund, Sweden
| | - Peter Siesjö
- Department of Clinical Sciences Lund, Department of Neurosurgery, Lund University, Skåne University Hospital, Lund, Sweden
| | - Per-Erik Sandström
- Department of Clinical Sciences, Pediatrics, Umeå University, Umeå, Sweden
| | - Jakob Stenman
- Childhood Cancer Research Unit, Department of Women's and Children's Health, Karolinska Institutet, Stockholm, Sweden
| | - Magnus Sabel
- Childhood Cancer Centre, Queen Silvia Children's Hospital, Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Bengt Gustavsson
- Department of Neurosurgery, Karolinska University Hospital, Stockholm, Sweden
| | - Per Kogner
- Childhood Cancer Research Unit, Department of Women's and Children's Health, Karolinska Institutet, Stockholm, Sweden
| | - Susan Pfeifer
- Pediatric Hematology/Oncology, Department of Women's and Children's Health, Uppsala University, Uppsala, Sweden
| | - Gustaf Ljungman
- Pediatric Hematology/Oncology, Department of Women's and Children's Health, Uppsala University, Uppsala, Sweden
| | - Johanna Sandgren
- Department of Oncology-Pathology, Karolinska Institutet, Stockholm, Sweden
- Clinical Pathology and Cancer Diagnostics, Karolinska University Hospital, Stockholm, Sweden
| | - Monica Nistér
- Department of Oncology-Pathology, Karolinska Institutet, Stockholm, Sweden
| |
Collapse
|
4
|
Guerra-Assunção JA, Conde L, Moghul I, Webster AP, Ecker S, Chervova O, Chatzipantsiou C, Prieto PP, Beck S, Herrero J. GenomeChronicler: The Personal Genome Project UK Genomic Report Generator Pipeline. Front Genet 2020; 11:518644. [PMID: 33193602 PMCID: PMC7541957 DOI: 10.3389/fgene.2020.518644] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2019] [Accepted: 09/02/2020] [Indexed: 11/13/2022] Open
Abstract
In recent years, there has been a significant increase in whole genome sequencing data of individual genomes produced by research projects as well as direct to consumer service providers. While many of these sources provide their users with an interpretation of the data, there is a lack of free, open tools for generating reports exploring the data in an easy to understand manner. GenomeChronicler was developed as part of the Personal Genome Project UK (PGP-UK) to address this need. PGP-UK provides genomic, transcriptomic, epigenomic and self-reported phenotypic data under an open-access model with full ethical approval. As a result, the reports generated by GenomeChronicler are intended for research purposes only and include information relating to potentially beneficial and potentially harmful variants, but without clinical curation. GenomeChronicler can be used with data from whole genome or whole exome sequencing, producing a genome report containing information on variant statistics, ancestry and known associated phenotypic traits. Example reports are available from the PGP-UK data page (personalgenomes.org.uk/data). The objective of this method is to leverage existing resources to find known phenotypes associated with the genotypes detected in each sample. The provided trait data is based primarily upon information available in SNPedia, but also collates data from ClinVar, GETevidence, and gnomAD to provide additional details on potential health implications, presence of genotype in other PGP participants and population frequency of each genotype. The analysis can be run in a self-contained environment without requiring internet access, making it a good choice for cases where privacy is essential or desired: any third party project can embed GenomeChronicler within their off-line safe-haven environments. GenomeChronicler can be run for one sample at a time, or in parallel making use of the Nextflow workflow manager. The source code is available from GitHub (https://github.com/PGP-UK/GenomeChronicler), container recipes are available for Docker and Singularity, as well as a pre-built container from SingularityHub (https://singularity-hub.org/collections/3664) enabling easy deployment in a variety of settings. Users without access to computational resources to run GenomeChronicler can access the software from the Lifebit CloudOS platform (https://lifebit.ai/cloudos) enabling the production of reports and variant calls from raw sequencing data in a scalable fashion.
Collapse
Affiliation(s)
- José Afonso Guerra-Assunção
- Infection and Immunity, University College London, London, United Kingdom.,Bill Lyons Informatics Centre, UCL Cancer Institute, University College London, London, United Kingdom
| | - Lucia Conde
- Bill Lyons Informatics Centre, UCL Cancer Institute, University College London, London, United Kingdom
| | - Ismail Moghul
- Medical Genomics, UCL Cancer Institute, University College London, London, United Kingdom
| | - Amy P Webster
- Medical Genomics, UCL Cancer Institute, University College London, London, United Kingdom
| | - Simone Ecker
- Medical Genomics, UCL Cancer Institute, University College London, London, United Kingdom
| | - Olga Chervova
- Medical Genomics, UCL Cancer Institute, University College London, London, United Kingdom
| | | | | | - Stephan Beck
- Medical Genomics, UCL Cancer Institute, University College London, London, United Kingdom
| | - Javier Herrero
- Bill Lyons Informatics Centre, UCL Cancer Institute, University College London, London, United Kingdom
| |
Collapse
|