1
|
Salignon J, Millan-Ariño L, Garcia MU, Riedel CG. Cactus: A user-friendly and reproducible ATAC-Seq and mRNA-Seq analysis pipeline for data preprocessing, differential analysis, and enrichment analysis. Genomics 2024; 116:110858. [PMID: 38735595 DOI: 10.1016/j.ygeno.2024.110858] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Revised: 05/06/2024] [Accepted: 05/09/2024] [Indexed: 05/14/2024]
Abstract
The ever decreasing cost of Next-Generation Sequencing coupled with the emergence of efficient and reproducible analysis pipelines has rendered genomic methods more accessible. However, downstream analyses are basic or missing in most workflows, creating a significant barrier for non-bioinformaticians. To help close this gap, we developed Cactus, an end-to-end pipeline for analyzing ATAC-Seq and mRNA-Seq data, either separately or jointly. Its Nextflow-, container-, and virtual environment-based architecture ensures efficient and reproducible analyses. Cactus preprocesses raw reads, conducts differential analyses between conditions, and performs enrichment analyses in various databases, including DNA-binding motifs, ChIP-Seq binding sites, chromatin states, and ontologies. We demonstrate the utility of Cactus in a multi-modal and multi-species case study as well as by showcasing its unique capabilities as compared to other ATAC-Seq pipelines. In conclusion, Cactus can assist researchers in gaining comprehensive insights from chromatin accessibility and gene expression data in a quick, user-friendly, and reproducible manner.
Collapse
Affiliation(s)
- Jérôme Salignon
- Department of Bioscience and Nutrition, Karolinska Institute, Blickagången 16, Huddinge SE-141 83, Sweden.
| | - Lluís Millan-Ariño
- Department of Bioscience and Nutrition, Karolinska Institute, Blickagången 16, Huddinge SE-141 83, Sweden
| | - Maxime U Garcia
- National Genomics Infrastructure, Science for Life Laboratory, Tomtebodavägen 23A, Solna SE-171 65, Sweden; Department of Oncology-Pathology, Karolinska Institute, Visionsgatan 4, Solna SE-171 64, Sweden
| | - Christian G Riedel
- Department of Bioscience and Nutrition, Karolinska Institute, Blickagången 16, Huddinge SE-141 83, Sweden.
| |
Collapse
|
2
|
Chen BJ, Lin CH, Wu HY, Cai JJ, Chao DY. Experimental and analytical pipeline for sub-genomic RNA landscape of coronavirus by Nanopore sequencer. Microbiol Spectr 2024; 12:e0395423. [PMID: 38483513 PMCID: PMC10986531 DOI: 10.1128/spectrum.03954-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Accepted: 02/26/2024] [Indexed: 04/06/2024] Open
Abstract
Coronaviruses (CoVs), including severe acute respiratory syndrome coronavirus 2, can infect a variety of mammalian and avian hosts with significant medical and economic consequences. During the life cycle of CoV, a coordinated series of subgenomic RNAs, including canonical subgenomic messenger RNA and non-canonical defective viral genomes (DVGs), are generated with different biological implications. Studies that adopted the Nanopore sequencer (ONT) to investigate the landscape and dynamics of viral RNA subgenomic transcriptomes applied arbitrary bioinformatics parameters without justification or experimental validation. The current study used bovine coronavirus (BCoV), which can be performed under biosafety level 2 for library construction and experimental validation using traditional colony polymerase chain reaction and Sanger sequencing. Four different ONT protocols, including RNA direct and cDNA direct sequencing with or without exonuclease treatment, were used to generate RNA transcriptomic libraries from BCoV-infected cell lysates. Through rigorously examining the k-mer, gap size, segment size, and bin size, the optimal cutoffs for the bioinformatic pipeline were determined to remove the sequence noise while keeping the informative DVG reads. The sensitivity and specificity of identifying DVG reads using the proposed pipeline can reach 82.6% and 99.6% under the k-mer size cutoff of 15. Exonuclease treatment reduced the abundance of RNA transcripts; however, it was not necessary for future library preparation. Additional recovery of clipped BCoV nucleotide sequences with experimental validation expands the landscape of the CoV discontinuous RNA transcriptome, whose biological function requires future investigation. The results of this study provide the benchmarks for library construction and bioinformatic parameters for studying the discontinuous CoV RNA transcriptome.IMPORTANCEFunctional defective viral genomic RNA, containing all the cis-acting elements required for translation or replication, may play different roles in triggering cell innate immune signaling, interfering with the canonical subgenomic messenger RNA transcription/translation or assisting in establishing persistence infection. This study does not only provide benchmarks for library construction and bioinformatic parameters for studying the discontinuous coronavirus RNA transcriptome but also reveals the complexity of the bovine coronavirus transcriptome, whose functional assays will be critical in future studies.
Collapse
Affiliation(s)
- Bo-Jia Chen
- Doctoral Program in Microbial Genomics, National Chung Hsing University and Academia Sinica, Taichung, Taiwan
| | - Ching-Hung Lin
- Graduate Institute of Veterinary Pathobiology, College of Veterinary Medicine, National Chung Hsing University, Taichung, Taiwan
| | - Hung-Yi Wu
- Graduate Institute of Veterinary Pathobiology, College of Veterinary Medicine, National Chung Hsing University, Taichung, Taiwan
| | - James J. Cai
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas, USA
| | - Day-Yu Chao
- Doctoral Program in Microbial Genomics, National Chung Hsing University and Academia Sinica, Taichung, Taiwan
- Graduate Institute of Microbiology and Public Health, College of Veterinary Medicine, National Chung Hsing University, Taichung, Taiwan
- Department of Post-Baccalaureate Medicine, College of Medicine, National Chung Hsing University, Taichung, Taiwan
| |
Collapse
|
3
|
Bundschuh C, Weidner N, Klein J, Rausch T, Azevedo N, Telzerow A, Mallm JP, Kim H, Steiger S, Seufert I, Börner K, Bauer K, Hübschmann D, Jost KL, Parthé S, Schnitzler P, Boutros M, Rippe K, Müller B, Bartenschlager R, Kräusslich HG, Benes V. Evolution of SARS-CoV-2 in the Rhine-Neckar/Heidelberg Region 01/2021 - 07/2023. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2024; 119:105577. [PMID: 38403035 DOI: 10.1016/j.meegid.2024.105577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Revised: 01/31/2024] [Accepted: 02/21/2024] [Indexed: 02/27/2024]
Abstract
In January 2021, the monitoring of circulating variants of SARS-CoV-2 was initiated in Germany under the Corona Surveillance Act, which was discontinued after July 2023. This initiative aimed to enhance pandemic containment, as specific amino acid changes, particularly in the spike protein, were associated with increased transmission and reduced vaccine efficacy. Our group conducted whole genome sequencing using the ARTIC protocol (currently V4) on Illumina's NextSeq 500 platform (and, starting in May 2023, on the MiSeq DX platform) for SARS-CoV-2 positive specimen from patients at Heidelberg University Hospital, associated hospitals, and the public health office in the Rhine-Neckar/Heidelberg region. In total, we sequenced 26,795 SARS-CoV-2-positive samples between January 2021 and July 2023. Valid sequences, meeting the requirements for upload to the German electronic sequencing data hub (DESH) operated by the Robert Koch Institute (RKI), were determined for 24,852 samples, and the lineage/clade could be identified for 25,912 samples. The year 2021 witnessed significant dynamics in the circulating variants in the Rhine-Neckar/Heidelberg region, including A.27.RN, followed by the emergence of B.1.1.7 (Alpha), subsequently displaced by B.1.617.2 (Delta), and the initial occurrences of B.1.1.529 (Omicron). By January 2022, B.1.1.529 had superseded B.1.617.2, dominating with over 90%. The years 2022 and 2023 were then characterized by the dominance of B.1.1.529 and its sublineages, particularly BA.5 and BA.2, and more recently, the emergence of recombinant variants like XBB.1.5. Since the global dominance of B.1.617.2, the identified variant distribution in our local study, apart from a time delay in the spread of new variants, can be considered largely representative of the global distribution. om a time delay in the spread of new variants, can be considered largely representative of the global distribution.
Collapse
Affiliation(s)
- Christian Bundschuh
- Heidelberg University, Medical Faculty Heidelberg, Department of Infectious Diseases, Virology, Heidelberg, Germany.
| | - Niklas Weidner
- Heidelberg University, Medical Faculty Heidelberg, Department of Infectious Diseases, Virology, Heidelberg, Germany; Heidelberg University, Medical Faculty Heidelberg, Department of Infectious Diseases, Microbiology and Hygiene, Heidelberg, Germany
| | - Julian Klein
- Heidelberg University, Medical Faculty Heidelberg, Department of Infectious Diseases, Virology, Heidelberg, Germany
| | - Tobias Rausch
- Genomics Core Facility, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Nayara Azevedo
- Genomics Core Facility, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Anja Telzerow
- Heidelberg University, Medical Faculty Heidelberg, Department of Infectious Diseases, Virology, Heidelberg, Germany; Genomics Core Facility, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Jan-Philipp Mallm
- Single Cell Open Lab, German Cancer Research Center (DKFZ) and Bioquant, Heidelberg, Germany
| | - Heeyoung Kim
- Department of Infectious Diseases, Molecular Virology, Heidelberg University, Heidelberg, Germany
| | - Simon Steiger
- Division Chromatin Networks, German Cancer Research Center (DKFZ) and Bioquant, Heidelberg, Germany
| | - Isabelle Seufert
- Division Chromatin Networks, German Cancer Research Center (DKFZ) and Bioquant, Heidelberg, Germany
| | - Kathleen Börner
- Heidelberg University, Medical Faculty Heidelberg, Department of Infectious Diseases, Virology, Heidelberg, Germany
| | - Katharina Bauer
- Single Cell Open Lab, German Cancer Research Center (DKFZ) and Bioquant, Heidelberg, Germany
| | - Daniel Hübschmann
- Division Stem Cells and Cancer, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Katharina Laurence Jost
- Heidelberg University, Medical Faculty Heidelberg, Department of Infectious Diseases, Virology, Heidelberg, Germany
| | - Sylvia Parthé
- Heidelberg University, Medical Faculty Heidelberg, Department of Infectious Diseases, Virology, Heidelberg, Germany
| | - Paul Schnitzler
- Heidelberg University, Medical Faculty Heidelberg, Department of Infectious Diseases, Virology, Heidelberg, Germany
| | - Michael Boutros
- Division Signaling and Functional Genomics, German Cancer Research Center (DKFZ), Department for Cell and Molecular Biology, Medical Faculty Mannheim, Heidelberg University, Heidelberg, Germany
| | - Karsten Rippe
- Division Chromatin Networks, German Cancer Research Center (DKFZ) and Bioquant, Heidelberg, Germany
| | - Barbara Müller
- Heidelberg University, Medical Faculty Heidelberg, Department of Infectious Diseases, Virology, Heidelberg, Germany
| | - Ralf Bartenschlager
- Single Cell Open Lab, German Cancer Research Center (DKFZ) and Bioquant, Heidelberg, Germany; Division Virus-Associated Carcinogenesis, German Cancer Research Center, Heidelberg, Germany; Deutsches Zentrum für Infektionsforschung, partner site Heidelberg, Germany
| | - Hans-Georg Kräusslich
- Heidelberg University, Medical Faculty Heidelberg, Department of Infectious Diseases, Virology, Heidelberg, Germany; Deutsches Zentrum für Infektionsforschung, partner site Heidelberg, Germany
| | - Vladimir Benes
- Genomics Core Facility, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany.
| |
Collapse
|
4
|
Lozovska A, Korovesi AG, Dias A, Lopes A, Fowler DA, Martins GG, Nóvoa A, Mallo M. Tgfbr1 controls developmental plasticity between the hindlimb and external genitalia by remodeling their regulatory landscape. Nat Commun 2024; 15:2509. [PMID: 38509075 PMCID: PMC10954616 DOI: 10.1038/s41467-024-46870-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 03/13/2024] [Indexed: 03/22/2024] Open
Abstract
The hindlimb and external genitalia of present-day tetrapods are thought to derive from an ancestral common primordium that evolved to generate a wide diversity of structures adapted for efficient locomotion and mating in the ecological niche occupied by the species. We show that despite long evolutionary distance from the ancestral condition, the early primordium of the mouse external genitalia preserved the capacity to take hindlimb fates. In the absence of Tgfbr1, the pericloacal mesoderm generates an extra pair of hindlimbs at the expense of the external genitalia. It has been shown that the hindlimb and the genital primordia share many of their key regulatory factors. Tgfbr1 controls the response to those factors by modulating the accessibility status of regulatory elements that control the gene regulatory networks leading to the formation of genital or hindlimb structures. Our work uncovers a remarkable tissue plasticity with potential implications in the evolution of the hindlimb/genital area of tetrapods, and identifies an additional mechanism for Tgfbr1 activity that might also contribute to the control of other physiological or pathological processes.
Collapse
Affiliation(s)
- Anastasiia Lozovska
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande 6, 2780-156, Oeiras, Portugal
| | - Artemis G Korovesi
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande 6, 2780-156, Oeiras, Portugal
| | - André Dias
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande 6, 2780-156, Oeiras, Portugal
- Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Alexandre Lopes
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande 6, 2780-156, Oeiras, Portugal
| | - Donald A Fowler
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande 6, 2780-156, Oeiras, Portugal
| | - Gabriel G Martins
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande 6, 2780-156, Oeiras, Portugal
| | - Ana Nóvoa
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande 6, 2780-156, Oeiras, Portugal
| | - Moisés Mallo
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande 6, 2780-156, Oeiras, Portugal.
| |
Collapse
|
5
|
Beschorner N, Künzle P, Voges M, Hauber I, Indenbirken D, Nakel J, Virdi S, Bradtke P, Lory NC, Rothe M, Paszkowski-Rogacz M, Buchholz F, Grundhoff A, Schambach A, Thirion C, Mittrücker HW, Schulze zur Wiesch J, Hauber J, Chemnitz J. Preclinical toxicity analyses of lentiviral vectors expressing the HIV-1 LTR-specific designer-recombinase Brec1. PLoS One 2024; 19:e0298542. [PMID: 38457474 PMCID: PMC10923487 DOI: 10.1371/journal.pone.0298542] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Accepted: 01/25/2024] [Indexed: 03/10/2024] Open
Abstract
Drug-based antiretroviral therapies (ART) efficiently suppress HIV replication in humans, but the virus persists as integrated proviral reservoirs in small numbers of cells. Importantly, ART cannot eliminate HIV from an infected individual, since it does not target the integrated provirus. Therefore, genome editing-based strategies that can inactivate or excise HIV genomes would provide the technology for novel curative therapies. In fact, the HIV-1 LTR-specific designer-recombinase Brec1 has been shown to remove integrated proviruses from infected cells and is highly efficacious on clinical HIV-1 isolates in vitro and in vivo, suggesting that Brec1 has the potential for clinical development of advanced HIV-1 eradication strategies in people living with HIV. In line with the preparation of a first-in-human advanced therapy medicinal product gene therapy trial, we here present an extensive preclinical evaluation of Brec1 and lentiviral vectors expressing the Brec1 transgene. This included detailed functional analysis of potential genomic off-target sites, assessing vector safety by investigating vector copy number (VCN) and the risk for potential vector-related insertional mutagenesis, as well as analyzing the potential of Brec1 to trigger an undesired strong T cell immune response. In conclusion, the antiviral designer-recombinase Brec1 is shown to lack any detectable cytopathic, genotoxic or T cell-related immunogenic effects, thereby meeting an important precondition for clinical application of the therapeutic lentiviral vector LV-Brec1 in novel HIV-1 curative strategies.
Collapse
Affiliation(s)
- Niklas Beschorner
- Leibniz-Institute of Virology (LIV), Hamburg, Germany
- German Center for Infection Research (DZIF), Partner Site Hamburg–Lübeck–Borstel–Riems, Germany
- PROVIREX Genome Editing Therapies GmbH, Hamburg, Germany
| | - Paul Künzle
- Leibniz-Institute of Virology (LIV), Hamburg, Germany
- German Center for Infection Research (DZIF), Partner Site Hamburg–Lübeck–Borstel–Riems, Germany
| | - Maike Voges
- Leibniz-Institute of Virology (LIV), Hamburg, Germany
- PROVIREX Genome Editing Therapies GmbH, Hamburg, Germany
| | - Ilona Hauber
- Leibniz-Institute of Virology (LIV), Hamburg, Germany
- PROVIREX Genome Editing Therapies GmbH, Hamburg, Germany
| | - Daniela Indenbirken
- Leibniz-Institute of Virology (LIV), Hamburg, Germany
- German Center for Infection Research (DZIF), Partner Site Hamburg–Lübeck–Borstel–Riems, Germany
| | - Jacqueline Nakel
- Leibniz-Institute of Virology (LIV), Hamburg, Germany
- German Center for Infection Research (DZIF), Partner Site Hamburg–Lübeck–Borstel–Riems, Germany
| | - Sanamjeet Virdi
- Leibniz-Institute of Virology (LIV), Hamburg, Germany
- German Center for Infection Research (DZIF), Partner Site Hamburg–Lübeck–Borstel–Riems, Germany
| | - Peter Bradtke
- Institute of Immunology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Niels Christian Lory
- Institute of Immunology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Michael Rothe
- Institute of Experimental Hematology, Hannover Medical School, Hannover, Germany
| | | | - Frank Buchholz
- PROVIREX Genome Editing Therapies GmbH, Hamburg, Germany
- Medical Systems Biology, UCC, Medical Faculty Carl Gustav Carus, TU Dresden, Dresden, Germany
| | - Adam Grundhoff
- Leibniz-Institute of Virology (LIV), Hamburg, Germany
- German Center for Infection Research (DZIF), Partner Site Hamburg–Lübeck–Borstel–Riems, Germany
| | - Axel Schambach
- Institute of Experimental Hematology, Hannover Medical School, Hannover, Germany
| | | | - Hans-Willi Mittrücker
- Institute of Immunology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
- Hamburg Center for Translational Immunology (HCTI), University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Julian Schulze zur Wiesch
- German Center for Infection Research (DZIF), Partner Site Hamburg–Lübeck–Borstel–Riems, Germany
- Infectious Diseases Unit, I. Department of Medicine, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Joachim Hauber
- Leibniz-Institute of Virology (LIV), Hamburg, Germany
- German Center for Infection Research (DZIF), Partner Site Hamburg–Lübeck–Borstel–Riems, Germany
- PROVIREX Genome Editing Therapies GmbH, Hamburg, Germany
| | - Jan Chemnitz
- Leibniz-Institute of Virology (LIV), Hamburg, Germany
- German Center for Infection Research (DZIF), Partner Site Hamburg–Lübeck–Borstel–Riems, Germany
- PROVIREX Genome Editing Therapies GmbH, Hamburg, Germany
| |
Collapse
|
6
|
Sanchez-Flores M, Corral-Juan M, Gasch-Navalón E, Cirillo D, Sanchez I, Matilla-Dueñas A. Novel genotype-phenotype correlations, differential cerebellar allele-specific methylation, and a common origin of the (ATTTC) n insertion in spinocerebellar ataxia type 37. Hum Genet 2024; 143:211-232. [PMID: 38396267 PMCID: PMC11043136 DOI: 10.1007/s00439-024-02644-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2023] [Accepted: 01/17/2024] [Indexed: 02/25/2024]
Abstract
Spinocerebellar ataxia subtype 37 (SCA37) is a rare disease originally identified in ataxia patients from the Iberian Peninsula with a pure cerebellar syndrome. SCA37 patients carry a pathogenic intronic (ATTTC)n repeat insertion flanked by two polymorphic (ATTTT)n repeats in the Disabled-1 (DAB1) gene leading to cerebellar dysregulation. Herein, we determine the precise configuration of the pathogenic 5'(ATTTT)n-(ATTTC)n-3'(ATTTT)n SCA37 alleles by CRISPR-Cas9 and long-read nanopore sequencing, reveal their epigenomic signatures in SCA37 lymphocytes, fibroblasts, and cerebellar samples, and establish new molecular and clinical correlations. The 5'(ATTTT)n-(ATTTC)n-3'(ATTTT)n pathogenic allele configurations revealed repeat instability and differential methylation signatures. Disease age of onset negatively correlated with the (ATTTC)n, and positively correlated with the 3'(ATTTT)n. Geographic origin and gender significantly correlated with age of onset. Furthermore, significant predictive regression models were obtained by machine learning for age of onset and disease evolution by considering gender, the (ATTTC)n, the 3'(ATTTT)n, and seven CpG positions differentially methylated in SCA37 cerebellum. A common 964-kb genomic region spanning the (ATTTC)n insertion was identified in all SCA37 patients analysed from Portugal and Spain, evidencing a common origin of the SCA37 mutation in the Iberian Peninsula originating 859 years ago (95% CI 647-1378). In conclusion, we demonstrate an accurate determination of the size and configuration of the regulatory 5'(ATTTT)n-(ATTTC)n-3'(ATTTT)n repeat tract, avoiding PCR bias amplification using CRISPR/Cas9-enrichment and nanopore long-read sequencing, resulting relevant for accurate genetic diagnosis of SCA37. Moreover, we determine novel significant genotype-phenotype correlations in SCA37 and identify differential cerebellar allele-specific methylation signatures that may underlie DAB1 pathogenic dysregulation.
Collapse
Affiliation(s)
- Marina Sanchez-Flores
- Neurogenetics Unit, Department of Neuroscience, Germans Trias i Pujol Research Institute (IGTP), Universitat Autònoma de Barcelona-Can Ruti Campus, Carretera de Can Ruti, Camí de les Escoles s/n, 08916, Badalona, Spain
| | - Marc Corral-Juan
- Neurogenetics Unit, Department of Neuroscience, Germans Trias i Pujol Research Institute (IGTP), Universitat Autònoma de Barcelona-Can Ruti Campus, Carretera de Can Ruti, Camí de les Escoles s/n, 08916, Badalona, Spain
| | - Esther Gasch-Navalón
- Neurogenetics Unit, Department of Neuroscience, Germans Trias i Pujol Research Institute (IGTP), Universitat Autònoma de Barcelona-Can Ruti Campus, Carretera de Can Ruti, Camí de les Escoles s/n, 08916, Badalona, Spain
| | | | - Ivelisse Sanchez
- Neurogenetics Unit, Department of Neuroscience, Germans Trias i Pujol Research Institute (IGTP), Universitat Autònoma de Barcelona-Can Ruti Campus, Carretera de Can Ruti, Camí de les Escoles s/n, 08916, Badalona, Spain
| | - Antoni Matilla-Dueñas
- Neurogenetics Unit, Department of Neuroscience, Germans Trias i Pujol Research Institute (IGTP), Universitat Autònoma de Barcelona-Can Ruti Campus, Carretera de Can Ruti, Camí de les Escoles s/n, 08916, Badalona, Spain.
| |
Collapse
|
7
|
Brockman QR, Scherer A, McGivney GR, Gutierrez WR, Rytlewski J, Sheehan A, Warrier A, Laverty EA, Roughton G, Carnevale NC, Knepper-Adrian V, Dodd RD. Discrepancies in indel software resolution with somatic CRISPR/Cas9 tumorigenesis models. Sci Rep 2023; 13:14798. [PMID: 37684258 PMCID: PMC10491828 DOI: 10.1038/s41598-023-41109-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Accepted: 08/22/2023] [Indexed: 09/10/2023] Open
Abstract
CRISPR/Cas9 gene editing has evolved from a simple laboratory tool to a powerful method of in vivo genomic engineering. As the applications of CRISPR/Cas9 technology have grown, the need to characterize the breadth and depth of indels generated by editing has expanded. Traditionally, investigators use one of several publicly-available platforms to determine CRISPR/Cas9-induced indels in an edited sample. However, to our knowledge, there has not been a cross-platform comparison of available indel analysis software in samples generated from somatic in vivo mouse models. Our group has pioneered using CRISPR/Cas9 to generate somatic primary mouse models of malignant peripheral nerve sheath tumors (MPNSTs) through genetic editing of Nf1. Here, we used sequencing data from the in vivo editing of the Nf1 gene in our CRISPR/Cas9 tumorigenesis model to directly compare results across four different software platforms. By analyzing the same genetic target across a wide panel of cell lines with the same sequence file, we are able to draw systematic conclusions about the differences in these software programs for analysis of in vivo-generated indels. Surprisingly, we report high variability in the reported number, size, and frequency of indels across each software platform. These data highlight the importance of selecting indel analysis platforms specific to the context that the gene editing approach is being applied. Taken together, this analysis shows that different software platforms can report widely divergent indel data from the same sample, particularly if larger indels are present, which are common in somatic, in vivo CRISPR/Cas9 tumor models.
Collapse
Affiliation(s)
- Qierra R Brockman
- Department of Internal Medicine, Carver College of Medicine, University of Iowa, 375 Newton Rd, 5206 MERF, Iowa City, IA, 52246, USA
- Holden Comprehensive Cancer Center, University of Iowa, Iowa City, IA, USA
| | - Amanda Scherer
- Department of Internal Medicine, Carver College of Medicine, University of Iowa, 375 Newton Rd, 5206 MERF, Iowa City, IA, 52246, USA
| | - Gavin R McGivney
- Department of Internal Medicine, Carver College of Medicine, University of Iowa, 375 Newton Rd, 5206 MERF, Iowa City, IA, 52246, USA
- Cancer Biology Training Program, University of Iowa, Iowa City, IA, USA
- Department of Molecular Physiology and Biophysics, University of Iowa, Iowa City, IA, USA
| | - Wade R Gutierrez
- Department of Internal Medicine, Carver College of Medicine, University of Iowa, 375 Newton Rd, 5206 MERF, Iowa City, IA, 52246, USA
- Holden Comprehensive Cancer Center, University of Iowa, Iowa City, IA, USA
- Cancer Biology Training Program, University of Iowa, Iowa City, IA, USA
- Medical Scientist Training Program, University of Iowa, Iowa City, IA, USA
| | - Jeffrey Rytlewski
- Department of Internal Medicine, Carver College of Medicine, University of Iowa, 375 Newton Rd, 5206 MERF, Iowa City, IA, 52246, USA
| | - Alexa Sheehan
- Department of Internal Medicine, Carver College of Medicine, University of Iowa, 375 Newton Rd, 5206 MERF, Iowa City, IA, 52246, USA
| | - Akshaya Warrier
- Cancer Biology Training Program, University of Iowa, Iowa City, IA, USA
- Medical Scientist Training Program, University of Iowa, Iowa City, IA, USA
| | - Emily A Laverty
- Department of Internal Medicine, Carver College of Medicine, University of Iowa, 375 Newton Rd, 5206 MERF, Iowa City, IA, 52246, USA
| | - Grace Roughton
- Department of Internal Medicine, Carver College of Medicine, University of Iowa, 375 Newton Rd, 5206 MERF, Iowa City, IA, 52246, USA
| | - Nina C Carnevale
- Department of Internal Medicine, Carver College of Medicine, University of Iowa, 375 Newton Rd, 5206 MERF, Iowa City, IA, 52246, USA
| | - Vickie Knepper-Adrian
- Department of Internal Medicine, Carver College of Medicine, University of Iowa, 375 Newton Rd, 5206 MERF, Iowa City, IA, 52246, USA
| | - Rebecca D Dodd
- Department of Internal Medicine, Carver College of Medicine, University of Iowa, 375 Newton Rd, 5206 MERF, Iowa City, IA, 52246, USA.
- Holden Comprehensive Cancer Center, University of Iowa, Iowa City, IA, USA.
- Cancer Biology Training Program, University of Iowa, Iowa City, IA, USA.
| |
Collapse
|
8
|
Sønderstrup M, Batiuk MY, Mantas P, Tapias-Espinosa C, Oliveras I, Cañete T, Sampedro-Viana D, Brudek T, Rydbirk R, Khodosevich K, Fernandez-Teruel A, Elfving B, Aznar S. A maturational shift in the frontal cortex synaptic transcriptional landscape underlies schizophrenia-relevant behavioural traits: A congenital rat model. Eur Neuropsychopharmacol 2023; 74:32-46. [PMID: 37263043 DOI: 10.1016/j.euroneuro.2023.05.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 04/05/2023] [Accepted: 05/10/2023] [Indexed: 06/03/2023]
Abstract
Disruption of brain development early in life may underlie the neurobiology behind schizophrenia. We have reported more immature synaptic spines in the frontal cortex (FC) of adult Roman High-Avoidance (RHA-I) rats, a behavioural model displaying schizophrenia-like traits. Here, we performed a whole transcriptome analysis in the FC of 4 months old male RHA-I (n=8) and its counterpart, the Roman Low-Avoidance (RLA-I) (n=8). We identified 203 significant genes with overrepresentation of genes involved in synaptic function. Next, we performed a gene set enrichment analysis (GSEA) for genes co-expressed during neurodevelopment. Gene networks were obtained by weighted gene co-expression network analysis (WGCNA) of a transcriptomic dataset containing human FC during lifespan (n=269). Out of thirty-one functional gene networks, six were significantly enriched in the RHA-I. These were differentially regulated during infancy and enriched in biological ontologies related to myelination, synaptic function, and immune response. We validated differential gene expression in a new cohort of adolescent (<=2 months old) and young-adult (>=3 months old) RHA-I and RLA-I rats. The results confirmed overexpression of Gsn, Nt5cd1, Ppp1r1b, and Slc9a3r1 in young-adult RHA-I, while Cables1, a regulator of Cdk5 phosphorylation in actin regulation and involved in synaptic plasticity and maturation, was significantly downregulated in adolescent RHA-I. This age-related expression change was also observed for presynaptic components Snap25 and Snap29. Our results show a different maturational expression profile of synaptic components in the RHA-I strain, supporting a shift in FC maturation underlying schizophrenia-like behavioural traits and adding construct validity to this strain as a neurodevelopmental model.
Collapse
Affiliation(s)
- Marie Sønderstrup
- Centre for Neuroscience and Stereology, Copenhagen University Hospital Bispebjerg-Frederiksberg, Denmark; Translational Neuropsychiatry Unit, Department of Clinical Medicine, Aarhus University, Denmark
| | - Mykhailo Y Batiuk
- Biotech Research and Innovation Centre (BRIC), Faculty of Health and Medical Sciences, University of Copenhagen, Denmark
| | - Panagiotis Mantas
- Department of Health Technology, Technical University of Denmark (DTU), Denmark
| | - Carles Tapias-Espinosa
- Department of Psychiatry and Forensic Medicine, School of Medicine, Universidad Autónoma de Barcelona, Spain
| | - Ignasi Oliveras
- Centre for Neuroscience and Stereology, Copenhagen University Hospital Bispebjerg-Frederiksberg, Denmark; Department of Psychiatry and Forensic Medicine, School of Medicine, Universidad Autónoma de Barcelona, Spain
| | - Toni Cañete
- Department of Psychiatry and Forensic Medicine, School of Medicine, Universidad Autónoma de Barcelona, Spain
| | - Daniel Sampedro-Viana
- Department of Psychiatry and Forensic Medicine, School of Medicine, Universidad Autónoma de Barcelona, Spain
| | - Tomasz Brudek
- Centre for Neuroscience and Stereology, Copenhagen University Hospital Bispebjerg-Frederiksberg, Denmark; Center for Translational Research, Copenhagen University Hospital Bispebjerg-Frederiksberg, Denmark
| | - Rasmus Rydbirk
- Biotech Research and Innovation Centre (BRIC), Faculty of Health and Medical Sciences, University of Copenhagen, Denmark
| | - Konstantin Khodosevich
- Biotech Research and Innovation Centre (BRIC), Faculty of Health and Medical Sciences, University of Copenhagen, Denmark
| | - Alberto Fernandez-Teruel
- Department of Psychiatry and Forensic Medicine, School of Medicine, Universidad Autónoma de Barcelona, Spain.
| | - Betina Elfving
- Translational Neuropsychiatry Unit, Department of Clinical Medicine, Aarhus University, Denmark
| | - Susana Aznar
- Centre for Neuroscience and Stereology, Copenhagen University Hospital Bispebjerg-Frederiksberg, Denmark; Center for Translational Research, Copenhagen University Hospital Bispebjerg-Frederiksberg, Denmark.
| |
Collapse
|
9
|
Bonner ER, Dawood A, Gordish-Dressman H, Eze A, Bhattacharya S, Yadavilli S, Mueller S, Waszak SM, Nazarian J. Pan-cancer atlas of somatic core and linker histone mutations. NPJ Genom Med 2023; 8:23. [PMID: 37640703 PMCID: PMC10462747 DOI: 10.1038/s41525-023-00367-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2023] [Accepted: 08/07/2023] [Indexed: 08/31/2023] Open
Abstract
Recent genomic data points to a growing role for somatic mutations altering core histone and linker histone-encoding genes in cancer. However, the prevalence and the clinical and biological implications of histone gene mutations in malignant tumors remain incompletely defined. To address these knowledge gaps, we analyzed somatic mutations in 88 linker and core histone genes across 12,743 tumors from pediatric, adolescent and young adult (AYA), and adult cancer patients. We established a pan-cancer histone mutation atlas contextualized by patient age, survival outcome, and tumor location. Overall, 11% of tumors harbored somatic histone mutations, with the highest rates observed among chondrosarcoma (67%), pediatric high-grade glioma (pHGG, >60%), and lymphoma (>30%). Previously unreported histone mutations were discovered in pHGG and other pediatric brain tumors, extending the spectrum of histone gene alterations associated with these cancers. Histone mutation status predicted patient survival outcome in tumor entities including adrenocortical carcinoma. Recurrent pan-cancer histone mutation hotspots were defined and shown to converge on evolutionarily conserved and functional residues. Moreover, we studied histone gene mutations in 1700 pan-cancer cell lines to validate the prevalence and spectrum of histone mutations seen in primary tumors and derived histone-associated drug response profiles, revealing candidate drugs targeting histone mutant cancer cells. This study presents the first-of-its-kind atlas of both core and linker histone mutations across pediatric, AYA, and adult cancers, providing a framework by which specific cancers may be redefined in the context of histone and chromatin alterations.
Collapse
Affiliation(s)
- Erin R Bonner
- Center for Genetic Medicine Research, Children's National Hospital, Washington, DC, USA
| | - Adam Dawood
- Center for Genetic Medicine Research, Children's National Hospital, Washington, DC, USA
| | | | - Augustine Eze
- Center for Genetic Medicine Research, Children's National Hospital, Washington, DC, USA
| | - Surajit Bhattacharya
- Center for Genetic Medicine Research, Children's National Hospital, Washington, DC, USA
| | - Sridevi Yadavilli
- Center for Genetic Medicine Research, Children's National Hospital, Washington, DC, USA
| | - Sabine Mueller
- Department of Neurology, Neurosurgery and Pediatrics, University of California San Francisco, San Francisco, CA, USA
- Department of Oncology, University Children's Hospital Zürich, Zürich, Switzerland
| | - Sebastian M Waszak
- Laboratory of Computational Neuro-Oncology, Swiss Institute for Experimental Cancer Research (ISREC), School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland.
- Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo and Oslo University Hospital, Oslo, Norway.
- Department of Neurology, University of California, San Francisco, San Francisco, CA, USA.
| | - Javad Nazarian
- Center for Genetic Medicine Research, Children's National Hospital, Washington, DC, USA.
- Department of Oncology, University Children's Hospital Zürich, Zürich, Switzerland.
| |
Collapse
|
10
|
Das S, Biswas NK, Basu A. Mapinsights: deep exploration of quality issues and error profiles in high-throughput sequence data. Nucleic Acids Res 2023; 51:e75. [PMID: 37378434 PMCID: PMC10415152 DOI: 10.1093/nar/gkad539] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 05/16/2023] [Accepted: 06/27/2023] [Indexed: 06/29/2023] Open
Abstract
High-throughput sequencing (HTS) has revolutionized science by enabling super-fast detection of genomic variants at base-pair resolution. Consequently, it poses the challenging problem of identification of technical artifacts, i.e. hidden non-random error patterns. Understanding the properties of sequencing artifacts holds the key in separating true variants from false positives. Here, we develop Mapinsights, a toolkit that performs quality control (QC) analysis of sequence alignment files, capable of detecting outliers based on sequencing artifacts of HTS data at a deeper resolution compared with existing methods. Mapinsights performs a cluster analysis based on novel and existing QC features derived from the sequence alignment for outlier detection. We applied Mapinsights on community standard open-source datasets and identified various quality issues including technical errors related to sequencing cycles, sequencing chemistry, sequencing libraries and across various orthogonal sequencing platforms. Mapinsights also enables identification of anomalies related to sequencing depth. A logistic regression-based model built on the features of Mapinsights shows high accuracy in detecting 'low-confidence' variant sites. Quantitative estimates and probabilistic arguments provided by Mapinsights can be utilized in identifying errors, bias and outlier samples, and also aid in improving the authenticity of variant calls.
Collapse
Affiliation(s)
- Subrata Das
- National Institute of Biomedical Genomics, Kalyani, 741251, West Bengal, India
| | - Nidhan K Biswas
- National Institute of Biomedical Genomics, Kalyani, 741251, West Bengal, India
| | - Analabha Basu
- National Institute of Biomedical Genomics, Kalyani, 741251, West Bengal, India
| |
Collapse
|
11
|
Haase MAB, Lazar-Stefanita L, Ólafsson G, Wudzinska A, Shen MJ, Truong DM, Boeke JD. Human macroH2A1 drives nucleosome dephasing and genome instability in histone-humanized yeast. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.06.538725. [PMID: 37205538 PMCID: PMC10187286 DOI: 10.1101/2023.05.06.538725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
In addition to replicative histones, eukaryotic genomes encode a repertoire of non-replicative variant histones providing additional layers of structural and epigenetic regulation. Here, we systematically replaced individual replicative human histones with non-replicative human variant histones using a histone replacement system in yeast. Variants H2A.J, TsH2B, and H3.5 complemented for their respective replicative counterparts. However, macroH2A1 failed to complement and its expression was toxic in yeast, negatively interacting with native yeast histones and kinetochore genes. To isolate yeast with "macroH2A1 chromatin" we decoupled the effects of its macro and histone fold domains, which revealed that both domains sufficed to override native yeast nucleosome positioning. Furthermore, both modified constructs of macroH2A1 exhibited lower nucleosome occupancy that correlated with decreased short-range chromatin interactions (<20 Kb), disrupted centromeric clustering, and increased chromosome instability. While supporting viability, macroH2A1 dramatically alters chromatin organization in yeast, leading to genome instability and massive fitness defects.
Collapse
Affiliation(s)
- Max A. B. Haase
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, 430 East 29th Street, New York, 10016, USA
- Vilcek Institute of Graduate Biomedical Sciences, NYU School of Medicine, New York, NY, 10016, USA
| | - Luciana Lazar-Stefanita
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, 430 East 29th Street, New York, 10016, USA
| | - Guðjón Ólafsson
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, 430 East 29th Street, New York, 10016, USA
| | - Aleksandra Wudzinska
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, 430 East 29th Street, New York, 10016, USA
| | - Michael J. Shen
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, 430 East 29th Street, New York, 10016, USA
| | - David M. Truong
- Department of Biomedical Engineering, NYU Tandon School of Engineering, Brooklyn, NY, 11201, USA
- Department of Pathology, NYU Langone Health, New York, NY 10016, USA
| | - Jef D. Boeke
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, 430 East 29th Street, New York, 10016, USA
- Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, NY 10016, USA
- Department of Biomedical Engineering, NYU Tandon School of Engineering, Brooklyn, NY, 11201, USA
| |
Collapse
|
12
|
Rausch T, Snajder R, Leger A, Simovic M, Giurgiu M, Villacorta L, Henssen AG, Fröhling S, Stegle O, Birney E, Bonder MJ, Ernst A, Korbel JO. Long-read sequencing of diagnosis and post-therapy medulloblastoma reveals complex rearrangement patterns and epigenetic signatures. CELL GENOMICS 2023; 3:100281. [PMID: 37082141 PMCID: PMC10112291 DOI: 10.1016/j.xgen.2023.100281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/20/2022] [Revised: 06/14/2022] [Accepted: 02/22/2023] [Indexed: 04/22/2023]
Abstract
Cancer genomes harbor a broad spectrum of structural variants (SVs) driving tumorigenesis, a relevant subset of which escape discovery using short-read sequencing. We employed Oxford Nanopore Technologies (ONT) long-read sequencing in a paired diagnostic and post-therapy medulloblastoma to unravel the haplotype-resolved somatic genetic and epigenetic landscape. We assembled complex rearrangements, including a 1.55-Mbp chromothripsis event, and we uncover a complex SV pattern termed templated insertion (TI) thread, characterized by short (mostly <1 kb) insertions showing prevalent self-concatenation into highly amplified structures of up to 50 kbp in size. TI threads occur in 3% of cancers, with a prevalence up to 74% in liposarcoma, and frequent colocalization with chromothripsis. We also perform long-read-based methylome profiling and discover allele-specific methylation (ASM) effects, complex rearrangements exhibiting differential methylation, and differential promoter methylation in cancer-driver genes. Our study shows the advantage of long-read sequencing in the discovery and characterization of complex somatic rearrangements.
Collapse
Affiliation(s)
- Tobias Rausch
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
- European Molecular Biology Laboratory (EMBL), GeneCore, Heidelberg, Germany
| | - Rene Snajder
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Faculty for Biosciences, Heidelberg University, Heidelberg, Germany
- HIDSS4Health, Helmholtz Information and Data Science School for Health, Heidelberg, Germany
| | - Adrien Leger
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Milena Simovic
- Group “Genome Instability in Tumors,” German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Mădălina Giurgiu
- Experimental and Clinical Research Center (ECRC) of the Max Delbrück Center (MDC) and Charité-Universitätsmedizin, Berlin, Germany
- Freie Universität Berlin, Berlin, Germany
| | - Laura Villacorta
- European Molecular Biology Laboratory (EMBL), GeneCore, Heidelberg, Germany
| | - Anton G. Henssen
- Department of Pediatric Oncology/Hematology, Charité-Universitätsmedizin, Berlin, Germany
- Experimental and Clinical Research Center (ECRC) of the Max Delbrück Center (MDC) and Charité-Universitätsmedizin, Berlin, Germany
- German Cancer Consortium (DKTK), partner site Berlin, and German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Stefan Fröhling
- National Center for Tumor Diseases (NCT), Heidelberg, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
- German Cancer Consortium (DKTK), Heidelberg, Germany
| | - Oliver Stegle
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Wellcome Sanger Institute, Wellcome Trust Genome Campus, Cambridge, UK
| | - Ewan Birney
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Marc Jan Bonder
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Corresponding author
| | - Aurelie Ernst
- Group “Genome Instability in Tumors,” German Cancer Research Center (DKFZ), Heidelberg, Germany
- Corresponding author
| | - Jan O. Korbel
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
- Bridging Research Division on Mechanisms of Genomic Variation and Data Science, DKFZ, Heidelberg, Germany
- Corresponding author
| |
Collapse
|
13
|
Boyd JR, Gao C, Quinn K, Fritz A, Stein J, Stein G, Glass K, Frietze S. peaksat: an R package for ChIP-seq peak saturation analysis. BMC Genomics 2023; 24:43. [PMID: 36698077 PMCID: PMC9878872 DOI: 10.1186/s12864-023-09109-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Accepted: 01/02/2023] [Indexed: 01/26/2023] Open
Abstract
BACKGROUND Epigenomic profiling assays such as ChIP-seq have been widely used to map the genome-wide enrichment profiles of chromatin-associated proteins and posttranslational histone modifications. Sequencing depth is a key parameter in experimental design and quality control. However, due to variable sequencing depth requirements across experimental conditions, it can be challenging to determine optimal sequencing depth, particularly for projects involving multiple targets or cell types. RESULTS We developed the peaksat R package to provide target read depth estimates for epigenomic experiments based on the analysis of peak saturation curves. We applied peaksat to establish the distinctive read depth requirements for ChIP-seq studies of histone modifications in different cell lines. Using peaksat, we were able to estimate the target read depth required per library to obtain high-quality peak calls for downstream analysis. In addition, peaksat was applied to other sequence-enrichment methods including CUT&RUN and ATAC-seq. CONCLUSION peaksat addresses a need for researchers to make informed decisions about whether their sequencing data has been generated to an adequate depth and subsequently sufficient meaningful peaks, and failing that, how many more reads would be required per library. peaksat is applicable to other sequence-based methods that include calling peaks in their analysis.
Collapse
Affiliation(s)
- Joseph R Boyd
- Department of Biomedical and Health Sciences, University of Vermont, 106 Carrigan Drive, 302 Rowell, VT, 04505, Burlington, USA
| | - Cong Gao
- Department of Biomedical and Health Sciences, University of Vermont, 106 Carrigan Drive, 302 Rowell, VT, 04505, Burlington, USA
| | - Kathleen Quinn
- Department of Biomedical and Health Sciences, University of Vermont, 106 Carrigan Drive, 302 Rowell, VT, 04505, Burlington, USA
| | - Andrew Fritz
- Department of Biochemistry, University of Vermont, Burlington, VT, USA
| | - Janet Stein
- Department of Biochemistry, University of Vermont, Burlington, VT, USA
- University of Vermont Cancer Center, Burlington, VT, USA
| | - Gary Stein
- Department of Biochemistry, University of Vermont, Burlington, VT, USA
- University of Vermont Cancer Center, Burlington, VT, USA
- Department of Surgery, University of Vermont, Burlington, VT, USA
| | - Karen Glass
- Department of Biochemistry, University of Vermont, Burlington, VT, USA
- University of Vermont Cancer Center, Burlington, VT, USA
- Department of Pharmacology, University of Vermont, Burlington, VT, USA
| | - Seth Frietze
- Department of Biomedical and Health Sciences, University of Vermont, 106 Carrigan Drive, 302 Rowell, VT, 04505, Burlington, USA.
- Department of Biochemistry, University of Vermont, Burlington, VT, USA.
- University of Vermont Cancer Center, Burlington, VT, USA.
| |
Collapse
|
14
|
Koh C, Frangeul L, Blanc H, Ngoagouni C, Boyer S, Dussart P, Grau N, Girod R, Duchemin JB, Saleh MC. Ribosomal RNA (rRNA) sequences from 33 globally distributed mosquito species for improved metagenomics and species identification. eLife 2023; 12:82762. [PMID: 36688360 PMCID: PMC10014081 DOI: 10.7554/elife.82762] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 01/19/2023] [Indexed: 01/24/2023] Open
Abstract
Total RNA sequencing (RNA-seq) is an important tool in the study of mosquitoes and the RNA viruses they vector as it allows assessment of both host and viral RNA in specimens. However, there are two main constraints. First, as with many other species, abundant mosquito ribosomal RNA (rRNA) serves as the predominant template from which sequences are generated, meaning that the desired host and viral templates are sequenced far less. Second, mosquito specimens captured in the field must be correctly identified, in some cases to the sub-species level. Here, we generate mosquito rRNA datasets which will substantially mitigate both of these problems. We describe a strategy to assemble novel rRNA sequences from mosquito specimens and produce an unprecedented dataset of 234 full-length 28S and 18S rRNA sequences of 33 medically important species from countries with known histories of mosquito-borne virus circulation (Cambodia, the Central African Republic, Madagascar, and French Guiana). These sequences will allow both physical and computational removal of rRNA from specimens during RNA-seq protocols. We also assess the utility of rRNA sequences for molecular taxonomy and compare phylogenies constructed using rRNA sequences versus those created using the gold standard for molecular species identification of specimens-the mitochondrial cytochrome c oxidase I (COI) gene. We find that rRNA- and COI-derived phylogenetic trees are incongruent and that 28S and concatenated 28S+18S rRNA phylogenies reflect evolutionary relationships that are more aligned with contemporary mosquito systematics. This significant expansion to the current rRNA reference library for mosquitoes will improve mosquito RNA-seq metagenomics by permitting the optimization of species-specific rRNA depletion protocols for a broader range of species and streamlining species identification by rRNA sequence and phylogenetics.
Collapse
Affiliation(s)
- Cassandra Koh
- Institut Pasteur, Université Paris Cité, CNRS UMR3569, Viruses and RNA Interference Unit, F-75015ParisFrance
| | - Lionel Frangeul
- Institut Pasteur, Université Paris Cité, CNRS UMR3569, Viruses and RNA Interference Unit, F-75015ParisFrance
| | - Hervé Blanc
- Institut Pasteur, Université Paris Cité, CNRS UMR3569, Viruses and RNA Interference Unit, F-75015ParisFrance
| | - Carine Ngoagouni
- Institut Pasteur de Bangui, Medical Entomology LaboratoryBanguiCentral African Republic
| | - Sébastien Boyer
- Institut Pasteur du Cambodge, Medical and Veterinary Entomology UnitPhnom PenhCambodia
| | | | - Nina Grau
- Institut Pasteur de Madagascar, Medical Entomology UnitAntananarivoMadagascar
| | - Romain Girod
- Institut Pasteur de Madagascar, Medical Entomology UnitAntananarivoMadagascar
| | - Jean-Bernard Duchemin
- Institut Pasteur de la Guyane, Vectopôle Amazonien Emile AbonnencCayenneFrench Guiana
| | - Maria-Carla Saleh
- Institut Pasteur, Université Paris Cité, CNRS UMR3569, Viruses and RNA Interference Unit, F-75015ParisFrance
| |
Collapse
|
15
|
Jokinen V, Mehine M, Reinikka S, Khamaiseh S, Ahvenainen T, Äyräväinen A, Härkki P, Bützow R, Pasanen A, Vahteristo P. 3'RNA and whole-genome sequencing of archival uterine leiomyomas reveal a tumor subtype with chromosomal rearrangements affecting either HMGA2, HMGA1, or PLAG1. Genes Chromosomes Cancer 2023; 62:27-38. [PMID: 35822448 PMCID: PMC9804854 DOI: 10.1002/gcc.23088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 07/08/2022] [Accepted: 07/09/2022] [Indexed: 01/09/2023] Open
Abstract
Uterine leiomyomas, or fibroids, are very common smooth muscle tumors that arise from the myometrium. They can be divided into distinct molecular subtypes. We have previously shown that 3'RNA-sequencing is highly effective in classifying archival formalin-fixed paraffin-embedded (FFPE) leiomyomas according to the underlying mutation. In this study, we performed 3'RNA-sequencing with 111 FFPE leiomyomas previously classified as negative for driver alterations in mediator complex subunit 12 (MED12), high mobility group AT-hook 2 (HMGA2), and fumarate hydratase (FH) by Sanger sequencing and immunohistochemistry. This revealed 43 tumors that displayed expression features typically seen in HMGA2-positive tumors, including overexpression of PLAG1. We explored 12 such leiomyomas by whole-genome sequencing to identify their underlying genomic drivers and to evaluate the feasibility of detecting chromosomal driver alterations from FFPE material. Four tumors with significant HMGA2 overexpression at the protein-level served as controls. We identified chromosomal rearrangements targeting either HMGA2, HMGA1, or PLAG1 in all 16 tumors, demonstrating that it is possible to detect chromosomal driver alterations in archival leiomyoma specimens as old as 18 years. Furthermore, two tumors displayed biallelic loss of DEPDC5 and one tumor harbored a COL4A5-COL4A6 deletion. These observations suggest that instead of only HMGA2-positive leiomyomas, a distinct leiomyoma subtype is characterized by rearrangements targeting either HMGA2, HMGA1, or PLAG1. The results indicate that the frequency of HMGA2-positive leiomyomas may be higher than estimated in previous studies where immunohistochemistry has been used. This study also demonstrates the feasibility of detecting chromosomal driver alterations from archival FFPE material.
Collapse
Affiliation(s)
- Vilja Jokinen
- Applied Tumor Genomics Research ProgramUniversity of HelsinkiHelsinkiFinland,Department of Medical and Clinical GeneticsUniversity of HelsinkiHelsinkiFinland
| | - Miika Mehine
- Applied Tumor Genomics Research ProgramUniversity of HelsinkiHelsinkiFinland,Department of Medical and Clinical GeneticsUniversity of HelsinkiHelsinkiFinland
| | - Siiri Reinikka
- Applied Tumor Genomics Research ProgramUniversity of HelsinkiHelsinkiFinland,Department of Medical and Clinical GeneticsUniversity of HelsinkiHelsinkiFinland
| | - Sara Khamaiseh
- Applied Tumor Genomics Research ProgramUniversity of HelsinkiHelsinkiFinland,Department of Medical and Clinical GeneticsUniversity of HelsinkiHelsinkiFinland,iCAN Digital Precision Cancer Medicine FlagshipHelsinkiFinland
| | - Terhi Ahvenainen
- Applied Tumor Genomics Research ProgramUniversity of HelsinkiHelsinkiFinland,Department of Medical and Clinical GeneticsUniversity of HelsinkiHelsinkiFinland,iCAN Digital Precision Cancer Medicine FlagshipHelsinkiFinland
| | - Anna Äyräväinen
- Applied Tumor Genomics Research ProgramUniversity of HelsinkiHelsinkiFinland,Department of Medical and Clinical GeneticsUniversity of HelsinkiHelsinkiFinland,Department of Obstetrics and GynecologyUniversity of Helsinki and Helsinki University HospitalHelsinkiFinland
| | - Päivi Härkki
- Department of Obstetrics and GynecologyUniversity of Helsinki and Helsinki University HospitalHelsinkiFinland
| | - Ralf Bützow
- Applied Tumor Genomics Research ProgramUniversity of HelsinkiHelsinkiFinland,Department of PathologyUniversity of Helsinki and HUSLAB, Helsinki University HospitalHelsinkiFinland
| | - Annukka Pasanen
- Applied Tumor Genomics Research ProgramUniversity of HelsinkiHelsinkiFinland,Department of PathologyUniversity of Helsinki and HUSLAB, Helsinki University HospitalHelsinkiFinland
| | - Pia Vahteristo
- Applied Tumor Genomics Research ProgramUniversity of HelsinkiHelsinkiFinland,Department of Medical and Clinical GeneticsUniversity of HelsinkiHelsinkiFinland,iCAN Digital Precision Cancer Medicine FlagshipHelsinkiFinland
| |
Collapse
|
16
|
Ferreira JC, Alshamali F, Pereira L, Fernandes V. Characterization of Arabian Peninsula whole exomes: Contributing to the catalogue of human diversity. iScience 2022; 25:105336. [DOI: 10.1016/j.isci.2022.105336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 08/01/2022] [Accepted: 10/10/2022] [Indexed: 11/24/2022] Open
|
17
|
Benning L, Morath C, Bartenschlager M, Kim H, Reineke M, Beimler J, Buylaert M, Nusshag C, Kälble F, Reichel P, Töllner M, Schaier M, Klein K, Benes V, Rausch T, Rieger S, Stich M, Tönshoff B, Weidner N, Schnitzler P, Zeier M, Süsal C, Hien Tran T, Bartenschlager R, Speer C. Neutralizing antibody response against the B.1.617.2 (delta) and the B.1.1.529 (omicron) variants after a third mRNA SARS-CoV-2 vaccine dose in kidney transplant recipients. Am J Transplant 2022; 22:1873-1883. [PMID: 35384272 PMCID: PMC9111366 DOI: 10.1111/ajt.17054] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Revised: 03/24/2022] [Accepted: 04/01/2022] [Indexed: 01/25/2023]
Abstract
Seroconversion after COVID-19 vaccination is impaired in kidney transplant recipients. Emerging variants of concern such as the B.1.617.2 (delta) and the B.1.1.529 (omicron) variants pose an increasing threat to these patients. In this observational cohort study, we measured anti-S1 IgG, surrogate neutralizing, and anti-receptor-binding domain antibodies three weeks after a third mRNA vaccine dose in 49 kidney transplant recipients and compared results to 25 age-matched healthy controls. In addition, vaccine-induced neutralization of SARS-CoV-2 wild-type, the B.1.617.2 (delta), and the B.1.1.529 (omicron) variants was assessed using a live-virus assay. After a third vaccine dose, anti-S1 IgG, surrogate neutralizing, and anti-receptor-binding domain antibodies were significantly lower in kidney transplant recipients compared to healthy controls. Only 29/49 (59%) sera of kidney transplant recipients contained neutralizing antibodies against the SARS-CoV-2 wild-type or the B.1.617.2 (delta) variant and neutralization titers were significantly reduced compared to healthy controls (p < 0.001). Vaccine-induced cross-neutralization of the B.1.1.529 (omicron) variants was detectable in 15/35 (43%) kidney transplant recipients with seropositivity for anti-S1 IgG, surrogate neutralizing, and/or anti-RBD antibodies. Neutralization of the B.1.1.529 (omicron) variants was significantly reduced compared to neutralization of SARS-CoV-2 wild-type or the B.1.617.2 (delta) variant for both, kidney transplant recipients and healthy controls (p < .001 for all).
Collapse
Affiliation(s)
- Louise Benning
- Department of Nephrology, University of Heidelberg, Heidelberg, Germany
| | - Christian Morath
- Department of Nephrology, University of Heidelberg, Heidelberg, Germany
| | - Marie Bartenschlager
- Department of Infectious Diseases, Molecular Virology, University of Heidelberg, Heidelberg, Germany
| | - Heeyoung Kim
- Department of Infectious Diseases, Molecular Virology, University of Heidelberg, Heidelberg, Germany
| | - Marvin Reineke
- Department of Nephrology, University of Heidelberg, Heidelberg, Germany
| | - Jörg Beimler
- Department of Nephrology, University of Heidelberg, Heidelberg, Germany
| | - Mirabel Buylaert
- Department of Nephrology, University of Heidelberg, Heidelberg, Germany
| | - Christian Nusshag
- Department of Nephrology, University of Heidelberg, Heidelberg, Germany
| | - Florian Kälble
- Department of Nephrology, University of Heidelberg, Heidelberg, Germany
| | - Paula Reichel
- Department of Nephrology, University of Heidelberg, Heidelberg, Germany
| | | | - Matthias Schaier
- Department of Nephrology, University of Heidelberg, Heidelberg, Germany
| | - Katrin Klein
- Department of Nephrology, University of Heidelberg, Heidelberg, Germany
| | - Vladimir Benes
- Genomics Core Facility, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Tobias Rausch
- Genomics Core Facility, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Susanne Rieger
- Department of Pediatrics I, University Children’s Hospital Heidelberg, Heidelberg, Germany
| | - Maximilian Stich
- Department of Pediatrics I, University Children’s Hospital Heidelberg, Heidelberg, Germany
| | - Burkhard Tönshoff
- Department of Pediatrics I, University Children’s Hospital Heidelberg, Heidelberg, Germany
| | - Niklas Weidner
- Department of Virology, University of Heidelberg, Heidelberg, Germany
| | - Paul Schnitzler
- Department of Virology, University of Heidelberg, Heidelberg, Germany
| | - Martin Zeier
- Department of Nephrology, University of Heidelberg, Heidelberg, Germany
| | - Caner Süsal
- Institute of Immunology, University of Heidelberg, Heidelberg, Germany
- Transplant Immunology Research Center of Excellence, Koç University Hospital, Istanbul, Turkey
| | - Thuong Hien Tran
- Institute of Immunology, University of Heidelberg, Heidelberg, Germany
| | - Ralf Bartenschlager
- Department of Infectious Diseases, Molecular Virology, University of Heidelberg, Heidelberg, Germany
- German Center for Infection Research, Heidelberg, Germany
- Division Virus-Associated Carcinogenesis, German Cancer Research Center, Heidelberg, Germany
| | - Claudius Speer
- Department of Nephrology, University of Heidelberg, Heidelberg, Germany
- Department of Molecular Medicine Partnership Unit Heidelberg, European Molecular Biology Laboratory, Heidelberg, Germany
| |
Collapse
|
18
|
Whitford W, Hawkins V, Moodley KS, Grant MJ, Lehnert K, Snell RG, Jacobsen JC. Proof of concept for multiplex amplicon sequencing for mutation identification using the MinION nanopore sequencer. Sci Rep 2022; 12:8572. [PMID: 35595858 PMCID: PMC9122479 DOI: 10.1038/s41598-022-12613-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Accepted: 05/04/2022] [Indexed: 12/22/2022] Open
Abstract
Rapid, cost-effective identification of genetic variants in small candidate genomic regions remains a challenge, particularly for less well equipped or lower throughput laboratories. The application of Oxford Nanopore Technologies’ MinION sequencer has the potential to fulfil this requirement. We demonstrate a proof of concept for a multiplexing assay that pools PCR amplicons for MinION sequencing to enable sequencing of multiple templates from multiple individuals, which could be applied to gene-targeted diagnostics. A combined strategy of barcoding and sample pooling was developed for simultaneous multiplex MinION sequencing of 100 PCR amplicons. The amplicons are family-specific, spanning a total of 30 loci in DNA isolated from 82 human neurodevelopmental cases and family members. The target regions were chosen for further interrogation because a potentially disease-causative variant had been identified in affected individuals following Illumina exome sequencing. The pooled MinION sequences were deconvoluted by aligning to custom references using the minimap2 aligner software. Our multiplexing approach produced an interpretable and expected sequence from 29 of the 30 targeted genetic loci. The sequence variant which was not correctly resolved in the MinION sequence was adjacent to a five nucleotide homopolymer. It is already known that homopolymers present a resolution problem with the MinION approach. Interestingly despite equimolar quantities of PCR amplicon pooled for sequencing, significant variation in the depth of coverage (127×–19,626×; mean = 8321×, std err = 452.99) was observed. We observed independent relationships between depth of coverage and target length, and depth of coverage and GC content. These relationships demonstrate biases of the MinION sequencer for longer templates and those with lower GC content. We demonstrate an efficient approach for variant discovery or confirmation from short DNA templates using the MinION sequencing device. With less than 130 × depth of coverage required for accurate genotyping, the methodology described here allows for rapid highly multiplexed targeted sequencing of large numbers of samples in a minimally equipped laboratory with a potential cost as much 200 × less than that from Sanger sequencing.
Collapse
Affiliation(s)
- Whitney Whitford
- School of Biological Sciences, The University of Auckland, Private Bag 92019, Auckland, 1142, New Zealand. .,Centre for Brain Research, The University of Auckland, Auckland, New Zealand.
| | - Victoria Hawkins
- School of Biological Sciences, The University of Auckland, Private Bag 92019, Auckland, 1142, New Zealand.,Centre for Brain Research, The University of Auckland, Auckland, New Zealand
| | - Kriebashne S Moodley
- School of Biological Sciences, The University of Auckland, Private Bag 92019, Auckland, 1142, New Zealand.,Centre for Brain Research, The University of Auckland, Auckland, New Zealand
| | - Matthew J Grant
- School of Biological Sciences, The University of Auckland, Private Bag 92019, Auckland, 1142, New Zealand.,Centre for Brain Research, The University of Auckland, Auckland, New Zealand
| | - Klaus Lehnert
- School of Biological Sciences, The University of Auckland, Private Bag 92019, Auckland, 1142, New Zealand.,Centre for Brain Research, The University of Auckland, Auckland, New Zealand
| | - Russell G Snell
- School of Biological Sciences, The University of Auckland, Private Bag 92019, Auckland, 1142, New Zealand.,Centre for Brain Research, The University of Auckland, Auckland, New Zealand
| | - Jessie C Jacobsen
- School of Biological Sciences, The University of Auckland, Private Bag 92019, Auckland, 1142, New Zealand.,Centre for Brain Research, The University of Auckland, Auckland, New Zealand
| |
Collapse
|
19
|
Niu YN, Roberts EG, Denisko D, Hoffman MM. Assessing and assuring interoperability of a genomics file format. Bioinformatics 2022; 38:3327-3336. [PMID: 35575355 PMCID: PMC9237710 DOI: 10.1093/bioinformatics/btac327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Revised: 03/30/2022] [Accepted: 05/11/2022] [Indexed: 12/01/2022] Open
Abstract
Motivation Bioinformatics software tools operate largely through the use of specialized genomics file formats. Often these formats lack formal specification, making it difficult or impossible for the creators of these tools to robustly test them for correct handling of input and output. This causes problems in interoperability between different tools that, at best, wastes time and frustrates users. At worst, interoperability issues could lead to undetected errors in scientific results. Results We developed a new verification system, Acidbio, which tests for correct behavior in bioinformatics software packages. We crafted tests to unify correct behavior when tools encounter various edge cases—potentially unexpected inputs that exemplify the limits of the format. To analyze the performance of existing software, we tested the input validation of 80 Bioconda packages that parsed the Browser Extensible Data (BED) format. We also used a fuzzing approach to automatically perform additional testing. Of 80 software packages examined, 75 achieved less than 70% correctness on our test suite. We categorized multiple root causes for the poor performance of different types of software. Fuzzing detected other errors that the manually designed test suite could not. We also created a badge system that developers can use to indicate more precisely which BED variants their software accepts and to advertise the software’s performance on the test suite. Availability and implementation Acidbio is available at https://github.com/hoffmangroup/acidbio. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yi Nian Niu
- Princess Margaret Cancer Centre University Health Network, Toronto, ON, M5G 2C1, Canada
| | - Eric G Roberts
- Princess Margaret Cancer Centre University Health Network, Toronto, ON, M5G 2C1, Canada
| | - Danielle Denisko
- Princess Margaret Cancer Centre University Health Network, Toronto, ON, M5G 2C1, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, ON, M5G 1L7, Canada
| | - Michael M Hoffman
- Princess Margaret Cancer Centre University Health Network, Toronto, ON, M5G 2C1, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, ON, M5G 1L7, Canada.,Department of Computer Science, University of Toronto, Toronto, ON, M5S 2E4, Canada.,Vector Institute, Toronto, ON, M5G 1M1, Canada
| |
Collapse
|
20
|
Abstract
Distilling biologically meaningful information from cancer genome sequencing data requires comprehensive identification of somatic alterations using rigorous computational methods. As the amount and complexity of sequencing data have increased, so has the number of tools for analysing them. Here, we describe the main steps involved in the bioinformatic analysis of cancer genomes, review key algorithmic developments and highlight popular tools and emerging technologies. These tools include those that identify point mutations, copy number alterations, structural variations and mutational signatures in cancer genomes. We also discuss issues in experimental design, the strengths and limitations of sequencing modalities and methodological challenges for the future.
Collapse
|
21
|
Wallmeroth D, Lackmann JW, Kueckelmann S, Altmüller J, Dieterich C, Boehm V, Gehring NH. Human UPF3A and UPF3B enable fault-tolerant activation of nonsense-mediated mRNA decay. EMBO J 2022; 41:e109191. [PMID: 35451084 PMCID: PMC9108619 DOI: 10.15252/embj.2021109191] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Revised: 03/18/2022] [Accepted: 03/31/2022] [Indexed: 12/14/2022] Open
Abstract
The paralogous human proteins UPF3A and UPF3B are involved in recognizing mRNAs targeted by nonsense‐mediated mRNA decay (NMD). UPF3B has been demonstrated to support NMD, presumably by bridging an exon junction complex (EJC) to the NMD factor UPF2. The role of UPF3A has been described either as a weak NMD activator or an NMD inhibitor. Here, we present a comprehensive functional analysis of UPF3A and UPF3B in human cells using combinatory experimental approaches. Overexpression or knockout of UPF3A as well as knockout of UPF3B did not substantially change global NMD activity. In contrast, the co‐depletion of UPF3A and UPF3B resulted in a marked NMD inhibition and a transcriptome‐wide upregulation of NMD substrates, demonstrating a functional redundancy between both NMD factors. In rescue experiments, UPF2 or EJC binding‐deficient UPF3B largely retained NMD activity. However, combinations of different mutants, including deletion of the middle domain, showed additive or synergistic effects and therefore failed to maintain NMD. Collectively, UPF3A and UPF3B emerge as fault‐tolerant, functionally redundant NMD activators in human cells.
Collapse
Affiliation(s)
- Damaris Wallmeroth
- Institute for Genetics, University of Cologne, Cologne, Germany.,Center for Molecular Medicine Cologne (CMMC), University of Cologne, Cologne, Germany
| | | | - Sabrina Kueckelmann
- Institute for Genetics, University of Cologne, Cologne, Germany.,Center for Molecular Medicine Cologne (CMMC), University of Cologne, Cologne, Germany
| | - Janine Altmüller
- Cologne Center for Genomics (CCG), University of Cologne, Cologne, Germany
| | - Christoph Dieterich
- Section of Bioinformatics and Systems Cardiology, Department of Internal Medicine III and Klaus Tschira Institute for Integrative Computational Cardiology, Heidelberg University Hospital, Heidelberg, Germany.,DZHK (German Centre for Cardiovascular Research), Partner site Heidelberg/Mannheim, Heidelberg, Germany
| | - Volker Boehm
- Institute for Genetics, University of Cologne, Cologne, Germany.,Center for Molecular Medicine Cologne (CMMC), University of Cologne, Cologne, Germany
| | - Niels H Gehring
- Institute for Genetics, University of Cologne, Cologne, Germany.,Center for Molecular Medicine Cologne (CMMC), University of Cologne, Cologne, Germany
| |
Collapse
|
22
|
Fernández-Orth D, Rueda M, Singh B, Moldes M, Jene A, Ferri M, Vasallo C, Fromont LA, Navarro A, Rambla J. A quality control portal for sequencing data deposited at the European genome-phenome archive. Brief Bioinform 2022; 23:6570012. [PMID: 35438138 PMCID: PMC9116225 DOI: 10.1093/bib/bbac136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Revised: 03/01/2022] [Accepted: 03/23/2022] [Indexed: 11/15/2022] Open
Abstract
Since its launch in 2008, the European Genome-Phenome Archive (EGA) has been leading the archiving and distribution of human identifiable genomic data. In this regard, one of the community concerns is the potential usability of the stored data, as of now, data submitters are not mandated to perform any quality control (QC) before uploading their data and associated metadata information. Here, we present a new File QC Portal developed at EGA, along with QC reports performed and created for 1 694 442 files [Fastq, sequence alignment map (SAM)/binary alignment map (BAM)/CRAM and variant call format (VCF)] submitted at EGA. QC reports allow anonymous EGA users to view summary-level information regarding the files within a specific dataset, such as quality of reads, alignment quality, number and type of variants and other features. Researchers benefit from being able to assess the quality of data prior to the data access decision and thereby, increasing the reusability of data (https://ega-archive.org/blog/data-upcycling-powered-by-ega/).
Collapse
Affiliation(s)
- Dietmar Fernández-Orth
- European Genome-phenome Archive (EGA) in the Centre for Genomic Regulation (CRG), the Barcelona Institute of Science and Technology Dr. Aiguader 88, Barcelona, 08003 Spain
| | - Manuel Rueda
- European Genome-phenome Archive (EGA) in the Centre for Genomic Regulation (CRG), the Barcelona Institute of Science and Technology Dr. Aiguader 88, Barcelona, 08003 Spain
| | - Babita Singh
- European Genome-phenome Archive (EGA) in the Centre for Genomic Regulation (CRG), the Barcelona Institute of Science and Technology Dr. Aiguader 88, Barcelona, 08003 Spain
| | - Mauricio Moldes
- European Genome-phenome Archive (EGA) in the Centre for Genomic Regulation (CRG), the Barcelona Institute of Science and Technology Dr. Aiguader 88, Barcelona, 08003 Spain
| | - Aina Jene
- European Genome-phenome Archive (EGA) in the Centre for Genomic Regulation (CRG), the Barcelona Institute of Science and Technology Dr. Aiguader 88, Barcelona, 08003 Spain
| | - Marta Ferri
- European Genome-phenome Archive (EGA) in the Centre for Genomic Regulation (CRG), the Barcelona Institute of Science and Technology Dr. Aiguader 88, Barcelona, 08003 Spain
| | - Claudia Vasallo
- European Genome-phenome Archive (EGA) in the Centre for Genomic Regulation (CRG), the Barcelona Institute of Science and Technology Dr. Aiguader 88, Barcelona, 08003 Spain
| | - Lauren A Fromont
- European Genome-phenome Archive (EGA) in the Centre for Genomic Regulation (CRG), the Barcelona Institute of Science and Technology Dr. Aiguader 88, Barcelona, 08003 Spain
| | - Arcadi Navarro
- European Genome-phenome Archive (EGA) in the Centre for Genomic Regulation (CRG), the Barcelona Institute of Science and Technology Dr. Aiguader 88, Barcelona, 08003 Spain
| | - Jordi Rambla
- European Genome-phenome Archive (EGA) in the Centre for Genomic Regulation (CRG), the Barcelona Institute of Science and Technology Dr. Aiguader 88, Barcelona, 08003 Spain
| |
Collapse
|
23
|
Valls-Margarit J, Galván-Femenía I, Matías-Sánchez D, Blay N, Puiggròs M, Carreras A, Salvoro C, Cortés B, Amela R, Farre X, Lerga-Jaso J, Puig M, Sánchez-Herrero J, Moreno V, Perucho M, Sumoy L, Armengol L, Delaneau O, Cáceres M, de Cid R, Torrents D. GCAT|Panel, a comprehensive structural variant haplotype map of the Iberian population from high-coverage whole-genome sequencing. Nucleic Acids Res 2022; 50:2464-2479. [PMID: 35176773 PMCID: PMC8934637 DOI: 10.1093/nar/gkac076] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Revised: 12/24/2021] [Accepted: 02/09/2022] [Indexed: 11/17/2022] Open
Abstract
The combined analysis of haplotype panels with phenotype clinical cohorts is a common approach to explore the genetic architecture of human diseases. However, genetic studies are mainly based on single nucleotide variants (SNVs) and small insertions and deletions (indels). Here, we contribute to fill this gap by generating a dense haplotype map focused on the identification, characterization, and phasing of structural variants (SVs). By integrating multiple variant identification methods and Logistic Regression Models (LRMs), we present a catalogue of 35 431 441 variants, including 89 178 SVs (≥50 bp), 30 325 064 SNVs and 5 017 199 indels, across 785 Illumina high coverage (30x) whole-genomes from the Iberian GCAT Cohort, containing a median of 3.52M SNVs, 606 336 indels and 6393 SVs per individual. The haplotype panel is able to impute up to 14 360 728 SNVs/indels and 23 179 SVs, showing a 2.7-fold increase for SVs compared with available genetic variation panels. The value of this panel for SVs analysis is shown through an imputed rare Alu element located in a new locus associated with Mononeuritis of lower limb, a rare neuromuscular disease. This study represents the first deep characterization of genetic variation within the Iberian population and the first operational haplotype panel to systematically include the SVs into genome-wide genetic studies.
Collapse
Affiliation(s)
| | | | | | - Natalia Blay
- Genomes for Life-GCAT lab Group, Institute for Health Science Research Germans Trias i Pujol (IGTP), Badalona 08916, Spain
| | - Montserrat Puiggròs
- Life Sciences Department, Barcelona Supercomputing Center (BSC), Barcelona 08034, Spain
| | - Anna Carreras
- Genomes for Life-GCAT lab Group, Institute for Health Science Research Germans Trias i Pujol (IGTP), Badalona 08916, Spain
| | - Cecilia Salvoro
- Life Sciences Department, Barcelona Supercomputing Center (BSC), Barcelona 08034, Spain
| | - Beatriz Cortés
- Genomes for Life-GCAT lab Group, Institute for Health Science Research Germans Trias i Pujol (IGTP), Badalona 08916, Spain
| | - Ramon Amela
- Life Sciences Department, Barcelona Supercomputing Center (BSC), Barcelona 08034, Spain
| | - Xavier Farre
- Genomes for Life-GCAT lab Group, Institute for Health Science Research Germans Trias i Pujol (IGTP), Badalona 08916, Spain
| | - Jon Lerga-Jaso
- Institut de Biotecnologia i de Biomedicina, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Marta Puig
- Institut de Biotecnologia i de Biomedicina, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Jose Francisco Sánchez-Herrero
- High Content Genomics and Bioinformatics Unit, Institute for Health Science Research Germans Trias i Pujol (IGTP), 08916 Badalona, Spain
| | - Victor Moreno
- Catalan Institute of Oncology, Hospitalet del Llobregat, 08908, Spain
- Bellvitge Biomedical Research Institute (IDIBELL), Hospitalet del Llobregat, 08908, Spain
- CIBER Epidemiología y Salud Pública (CIBERESP), Madrid 28029, Spain
- Universitat de Barcelona (UB), Barcelona 08007, Spain
| | - Manuel Perucho
- Sanford Burnham Prebys Medical Discovery Institute (SBP), La Jolla, CA 92037, USA
- Cancer Genetics and Epigenetics, Program of Predictive and Personalized Medicine of Cancer (PMPPC), Health Science Research Institute Germans Trias i Pujol (IGTP), Badalona 08916, Spain
| | - Lauro Sumoy
- High Content Genomics and Bioinformatics Unit, Institute for Health Science Research Germans Trias i Pujol (IGTP), 08916 Badalona, Spain
| | - Lluís Armengol
- Quantitative Genomic Medicine Laboratories (qGenomics), Esplugues del Llobregat, 08950, Spain
| | - Olivier Delaneau
- Department of Computational Biology, University of Lausanne, Génopode, 1015 Lausanne, Switzerland
- Swiss Institute of Bioinformatics (SIB), University of Lausanne, Quartier Sorge – Batiment Amphipole, 1015 Lausanne, Switzerland
| | - Mario Cáceres
- Institut de Biotecnologia i de Biomedicina, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- ICREA, Barcelona 08010, Spain
| | - Rafael de Cid
- Correspondence may also be addressed to Rafael de Cid. Tel: +34 930330542;
| | - David Torrents
- To whom correspondence should be addressed. Tel: +34 934134074;
| |
Collapse
|
24
|
Gaio D, Anantanawat K, To J, Liu M, Monahan L, Darling AE. Hackflex: low-cost, high-throughput, Illumina Nextera Flex library construction. Microb Genom 2022; 8. [PMID: 35014949 PMCID: PMC8914357 DOI: 10.1099/mgen.0.000744] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
We developed a low-cost method for the production of Illumina-compatible sequencing libraries that allows up to 14 times more libraries for high-throughput Illumina sequencing to be generated for the same cost. We call this new method Hackflex. The quality of library preparation was tested by constructing libraries from Escherichia coli MG1655 genomic DNA using either Hackflex, standard Nextera Flex (recently renamed as Illumina DNA Prep) or a variation of standard Nextera Flex in which the bead-linked transposase is diluted prior to use. In order to test the library quality for genomes with a higher and a lower G+C content, library construction methods were also tested on Pseudomonas aeruginosa PAO1 and Staphylococcus aureus ATCC 25923, respectively. We demonstrated that Hackflex can produce high-quality libraries and yields a highly uniform coverage, equivalent to the standard Nextera Flex kit. We show that strongly size-selected libraries produce sufficient yield and complexity to support de novo microbial genome assembly, and that assemblies of the large-insert libraries can be much more contiguous than standard libraries without strong size selection. We introduce a new set of sample barcodes that are distinct from standard Illumina barcodes, enabling Hackflex samples to be multiplexed with samples barcoded using standard Illumina kits. Using Hackflex, we were able to achieve a per-sample reagent cost for library prep of A$7.22 (Australian dollars) (US $5.60; UK £3.87, £1=A$1.87), which is 9.87 times lower than the standard Nextera Flex protocol at advertised retail price. An additional simple modification and further simplification of the protocol by omitting the wash step enables a further price reduction to reach an overall 14-fold cost saving. This method will allow researchers to construct more libraries within a given budget, thereby yielding more data and facilitating research programmes where sequencing large numbers of libraries is beneficial.
Collapse
Affiliation(s)
- Daniela Gaio
- iThree Institute, University of Technology Sydney, Sydney, NSW, Australia
| | - Kay Anantanawat
- iThree Institute, University of Technology Sydney, Sydney, NSW, Australia
| | - Joyce To
- iThree Institute, University of Technology Sydney, Sydney, NSW, Australia
| | - Michael Liu
- iThree Institute, University of Technology Sydney, Sydney, NSW, Australia
| | - Leigh Monahan
- iThree Institute, University of Technology Sydney, Sydney, NSW, Australia
| | - Aaron E Darling
- iThree Institute, University of Technology Sydney, Sydney, NSW, Australia
| |
Collapse
|
25
|
Sadlowski H, Schmidt V, Hiss J, Kuehn JA, Schneider CG, Zulu G, Hachangu A, Sikasunge CS, Mwape KE, Winkler AS, Schuelke M. Diagnosis of Taenia solium infections based on "mail order" RNA-sequencing of single tapeworm egg isolates from stool samples. PLoS Negl Trop Dis 2021; 15:e0009787. [PMID: 34890398 PMCID: PMC8694474 DOI: 10.1371/journal.pntd.0009787] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Revised: 12/22/2021] [Accepted: 11/22/2021] [Indexed: 11/18/2022] Open
Abstract
Combined community health programs aiming at health education, preventive anti-parasitic chemotherapy, and vaccination of pigs have proven their potential to regionally reduce and even eliminate Taenia solium infections that are associated with a high risk of neurological disease through ingestion of T. solium eggs. Yet it remains challenging to target T. solium endemic regions precisely or to make exact diagnoses in individual patients. One major reason is that the widely available stool microscopy may identify Taenia ssp. eggs in stool samples as such, but fails to distinguish between invasive (T. solium) and less invasive Taenia (T. saginata, T. asiatica, and T. hydatigena) species. The identification of Taenia ssp. eggs in routine stool samples often prompts a time-consuming and frequently unsuccessful epidemiologic workup in remote villages far away from a diagnostic laboratory. Here we present "mail order" single egg RNA-sequencing, a new method allowing the identification of the exact Taenia ssp. based on a few eggs found in routine diagnostic stool samples. We provide first T. solium transcriptome data, which show extremely high mitochondrial DNA (mtDNA) transcript counts that can be used for subspecies classification. "Mail order" RNA-sequencing can be administered by health personnel equipped with basic laboratory tools such as a microscope, a Bunsen burner, and access to an international post office for shipment of samples to a next generation sequencing facility. Our suggested workflow combines traditional stool microscopy, RNA-extraction from single Taenia eggs with mitochondrial RNA-sequencing, followed by bioinformatic processing with a basic laptop computer. The workflow could help to better target preventive healthcare measures and improve diagnostic specificity in individual patients based on incidental findings of Taenia ssp. eggs in diagnostic laboratories with limited resources.
Collapse
Affiliation(s)
- Henrik Sadlowski
- Charité-Universitätsmedizin Berlin, corporate member of the Freie Universität Berlin and Humboldt-Universität zu Berlin, NeuroCure Cluster of Excellence, Berlin, Germany
| | - Veronika Schmidt
- Department of Neurology, Centre for Global Health, Klinikum rechts der Isar, Technical University Munich (TUM), Munich, Germany
- Centre for Global Health, Institute of Health and Society, University of Oslo, Oslo, Norway
| | - Jonathan Hiss
- Charité-Universitätsmedizin Berlin, corporate member of the Freie Universität Berlin and Humboldt-Universität zu Berlin, NeuroCure Cluster of Excellence, Berlin, Germany
| | - Johannes A. Kuehn
- Charité-Universitätsmedizin Berlin, corporate member of the Freie Universität Berlin and Humboldt-Universität zu Berlin, NeuroCure Cluster of Excellence, Berlin, Germany
| | - Christian G. Schneider
- Charité-Universitätsmedizin Berlin, corporate member of the Freie Universität Berlin and Humboldt-Universität zu Berlin, NeuroCure Cluster of Excellence, Berlin, Germany
| | - Gideon Zulu
- School of Veterinary Medicine, University of Zambia, Lusaka, Zambia
| | - Alex Hachangu
- School of Veterinary Medicine, University of Zambia, Lusaka, Zambia
| | | | - Kabemba E. Mwape
- School of Veterinary Medicine, University of Zambia, Lusaka, Zambia
| | - Andrea S. Winkler
- Department of Neurology, Centre for Global Health, Klinikum rechts der Isar, Technical University Munich (TUM), Munich, Germany
- Centre for Global Health, Institute of Health and Society, University of Oslo, Oslo, Norway
| | - Markus Schuelke
- Charité-Universitätsmedizin Berlin, corporate member of the Freie Universität Berlin and Humboldt-Universität zu Berlin, NeuroCure Cluster of Excellence, Berlin, Germany
- Charité-Universitätsmedizin Berlin, corporate member of the Freie Universität Berlin and Humboldt-Universität zu Berlin, Department of Neuropediatrics, Berlin, Germany
- * E-mail:
| |
Collapse
|
26
|
Seleit A, Aulehla A, Paix A. Endogenous protein tagging in medaka using a simplified CRISPR/Cas9 knock-in approach. eLife 2021; 10:75050. [PMID: 34870593 PMCID: PMC8691840 DOI: 10.7554/elife.75050] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Accepted: 12/05/2021] [Indexed: 12/19/2022] Open
Abstract
The CRISPR/Cas9 system has been used to generate fluorescently labelled fusion proteins by homology-directed repair in a variety of species. Despite its revolutionary success, there remains an urgent need for increased simplicity and efficiency of genome editing in research organisms. Here, we establish a simplified, highly efficient, and precise strategy for CRISPR/Cas9-mediated endogenous protein tagging in medaka (Oryzias latipes). We use a cloning-free approach that relies on PCR-amplified donor fragments containing the fluorescent reporter sequences flanked by short homology arms (30–40 bp), a synthetic single-guide RNA and Cas9 mRNA. We generate eight novel knock-in lines with high efficiency of F0 targeting and germline transmission. Whole genome sequencing results reveal single-copy integration events only at the targeted loci. We provide an initial characterization of these fusion protein lines, significantly expanding the repertoire of genetic tools available in medaka. In particular, we show that the mScarlet-pcna line has the potential to serve as an organismal-wide label for proliferative zones and an endogenous cell cycle reporter.
Collapse
Affiliation(s)
- Ali Seleit
- Developmental Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Alexander Aulehla
- Developmental Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Alexandre Paix
- Developmental Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| |
Collapse
|
27
|
Smith JP, Corces MR, Xu J, Reuter VP, Chang HY, Sheffield NC. PEPATAC: an optimized pipeline for ATAC-seq data analysis with serial alignments. NAR Genom Bioinform 2021; 3:lqab101. [PMID: 34859208 PMCID: PMC8632735 DOI: 10.1093/nargab/lqab101] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2021] [Revised: 09/30/2021] [Accepted: 11/15/2021] [Indexed: 12/18/2022] Open
Abstract
As chromatin accessibility data from ATAC-seq experiments continues to expand, there is continuing need for standardized analysis pipelines. Here, we present PEPATAC, an ATAC-seq pipeline that is easily applied to ATAC-seq projects of any size, from one-off experiments to large-scale sequencing projects. PEPATAC leverages unique features of ATAC-seq data to optimize for speed and accuracy, and it provides several unique analytical approaches. Output includes convenient quality control plots, summary statistics, and a variety of generally useful data formats to set the groundwork for subsequent project-specific data analysis. Downstream analysis is simplified by a standard definition format, modularity of components, and metadata APIs in R and Python. It is restartable, fault-tolerant, and can be run on local hardware, using any cluster resource manager, or in provided Linux containers. We also demonstrate the advantage of aligning to the mitochondrial genome serially, which improves the accuracy of alignment statistics and quality control metrics. PEPATAC is a robust and portable first step for any ATAC-seq project. BSD2-licensed code and documentation are available at https://pepatac.databio.org.
Collapse
Affiliation(s)
- Jason P Smith
- Center for Public Health Genomics, University of Virginia, VA,22908, USA
- Department of Biochemistry and Molecular Genetics, University of Virginia, VA 22908 USA
| | - M Ryan Corces
- Center for Personal Dynamic Regulomes, Stanford University, Stanford, CA 94304, USA
| | - Jin Xu
- Center for Personal Dynamic Regulomes, Stanford University, Stanford, CA 94304, USA
| | - Vincent P Reuter
- Genomics and Computational Biology Graduate Group, University of Pennsylvania, PA 19087, USA
| | - Howard Y Chang
- Center for Personal Dynamic Regulomes, Stanford University, Stanford, CA 94304, USA
| | - Nathan C Sheffield
- Center for Public Health Genomics, University of Virginia, VA,22908, USA
- Department of Biochemistry and Molecular Genetics, University of Virginia, VA 22908 USA
- Department of Public Health Sciences, University of Virginia, VA 22908, USA
- Department of Biomedical Engineering, University of Virginia, VA 22908, USA
| |
Collapse
|
28
|
Hirschfeldova K, Cerny J, Bozikova P, Kuchtiak V, Rausch T, Benes V, Spaniel F, Gregus D, Horacek J, Vyklicky L, Balik A. Evidence for the Association between the Intronic Haplotypes of Ionotropic Glutamate Receptors and First-Episode Schizophrenia. J Pers Med 2021; 11:1250. [PMID: 34945722 PMCID: PMC8708351 DOI: 10.3390/jpm11121250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Revised: 11/14/2021] [Accepted: 11/16/2021] [Indexed: 11/16/2022] Open
Abstract
The heritable component of schizophrenia (SCH) as a polygenic trait is represented by numerous variants from a heterogeneous group of genes each contributing a relatively small effect. Various SNPs have already been found and analyzed in genes encoding the NMDAR subunits. However, less is known about genetic variations of genes encoding the AMPA and kainate receptor subunits. We analyzed sixteen iGluR genes in full length to determine the sequence variability of iGluR genes. Our aim was to describe the rate of genetic variability, its distribution, and the co-occurrence of variants and to identify new candidate risk variants or haplotypes. The cumulative effect of genetic risk was then estimated using a simple scoring model. GRIN2A-B, GRIN3A-B, and GRIK4 genes showed significantly increased genetic variation in SCH patients. The fixation index statistic revealed eight intronic haplotypes and an additional four intronic SNPs within the sequences of iGluR genes associated with SCH (p < 0.05). The haplotypes were used in the proposed simple scoring model and moreover as a test for genetic predisposition to schizophrenia. The positive likelihood ratio for the scoring model test reached 7.11. We also observed 41 protein-altering variants (38 missense variants, four frameshifts, and one nonsense variant) that were not significantly associated with SCH. Our data suggest that some intronic regulatory regions of iGluR genes and their common variability are among the components from which the genetic predisposition to SCH is composed.
Collapse
Affiliation(s)
- Katerina Hirschfeldova
- Institute of Biology and Medical Genetics, First Faculty of Medicine, Charles University, 12800 Prague, Czech Republic;
- Institute of Physiology, Czech Academy of Sciences, 14220 Prague, Czech Republic; (J.C.); (V.K.); (L.V.)
| | - Jiri Cerny
- Institute of Physiology, Czech Academy of Sciences, 14220 Prague, Czech Republic; (J.C.); (V.K.); (L.V.)
- Institute of Biotechnology, Czech Academy of Sciences, BIOCEV, 25250 Vestec, Czech Republic;
| | - Paulina Bozikova
- Institute of Biotechnology, Czech Academy of Sciences, BIOCEV, 25250 Vestec, Czech Republic;
| | - Viktor Kuchtiak
- Institute of Physiology, Czech Academy of Sciences, 14220 Prague, Czech Republic; (J.C.); (V.K.); (L.V.)
- Faculty of Science, Charles University, 12800 Prague, Czech Republic
| | - Tobias Rausch
- Genomics Core Facility, EMBL, 69117 Heidelberg, Germany; (T.R.); (V.B.)
| | - Vladimir Benes
- Genomics Core Facility, EMBL, 69117 Heidelberg, Germany; (T.R.); (V.B.)
| | - Filip Spaniel
- The National Institute of Mental Health, 25067 Klecany, Czech Republic; (F.S.); (D.G.); (J.H.)
| | - David Gregus
- The National Institute of Mental Health, 25067 Klecany, Czech Republic; (F.S.); (D.G.); (J.H.)
| | - Jiri Horacek
- The National Institute of Mental Health, 25067 Klecany, Czech Republic; (F.S.); (D.G.); (J.H.)
| | - Ladislav Vyklicky
- Institute of Physiology, Czech Academy of Sciences, 14220 Prague, Czech Republic; (J.C.); (V.K.); (L.V.)
| | - Ales Balik
- Institute of Physiology, Czech Academy of Sciences, 14220 Prague, Czech Republic; (J.C.); (V.K.); (L.V.)
- Institute of Physiology, Czech Academy of Sciences, BIOCEV, 25250 Vestec, Czech Republic
| |
Collapse
|
29
|
Sánchez-Corrales L, Tovar-Aguirre OL, Galeano-Vanegas NF, Castaño Jiménez PA, Martínez-Vega RA, Maldonado-Londoño CE, Hernández-Botero JS, Siller-López F. Phylogenomic analysis and Mycobacterium tuberculosis antibiotic resistance prediction by whole-genome sequencing from clinical isolates of Caldas, Colombia. PLoS One 2021; 16:e0258402. [PMID: 34618869 PMCID: PMC8496870 DOI: 10.1371/journal.pone.0258402] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Accepted: 09/25/2021] [Indexed: 12/30/2022] Open
Abstract
Mycobacterium tuberculosis (M. tuberculosis) was the pathogen responsible for the highest number of deaths from infectious diseases in the world, before the arrival of the COVID-19 pandemic. Whole genome sequencing (WGS) has contributed to the understanding of genetic diversity, the mechanisms involved in drug resistance and the transmission dynamics of this pathogen. The object of this study is to use WGS for the epidemiological and molecular characterization of M. tuberculosis clinical strains from Chinchiná, Caldas, a small town in Colombia with a high incidence of TB. Sputum samples were obtained during the first semester of 2020 from six patients and cultured in solid Löwenstein-Jensen medium. DNA extraction was obtained from positive culture samples and WGS was performed with the Illumina HiSeq 2500 platform for subsequent bioinformatic analysis. M. tuberculosis isolates were typified as Euro-American lineage 4 with a predominance of the Harlem and LAM sublineages. All samples were proven sensitive to antituberculosis drugs by genomic analysis, although no phenotype antimicrobial tests were performed on the samples, unreported mutations were identified that could require further analysis. The present study provides preliminary data for the construction of a genomic database line and the follow-up of lineages in this region.
Collapse
Affiliation(s)
- Lusayda Sánchez-Corrales
- Maestría en Investigación en Enfermedades Infecciosas, Universidad de Santander, Bucaramanga, Santander, Colombia
| | | | - Narmer Fernando Galeano-Vanegas
- Instituto de Investigación en Microbiología y Biotecnología Agroindustrial, Universidad Católica de Manizales, Manizales, Caldas, Colombia
- Departamento de Biotecnología, BIOS Centro de Bioinformática y Biología Computacional, Manizales, Caldas, Colombia
| | | | | | | | - Johan Sebastián Hernández-Botero
- Grupo de Investigación Médica, Escuela de Medicina, Universidad de Manizales, Manizales, Caldas, Colombia
- Grupo de Resistencia Antibiótica de Manizales, Manizales, Caldas, Colombia
| | - Fernando Siller-López
- Programa de Bacteriología, Universidad Católica de Manizales, Manizales, Caldas, Colombia
- Programa de Microbiología, Universidad Libre, Pereira, Risaralda, Colombia
- * E-mail:
| |
Collapse
|
30
|
SMG5-SMG7 authorize nonsense-mediated mRNA decay by enabling SMG6 endonucleolytic activity. Nat Commun 2021; 12:3965. [PMID: 34172724 PMCID: PMC8233366 DOI: 10.1038/s41467-021-24046-3] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Accepted: 05/30/2021] [Indexed: 12/28/2022] Open
Abstract
Eukaryotic gene expression is constantly controlled by the translation-coupled nonsense-mediated mRNA decay (NMD) pathway. Aberrant translation termination leads to NMD activation, resulting in phosphorylation of the central NMD factor UPF1 and robust clearance of NMD targets via two seemingly independent and redundant mRNA degradation branches. Here, we uncover that the loss of the first SMG5-SMG7-dependent pathway also inactivates the second SMG6-dependent branch, indicating an unexpected functional connection between the final NMD steps. Transcriptome-wide analyses of SMG5-SMG7-depleted cells confirm exhaustive NMD inhibition resulting in massive transcriptomic alterations. Intriguingly, we find that the functionally underestimated SMG5 can substitute the role of SMG7 and individually activate NMD. Furthermore, the presence of either SMG5 or SMG7 is sufficient to support SMG6-mediated endonucleolysis of NMD targets. Our data support an improved model for NMD execution that features two-factor authentication involving UPF1 phosphorylation and SMG5-SMG7 recruitment to access SMG6 activity. Degradation of nonsense mediated mRNA decay (NMD) substrates is carried out by two seemingly independent pathways, SMG6-mediated endonucleolytic cleavage and/or SMG5-SMG7-induced accelerated deadenylation. Here the authors show that SMG5-SMG7 maintain NMD activity by permitting SMG6 activation.
Collapse
|
31
|
New evaluation methods of read mapping by 17 aligners on simulated and empirical NGS data: an updated comparison of DNA- and RNA-Seq data from Illumina and Ion Torrent technologies. Neural Comput Appl 2021; 33:15669-15692. [PMID: 34155424 PMCID: PMC8208613 DOI: 10.1007/s00521-021-06188-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2020] [Accepted: 06/02/2021] [Indexed: 12/13/2022]
Abstract
During the last (15) years, improved omics sequencing technologies have expanded the scale and resolution of various biological applications, generating high-throughput datasets that require carefully chosen software tools to be processed. Therefore, following the sequencing development, bioinformatics researchers have been challenged to implement alignment algorithms for next-generation sequencing reads. However, nowadays selection of aligners based on genome characteristics is poorly studied, so our benchmarking study extended the “state of art” comparing 17 different aligners. The chosen tools were assessed on empirical human DNA- and RNA-Seq data, as well as on simulated datasets in human and mouse, evaluating a set of parameters previously not considered in such kind of benchmarks. As expected, we found that each tool was the best in specific conditions. For Ion Torrent single-end RNA-Seq samples, the most suitable aligners were CLC and BWA-MEM, which reached the best results in terms of efficiency, accuracy, duplication rate, saturation profile and running time. About Illumina paired-end osteomyelitis transcriptomics data, instead, the best performer algorithm, together with the already cited CLC, resulted Novoalign, which excelled in accuracy and saturation analyses. Segemehl and DNASTAR performed the best on both DNA-Seq data, with Segemehl particularly suitable for exome data. In conclusion, our study could guide users in the selection of a suitable aligner based on genome and transcriptome characteristics. However, several other aspects, emerged from our work, should be considered in the evolution of alignment research area, such as the involvement of artificial intelligence to support cloud computing and mapping to multiple genomes.
Collapse
|
32
|
Bulankova P, Sekulić M, Jallet D, Nef C, van Oosterhout C, Delmont TO, Vercauteren I, Osuna-Cruz CM, Vancaester E, Mock T, Sabbe K, Daboussi F, Bowler C, Vyverman W, Vandepoele K, De Veylder L. Mitotic recombination between homologous chromosomes drives genomic diversity in diatoms. Curr Biol 2021; 31:3221-3232.e9. [PMID: 34102110 DOI: 10.1016/j.cub.2021.05.013] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2020] [Revised: 03/17/2021] [Accepted: 05/06/2021] [Indexed: 01/31/2023]
Abstract
Diatoms, an evolutionarily successful group of microalgae, display high levels of intraspecific genetic variability in natural populations. However, the contribution of various mechanisms generating such diversity is unknown. Here we estimated the genetic micro-diversity within a natural diatom population and mapped the genomic changes arising within clonally propagated diatom cell cultures. Through quantification of haplotype diversity by next-generation sequencing and amplicon re-sequencing of selected loci, we documented a rapid accumulation of multiple haplotypes accompanied by the appearance of novel protein variants in cell cultures initiated from a single founder cell. Comparison of the genomic changes between mother and daughter cells revealed copy number variation and copy-neutral loss of heterozygosity leading to the fixation of alleles within individual daughter cells. The loss of heterozygosity can be accomplished by recombination between homologous chromosomes. To test this hypothesis, we established an endogenous readout system and estimated that the frequency of interhomolog mitotic recombination was under standard growth conditions 4.2 events per 100 cell divisions. This frequency is increased under environmental stress conditions, including treatment with hydrogen peroxide and cadmium. These data demonstrate that copy number variation and mitotic recombination between homologous chromosomes underlie clonal variability in diatom populations. We discuss the potential adaptive evolutionary benefits of the plastic response in the interhomolog mitotic recombination rate, and we propose that this may have contributed to the ecological success of diatoms.
Collapse
Affiliation(s)
- Petra Bulankova
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium; Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium.
| | - Mirna Sekulić
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium; Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium; Protistology and Aquatic Ecology, Department of Biology, Ghent University, 9000 Ghent, Belgium
| | - Denis Jallet
- TBI, Université de Toulouse, CNRS, INRAE, INSA, 135 Avenue de Rangueil, 31077 Toulouse, France
| | - Charlotte Nef
- Institut de Biologie de l'École Normale Supérieure (IBENS), École Normale Supérieure, CNRS, INSERM, PSL Université Paris, 75005 Paris, France
| | - Cock van Oosterhout
- School of Environmental Sciences, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, UK
| | - Tom O Delmont
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91000 Evry, France
| | - Ilse Vercauteren
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium; Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
| | - Cristina Maria Osuna-Cruz
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium; Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium; Bioinformatics Institute Ghent, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
| | - Emmelien Vancaester
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium; Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium; Bioinformatics Institute Ghent, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
| | - Thomas Mock
- School of Environmental Sciences, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, UK
| | - Koen Sabbe
- Protistology and Aquatic Ecology, Department of Biology, Ghent University, 9000 Ghent, Belgium
| | - Fayza Daboussi
- TBI, Université de Toulouse, CNRS, INRAE, INSA, 135 Avenue de Rangueil, 31077 Toulouse, France
| | - Chris Bowler
- Institut de Biologie de l'École Normale Supérieure (IBENS), École Normale Supérieure, CNRS, INSERM, PSL Université Paris, 75005 Paris, France
| | - Wim Vyverman
- Protistology and Aquatic Ecology, Department of Biology, Ghent University, 9000 Ghent, Belgium
| | - Klaas Vandepoele
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium; Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium; Bioinformatics Institute Ghent, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
| | - Lieven De Veylder
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium; Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium.
| |
Collapse
|
33
|
Quantification, Dynamic Visualization, and Validation of Bias in ATAC-Seq Data with ataqv. Cell Syst 2021; 10:298-306.e4. [PMID: 32213349 DOI: 10.1016/j.cels.2020.02.009] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2019] [Revised: 11/15/2019] [Accepted: 02/25/2020] [Indexed: 12/17/2022]
Abstract
The assay for transposase-accessible chromatin using sequencing (ATAC-seq) has become the preferred method for mapping chromatin accessibility due to its time and input material efficiency. However, it can be difficult to evaluate data quality and identify sources of technical bias across samples. Here, we present ataqv, a computational toolkit for efficiently measuring, visualizing, and comparing quality control (QC) results across samples and experiments. We use ataqv to analyze 2,009 public ATAC-seq datasets; their QC metrics display a 10-fold range. Tn5 dosage experiments and statistical modeling show that technical variation in the ratio of Tn5 transposase to nuclei and sequencing flowcell density induces systematic bias in ATAC-seq data by changing the enrichment of reads across functional genomic annotations including promoters, enhancers, and transcription-factor-bound regions, with the notable exception of CTCF. ataqv can be integrated into existing computational pipelines and is freely available at https://github.com/ParkerLab/ataqv/.
Collapse
|
34
|
Wang J, Rojas P, Mao J, Mustè Sadurnì M, Garnier O, Xiao S, Higgs MR, Garcia P, Saponaro M. Persistence of RNA transcription during DNA replication delays duplication of transcription start sites until G2/M. Cell Rep 2021; 34:108759. [PMID: 33596418 PMCID: PMC7900609 DOI: 10.1016/j.celrep.2021.108759] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2019] [Revised: 11/09/2020] [Accepted: 01/26/2021] [Indexed: 12/22/2022] Open
Abstract
As transcription and replication use DNA as substrate, conflicts between transcription and replication can occur, leading to genome instability with direct consequences for human health. To determine how the two processes are coordinated throughout S phase, we characterize both processes together at high resolution. We find that transcription occurs during DNA replication, with transcription start sites (TSSs) not fully replicated along with surrounding regions and remaining under-replicated until late in the cell cycle. TSSs undergo completion of DNA replication specifically when cells enter mitosis, when RNA polymerase II is removed. Intriguingly, G2/M DNA synthesis occurs at high frequency in unperturbed cell culture, but it is not associated with increased DNA damage and is fundamentally separated from mitotic DNA synthesis. TSSs duplicated in G2/M are characterized by a series of specific features, including high levels of antisense transcription, making them difficult to duplicate during S phase.
Collapse
Affiliation(s)
- Jianming Wang
- Transcription Associated Genome Instability Laboratory, Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham B15 2TT, UK
| | - Patricia Rojas
- Transcription Associated Genome Instability Laboratory, Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham B15 2TT, UK
| | - Jingwen Mao
- Transcription Associated Genome Instability Laboratory, Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham B15 2TT, UK
| | - Martina Mustè Sadurnì
- Transcription Associated Genome Instability Laboratory, Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham B15 2TT, UK
| | - Olivia Garnier
- Transcription Associated Genome Instability Laboratory, Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham B15 2TT, UK
| | - Songshu Xiao
- Transcription Associated Genome Instability Laboratory, Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham B15 2TT, UK
| | - Martin R Higgs
- Lysine Methylation and DNA Damage Laboratory, Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham B15 2TT, UK
| | - Paloma Garcia
- Stem Cells and Genome Stability Laboratory, Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham B15 2TT, UK
| | - Marco Saponaro
- Transcription Associated Genome Instability Laboratory, Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham B15 2TT, UK.
| |
Collapse
|
35
|
Bolognini D, Magi A, Benes V, Korbel JO, Rausch T. TRiCoLOR: tandem repeat profiling using whole-genome long-read sequencing data. Gigascience 2020; 9:giaa101. [PMID: 33034633 PMCID: PMC7539535 DOI: 10.1093/gigascience/giaa101] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Revised: 08/07/2020] [Accepted: 09/07/2020] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Tandem repeat sequences are widespread in the human genome, and their expansions cause multiple repeat-mediated disorders. Genome-wide discovery approaches are needed to fully elucidate their roles in health and disease, but resolving tandem repeat variation accurately remains a challenging task. While traditional mapping-based approaches using short-read data have severe limitations in the size and type of tandem repeats they can resolve, recent third-generation sequencing technologies exhibit substantially higher sequencing error rates, which complicates repeat resolution. RESULTS We developed TRiCoLOR, a freely available tool for tandem repeat profiling using error-prone long reads from third-generation sequencing technologies. The method can identify repetitive regions in sequencing data without a prior knowledge of their motifs or locations and resolve repeat multiplicity and period size in a haplotype-specific manner. The tool includes methods to interactively visualize the identified repeats and to trace their Mendelian consistency in pedigrees. CONCLUSIONS TRiCoLOR demonstrates excellent performance and improved sensitivity and specificity compared with alternative tools on synthetic data. For real human whole-genome sequencing data, TRiCoLOR achieves high validation rates, suggesting its suitability to identify tandem repeat variation in personal genomes.
Collapse
Affiliation(s)
- Davide Bolognini
- Department of Experimental and Clinical Medicine, University of Florence, Viale Pieraccini 6, Florence 50134, Italy
- European Molecular Biology Laboratory (EMBL), GeneCore, Meyerhofstraße 1, Heidelberg 69117, Germany
| | - Alberto Magi
- Department of Information Engineering, University of Florence, Via di S. Marta 3, Florence 50134, Italy
| | - Vladimir Benes
- European Molecular Biology Laboratory (EMBL), GeneCore, Meyerhofstraße 1, Heidelberg 69117, Germany
| | - Jan O Korbel
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, Heidelberg 69117, Germany
| | - Tobias Rausch
- European Molecular Biology Laboratory (EMBL), GeneCore, Meyerhofstraße 1, Heidelberg 69117, Germany
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, Heidelberg 69117, Germany
| |
Collapse
|
36
|
Erarslan-Uysal B, Kunz JB, Rausch T, Richter-Pechańska P, van Belzen IA, Frismantas V, Bornhauser B, Ordoñez-Rueada D, Paulsen M, Benes V, Stanulla M, Schrappe M, Cario G, Escherich G, Bakharevich K, Kirschner-Schwabe R, Eckert C, Loukanov T, Gorenflo M, Waszak SM, Bourquin JP, Muckenthaler MU, Korbel JO, Kulozik AE. Chromatin accessibility landscape of pediatric T-lymphoblastic leukemia and human T-cell precursors. EMBO Mol Med 2020; 12:e12104. [PMID: 32755029 PMCID: PMC7507092 DOI: 10.15252/emmm.202012104] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2020] [Revised: 06/19/2020] [Accepted: 06/24/2020] [Indexed: 01/11/2023] Open
Abstract
We aimed at identifying the developmental stage at which leukemic cells of pediatric T-ALLs are arrested and at defining leukemogenic mechanisms based on ATAC-Seq. Chromatin accessibility maps of seven developmental stages of human healthy T cells revealed progressive chromatin condensation during T-cell maturation. Developmental stages were distinguished by 2,823 signature chromatin regions with 95% accuracy. Open chromatin surrounding SAE1 was identified to best distinguish thymic developmental stages suggesting a potential role of SUMOylation in T-cell development. Deconvolution using signature regions revealed that T-ALLs, including those with mature immunophenotypes, resemble the most immature populations, which was confirmed by TF-binding motif profiles. We integrated ATAC-Seq and RNA-Seq and found DAB1, a gene not related to leukemia previously, to be overexpressed, abnormally spliced and hyper-accessible in T-ALLs. DAB1-negative patients formed a distinct subgroup with particularly immature chromatin profiles and hyper-accessible binding sites for SPI1 (PU.1), a TF crucial for normal T-cell maturation. In conclusion, our analyses of chromatin accessibility and TF-binding motifs showed that pediatric T-ALL cells are most similar to immature thymic precursors, indicating an early developmental arrest.
Collapse
Affiliation(s)
- Büşra Erarslan-Uysal
- Department of Pediatric Oncology, Hematology, and Immunology, University of Heidelberg, Heidelberg, Germany.,Hopp Children's Cancer Center (KiTZ) Heidelberg, Heidelberg, Germany.,Molecular Medicine Partnership Unit (MMPU), European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Joachim B Kunz
- Department of Pediatric Oncology, Hematology, and Immunology, University of Heidelberg, Heidelberg, Germany.,Hopp Children's Cancer Center (KiTZ) Heidelberg, Heidelberg, Germany.,Molecular Medicine Partnership Unit (MMPU), European Molecular Biology Laboratory (EMBL), Heidelberg, Germany.,German Consortium for Translational Cancer Research (DKTK), Heidelberg, Germany
| | - Tobias Rausch
- Molecular Medicine Partnership Unit (MMPU), European Molecular Biology Laboratory (EMBL), Heidelberg, Germany.,Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany.,Genomics Core Facility, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Paulina Richter-Pechańska
- Department of Pediatric Oncology, Hematology, and Immunology, University of Heidelberg, Heidelberg, Germany.,Hopp Children's Cancer Center (KiTZ) Heidelberg, Heidelberg, Germany.,Molecular Medicine Partnership Unit (MMPU), European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Ianthe Aem van Belzen
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Viktoras Frismantas
- Genomics Core Facility, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Beat Bornhauser
- Division of Pediatric Oncology, University Children's Hospital, Zürich, Switzerland
| | - Diana Ordoñez-Rueada
- Flow Cytometry Core Facility, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Malte Paulsen
- Flow Cytometry Core Facility, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Vladimir Benes
- Genomics Core Facility, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Martin Stanulla
- Department of Pediatric Hematology and Oncology, Hannover Medical School, Hannover, Germany
| | - Martin Schrappe
- Department of Pediatrics, University Hospital Schleswig-Holstein, Kiel, Germany
| | - Gunnar Cario
- Department of Pediatrics, University Hospital Schleswig-Holstein, Kiel, Germany
| | - Gabriele Escherich
- Clinic of Pediatric Hematology and Oncology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Kseniya Bakharevich
- Clinic of Pediatric Hematology and Oncology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Renate Kirschner-Schwabe
- Department of Pediatric Oncology/Hematology, Charité Universitätsmedizin Berlin, Berlin, Germany
| | - Cornelia Eckert
- Department of Pediatric Oncology/Hematology, Charité Universitätsmedizin Berlin, Berlin, Germany
| | - Tsvetomir Loukanov
- Department of Cardiac Surgery, University of Heidelberg, Heidelberg, Germany
| | - Matthias Gorenflo
- Department of Pediatric Cardiology and Congenital Heart Diseases, University of Heidelberg, Heidelberg, Germany
| | - Sebastian M Waszak
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Jean-Pierre Bourquin
- Division of Pediatric Oncology, University Children's Hospital, Zürich, Switzerland
| | - Martina U Muckenthaler
- Department of Pediatric Oncology, Hematology, and Immunology, University of Heidelberg, Heidelberg, Germany.,Hopp Children's Cancer Center (KiTZ) Heidelberg, Heidelberg, Germany.,Molecular Medicine Partnership Unit (MMPU), European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Jan O Korbel
- Molecular Medicine Partnership Unit (MMPU), European Molecular Biology Laboratory (EMBL), Heidelberg, Germany.,Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Andreas E Kulozik
- Department of Pediatric Oncology, Hematology, and Immunology, University of Heidelberg, Heidelberg, Germany.,Hopp Children's Cancer Center (KiTZ) Heidelberg, Heidelberg, Germany.,Molecular Medicine Partnership Unit (MMPU), European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| |
Collapse
|
37
|
Iwanowicz DD, Wu-Smart JY, Olgun T, Smart AH, Otto CRV, Lopez D, Evans JD, Cornman R. An updated genetic marker for detection of Lake Sinai Virus and metagenetic applications. PeerJ 2020; 8:e9424. [PMID: 32742773 PMCID: PMC7370930 DOI: 10.7717/peerj.9424] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2020] [Accepted: 06/05/2020] [Indexed: 11/29/2022] Open
Abstract
Background Lake Sinai Viruses (LSV) are common RNA viruses of honey bees (Apis mellifera) that frequently reach high abundance but are not linked to overt disease. LSVs are genetically heterogeneous and collectively widespread, but despite frequent detection in surveys, the ecological and geographic factors structuring their distribution in A. mellifera are not understood. Even less is known about their distribution in other species. Better understanding of LSV prevalence and ecology have been hampered by high sequence diversity within the LSV clade. Methods Here we report a new polymerase chain reaction (PCR) assay that is compatible with currently known lineages with minimal primer degeneracy, producing an expected 365 bp amplicon suitable for end-point PCR and metagenetic sequencing. Using the Illumina MiSeq platform, we performed pilot metagenetic assessments of three sample sets, each representing a distinct variable that might structure LSV diversity (geography, tissue, and species). Results The first sample set in our pilot assessment compared cDNA pools from managed A. mellifera hives in California (n = 8) and Maryland (n = 6) that had previously been evaluated for LSV2, confirming that the primers co-amplify divergent lineages in real-world samples. The second sample set included cDNA pools derived from different tissues (thorax vs. abdomen, n = 24 paired samples), collected from managed A. mellifera hives in North Dakota. End-point detection of LSV frequently differed between the two tissue types; LSV metagenetic composition was similar in one pair of sequenced samples but divergent in a second pair. Overall, LSV1 and intermediate lineages were common in these samples whereas variants clustering with LSV2 were rare. The third sample set included cDNA from individual pollinator specimens collected from diverse landscapes in the vicinity of Lincoln, Nebraska. We detected LSV in the bee Halictus ligatus (four of 63 specimens tested, 6.3%) at a similar rate as A. mellifera (nine of 115 specimens, 7.8%), but only one H. ligatus sequencing library yielded sufficient data for compositional analysis. Sequenced samples often contained multiple divergent LSV lineages, including individual specimens. While these studies were exploratory rather than statistically powerful tests of hypotheses, they illustrate the utility of high-throughput sequencing for understanding LSV transmission within and among species.
Collapse
Affiliation(s)
- Deborah D Iwanowicz
- Leetown Science Center, U.S. Geological Survey, Kearneysville, WV, United States of America
| | - Judy Y Wu-Smart
- Entomology, University of Nebraska-Lincoln, Lincoln, NE, United States of America
| | - Tugce Olgun
- Entomology, University of Nebraska-Lincoln, Lincoln, NE, United States of America
| | - Autumn H Smart
- Entomology, University of Nebraska-Lincoln, Lincoln, NE, United States of America
| | - Clint R V Otto
- Northern Prairie Wildlife Research Center, U.S. Geological Survey, Jamestown, ND, United States of America
| | - Dawn Lopez
- Beltsville Agricultural Research Center, U.S. Department of Agriculture, Agricultural Research Service, Beltsville, MD, United States of America
| | - Jay D Evans
- Beltsville Agricultural Research Center, U.S. Department of Agriculture, Agricultural Research Service, Beltsville, MD, United States of America
| | - Robert Cornman
- Fort Collins Science Center, United States Geological Survey, Fort Collins, CO, United States of America
| |
Collapse
|
38
|
Smith JP, Sheffield NC. Analytical Approaches for ATAC-seq Data Analysis. CURRENT PROTOCOLS IN HUMAN GENETICS 2020; 106:e101. [PMID: 32543102 PMCID: PMC8191135 DOI: 10.1002/cphg.101] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
ATAC-seq, the assay for transposase-accessible chromatin using sequencing, is a quick and efficient approach to investigating the chromatin accessibility landscape. Investigating chromatin accessibility has broad utility for answering many biological questions, such as mapping nucleosomes, identifying transcription factor binding sites, and measuring differential activity of DNA regulatory elements. Because the ATAC-seq protocol is both simple and relatively inexpensive, there has been a rapid increase in the availability of chromatin accessibility data. Furthermore, advances in ATAC-seq protocols are rapidly extending its breadth to additional experimental conditions, cell types, and species. Accompanying the increase in data, there has also been an explosion of new tools and analytical approaches for analyzing it. Here, we explain the fundamentals of ATAC-seq data processing, summarize common analysis approaches, and review computational tools to provide recommendations for different research questions. This primer provides a starting point and a reference for analysis of ATAC-seq data. © 2020 Wiley Periodicals LLC.
Collapse
Affiliation(s)
- Jason P. Smith
- Center for Public Health Genomics, University of Virginia, Charlottesville, Virginia
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, Virginia
| | - Nathan C. Sheffield
- Center for Public Health Genomics, University of Virginia, Charlottesville, Virginia
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, Virginia
- Department of Public Health Sciences, University of Virginia, Charlottesville, Virginia
- Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia
| |
Collapse
|
39
|
Tracy: basecalling, alignment, assembly and deconvolution of sanger chromatogram trace files. BMC Genomics 2020; 21:230. [PMID: 32171249 PMCID: PMC7071639 DOI: 10.1186/s12864-020-6635-8] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Accepted: 02/28/2020] [Indexed: 12/18/2022] Open
Abstract
Background DNA sequencing is at the core of many molecular biology laboratories. Despite its long history, there is a lack of user-friendly Sanger sequencing data analysis tools that can be run interactively as a web application or at large-scale in batch from the command-line. Results We present Tracy, an efficient and versatile command-line application that enables basecalling, alignment, assembly and deconvolution of sequencing chromatogram files. Its companion web applications make all functionality of Tracy easily accessible using standard web browser technologies and interactive graphical user interfaces. Tracy can be easily integrated in large-scale pipelines and high-throughput settings, and it uses state-of-the-art file formats such as JSON and BCF for reporting chromatogram sequencing results and variant calls. The software is open-source and freely available at https://github.com/gear-genomics/tracy, the companion web applications are hosted at https://www.gear-genomics.com. Conclusions Tracy can be routinely applied in large-scale validation efforts conducted in clinical genomics studies as well as for high-throughput genome editing techniques that require a fast and rapid method to confirm discovered variants or engineered mutations. Molecular biologists benefit from the companion web applications that enable installation-free Sanger chromatogram analyses using intuitive, graphical user interfaces.
Collapse
|
40
|
Wein T, Dagan T. The Effect of Population Bottleneck Size and Selective Regime on Genetic Diversity and Evolvability in Bacteria. Genome Biol Evol 2019; 11:3283-3290. [PMID: 31688900 PMCID: PMC7145630 DOI: 10.1093/gbe/evz243] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/31/2019] [Indexed: 12/20/2022] Open
Abstract
Population bottlenecks leading to a drastic reduction of the population size are common in the evolutionary dynamics of natural populations; their occurrence is known to have implications for genome evolution due to genetic drift, the consequent reduction in genetic diversity, and the rate of adaptation. Nevertheless, an empirical characterization of the effect of population bottleneck size on evolutionary dynamics of bacteria is currently lacking. In this study, we show that selective conditions have a stronger effect on the evolutionary history of bacteria in comparison to population bottlenecks. We evolved Escherichia coli populations under three different population bottleneck sizes (small, medium, and large) in two temperature regimes (37 °C and 20 °C). We find a high genetic diversity in the large in comparison to the small bottleneck size. Nonetheless, the cold temperature led to reduced genetic diversity regardless the bottleneck size; hence, the temperature has a stronger effect on the genetic diversity in comparison to the bottleneck size. A comparison of the fitness gain among the evolved populations reveals a similar pattern where the temperature has a significant effect on the fitness. Our study demonstrates that population bottlenecks are an important determinant of bacterial evolvability; their consequences depend on the selective conditions and are best understood via their effect on the standing genetic variation.
Collapse
Affiliation(s)
- Tanita Wein
- Institute of Microbiology, Kiel University, Germany
| | - Tal Dagan
- Institute of Microbiology, Kiel University, Germany
| |
Collapse
|
41
|
Zhang X, Garnerone S, Simonetti M, Harbers L, Nicoś M, Mirzazadeh R, Venesio T, Sapino A, Hartman J, Marchiò C, Bienko M, Crosetto N. CUTseq is a versatile method for preparing multiplexed DNA sequencing libraries from low-input samples. Nat Commun 2019; 10:4732. [PMID: 31628304 PMCID: PMC6802095 DOI: 10.1038/s41467-019-12570-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2018] [Accepted: 09/17/2019] [Indexed: 12/14/2022] Open
Abstract
Current multiplexing strategies for massively parallel sequencing of genomic DNA mainly rely on library indexing in the final steps of library preparation. This procedure is costly and time-consuming, because a library must be generated separately for each sample. Furthermore, library preparation is challenging in the case of fixed samples, such as DNA extracted from formalin-fixed paraffin-embedded (FFPE) tissues. Here we describe CUTseq, a method that uses restriction enzymes and in vitro transcription to barcode and amplify genomic DNA prior to library construction. We thoroughly assess the sensitivity and reproducibility of CUTseq in both cell lines and FFPE samples, and demonstrate an application of CUTseq for multi-region DNA copy number profiling within single FFPE tumor sections, to assess intratumor genetic heterogeneity at high spatial resolution. In conclusion, CUTseq is a versatile and cost-effective method for library preparation for reduced representation genome sequencing, which can find numerous applications in research and diagnostics. Genomics DNA library preparation from formalin-fixed paraffin-embedded tissues is challenging. Here the authors describe CUTseq that uses restriction enzymes and in vitro amplification to barcode samples for reduced representation genome sequencing.
Collapse
Affiliation(s)
- Xiaolu Zhang
- Science for Life Laboratory, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, SE-17165, Sweden
| | - Silvano Garnerone
- Science for Life Laboratory, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, SE-17165, Sweden
| | - Michele Simonetti
- Science for Life Laboratory, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, SE-17165, Sweden
| | - Luuk Harbers
- Science for Life Laboratory, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, SE-17165, Sweden
| | - Marcin Nicoś
- Science for Life Laboratory, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, SE-17165, Sweden.,Department of Pneumonology, Oncology and Allergology, Medical University of Lublin, 20954, Lublin, Poland
| | - Reza Mirzazadeh
- Science for Life Laboratory, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, SE-17165, Sweden
| | - Tiziana Venesio
- Pathology Unit, Candiolo Cancer Institute, FPO-IRCCS, 10060, Candiolo (TO), Italy
| | - Anna Sapino
- Pathology Unit, Candiolo Cancer Institute, FPO-IRCCS, 10060, Candiolo (TO), Italy.,Department of Medical Sciences, University of Turin, Turin, Italy
| | - Johan Hartman
- Department of Oncology and Pathology, Karolinska Institutet, Stockholm, SE-17177, Sweden.,Department of Clinical Pathology, Karolinska University Laboratory, 17176, Stockholm, Sweden
| | - Caterina Marchiò
- Pathology Unit, Candiolo Cancer Institute, FPO-IRCCS, 10060, Candiolo (TO), Italy.,Department of Medical Sciences, University of Turin, Turin, Italy
| | - Magda Bienko
- Science for Life Laboratory, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, SE-17165, Sweden.
| | - Nicola Crosetto
- Science for Life Laboratory, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, SE-17165, Sweden.
| |
Collapse
|
42
|
The NSL complex maintains nuclear architecture stability via lamin A/C acetylation. Nat Cell Biol 2019; 21:1248-1260. [PMID: 31576060 DOI: 10.1038/s41556-019-0397-z] [Citation(s) in RCA: 44] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2019] [Accepted: 08/21/2019] [Indexed: 12/26/2022]
Abstract
While nuclear lamina abnormalities are hallmarks of human diseases, their interplay with epigenetic regulators and precise epigenetic landscape remain poorly understood. Here, we show that loss of the lysine acetyltransferase MOF or its associated NSL-complex members KANSL2 or KANSL3 leads to a stochastic accumulation of nuclear abnormalities with genomic instability patterns including chromothripsis. SILAC-based MOF and KANSL2 acetylomes identified lamin A/C as an acetylation target of MOF. HDAC inhibition or acetylation-mimicking lamin A derivatives rescue nuclear abnormalities observed in MOF-deficient cells. Mechanistically, loss of lamin A/C acetylation resulted in its increased solubility, defective phosphorylation dynamics and impaired nuclear mechanostability. We found that nuclear abnormalities include EZH2-dependent histone H3 Lys 27 trimethylation and loss of nascent transcription. We term this altered epigenetic landscape "heterochromatin enrichment in nuclear abnormalities" (HENA). Collectively, the NSL-complex-dependent lamin A/C acetylation provides a mechanism that maintains nuclear architecture and genome integrity.
Collapse
|