101
|
Resources for Genetic and Genomic Analysis of Emerging Pathogen Acinetobacter baumannii. J Bacteriol 2015; 197:2027-35. [PMID: 25845845 DOI: 10.1128/jb.00131-15] [Citation(s) in RCA: 206] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2015] [Accepted: 03/31/2015] [Indexed: 02/06/2023] Open
Abstract
UNLABELLED Acinetobacter baumannii is a Gram-negative bacterial pathogen notorious for causing serious nosocomial infections that resist antibiotic therapy. Research to identify factors responsible for the pathogen's success has been limited by the resources available for genome-scale experimental studies. This report describes the development of several such resources for A. baumannii strain AB5075, a recently characterized wound isolate that is multidrug resistant and displays robust virulence in animal models. We report the completion and annotation of the genome sequence, the construction of a comprehensive ordered transposon mutant library, the extension of high-coverage transposon mutant pool sequencing (Tn-seq) to the strain, and the identification of the genes essential for growth on nutrient-rich agar. These resources should facilitate large-scale genetic analysis of virulence, resistance, and other clinically relevant traits that make A. baumannii a formidable public health threat. IMPORTANCE Acinetobacter baumannii is one of six bacterial pathogens primarily responsible for antibiotic-resistant infections that have become the scourge of health care facilities worldwide. Eliminating such infections requires a deeper understanding of the factors that enable the pathogen to persist in hospital environments, establish infections, and resist antibiotics. We present a set of resources that should accelerate genome-scale genetic characterization of these traits for a reference isolate of A. baumannii that is highly virulent and representative of current outbreak strains.
Collapse
|
102
|
Santiago M, Matano LM, Moussa SH, Gilmore MS, Walker S, Meredith TC. A new platform for ultra-high density Staphylococcus aureus transposon libraries. BMC Genomics 2015; 16:252. [PMID: 25888466 PMCID: PMC4389836 DOI: 10.1186/s12864-015-1361-3] [Citation(s) in RCA: 73] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2014] [Accepted: 02/19/2015] [Indexed: 01/17/2023] Open
Abstract
BACKGROUND Staphylococcus aureus readily develops resistance to antibiotics and achieving effective therapies to overcome resistance requires in-depth understanding of S. aureus biology. High throughput, parallel-sequencing methods for analyzing transposon mutant libraries have the potential to revolutionize studies of S. aureus, but the genetic tools to take advantage of the power of next generation sequencing have not been fully developed. RESULTS Here we report a phage-based transposition system to make ultra-high density transposon libraries for genome-wide analysis of mutant fitness in any Φ11-transducible S. aureus strain. The high efficiency of the delivery system has made it possible to multiplex transposon cassettes containing different regulatory elements in order to make libraries in which genes are over- or under-expressed as well as deleted. By incorporating transposon-specific barcodes into the cassettes, we can evaluate how null mutations and changes in gene expression levels affect fitness in a single sequencing data set. Demonstrating the power of the system, we have prepared a library containing more than 690,000 unique insertions. Because one unique feature of the phage-based approach is that temperature-sensitive mutants are retained, we have carried out a genome-wide study of S. aureus genes involved in withstanding temperature stress. We find that many genes previously identified as essential are temperature sensitive and also identify a number of genes that, when disrupted, confer a growth advantage at elevated temperatures. CONCLUSIONS The platform described here reliably provides mutant collections of unparalleled genotypic diversity and will enable a wide range of functional genomic studies in S. aureus.
Collapse
Affiliation(s)
- Marina Santiago
- Department of Microbiology and Immunobiology, Harvard Medical School, Boston, MA, 02115, USA.
| | - Leigh M Matano
- Department of Microbiology and Immunobiology, Harvard Medical School, Boston, MA, 02115, USA.
| | - Samir H Moussa
- Department of Microbiology and Immunobiology, Harvard Medical School, Boston, MA, 02115, USA.
| | - Michael S Gilmore
- Department of Microbiology and Immunobiology, Harvard Medical School, Boston, MA, 02115, USA.
- Department of Ophthalmology, Harvard Medical School, Massachusetts Eye and Ear Infirmary, Boston, MA, 02114, USA.
| | - Suzanne Walker
- Department of Microbiology and Immunobiology, Harvard Medical School, Boston, MA, 02115, USA.
| | - Timothy C Meredith
- Department of Microbiology and Immunobiology, Harvard Medical School, Boston, MA, 02115, USA.
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA, 16802, USA.
| |
Collapse
|
103
|
Abstract
Clostridium difficile is the most common cause of antibiotic-associated intestinal infections and a significant cause of morbidity and mortality. Infection with C. difficile requires disruption of the intestinal microbiota, most commonly by antibiotic usage. Therapeutic intervention largely relies on a small number of broad-spectrum antibiotics, which further exacerbate intestinal dysbiosis and leave the patient acutely sensitive to reinfection. Development of novel targeted therapeutic interventions will require a detailed knowledge of essential cellular processes, which represent attractive targets, and species-specific processes, such as bacterial sporulation. Our knowledge of the genetic basis of C. difficile infection has been hampered by a lack of genetic tools, although recent developments have made some headway in addressing this limitation. Here we describe the development of a method for rapidly generating large numbers of transposon mutants in clinically important strains of C. difficile. We validated our transposon mutagenesis approach in a model strain of C. difficile and then generated a comprehensive transposon library in the highly virulent epidemic strain R20291 (027/BI/NAP1) containing more than 70,000 unique mutants. Using transposon-directed insertion site sequencing (TraDIS), we have identified a core set of 404 essential genes, required for growth in vitro. We then applied this technique to the process of sporulation, an absolute requirement for C. difficile transmission and pathogenesis, identifying 798 genes that are likely to impact spore production. The data generated in this study will form a valuable resource for the community and inform future research on this important human pathogen. Clostridium difficile is a common cause of potentially fatal intestinal infections in hospital patients, particularly those who have been treated with antibiotics. Our knowledge of this bacterium has been hampered by a lack of tools for dissecting the organism. We have developed a method to study the function of every gene in the bacterium simultaneously. Using this tool, we have identified a set of 404 genes that are required for growth of the bacteria in the laboratory. C. difficile also produces a highly resistant spore that can survive in the environment for a long time and is a requirement for transmission of the bacteria between patients. We have applied our genetic tool to identify all of the genes required for production of a spore. All of these genes represent attractive targets for new drugs to treat infection.
Collapse
|
104
|
Garcia-Fraile P, Seaman JC, Karunakaran R, Edwards A, Poole PS, Downie JA. Arabinose and protocatechuate catabolism genes are important for growth of Rhizobium leguminosarum biovar viciae in the pea rhizosphere. PLANT AND SOIL 2015; 390:251-264. [PMID: 26166901 PMCID: PMC4495286 DOI: 10.1007/s11104-015-2389-5] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/12/2014] [Accepted: 01/14/2015] [Indexed: 05/29/2023]
Abstract
BACKGROUND AND AIMS To form nitrogen-fixing nodules on pea roots, Rhizobium leguminosarum biovar viciae must be competitive in the rhizosphere. Our aim was to identify genes important for rhizosphere fitness. METHODS Signature-tagged mutants were screened using microarrays to identify mutants reduced for growth in pea rhizospheres. Candidate mutants were assessed relative to controls for growth in minimal medium, growth in pea rhizospheres and for infection of peas in mixed inoculants. Mutated genes were identified by DNA sequencing and confirmed by transduction. RESULTS Of 5508 signature-tagged mutants, microarrays implicated 50 as having decreased rhizosphere fitness. Growth tests identified six mutants with rhizosphere-specific phenotypes. The mutation in one of the genes (araE) was in an arabinose catabolism operon and blocked growth on arabinose. The mutation in another gene (pcaM), encoding a predicted solute binding protein for protocatechuate and hydroxybenzoate uptake, decreased growth on protocatechuate. Both mutants were decreased for nodule infection competitiveness with mixed inoculants, but nodulated peas normally when inoculated alone. Other mutants with similar phenotypes had mutations predicted to affect secondary metabolism. CONCLUSIONS Catabolism of arabinose and protocatechuate in the pea rhizosphere is important for competitiveness of R.l. viciae. Other genes predicted to be involved in secondary metabolism are also important.
Collapse
Affiliation(s)
- Paula Garcia-Fraile
- Department of Molecular Microbiology, John Innes Centre, Norwich Research Park, Norwich, NR4 7UH UK
- Present Address: Institute of Microbiology, Academy of Sciences of Czech Republic, Videnska 1083, 142 20 Prague, Czech Republic
| | - Jonathan C. Seaman
- School of Biological Sciences, University of Reading, Reading, RG6 6AJ UK
| | - Ramakrishnan Karunakaran
- Department of Molecular Microbiology, John Innes Centre, Norwich Research Park, Norwich, NR4 7UH UK
| | - Anne Edwards
- Department of Molecular Microbiology, John Innes Centre, Norwich Research Park, Norwich, NR4 7UH UK
| | - Philip S. Poole
- Department of Molecular Microbiology, John Innes Centre, Norwich Research Park, Norwich, NR4 7UH UK
- Present Address: Department of Plant Sciences, University of Oxford, South Parks Road, Oxford, OX1 3RB UK
| | - J. Allan Downie
- Department of Molecular Microbiology, John Innes Centre, Norwich Research Park, Norwich, NR4 7UH UK
| |
Collapse
|
105
|
Klein BA, Duncan MJ, Hu LT. Defining essential genes and identifying virulence factors of Porphyromonas gingivalis by massively parallel sequencing of transposon libraries (Tn-seq). Methods Mol Biol 2015; 1279:25-43. [PMID: 25636611 PMCID: PMC4824196 DOI: 10.1007/978-1-4939-2398-4_3] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Porphyromonas gingivalis is a keystone pathogen in the development and progression of periodontal disease. Obstacles to the development of saturated transposon libraries have previously limited transposon mutant-based screens as well as essential gene studies. We have developed a system for efficient transposon mutagenesis of P. gingivalis using a modified mariner transposon. Tn-seq is a technique that allows for quantitative assessment of individual mutants within a transposon mutant library by sequencing the transposon-genome junctions and then compiling mutant presence by mapping to a base genome. Using Tn-seq, it is possible to quickly define all the insertional mutants in a library and thus identify nonessential genes under the conditions in which the library was produced. Identification of fitness of individual mutants under specific conditions can be performed by exposing the library to selective pressures.
Collapse
Affiliation(s)
- Brian A Klein
- Graduate Program of Molecular Microbiology, Tufts University School of Medicine, Boston, MA, USA
| | | | | |
Collapse
|
106
|
Gardner PP, Eldai H. Annotating RNA motifs in sequences and alignments. Nucleic Acids Res 2015; 43:691-8. [PMID: 25520192 PMCID: PMC4333381 DOI: 10.1093/nar/gku1327] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2014] [Revised: 11/30/2014] [Accepted: 12/05/2014] [Indexed: 11/21/2022] Open
Abstract
RNA performs a diverse array of important functions across all cellular life. These functions include important roles in translation, building translational machinery and maturing messenger RNA. More recent discoveries include the miRNAs and bacterial sRNAs that regulate gene expression, the thermosensors, riboswitches and other cis-regulatory elements that help prokaryotes sense their environment and eukaryotic piRNAs that suppress transposition. However, there can be a long period between the initial discovery of a RNA and determining its function. We present a bioinformatic approach to characterize RNA motifs, which are critical components of many RNA structure-function relationships. These motifs can, in some instances, provide researchers with functional hypotheses for uncharacterized RNAs. Moreover, we introduce a new profile-based database of RNA motifs--RMfam--and illustrate some applications for investigating the evolution and functional characterization of RNA. All the data and scripts associated with this work are available from: https://github.com/ppgardne/RMfam.
Collapse
Affiliation(s)
- Paul P Gardner
- School of Biological Sciences, University of Canterbury, Private Bag 4800, Christchurch 8140, New Zealand Biomolecular Interaction Centre, University of Canterbury, Private Bag 4800, Christchurch 8140, New Zealand
| | - Hisham Eldai
- School of Biological Sciences, University of Canterbury, Private Bag 4800, Christchurch 8140, New Zealand Biomolecular Interaction Centre, University of Canterbury, Private Bag 4800, Christchurch 8140, New Zealand
| |
Collapse
|
107
|
Yamaichi Y, Chao MC, Sasabe J, Clark L, Davis BM, Yamamoto N, Mori H, Kurokawa K, Waldor MK. High-resolution genetic analysis of the requirements for horizontal transmission of the ESBL plasmid from Escherichia coli O104:H4. Nucleic Acids Res 2015; 43:348-60. [PMID: 25477379 PMCID: PMC4288162 DOI: 10.1093/nar/gku1262] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2014] [Revised: 11/12/2014] [Accepted: 11/17/2014] [Indexed: 12/20/2022] Open
Abstract
Horizontal dissemination of the genes encoding extended spectrum beta-lactamases (ESBLs) via conjugative plasmids is facilitating the increasingly widespread resistance of pathogens to beta-lactam antibiotics. However, there is relatively little known about the regulatory factors and mechanisms that govern the spread of these plasmids. Here, we carried out a high-throughput, transposon insertion site sequencing analysis (TnSeq) to identify genes that enable the maintenance and transmission of pESBL, an R64 (IncI1)-related resistance plasmid that was isolated from Escherichia coli O104:H4 linked to a recent large outbreak of gastroenteritis. With a few exceptions, the majority of the genes identified as required for maintenance and transmission of pESBL matched those of their previously defined R64 counterparts. However, our analyses of the high-density transposon insertion library in pESBL also revealed two very short and linked regions that constitute a previously unrecognized regulatory system controlling spread of IncI1 plasmids. In addition, we investigated the function of the pESBL-encoded M.EcoGIX methyltransferase, which is also encoded by many other IncI1 and IncF plasmids. This enzyme proved to protect pESBL from restriction in new hosts, suggesting it aids in expanding the plasmid's host range. Collectively, our work illustrates the power of the TnSeq approach to enable rapid and comprehensive analyses of plasmid genes and sequences that facilitate the dissemination of determinants of antibiotic resistance.
Collapse
Affiliation(s)
- Yoshiharu Yamaichi
- Division of Infectious Diseases, Brigham and Women's Hospital, Boston, MA 02115, USA
- Department of Microbiology and Immunobiology, Harvard Medical School, Boston, MA 02115, USA
- Institute for Integrative Biology of the Cell (I2BC), CNRS, CEA, Université Paris-Sud, Gif-sur-Yvette 91198, France
| | - Michael C. Chao
- Division of Infectious Diseases, Brigham and Women's Hospital, Boston, MA 02115, USA
- Department of Microbiology and Immunobiology, Harvard Medical School, Boston, MA 02115, USA
| | - Jumpei Sasabe
- Division of Infectious Diseases, Brigham and Women's Hospital, Boston, MA 02115, USA
- Department of Microbiology and Immunobiology, Harvard Medical School, Boston, MA 02115, USA
- Department of Anatomy, Keio University School of Medicine, Shinjuku-ku, Tokyo160-8582, Japan
| | - Lars Clark
- Division of Infectious Diseases, Brigham and Women's Hospital, Boston, MA 02115, USA
- Department of Microbiology and Immunobiology, Harvard Medical School, Boston, MA 02115, USA
| | - Brigid M. Davis
- Division of Infectious Diseases, Brigham and Women's Hospital, Boston, MA 02115, USA
- Department of Microbiology and Immunobiology, Harvard Medical School, Boston, MA 02115, USA
| | - Nozomi Yamamoto
- Earth-Life Science Institute, Tokyo Institute of Technology, Meguro-ku, Tokyo 152-8551, Japan
| | - Hiroshi Mori
- Department of Biological Information, Tokyo Institute of Technology, Yokohama, Kanagawa 226-8501, Japan
| | - Ken Kurokawa
- Earth-Life Science Institute, Tokyo Institute of Technology, Meguro-ku, Tokyo 152-8551, Japan
- Department of Biological Information, Tokyo Institute of Technology, Yokohama, Kanagawa 226-8501, Japan
| | - Matthew K. Waldor
- Division of Infectious Diseases, Brigham and Women's Hospital, Boston, MA 02115, USA
- Department of Microbiology and Immunobiology, Harvard Medical School, Boston, MA 02115, USA
- Howard Hughes Medical Institute, Boston, MA 02115, USA
| |
Collapse
|
108
|
Perry BJ, Yost CK. Construction of a mariner-based transposon vector for use in insertion sequence mutagenesis in selected members of the Rhizobiaceae. BMC Microbiol 2014; 14:298. [PMID: 25433486 PMCID: PMC4255674 DOI: 10.1186/s12866-014-0298-z] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2014] [Accepted: 11/18/2014] [Indexed: 12/11/2022] Open
Abstract
Background The Rhizobiaceae family of Gram-negative bacteria often engage in symbiosis with plants of economic importance. Historically, genetic studies to identify the function of individual genes, and characterize the biology of these bacteria have relied on the use of classical transposon mutagenesis. To increase the rate of scientific discovery in the Rhizobiaceae there is a need to adapt high-throughput genetic screens like insertion sequencing for use in this family of bacteria. Here we describe a Rhizobiaceae compatible MmeI-adapted mariner transposon that can be used with insertion sequencing for high-throughput genetic screening. Results The newly constructed mariner transposon pSAM_Rl mutagenized R. leguminosarum, S. meliloti, and A. tumefaciens at a high frequency. In R. leguminosarum, mutant pools were generated that saturated 88% of potential mariner insertions sites in the genome. Analysis of the R. leguminosarum transposon insertion sequencing data with a previously described hidden Markov model-based method resulted in assignment of the contribution of all annotated genes in the R. leguminosarum 3841 genome for growth on a complex medium. Good concordance was observed between genes observed to be required for growth on the complex medium, and previous studies. Conclusions The newly described Rhizobiaceaee compatible mariner transposon insertion sequencing vector pSAM_Rl has been shown to mutagenize at a high frequency and to be an effective tool for use in high-throughput genetic screening. The construction and validation of this transposon insertion sequencing tool for use in the Rhizobiziaceae will provide an opportunity for researchers in the Rhizobiaceae community to use high-throughput genetic screening, allowing for significant increase in the rate of genetic discovery, particularly given the recent release of genome sequences from many Rhizobiaceae strains. Electronic supplementary material The online version of this article (doi:10.1186/s12866-014-0298-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Benjamin J Perry
- Department of Biology, University of Regina, 3737 Wascana Parkway, Regina, SK, S4S 0A2, Canada.
| | - Christopher K Yost
- Department of Biology, University of Regina, 3737 Wascana Parkway, Regina, SK, S4S 0A2, Canada.
| |
Collapse
|
109
|
ARTIST: high-resolution genome-wide assessment of fitness using transposon-insertion sequencing. PLoS Genet 2014; 10:e1004782. [PMID: 25375795 PMCID: PMC4222735 DOI: 10.1371/journal.pgen.1004782] [Citation(s) in RCA: 119] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2014] [Accepted: 09/26/2014] [Indexed: 12/20/2022] Open
Abstract
Transposon-insertion sequencing (TIS) is a powerful approach for deciphering genetic requirements for bacterial growth in different conditions, as it enables simultaneous genome-wide analysis of the fitness of thousands of mutants. However, current methods for comparative analysis of TIS data do not adjust for stochastic experimental variation between datasets and are limited to interrogation of annotated genomic elements. Here, we present ARTIST, an accessible TIS analysis pipeline for identifying essential regions that are required for growth under optimal conditions as well as conditionally essential loci that participate in survival only under specific conditions. ARTIST uses simulation-based normalization to model and compensate for experimental noise, and thereby enhances the statistical power in conditional TIS analyses. ARTIST also employs a novel adaptation of the hidden Markov model to generate statistically robust, high-resolution, annotation-independent maps of fitness-linked loci across the entire genome. Using ARTIST, we sensitively and comprehensively define Mycobacterium tuberculosis and Vibrio cholerae loci required for host infection while limiting inclusion of false positive loci. ARTIST is applicable to a broad range of organisms and will facilitate TIS-based dissection of pathways required for microbial growth and survival under a multitude of conditions. Transposon insertion sequencing (TIS) is a powerful method that couples high-density transposon mutagenesis with next-generation sequencing to comprehensively assess the fitness of thousands of transposon mutants across a genome. TIS is an extremely flexible technique that has been used to define genomic loci required for bacterial growth and survival in a variety of species and in many different growth conditions, including during host infection. However, there remain several important limitations to current TIS analysis methods. First, TIS data are not routinely normalized for the impact of experimental variability; second, most analyses are restricted to annotated loci and do not completely exploit the richness of TIS datasets; finally, TIS analysis methods are not easily accessible to most biologists. Here we present a pipeline—ARTIST—that addresses these issues and will transform TIS-based studies. We used ARTIST to conduct robust analyses of Mycobacterium tuberculosis and Vibrio cholerae in vivo TIS datasets and comprehensively defined the genetic requirements of these pathogens for host infection. The ARTIST pipeline will make TIS analysis accessible to many researchers and greatly enhance the rigor of and insights gained from TIS studies in a wide range of microorganisms.
Collapse
|
110
|
Mobegi FM, van Hijum SAFT, Burghout P, Bootsma HJ, de Vries SPW, van der Gaast-de Jongh CE, Simonetti E, Langereis JD, Hermans PWM, de Jonge MI, Zomer A. From microbial gene essentiality to novel antimicrobial drug targets. BMC Genomics 2014; 15:958. [PMID: 25373505 PMCID: PMC4233050 DOI: 10.1186/1471-2164-15-958] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2014] [Accepted: 10/23/2014] [Indexed: 01/12/2023] Open
Abstract
Background Bacterial respiratory tract infections, mainly caused by Streptococcus pneumoniae, Haemophilus influenzae and Moraxella catarrhalis are among the leading causes of global mortality and morbidity. Increased resistance of these pathogens to existing antibiotics necessitates the search for novel targets to develop potent antimicrobials. Result Here, we report a proof of concept study for the reliable identification of potential drug targets in these human respiratory pathogens by combining high-density transposon mutagenesis, high-throughput sequencing, and integrative genomics. Approximately 20% of all genes in these three species were essential for growth and viability, including 128 essential and conserved genes, part of 47 metabolic pathways. By comparing these essential genes to the human genome, and a database of genes from commensal human gut microbiota, we identified and excluded potential drug targets in respiratory tract pathogens that will have off-target effects in the host, or disrupt the natural host microbiota. We propose 249 potential drug targets, 67 of which are targets for 75 FDA-approved antimicrobials and 35 other researched small molecule inhibitors. Two out of four selected novel targets were experimentally validated, proofing the concept. Conclusion Here we have pioneered an attempt in systematically combining the power of high-density transposon mutagenesis, high-throughput sequencing, and integrative genomics to discover potential drug targets at genome-scale. By circumventing the time-consuming and expensive laboratory screens traditionally used to select potential drug targets, our approach provides an attractive alternative that could accelerate the much needed discovery of novel antimicrobials. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-958) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | - Sacha A F T van Hijum
- Radboud Institute for Molecular Life Sciences, Laboratory of Paediatric Infectious Diseases, Radboud University Medical Centre, Nijmegen 6500 HB, The Netherlands.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
111
|
Lindgreen S, Umu SU, Lai ASW, Eldai H, Liu W, McGimpsey S, Wheeler NE, Biggs PJ, Thomson NR, Barquist L, Poole AM, Gardner PP. Robust identification of noncoding RNA from transcriptomes requires phylogenetically-informed sampling. PLoS Comput Biol 2014; 10:e1003907. [PMID: 25357249 PMCID: PMC4214555 DOI: 10.1371/journal.pcbi.1003907] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2014] [Accepted: 09/11/2014] [Indexed: 02/03/2023] Open
Abstract
Noncoding RNAs are integral to a wide range of biological processes, including translation, gene regulation, host-pathogen interactions and environmental sensing. While genomics is now a mature field, our capacity to identify noncoding RNA elements in bacterial and archaeal genomes is hampered by the difficulty of de novo identification. The emergence of new technologies for characterizing transcriptome outputs, notably RNA-seq, are improving noncoding RNA identification and expression quantification. However, a major challenge is to robustly distinguish functional outputs from transcriptional noise. To establish whether annotation of existing transcriptome data has effectively captured all functional outputs, we analysed over 400 publicly available RNA-seq datasets spanning 37 different Archaea and Bacteria. Using comparative tools, we identify close to a thousand highly-expressed candidate noncoding RNAs. However, our analyses reveal that capacity to identify noncoding RNA outputs is strongly dependent on phylogenetic sampling. Surprisingly, and in stark contrast to protein-coding genes, the phylogenetic window for effective use of comparative methods is perversely narrow: aggregating public datasets only produced one phylogenetic cluster where these tools could be used to robustly separate unannotated noncoding RNAs from a null hypothesis of transcriptional noise. Our results show that for the full potential of transcriptomics data to be realized, a change in experimental design is paramount: effective transcriptomics requires phylogeny-aware sampling. We have analysed more than 400 public transcriptomes, generated using RNA-seq, from almost 40 strains of Bacteria and Archaea. We discovered that the capacity to identify noncoding RNA outputs from this data is strongly dependent on phylogenetic sampling. Our results show that, for the full potential of transcriptomics data as a discovery tool to be realized, a change in experimental design is critical: effective comparative transcriptomics requires phylogeny-aware sampling. We also examined how comparative transcriptomics experiments can be used to effectively identify RNA elements. We find that, for RNA element discovery, a phylogeny-informed sampling approach is more effective than analyses of individual species. Phylogeny-informed sampling reveals a narrow ‘Goldilocks Zone’ (where species are not too similar and not too divergent) for RNA identification using clusters of related species. In stark contrast to protein-coding genes, not only is the phylogenetic window for the effective use of comparative methods for noncoding RNA identification perversely narrow, but few existing datasets sit within this Goldilocks Zone: by aggregating public datasets, we were only able to create one phylogenetic cluster where comparative tools could be used to confidently separate unannotated noncoding RNAs from transcriptional noise.
Collapse
MESH Headings
- Archaea/genetics
- Bacteria/genetics
- Cluster Analysis
- Computational Biology
- Databases, Genetic
- Gene Expression Profiling/methods
- Phylogeny
- RNA, Archaeal/chemistry
- RNA, Archaeal/classification
- RNA, Archaeal/genetics
- RNA, Bacterial/chemistry
- RNA, Bacterial/classification
- RNA, Bacterial/genetics
- RNA, Untranslated/chemistry
- RNA, Untranslated/classification
- RNA, Untranslated/genetics
- Transcriptome/genetics
Collapse
Affiliation(s)
- Stinus Lindgreen
- Department of Biology, University of Copenhagen, Copenhagen, Denmark
- School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
| | - Sinan Uğur Umu
- School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
- Biomolecular Interaction Centre, University of Canterbury, Christchurch, New Zealand
| | - Alicia Sook-Wei Lai
- School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
| | - Hisham Eldai
- School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
| | - Wenting Liu
- School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
| | - Stephanie McGimpsey
- School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
| | - Nicole E. Wheeler
- School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
| | - Patrick J. Biggs
- Institute of Veterinary, Animal & Biomedical Sciences, Massey University, Palmerston North, New Zealand
- Allan Wilson Centre for Molecular Ecology & Evolution, Massey University, Palmerston North, New Zealand
| | - Nick R. Thomson
- Pathogen Genetics, Wellcome Trust Sanger Institute, Hinxton, United Kingdom
| | - Lars Barquist
- Pathogen Genetics, Wellcome Trust Sanger Institute, Hinxton, United Kingdom
- Institute for Molecular Infection Biology, University of Wuerzburg, Wuerzburg, Germany
| | - Anthony M. Poole
- School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
- Biomolecular Interaction Centre, University of Canterbury, Christchurch, New Zealand
- Allan Wilson Centre for Molecular Ecology & Evolution, Massey University, Palmerston North, New Zealand
- * E-mail: (AMP); (PPG)
| | - Paul P. Gardner
- School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
- Biomolecular Interaction Centre, University of Canterbury, Christchurch, New Zealand
- * E-mail: (AMP); (PPG)
| |
Collapse
|
112
|
Genome-scale metabolic network validation of Shewanella oneidensis using transposon insertion frequency analysis. PLoS Comput Biol 2014; 10:e1003848. [PMID: 25233219 PMCID: PMC4168976 DOI: 10.1371/journal.pcbi.1003848] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2014] [Accepted: 08/07/2014] [Indexed: 01/08/2023] Open
Abstract
Transposon mutagenesis, in combination with parallel sequencing, is becoming a powerful tool for en-masse mutant analysis. A probability generating function was used to explain observed miniHimar transposon insertion patterns, and gene essentiality calls were made by transposon insertion frequency analysis (TIFA). TIFA incorporated the observed genome and sequence motif bias of the miniHimar transposon. The gene essentiality calls were compared to: 1) previous genome-wide direct gene-essentiality assignments; and, 2) flux balance analysis (FBA) predictions from an existing genome-scale metabolic model of Shewanella oneidensis MR-1. A three-way comparison between FBA, TIFA, and the direct essentiality calls was made to validate the TIFA approach. The refinement in the interpretation of observed transposon insertions demonstrated that genes without insertions are not necessarily essential, and that genes that contain insertions are not always nonessential. The TIFA calls were in reasonable agreement with direct essentiality calls for S. oneidensis, but agreed more closely with E. coli essentiality calls for orthologs. The TIFA gene essentiality calls were in good agreement with the MR-1 FBA essentiality predictions, and the agreement between TIFA and FBA predictions was substantially better than between the FBA and the direct gene essentiality predictions.
Collapse
|
113
|
Peng C, Gao F. Protein localization analysis of essential genes in prokaryotes. Sci Rep 2014; 4:6001. [PMID: 25105358 PMCID: PMC4126397 DOI: 10.1038/srep06001] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2014] [Accepted: 07/22/2014] [Indexed: 01/27/2023] Open
Abstract
Essential genes, those critical for the survival of an organism under certain conditions, play a significant role in pharmaceutics and synthetic biology. Knowledge of protein localization is invaluable for understanding their function as well as the interaction of different proteins. However, systematical examination of essential genes from the aspect of the localizations of proteins they encode has not been explored before. Here, a comprehensive protein localization analysis of essential genes in 27 prokaryotes including 24 bacteria, 2 mycoplasmas and 1 archaeon has been performed. Both statistical analysis of localization information in these genomes and GO (Gene Ontology) terms enriched in the essential genes show that proteins encoded by essential genes are enriched in internal location sites, while exist in cell envelope with a lower proportion compared with non-essential ones. Meanwhile, there are few essential proteins in the external subcellular location sites such as flagellum and fimbrium, and proteins encoded by non-essential genes tend to have diverse localizations. These results would provide further insights into the understanding of fundamental functions needed to support a cellular life and improve gene essentiality prediction by taking the protein localization and enriched GO terms into consideration.
Collapse
Affiliation(s)
- Chong Peng
- Department of Physics, Tianjin University, Tianjin 300072, China
| | - Feng Gao
- Department of Physics, Tianjin University, Tianjin 300072, China
- Key Laboratory of Systems Bioengineering, Ministry of Education, Tianjin University, Tianjin 300072, China
- Collaborative Innovation Center of Chemical Science and Engineering, Tianjin 300072, China
| |
Collapse
|
114
|
Combinatorial strategies for improving multiple-stress resistance in industrially relevant Escherichia coli strains. Appl Environ Microbiol 2014; 80:6223-42. [PMID: 25085490 DOI: 10.1128/aem.01542-14] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
High-cell-density fermentation for industrial production of chemicals can impose numerous stresses on cells due to high substrate, product, and by-product concentrations; high osmolarity; reactive oxygen species; and elevated temperatures. There is a need to develop platform strains of industrial microorganisms that are more tolerant toward these typical processing conditions. In this study, the growth of six industrially relevant strains of Escherichia coli was characterized under eight stress conditions representative of fed-batch fermentation, and strains W and BL21(DE3) were selected as platforms for transposon (Tn) mutagenesis due to favorable resistance characteristics. Selection experiments, followed by either targeted or genome-wide next-generation-sequencing-based Tn insertion site determination, were performed to identify mutants with improved growth properties under a subset of three stress conditions and two combinations of individual stresses. A subset of the identified loss-of-function mutants were selected for a combinatorial approach, where strains with combinations of two and three gene deletions were systematically constructed and tested for single and multistress resistance. These approaches allowed identification of (i) strain-background-specific stress resistance phenotypes, (ii) novel gene deletion mutants in E. coli that confer single and multistress resistance in a strain-background-dependent manner, and (iii) synergistic effects of multiple gene deletions that confer improved resistance over single deletions. The results of this study underscore the suboptimality and strain-specific variability of the genetic network regulating growth under stressful conditions and suggest that further exploration of the combinatorial gene deletion space in multiple strain backgrounds is needed for optimizing strains for microbial bioprocessing applications.
Collapse
|
115
|
McAdam PR, Richardson EJ, Fitzgerald JR. High-throughput sequencing for the study of bacterial pathogen biology. Curr Opin Microbiol 2014; 19:106-113. [PMID: 25033019 PMCID: PMC4150483 DOI: 10.1016/j.mib.2014.06.002] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2014] [Revised: 06/02/2014] [Accepted: 06/07/2014] [Indexed: 12/23/2022]
Abstract
A revolution in sequencing technologies in recent years has led to dramatically increased throughput and reduced cost of bacterial genome sequencing. An increasing number of applications of the new technologies are providing broad insights into bacterial evolution, epidemiology, and pathogenesis. For example, the capacity to sequence large numbers of bacterial isolates is enabling high resolution phylogenetic analyses of bacterial populations leading to greatly enhanced understanding of the emergence, adaptation, and transmission of pathogenic clones. In addition, RNA-seq offers improved quantification and resolution for transcriptomic analysis, and the combination of high-throughput sequencing with transposon mutagenesis is a powerful approach for the identification of bacterial determinants required for survival in vivo. In this concise review we provide selected examples of how high throughput sequencing is being applied to understand the biology of bacterial pathogens, and discuss future technological advances likely to have a profound impact on the field.
Collapse
Affiliation(s)
- Paul R McAdam
- The Roslin Institute and Edinburgh Infectious Diseases, University of Edinburgh, Easter Bush Campus, Edinburgh EH25 9RG, United Kingdom
| | - Emily J Richardson
- The Roslin Institute and Edinburgh Infectious Diseases, University of Edinburgh, Easter Bush Campus, Edinburgh EH25 9RG, United Kingdom
| | - J Ross Fitzgerald
- The Roslin Institute and Edinburgh Infectious Diseases, University of Edinburgh, Easter Bush Campus, Edinburgh EH25 9RG, United Kingdom.
| |
Collapse
|
116
|
Rusmini R, Vecchietti D, Macchi R, Vidal-Aroca F, Bertoni G. A shotgun antisense approach to the identification of novel essential genes in Pseudomonas aeruginosa. BMC Microbiol 2014; 14:24. [PMID: 24499134 PMCID: PMC3922391 DOI: 10.1186/1471-2180-14-24] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2013] [Accepted: 01/23/2014] [Indexed: 12/29/2022] Open
Abstract
Background Antibiotics in current use target a surprisingly small number of cellular functions: cell wall, DNA, RNA, and protein biosynthesis. Targeting of novel essential pathways is expected to play an important role in the discovery of new antibacterial agents against bacterial pathogens, such as Pseudomonas aeruginosa, that are difficult to control because of their ability to develop resistance, often multiple, to all current classes of clinical antibiotics. Results We aimed to identify novel essential genes in P. aeruginosa by shotgun antisense screening. This technique was developed in Staphylococcus aureus and, following a period of limited success in Gram-negative bacteria, has recently been used effectively in Escherichia coli. To also target low expressed essential genes, we included some variant steps that were expected to overcome the non-stringent regulation of the promoter carried by the expression vector used for the shotgun antisense libraries. Our antisense screenings identified 33 growth-impairing single-locus genomic inserts that allowed us to generate a list of 28 “essential-for-growth” genes: five were “classical” essential genes involved in DNA replication, transcription, translation, and cell division; seven were already reported as essential in other bacteria; and 16 were “novel” essential genes with no homologs reported to have an essential role in other bacterial species. Interestingly, the essential genes in our panel were suggested to take part in a broader range of cellular functions than those currently targeted by extant antibiotics, namely protein secretion, biosynthesis of cofactors, prosthetic groups and carriers, energy metabolism, central intermediary metabolism, transport of small molecules, translation, post-translational modification, non-ribosomal peptide synthesis, lipopolysaccharide synthesis/modification, and transcription regulation. This study also identified 43 growth-impairing inserts carrying multiple loci targeting 105 genes, of which 25 have homologs reported as essential in other bacteria. Finally, four multigenic growth-impairing inserts belonged to operons that have never been reported to play an essential role. Conclusions For the first time in P. aeruginosa, we applied regulated antisense RNA expression and showed the feasibility of this technology for the identification of novel essential genes.
Collapse
Affiliation(s)
| | | | | | | | - Giovanni Bertoni
- Department of Life Sciences, Università degli Studi di Milano, via Celoria 26, 20133 Milan, Italy.
| |
Collapse
|
117
|
Luo H, Lin Y, Gao F, Zhang CT, Zhang R. DEG 10, an update of the database of essential genes that includes both protein-coding genes and noncoding genomic elements. Nucleic Acids Res 2013; 42:D574-80. [PMID: 24243843 PMCID: PMC3965060 DOI: 10.1093/nar/gkt1131] [Citation(s) in RCA: 401] [Impact Index Per Article: 33.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The combination of high-density transposon-mediated mutagenesis and high-throughput sequencing has led to significant advancements in research on essential genes, resulting in a dramatic increase in the number of identified prokaryotic essential genes under diverse conditions and a revised essential-gene concept that includes all essential genomic elements, rather than focusing on protein-coding genes only. DEG 10, a new release of the Database of Essential Genes (available at http://www.essentialgene.org), has been developed to accommodate these quantitative and qualitative advancements. In addition to increasing the number of bacterial and archaeal essential genes determined by genome-wide gene essentiality screens, DEG 10 also harbors essential noncoding RNAs, promoters, regulatory sequences and replication origins. These essential genomic elements are determined not only in vitro, but also in vivo, under diverse conditions including those for survival, pathogenesis and antibiotic resistance. We have developed customizable BLAST tools that allow users to perform species- and experiment-specific BLAST searches for a single gene, a list of genes, annotated or unannotated genomes. Therefore, DEG 10 includes essential genomic elements under different conditions in three domains of life, with customizable BLAST tools.
Collapse
Affiliation(s)
- Hao Luo
- Department of Physics, Tianjin University, Tianjin 300072, People's Republic of China and Center for Molecular Medicine and Genetics, School of Medicine, Wayne State University, Detroit 48201, USA
| | | | | | | | | |
Collapse
|
118
|
Phan MD, Peters KM, Sarkar S, Lukowski SW, Allsopp LP, Moriel DG, Achard MES, Totsika M, Marshall VM, Upton M, Beatson SA, Schembri MA. The serum resistome of a globally disseminated multidrug resistant uropathogenic Escherichia coli clone. PLoS Genet 2013; 9:e1003834. [PMID: 24098145 PMCID: PMC3789825 DOI: 10.1371/journal.pgen.1003834] [Citation(s) in RCA: 143] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2013] [Accepted: 08/12/2013] [Indexed: 01/01/2023] Open
Abstract
Escherichia coli ST131 is a globally disseminated, multidrug resistant clone responsible for a high proportion of urinary tract and bloodstream infections. The rapid emergence and successful spread of E. coli ST131 is strongly associated with antibiotic resistance; however, this phenotype alone is unlikely to explain its dominance amongst multidrug resistant uropathogens circulating worldwide in hospitals and the community. Thus, a greater understanding of the molecular mechanisms that underpin the fitness of E. coli ST131 is required. In this study, we employed hyper-saturated transposon mutagenesis in combination with multiplexed transposon directed insertion-site sequencing to define the essential genes required for in vitro growth and the serum resistome (i.e. genes required for resistance to human serum) of E. coli EC958, a representative of the predominant E. coli ST131 clonal lineage. We identified 315 essential genes in E. coli EC958, 231 (73%) of which were also essential in E. coli K-12. The serum resistome comprised 56 genes, the majority of which encode membrane proteins or factors involved in lipopolysaccharide (LPS) biosynthesis. Targeted mutagenesis confirmed a role in serum resistance for 46 (82%) of these genes. The murein lipoprotein Lpp, along with two lipid A-core biosynthesis enzymes WaaP and WaaG, were most strongly associated with serum resistance. While LPS was the main resistance mechanism defined for E. coli EC958 in serum, the enterobacterial common antigen and colanic acid also impacted on this phenotype. Our analysis also identified a novel function for two genes, hyxA and hyxR, as minor regulators of O-antigen chain length. This study offers novel insight into the genetic make-up of E. coli ST131, and provides a framework for future research on E. coli and other Gram-negative pathogens to define their essential gene repertoire and to dissect the molecular mechanisms that enable them to survive in the bloodstream and cause disease. The emergence and rapid dissemination of new bacterial pathogens presents multiple challenges to healthcare systems, including the need for rapid detection, precise diagnostics, effective transmission control and effective treatment. E. coli ST131 is an example of a recently emerged multidrug resistant pathogen that is capable of causing urinary tract and bloodstream infections with limited available treatment options. In order to increase our molecular understanding of E. coli ST131, we developed a high-throughput transposon mutagenesis system in combination with next generation sequencing to test every gene for its essential role in growth and for its contribution to serum resistance. We identified 315 essential genes, 270 of which were conserved among all currently available complete E. coli genomes. Fifty-six genes that define the serum resistome of E. coli ST131 were identified, including genes encoding membrane proteins, proteins involved in LPS biosynthesis, regulators and several novel proteins with previously unknown function. This study therefore provides an inventory of essential and serum resistance genes that could form a framework for the future development of targeted therapeutics to prevent disease caused by multidrug-resistant E. coli ST131 strains.
Collapse
Affiliation(s)
- Minh-Duy Phan
- Australian Infectious Diseases Research Centre, The University of Queensland, Brisbane, Queensland, Australia
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia
| | - Kate M. Peters
- Australian Infectious Diseases Research Centre, The University of Queensland, Brisbane, Queensland, Australia
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia
| | - Sohinee Sarkar
- Australian Infectious Diseases Research Centre, The University of Queensland, Brisbane, Queensland, Australia
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia
| | - Samuel W. Lukowski
- Queensland Brain Institute, The University of Queensland, Brisbane, Queensland, Australia
| | - Luke P. Allsopp
- Australian Infectious Diseases Research Centre, The University of Queensland, Brisbane, Queensland, Australia
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia
| | - Danilo Gomes Moriel
- Australian Infectious Diseases Research Centre, The University of Queensland, Brisbane, Queensland, Australia
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia
| | - Maud E. S. Achard
- Australian Infectious Diseases Research Centre, The University of Queensland, Brisbane, Queensland, Australia
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia
| | - Makrina Totsika
- Australian Infectious Diseases Research Centre, The University of Queensland, Brisbane, Queensland, Australia
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia
| | - Vikki M. Marshall
- Queensland Brain Institute, The University of Queensland, Brisbane, Queensland, Australia
| | - Mathew Upton
- Faculty of Medicine and Dentistry, University of Plymouth, Plymouth, United Kingdom
| | - Scott A. Beatson
- Australian Infectious Diseases Research Centre, The University of Queensland, Brisbane, Queensland, Australia
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia
| | - Mark A. Schembri
- Australian Infectious Diseases Research Centre, The University of Queensland, Brisbane, Queensland, Australia
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia
- * E-mail:
| |
Collapse
|
119
|
Chao MC, Pritchard JR, Zhang YJ, Rubin EJ, Livny J, Davis BM, Waldor MK. High-resolution definition of the Vibrio cholerae essential gene set with hidden Markov model-based analyses of transposon-insertion sequencing data. Nucleic Acids Res 2013; 41:9033-48. [PMID: 23901011 PMCID: PMC3799429 DOI: 10.1093/nar/gkt654] [Citation(s) in RCA: 107] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
The coupling of high-density transposon mutagenesis to high-throughput DNA sequencing (transposon-insertion sequencing) enables simultaneous and genome-wide assessment of the contributions of individual loci to bacterial growth and survival. We have refined analysis of transposon-insertion sequencing data by normalizing for the effect of DNA replication on sequencing output and using a hidden Markov model (HMM)-based filter to exploit heretofore unappreciated information inherent in all transposon-insertion sequencing data sets. The HMM can smooth variations in read abundance and thereby reduce the effects of read noise, as well as permit fine scale mapping that is independent of genomic annotation and enable classification of loci into several functional categories (e.g. essential, domain essential or ‘sick’). We generated a high-resolution map of genomic loci (encompassing both intra- and intergenic sequences) that are required or beneficial for in vitro growth of the cholera pathogen, Vibrio cholerae. This work uncovered new metabolic and physiologic requirements for V. cholerae survival, and by combining transposon-insertion sequencing and transcriptomic data sets, we also identified several novel noncoding RNA species that contribute to V. cholerae growth. Our findings suggest that HMM-based approaches will enhance extraction of biological meaning from transposon-insertion sequencing genomic data.
Collapse
Affiliation(s)
- Michael C Chao
- Division of Infectious Disease, Brigham & Women's Hospital, Boston, MA 02115, USA, Department of Microbiology and Immunobiology, Harvard Medical School, Boston, MA 02115, USA, Howard Hughes Medical Institute, Boston, MA 02115, USA, Department of Immunology and Infectious Diseases, Harvard School of Public Health, Boston, MA 02115, USA and Genome Sequencing and Analysis Program, Broad Institute, Cambridge, MA 02142, USA
| | | | | | | | | | | | | |
Collapse
|