1951
|
Luef B, Frischkorn KR, Wrighton KC, Holman HYN, Birarda G, Thomas BC, Singh A, Williams KH, Siegerist CE, Tringe SG, Downing KH, Comolli LR, Banfield JF. Diverse uncultivated ultra-small bacterial cells in groundwater. Nat Commun 2015; 6:6372. [DOI: 10.1038/ncomms7372] [Citation(s) in RCA: 243] [Impact Index Per Article: 24.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2014] [Accepted: 01/23/2015] [Indexed: 02/04/2023] Open
|
1952
|
Lee YP, Gan HM, Tan MH, Lys I, Page R, Dias Wanigasekera B, Austin CM. The complete mitogenome of the New Zealand freshwater crayfish Paranephrops planifrons White 1842 (Crustacea: Decapoda: Parastacidae). Mitochondrial DNA A DNA Mapp Seq Anal 2015; 27:3333-4. [DOI: 10.3109/19401736.2015.1018209] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Affiliation(s)
- Yin Peng Lee
- School of Science, Monash University Malaysia, Jalan Lagoon Selatan, Bandar Sunway, Selangor, Malaysia,
- Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan, Bandar Sunway, Petaling Jaya, Selangor, Malaysia,
| | - Han Ming Gan
- School of Science, Monash University Malaysia, Jalan Lagoon Selatan, Bandar Sunway, Selangor, Malaysia,
- Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan, Bandar Sunway, Petaling Jaya, Selangor, Malaysia,
| | - Mun Hua Tan
- School of Science, Monash University Malaysia, Jalan Lagoon Selatan, Bandar Sunway, Selangor, Malaysia,
- Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan, Bandar Sunway, Petaling Jaya, Selangor, Malaysia,
| | - Isabelle Lys
- School of Science, Faculty of Health Sciences, Australian Catholic University, Banyo, Queensland, Australia, and
| | - Rachel Page
- School of Food and Nutrition, Massey University, Wellington, New Zealand
| | | | - Christopher M. Austin
- School of Science, Monash University Malaysia, Jalan Lagoon Selatan, Bandar Sunway, Selangor, Malaysia,
- Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan, Bandar Sunway, Petaling Jaya, Selangor, Malaysia,
| |
Collapse
|
1953
|
Gan HY, Gan HM, Lee YP, Austin CM. The complete mitogenome of the Australian freshwater shrimp Paratya australiensis Kemp, 1917 (Crustacea: Decapoda: Atyidae). Mitochondrial DNA A DNA Mapp Seq Anal 2015; 27:3157-8. [PMID: 25693707 DOI: 10.3109/19401736.2015.1007312] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
The mitochondrial genome sequence of the Australian freshwater shrimp, Paratya australiensis, is presented, which is the fourth for genera of the superfamily Atyoidea and the first atyid from the southern hemisphere. The base composition of the P. australiensis, mitogenome is 33.55% for T, 18.24% for C, 35.16% for A, and 13.06% for G, with an AT bias of 71.58%. It has a mitogenome of 15,990 base pairs comprised of 13 protein-coding, 2 ribosomal subunit and 22 transfer RNAs genes and a non-coding AT-rich region. The mitogenome gene order for the species is typical for atyid shrimps, which conform to the primitive pan crustacean model.
Collapse
Affiliation(s)
- Huan You Gan
- a School of Science, Monash University, Jalan Lagoon Selatan , Selangor , Malaysia and.,b Monash University Malaysia Genomics Facility , Monash University, Jalan Lagoon Selatan , Selangor , Malaysia
| | - Han Ming Gan
- a School of Science, Monash University, Jalan Lagoon Selatan , Selangor , Malaysia and.,b Monash University Malaysia Genomics Facility , Monash University, Jalan Lagoon Selatan , Selangor , Malaysia
| | - Yin Peng Lee
- a School of Science, Monash University, Jalan Lagoon Selatan , Selangor , Malaysia and.,b Monash University Malaysia Genomics Facility , Monash University, Jalan Lagoon Selatan , Selangor , Malaysia
| | - Christopher M Austin
- a School of Science, Monash University, Jalan Lagoon Selatan , Selangor , Malaysia and.,b Monash University Malaysia Genomics Facility , Monash University, Jalan Lagoon Selatan , Selangor , Malaysia
| |
Collapse
|
1954
|
Gan HY, Gan HM, Lee YP, Austin CM. The complete mitogenome of the rock pool prawn Palaemon serenus (Heller, 1862) (Crustacea: Decapoda: Palaemonidae). Mitochondrial DNA A DNA Mapp Seq Anal 2015; 27:3155-6. [PMID: 25693708 DOI: 10.3109/19401736.2015.1007311] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
The mitochondrial genome of the rock pool prawn (Palaemon serenus), is sequenced, making it the third for genera of the family Palaemonidae and the first for the genus Palaemon. The mitogenome is 15,967 base pairs in length and comprises 13 protein-coding genes, 2 ribosomal subunit genes, 22 transfer RNAs and a non-coding AT-rich region. The P. serenus mitogenome has an AT bias of 58.97% and a base composition of 29.79% for T, 24.14% for C, 29.18% for A, and 16.89% for G. The mitogenome gene order of P. serenus is identical to Exopalaemon carinicauda.
Collapse
Affiliation(s)
- Huan You Gan
- a Genomics Facility , Monash University Malaysia, Jalan Lagoon Selatan , Petaling Jaya , Selangor , Malaysia and.,b School of Science, Monash University Malaysia, Jalan Lagoon Selatan , Petaling Jaya , Selangor , Malaysia
| | - Han Ming Gan
- a Genomics Facility , Monash University Malaysia, Jalan Lagoon Selatan , Petaling Jaya , Selangor , Malaysia and.,b School of Science, Monash University Malaysia, Jalan Lagoon Selatan , Petaling Jaya , Selangor , Malaysia
| | - Yin Peng Lee
- a Genomics Facility , Monash University Malaysia, Jalan Lagoon Selatan , Petaling Jaya , Selangor , Malaysia and.,b School of Science, Monash University Malaysia, Jalan Lagoon Selatan , Petaling Jaya , Selangor , Malaysia
| | - Christopher M Austin
- a Genomics Facility , Monash University Malaysia, Jalan Lagoon Selatan , Petaling Jaya , Selangor , Malaysia and.,b School of Science, Monash University Malaysia, Jalan Lagoon Selatan , Petaling Jaya , Selangor , Malaysia
| |
Collapse
|
1955
|
Castelle CJ, Wrighton KC, Thomas BC, Hug LA, Brown CT, Wilkins MJ, Frischkorn KR, Tringe SG, Singh A, Markillie LM, Taylor RC, Williams KH, Banfield JF. Genomic expansion of domain archaea highlights roles for organisms from new phyla in anaerobic carbon cycling. Curr Biol 2015; 25:690-701. [PMID: 25702576 DOI: 10.1016/j.cub.2015.01.014] [Citation(s) in RCA: 350] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2014] [Revised: 12/30/2014] [Accepted: 01/06/2015] [Indexed: 01/23/2023]
Abstract
BACKGROUND Archaea represent a significant fraction of Earth's biodiversity, yet they remain much less well understood than Bacteria. Gene surveys, a few metagenomic studies, and some single-cell sequencing projects have revealed numerous little-studied archaeal phyla. Certain lineages appear to branch deeply and may be part of a major phylum radiation. The structure of this radiation and the physiology of the organisms remain almost unknown. RESULTS We used genome-resolved metagenomic analyses to investigate the diversity, genomes sizes, metabolic capacities, and potential roles of Archaea in terrestrial subsurface biogeochemical cycles. We sequenced DNA from complex sediment and planktonic consortia from an aquifer adjacent to the Colorado River (USA) and reconstructed the first complete genomes for Archaea using cultivation-independent methods. To provide taxonomic context, we analyzed an additional 151 newly sampled archaeal sequences. We resolved two new phyla within a major, apparently deep-branching group of phyla (a superphylum). The organisms have small genomes, and metabolic predictions indicate that their primary contributions to Earth's biogeochemical cycles involve carbon and hydrogen metabolism, probably associated with symbiotic and/or fermentation-based lifestyles. CONCLUSIONS The results dramatically expand genomic sampling of the domain Archaea and clarify taxonomic designations within a major superphylum. This study, in combination with recently published work on bacterial phyla lacking cultivated representatives, reveals a fascinating phenomenon of major radiations of organisms with small genomes, novel proteome composition, and strong interdependence in both domains.
Collapse
Affiliation(s)
- Cindy J Castelle
- Department of Earth and Planetary Science, University of California, Berkeley, Berkeley, CA 94720 USA
| | - Kelly C Wrighton
- Department of Microbiology, The Ohio State University, Columbus, OH 43210 USA
| | - Brian C Thomas
- Department of Earth and Planetary Science, University of California, Berkeley, Berkeley, CA 94720 USA
| | - Laura A Hug
- Department of Earth and Planetary Science, University of California, Berkeley, Berkeley, CA 94720 USA
| | - Christopher T Brown
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA 94720 USA
| | - Michael J Wilkins
- Department of Microbiology, The Ohio State University, Columbus, OH 43210 USA; School of Earth Sciences, The Ohio State University, Columbus, OH 43210 USA
| | - Kyle R Frischkorn
- Department of Earth and Environmental Sciences and the Lamont-Doherty Earth Observatory, Columbia University, Palisades, New York, NY 10964 USA
| | - Susannah G Tringe
- Metagenome Program, DOE Joint Genome Institute, Walnut Creek, CA 94598 USA
| | - Andrea Singh
- Department of Earth and Planetary Science, University of California, Berkeley, Berkeley, CA 94720 USA
| | - Lye Meng Markillie
- Environmental Molecular Sciences Laboratory, Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352 USA
| | - Ronald C Taylor
- Environmental Molecular Sciences Laboratory, Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352 USA
| | - Kenneth H Williams
- Earth Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Jillian F Banfield
- Department of Earth and Planetary Science, University of California, Berkeley, Berkeley, CA 94720 USA; Department of Environmental Science, Policy, and Management, University of California, Berkeley, CA 94720 USA.
| |
Collapse
|
1956
|
Zhong C, Yang Y, Yooseph S. GRASP: guided reference-based assembly of short peptides. Nucleic Acids Res 2015; 43:e18. [PMID: 25414351 PMCID: PMC4330339 DOI: 10.1093/nar/gku1210] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2014] [Revised: 11/01/2014] [Accepted: 11/05/2014] [Indexed: 12/22/2022] Open
Abstract
Protein sequences predicted from metagenomic datasets are annotated by identifying their homologs via sequence comparisons with reference or curated proteins. However, a majority of metagenomic protein sequences are partial-length, arising as a result of identifying genes on sequencing reads or on assembled nucleotide contigs, which themselves are often very fragmented. The fragmented nature of metagenomic protein predictions adversely impacts homology detection and, therefore, the quality of the overall annotation of the dataset. Here we present a novel algorithm called GRASP that accurately identifies the homologs of a given reference protein sequence from a database consisting of partial-length metagenomic proteins. Our homology detection strategy is guided by the reference sequence, and involves the simultaneous search and assembly of overlapping database sequences. GRASP was compared to three commonly used protein sequence search programs (BLASTP, PSI-BLAST and FASTM). Our evaluations using several simulated and real datasets show that GRASP has a significantly higher sensitivity than these programs while maintaining a very high specificity. GRASP can be a very useful program for detecting and quantifying taxonomic and protein family abundances in metagenomic datasets. GRASP is implemented in GNU C++, and is freely available at http://sourceforge.net/projects/grasp-release.
Collapse
Affiliation(s)
- Cuncong Zhong
- Informatics Department, J. Craig Venter Institute, La Jolla, CA 92037, USA
| | - Youngik Yang
- Informatics Department, J. Craig Venter Institute, La Jolla, CA 92037, USA
| | - Shibu Yooseph
- Informatics Department, J. Craig Venter Institute, La Jolla, CA 92037, USA
| |
Collapse
|
1957
|
Ellison CE, Bachtrog D. Non-allelic gene conversion enables rapid evolutionary change at multiple regulatory sites encoded by transposable elements. eLife 2015; 4. [PMID: 25688566 PMCID: PMC4384637 DOI: 10.7554/elife.05899] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2014] [Accepted: 02/16/2015] [Indexed: 12/02/2022] Open
Abstract
Transposable elements (TEs) allow rewiring of regulatory networks, and the recent amplification of the ISX element dispersed 77 functional but suboptimal binding sites for the dosage compensation complex to a newly formed X chromosome in Drosophila. Here we identify two linked refining mutations within ISX that interact epistatically to increase binding affinity to the dosage compensation complex. Selection has increased the frequency of this derived haplotype in the population, which is fixed at 30% of ISX insertions and polymorphic among another 41%. Sharing of this haplotype indicates that high levels of gene conversion among ISX elements allow them to ‘crowd-source’ refining mutations, and a refining mutation that occurs at any single ISX element can spread in two dimensions: horizontally across insertion sites by non-allelic gene conversion, and vertically through the population by natural selection. These results describe a novel route by which fully functional regulatory elements can arise rapidly from TEs and implicate non-allelic gene conversion as having an important role in accelerating the evolutionary fine-tuning of regulatory networks. DOI:http://dx.doi.org/10.7554/eLife.05899.001 Mutations change genes and provide the raw material for evolution. Genes are sections of DNA that contain the instructions for making proteins or other molecules, and so determine the physical characteristics of each organism. Genetic mutations that increase an organism's number of offspring and chances of survival are more likely to be passed on to future generations. Changes to when or where a gene is switched on (so-called regulatory mutations) can also provide fitness benefits and can therefore be selected for during evolution. Transposable elements are sequences of DNA that are also called ‘jumping genes’ because they can make copies of themselves and these copies of the transposable element can move to other locations in the genome. Some transposable elements contain sequences that switch on nearby genes. If different copies of a transposable element that contains such a regulatory sequence insert themselves in more than one place, it can result in a network of genes that can all be controlled in the same way. The regulatory sequences contained within transposable elements are not always optimal, but they can be fine-tuned through evolution. A fruit fly called Drosophila miranda has a transposable element called ISX that has, over time, placed up to 77 regulatory sequences around one of this species' sex chromosomes. Just as in humans, female flies are XX and males are XY; but having only one copy of the X chromosome means that male flies need to increase the expression of certain genes to produce a full-dose of the molecules made by the genes. This process is called dosage compensation and in 2013 the 77 ISX regulatory sequences on the fruit fly's X chromosome were shown to help recruit the molecular machinery that carries out dosage compensation to nearby genes, albeit inefficiently. Now Ellison and Bachtrog—who also conducted the 2013 study—report how these transposable elements have been fine-tuned to make them more effective for dosage compensation. Ellison and Bachtrog uncovered two mutations that make the ISX transposable element better at recruiting the dosage compensation molecular machinery. ISX spread around different locations along the fly's X chromosome before these mutations arose; this means that initially none of the 77 insertions carried the two mutations, but now 30% of the 77 elements have the mutations in all flies, and 41% have them in only some flies. The same mutations have spread between the different ISX elements because transposable elements with the mutations have been used to directly convert other ISX elements without them. These mutations have also become more common in the fruit fly population by being passed on to offspring and increasing their survival. These two routes have accelerated the fine-tuning of these transposable elements for use in gene regulation. This implies that regulatory sequences derived from transposable elements evolve in a way that is fundamentally different from those that arise by other means, as the direct conversion between these insertions allows fine-tuning mutations to spread more rapidly. DOI:http://dx.doi.org/10.7554/eLife.05899.002
Collapse
Affiliation(s)
- Christopher E Ellison
- Department of Integrative Biology, University of California, Berkeley, Berkeley, United States
| | - Doris Bachtrog
- Department of Integrative Biology, University of California, Berkeley, Berkeley, United States
| |
Collapse
|
1958
|
|
1959
|
Genomes of planktonic Acidimicrobiales: widening horizons for marine Actinobacteria by metagenomics. mBio 2015; 6:mBio.02083-14. [PMID: 25670777 PMCID: PMC4337565 DOI: 10.1128/mbio.02083-14] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
The genomes of four novel marine Actinobacteria have been assembled from large metagenomic data sets derived from the Mediterranean deep chlorophyll maximum (DCM). These are the first marine representatives belonging to the order Acidimicrobiales and only the second group of planktonic marine Actinobacteria to be described. Their streamlined genomes and photoheterotrophic lifestyle suggest that they are planktonic, free-living microbes. A novel rhodopsin clade, acidirhodopsins, related to freshwater actinorhodopsins, was found in these organisms. Their genomes suggest a capacity to assimilate C2 compounds, some using the glyoxylate bypass and others with the ethylmalonyl-coenzyme A (CoA) pathway. They are also able to derive energy from dimethylsulfopropionate (DMSP), sulfonate, and carbon monoxide oxidation, all commonly available in the marine habitat. These organisms appear to be prevalent in the deep photic zone at or around the DCM. The presence of sister clades to the marine Acidimicrobiales in freshwater aquatic habitats provides a new example of marine-freshwater transitions with potential evolutionary insights. Despite several studies showing the importance and abundance of planktonic Actinobacteria in the marine habitat, a representative genome was only recently described. In order to expand the genomic repertoire of marine Actinobacteria, we describe here the first Acidimicrobidae genomes of marine origin and provide insights about their ecology. They display metabolic versatility in the acquisition of carbon and appear capable of utilizing diverse sources of energy. One of the genomes harbors a new kind of rhodopsin related to the actinorhodopsin clade of freshwater origin that is widespread in the oceans. Our data also support their preference to inhabit the deep chlorophyll maximum and the deep photic zone. This work contributes to the perception of marine actinobacterial groups as important players in the marine environment with distinct and important contributions to nutrient cycling in the oceans.
Collapse
|
1960
|
Gan HM, Gan HY, Lee YP, Grandjean F, Austin CM. The complete mitogenome of the invasive spiny-cheek crayfish Orconectes limosus (Rafinesque, 1817) (Crustacea: Decapoda: Cambaridae). Mitochondrial DNA A DNA Mapp Seq Anal 2015; 27:3181-3. [DOI: 10.3109/19401736.2015.1007326] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Affiliation(s)
- Han Ming Gan
- School of Science, Monash University Malaysia, Jalan Lagoon Selatan, Selangor, Malaysia,
- Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan, Selangor, Malaysia, and
| | - Huan You Gan
- School of Science, Monash University Malaysia, Jalan Lagoon Selatan, Selangor, Malaysia,
- Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan, Selangor, Malaysia, and
| | - Yin Peng Lee
- School of Science, Monash University Malaysia, Jalan Lagoon Selatan, Selangor, Malaysia,
- Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan, Selangor, Malaysia, and
| | - Frederic Grandjean
- Laboratoire Ecologie et Biologie des Interactions – UMR CNRS 7267 Equipe Ecologie Evolution Symbiose, Poitiers Cedex, France
| | - Christopher M. Austin
- School of Science, Monash University Malaysia, Jalan Lagoon Selatan, Selangor, Malaysia,
- Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan, Selangor, Malaysia, and
| |
Collapse
|
1961
|
Gan HM, Tan MH, Gan HY, Lee YP, Austin CM. The complete mitogenome of the Norway lobster Nephrops norvegicus (Linnaeus, 1758) (Crustacea: Decapoda: Nephropidae). Mitochondrial DNA A DNA Mapp Seq Anal 2015; 27:3179-80. [PMID: 25648918 DOI: 10.3109/19401736.2015.1007325] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
The clawed lobster Nephrops norvegicus is an important commercial species in European waters. We have sequenced the complete mitochondrial genome of the species from a partial genome scan using Next-Gen sequencing. The N. norvegicus has a mitogenome of 16,132 base pairs (71.22% A+ T content) comprising 13 protein-coding genes, 2 ribosomal subunit genes, 21 transfer RNAs, and a putative 1259 bp non-coding AT-rich region. This mitogenome is the second fully characterized for the family Nephropidae and the first for the genus Nephrops. The mitogenome gene order is identical to the Maine lobster, Homarus americanus with the exception of the possible loss of the trnI gene.
Collapse
Affiliation(s)
- Han Ming Gan
- a School of Science, Monash University Malaysia, Jalan Lagoon Selatan , Selangor , Malaysia and.,b Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan , Selangor , Malaysia
| | - Mun Hua Tan
- a School of Science, Monash University Malaysia, Jalan Lagoon Selatan , Selangor , Malaysia and.,b Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan , Selangor , Malaysia
| | - Huan You Gan
- a School of Science, Monash University Malaysia, Jalan Lagoon Selatan , Selangor , Malaysia and.,b Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan , Selangor , Malaysia
| | - Yin Peng Lee
- a School of Science, Monash University Malaysia, Jalan Lagoon Selatan , Selangor , Malaysia and.,b Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan , Selangor , Malaysia
| | - Christopher M Austin
- a School of Science, Monash University Malaysia, Jalan Lagoon Selatan , Selangor , Malaysia and.,b Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan , Selangor , Malaysia
| |
Collapse
|
1962
|
Gan HM, Gan HY, Tan MH, Penny SS, Willan RC, Austin CM. The complete mitogenome of the giant clam Tridacna squamosa (Heterodonta: Bivalvia: Tridacnidae). Mitochondrial DNA A DNA Mapp Seq Anal 2015; 27:3220-1. [PMID: 25648928 DOI: 10.3109/19401736.2015.1007355] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
The complete mitochondrial genome of the commercially and ecologically important and internationally vulnerable giant clam Tridacna squamosa was recovered by genome skimming using the MiSeq platform. The T. squamosa mitogenome has 20,930 base pairs (62.35% A+T content) and is made up of 12 protein-coding genes, 2 ribosomal subunit genes, 24 transfer RNAs, and a 2594 bp non-coding AT-rich region. The mitogenome has a relatively large insertion in the atp6 gene. This is the first mitogenome to be sequenced from the genus Tridacna, and the family Tridacnidae and represents a new gene order.
Collapse
Affiliation(s)
- Han Ming Gan
- a School of Science, Monash University Malaysia, Jalan Lagoon Selatan , Petaling Jaya , Selangor , Malaysia .,b Genomics Facility, Monash University Malaysia, Jalan Lagoon Selatan , Petaling Jaya , Selangor , Malaysia
| | - Huan You Gan
- a School of Science, Monash University Malaysia, Jalan Lagoon Selatan , Petaling Jaya , Selangor , Malaysia .,b Genomics Facility, Monash University Malaysia, Jalan Lagoon Selatan , Petaling Jaya , Selangor , Malaysia
| | - Mun Hua Tan
- a School of Science, Monash University Malaysia, Jalan Lagoon Selatan , Petaling Jaya , Selangor , Malaysia .,b Genomics Facility, Monash University Malaysia, Jalan Lagoon Selatan , Petaling Jaya , Selangor , Malaysia
| | - Shane S Penny
- c Research Institute for the Environment and Livelihoods, Charles Darwin University , Darwin , Australia , and
| | - Richard C Willan
- d Museum and Art Gallery of the Northern Territory , Darwin , Australia
| | - Christopher M Austin
- a School of Science, Monash University Malaysia, Jalan Lagoon Selatan , Petaling Jaya , Selangor , Malaysia .,b Genomics Facility, Monash University Malaysia, Jalan Lagoon Selatan , Petaling Jaya , Selangor , Malaysia .,c Research Institute for the Environment and Livelihoods, Charles Darwin University , Darwin , Australia , and.,d Museum and Art Gallery of the Northern Territory , Darwin , Australia
| |
Collapse
|
1963
|
Hug LA, Thomas BC, Brown CT, Frischkorn KR, Williams KH, Tringe SG, Banfield JF. Aquifer environment selects for microbial species cohorts in sediment and groundwater. ISME JOURNAL 2015; 9:1846-56. [PMID: 25647349 DOI: 10.1038/ismej.2015.2] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/02/2014] [Revised: 12/17/2014] [Accepted: 12/23/2014] [Indexed: 12/11/2022]
Abstract
Little is known about the biogeography or stability of sediment-associated microbial community membership because these environments are biologically complex and generally difficult to sample. High-throughput-sequencing methods provide new opportunities to simultaneously genomically sample and track microbial community members across a large number of sampling sites or times, with higher taxonomic resolution than is associated with 16 S ribosomal RNA gene surveys, and without the disadvantages of primer bias and gene copy number uncertainty. We characterized a sediment community at 5 m depth in an aquifer adjacent to the Colorado River and tracked its most abundant 133 organisms across 36 different sediment and groundwater samples. We sampled sites separated by centimeters, meters and tens of meters, collected on seven occasions over 6 years. Analysis of 1.4 terabase pairs of DNA sequence showed that these 133 organisms were more consistently detected in saturated sediments than in samples from the vadose zone, from distant locations or from groundwater filtrates. Abundance profiles across aquifer locations and from different sampling times identified organism cohorts that comprised subsets of the 133 organisms that were consistently associated. The data suggest that cohorts are partly selected for by shared environmental adaptation.
Collapse
Affiliation(s)
- Laura A Hug
- Department of Earth and Planetary Science, UC Berkeley, Berkeley, CA, USA
| | - Brian C Thomas
- Department of Earth and Planetary Science, UC Berkeley, Berkeley, CA, USA
| | | | - Kyle R Frischkorn
- Department of Earth and Environmental Science, Columbia University, New York, NY, USA
| | - Kenneth H Williams
- Geophysics Department, Earth Sciences Division, Lawrence Berkeley National Lab, Berkeley, CA, USA
| | - Susannah G Tringe
- Metagenome Program, DOE Joint Genome Institute, Walnut Creek, CA, USA
| | - Jillian F Banfield
- 1] Department of Earth and Planetary Science, UC Berkeley, Berkeley, CA, USA [2] Department of Environmental Science, Policy, and Management, Berkeley, CA, USA
| |
Collapse
|
1964
|
Wawrzyniak I, Courtine D, Osman M, Hubans-Pierlot C, Cian A, Nourrisson C, Chabe M, Poirier P, Bart A, Polonais V, Delgado-Viscogliosi P, El Alaoui H, Belkorchia A, van Gool T, Tan KSW, Ferreira S, Viscogliosi E, Delbac F. Draft genome sequence of the intestinal parasite Blastocystis subtype 4-isolate WR1. GENOMICS DATA 2015; 4:22-3. [PMID: 26484170 PMCID: PMC4535960 DOI: 10.1016/j.gdata.2015.01.009] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 12/05/2014] [Revised: 01/23/2015] [Accepted: 01/24/2015] [Indexed: 11/27/2022]
Abstract
The intestinal protistan parasite Blastocystis is characterized by an extensive genetic variability with 17 subtypes (ST1–ST17) described to date. Only the whole genome of a human ST7 isolate was previously sequenced. Here we report the draft genome sequence of Blastocystis ST4-WR1 isolated from a laboratory rodent at Singapore.
Collapse
Affiliation(s)
- Ivan Wawrzyniak
- Clermont Université, Université Blaise Pascal-Université d'Auvergne-CNRS, UMR 6023 Laboratoire Microorganismes: Génome et Environnement, Clermont-Ferrand, France
| | - Damien Courtine
- Clermont Université, Université Blaise Pascal-Université d'Auvergne-CNRS, UMR 6023 Laboratoire Microorganismes: Génome et Environnement, Clermont-Ferrand, France
| | - Marwan Osman
- Institut Pasteur de Lille, Centre d'Infection et d'Immunité de Lille, Inserm U1019, CNRS UMR 8204, Université Lille, Nord de France, France
| | | | - Amandine Cian
- Institut Pasteur de Lille, Centre d'Infection et d'Immunité de Lille, Inserm U1019, CNRS UMR 8204, Université Lille, Nord de France, France
| | - Céline Nourrisson
- Clermont Université, Université Blaise Pascal-Université d'Auvergne-CNRS, UMR 6023 Laboratoire Microorganismes: Génome et Environnement, Clermont-Ferrand, France
| | - Magali Chabe
- Institut Pasteur de Lille, Centre d'Infection et d'Immunité de Lille, Inserm U1019, CNRS UMR 8204, Université Lille, Nord de France, France
| | - Philippe Poirier
- Clermont Université, Université Blaise Pascal-Université d'Auvergne-CNRS, UMR 6023 Laboratoire Microorganismes: Génome et Environnement, Clermont-Ferrand, France
| | - Aldert Bart
- Department of Medical Microbiology, Section Parasitology, Center for Infection and Immunity Amsterdam, Academic Medical Center, Amsterdam, The Netherlands
| | - Valérie Polonais
- Clermont Université, Université Blaise Pascal-Université d'Auvergne-CNRS, UMR 6023 Laboratoire Microorganismes: Génome et Environnement, Clermont-Ferrand, France
| | - Pilar Delgado-Viscogliosi
- Institut Pasteur de Lille, Centre d'Infection et d'Immunité de Lille, Inserm U1019, CNRS UMR 8204, Université Lille, Nord de France, France
| | - Hicham El Alaoui
- Clermont Université, Université Blaise Pascal-Université d'Auvergne-CNRS, UMR 6023 Laboratoire Microorganismes: Génome et Environnement, Clermont-Ferrand, France
| | - Abdel Belkorchia
- Clermont Université, Université Blaise Pascal-Université d'Auvergne-CNRS, UMR 6023 Laboratoire Microorganismes: Génome et Environnement, Clermont-Ferrand, France
| | - Tom van Gool
- Department of Medical Microbiology, Section Parasitology, Center for Infection and Immunity Amsterdam, Academic Medical Center, Amsterdam, The Netherlands
| | - Kevin S W Tan
- Laboratory of Molecular and Cellular Parasitology, Department of Microbiology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
| | | | - Eric Viscogliosi
- Institut Pasteur de Lille, Centre d'Infection et d'Immunité de Lille, Inserm U1019, CNRS UMR 8204, Université Lille, Nord de France, France
| | - Frédéric Delbac
- Clermont Université, Université Blaise Pascal-Université d'Auvergne-CNRS, UMR 6023 Laboratoire Microorganismes: Génome et Environnement, Clermont-Ferrand, France
| |
Collapse
|
1965
|
De novo sequences of Haloquadratum walsbyi from Lake Tyrrell, Australia, reveal a variable genomic landscape. ARCHAEA-AN INTERNATIONAL MICROBIOLOGICAL JOURNAL 2015; 2015:875784. [PMID: 25709557 PMCID: PMC4330952 DOI: 10.1155/2015/875784] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/19/2014] [Revised: 09/02/2014] [Accepted: 09/16/2014] [Indexed: 11/29/2022]
Abstract
Hypersaline systems near salt saturation levels represent an extreme environment, in which organisms grow and survive near the limits of life. One of the abundant members of the microbial communities in hypersaline systems is the square archaeon, Haloquadratum walsbyi. Utilizing a short-read metagenome from Lake Tyrrell, a hypersaline ecosystem in Victoria, Australia, we performed a comparative genomic analysis of H. walsbyi to better understand the extent of variation between strains/subspecies. Results revealed that previously isolated strains/subspecies do not fully describe the complete repertoire of the genomic landscape present in H. walsbyi. Rearrangements, insertions, and deletions were observed for the Lake Tyrrell derived Haloquadratum genomes and were supported by environmental de novo sequences, including shifts in the dominant genomic landscape of the two most abundant strains. Analysis pertaining to halomucins indicated that homologs for this large protein are not a feature common for all species of Haloquadratum. Further, we analyzed ATP-binding cassette transporters (ABC-type transporters) for evidence of niche partitioning between different strains/subspecies. We were able to identify unique and variable transporter subunits from all five genomes analyzed and the de novo environmental sequences, suggesting that differences in nutrient and carbon source acquisition may play a role in maintaining distinct strains/subspecies.
Collapse
|
1966
|
Chen X, Bracht JR, Goldman AD, Dolzhenko E, Clay DM, Swart EC, Perlman DH, Doak TG, Stuart A, Amemiya CT, Sebra RP, Landweber LF. The architecture of a scrambled genome reveals massive levels of genomic rearrangement during development. Cell 2015; 158:1187-1198. [PMID: 25171416 DOI: 10.1016/j.cell.2014.07.034] [Citation(s) in RCA: 118] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2013] [Revised: 05/18/2014] [Accepted: 07/03/2014] [Indexed: 10/24/2022]
Abstract
Programmed DNA rearrangements in the single-celled eukaryote Oxytricha trifallax completely rewire its germline into a somatic nucleus during development. This elaborate, RNA-mediated pathway eliminates noncoding DNA sequences that interrupt gene loci and reorganizes the remaining fragments by inversions and permutations to produce functional genes. Here, we report the Oxytricha germline genome and compare it to the somatic genome to present a global view of its massive scale of genome rearrangements. The remarkably encrypted genome architecture contains >3,500 scrambled genes, as well as >800 predicted germline-limited genes expressed, and some posttranslationally modified, during genome rearrangements. Gene segments for different somatic loci often interweave with each other. Single gene segments can contribute to multiple, distinct somatic loci. Terminal precursor segments from neighboring somatic loci map extremely close to each other, often overlapping. This genome assembly provides a draft of a scrambled genome and a powerful model for studies of genome rearrangement.
Collapse
Affiliation(s)
- Xiao Chen
- Department of Molecular Biology, Princeton University, Princeton, NJ 08544, USA
| | - John R Bracht
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ 08544, USA
| | - Aaron David Goldman
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ 08544, USA
| | - Egor Dolzhenko
- Department of Mathematics and Statistics, University of South Florida, Tampa, FL 33620, USA
| | - Derek M Clay
- Department of Molecular Biology, Princeton University, Princeton, NJ 08544, USA
| | - Estienne C Swart
- Institute of Cell Biology, University of Bern, 3012 Bern, Switzerland
| | - David H Perlman
- Collaborative Proteomics and Mass Spectrometry Center, Molecular Biology Department and the Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA
| | - Thomas G Doak
- Department of Biology, University of Indiana, Bloomington, IN 47405, USA
| | - Andrew Stuart
- Benaroya Research Institute at Virginia Mason, Seattle, WA 98101, USA
| | - Chris T Amemiya
- Benaroya Research Institute at Virginia Mason, Seattle, WA 98101, USA
| | - Robert P Sebra
- Icahn Institute and Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Laura F Landweber
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ 08544, USA.
| |
Collapse
|
1967
|
Norman JM, Handley SA, Baldridge MT, Droit L, Liu CY, Keller BC, Kambal A, Monaco CL, Zhao G, Fleshner P, Stappenbeck TS, McGovern DPB, Keshavarzian A, Mutlu EA, Sauk J, Gevers D, Xavier RJ, Wang D, Parkes M, Virgin HW. Disease-specific alterations in the enteric virome in inflammatory bowel disease. Cell 2015; 160:447-60. [PMID: 25619688 DOI: 10.1016/j.cell.2015.01.002] [Citation(s) in RCA: 899] [Impact Index Per Article: 89.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2014] [Revised: 10/13/2014] [Accepted: 12/24/2014] [Indexed: 12/13/2022]
Abstract
Decreases in the diversity of enteric bacterial populations are observed in patients with Crohn's disease (CD) and ulcerative colitis (UC). Less is known about the virome in these diseases. We show that the enteric virome is abnormal in CD and UC patients. In-depth analysis of preparations enriched for free virions in the intestine revealed that CD and UC were associated with a significant expansion of Caudovirales bacteriophages. The viromes of CD and UC patients were disease and cohort specific. Importantly, it did not appear that expansion and diversification of the enteric virome was secondary to changes in bacterial populations. These data support a model in which changes in the virome may contribute to intestinal inflammation and bacterial dysbiosis. We conclude that the virome is a candidate for contributing to, or being a biomarker for, human inflammatory bowel disease and speculate that the enteric virome may play a role in other diseases.
Collapse
Affiliation(s)
- Jason M Norman
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Scott A Handley
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Megan T Baldridge
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Lindsay Droit
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Catherine Y Liu
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Brian C Keller
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO 63110, USA; Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Amal Kambal
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Cynthia L Monaco
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO 63110, USA; Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Guoyan Zhao
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO 63110, USA; Department of Molecular Microbiology, Washington University School of Medicine, Saint Louis, MO 63110, USA
| | - Phillip Fleshner
- Division of Colorectal Surgery, Cedars-Sinai Medical Center, Los Angeles, CA 90048, USA
| | - Thaddeus S Stappenbeck
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Dermot P B McGovern
- The F. Widjaja Foundation Inflammatory Bowel and Immunobiology Research Institute, Cedars-Sinai Medical Center, Los Angeles, CA 90048, USA
| | - Ali Keshavarzian
- Department of Medicine, Division of Digestive Diseases and Nutrition, Rush University Medical Center, Chicago, IL 60612, USA
| | - Ece A Mutlu
- Department of Medicine, Division of Digestive Diseases and Nutrition, Rush University Medical Center, Chicago, IL 60612, USA
| | - Jenny Sauk
- Gastrointestinal Unit and Center for the Study of Inflammatory Bowel Disease, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA
| | - Dirk Gevers
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Ramnik J Xavier
- Gastrointestinal Unit and Center for the Study of Inflammatory Bowel Disease, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - David Wang
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO 63110, USA; Department of Molecular Microbiology, Washington University School of Medicine, Saint Louis, MO 63110, USA
| | - Miles Parkes
- Division of Gastroenterology Addenbrooke's Hospital and Department of Medicine, University of Cambridge, Cambridge CB2 0QQ, UK
| | - Herbert W Virgin
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO 63110, USA.
| |
Collapse
|
1968
|
Jørgensen TS, Kiil AS, Hansen MA, Sørensen SJ, Hansen LH. Current strategies for mobilome research. Front Microbiol 2015; 5:750. [PMID: 25657641 PMCID: PMC4302988 DOI: 10.3389/fmicb.2014.00750] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2014] [Accepted: 12/10/2014] [Indexed: 11/30/2022] Open
Abstract
Mobile genetic elements (MGEs) are pivotal for bacterial evolution and adaptation, allowing shuffling of genes even between distantly related bacterial species. The study of these elements is biologically interesting as the mode of genetic propagation is kaleidoscopic and important, as MGEs are the main vehicles of the increasing bacterial antibiotic resistance that causes thousands of human deaths each year. The study of MGEs has previously focused on plasmids from individual isolates, but the revolution in sequencing technology has allowed the study of mobile genomic elements of entire communities using metagenomic approaches. The problem in using metagenomic sequencing for the study of MGEs is that plasmids and other mobile elements only comprise a small fraction of the total genetic content that are difficult to separate from chromosomal DNA based on sequence alone. The distinction between plasmid and chromosome is important as the mobility and regulation of genes largely depend on their genetic context. Several different approaches have been proposed that specifically enrich plasmid DNA from community samples. Here, we review recent approaches used to study entire plasmid pools from complex environments, and point out possible future developments for and pitfalls of these approaches. Further, we discuss the use of the PacBio long-read sequencing technology for MGE discovery.
Collapse
Affiliation(s)
- Tue S Jørgensen
- Section of Microbiology, Department of Biology, University of Copenhagen Copenhagen, Denmark
| | - Anne S Kiil
- Section of Microbiology, Department of Biology, University of Copenhagen Copenhagen, Denmark
| | - Martin A Hansen
- Section of Microbiology, Department of Biology, University of Copenhagen Copenhagen, Denmark
| | - Søren J Sørensen
- Section of Microbiology, Department of Biology, University of Copenhagen Copenhagen, Denmark
| | - Lars H Hansen
- Section of Microbiology, Department of Biology, University of Copenhagen Copenhagen, Denmark ; Environmental Microbiology and Biotechnology, Department of Environmental Science, Aarhus University Roskilde, Denmark
| |
Collapse
|
1969
|
Park JY, An YR, Kanda N, An CM, An HS, Kang JH, Kim EM, An DH, Jung H, Joung M, Park MH, Yoon SH, Lee BY, Lee T, Kim KW, Park WC, Shin DH, Lee YS, Kim J, Kwak W, Kim HJ, Kwon YJ, Moon S, Kim Y, Burt DW, Cho S, Kim H. Cetaceans evolution: insights from the genome sequences of common minke whales. BMC Genomics 2015; 16:13. [PMID: 25609461 PMCID: PMC4311506 DOI: 10.1186/s12864-015-1213-1] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2014] [Accepted: 01/02/2015] [Indexed: 12/11/2022] Open
Abstract
Background Whales have captivated the human imagination for millennia. These incredible cetaceans are the only mammals that have adapted to life in the open oceans and have been a source of human food, fuel and tools around the globe. The transition from land to water has led to various aquatic specializations related to hairless skin and ability to regulate their body temperature in cold water. Results We present four common minke whale (Balaenoptera acutorostrata) genomes with depth of ×13 ~ ×17 coverage and perform resequencing technology without a reference sequence. Our results indicated the time to the most recent common ancestors of common minke whales to be about 2.3574 (95% HPD, 1.1521 – 3.9212) million years ago. Further, we found that genes associated with epilation and tooth-development showed signatures of positive selection, supporting the morphological uniqueness of whales. Conclusions This whole-genome sequencing offers a chance to better understand the evolutionary journey of one of the largest mammals on earth. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1213-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jung Youn Park
- Biotechnology Research Division, National Fisheries Research & Development Institute, Gijang gun, Busan, 619-705, Republic of Korea.
| | - Yong-Rock An
- Cetacean Research Institute, National Fisheries Research & Development Institute, Nam-gu, Ulsan, 680-050, Republic of Korea.
| | - Naohisa Kanda
- The Institute of Cetacean Research, Toyomi 4-5, Chuo-ku, Tokyo, 104-0055, Japan.
| | - Chul-Min An
- Biotechnology Research Division, National Fisheries Research & Development Institute, Gijang gun, Busan, 619-705, Republic of Korea.
| | - Hye Suck An
- Biotechnology Research Division, National Fisheries Research & Development Institute, Gijang gun, Busan, 619-705, Republic of Korea.
| | - Jung-Ha Kang
- Biotechnology Research Division, National Fisheries Research & Development Institute, Gijang gun, Busan, 619-705, Republic of Korea.
| | - Eun Mi Kim
- Biotechnology Research Division, National Fisheries Research & Development Institute, Gijang gun, Busan, 619-705, Republic of Korea.
| | - Du-Hae An
- Cetacean Research Institute, National Fisheries Research & Development Institute, Nam-gu, Ulsan, 680-050, Republic of Korea.
| | - Hojin Jung
- Codes division, Insilicogen,Inc., Suwon, Gyeonggi-do, 441-813, Republic of Korea.
| | - Myunghee Joung
- Codes division, Insilicogen,Inc., Suwon, Gyeonggi-do, 441-813, Republic of Korea.
| | - Myung Hum Park
- TNT Research, #924 Doosan Venture Digm, Anyang, Gyeonggi-do, 431-755, Republic of Korea.
| | - Sook Hee Yoon
- Department of Agricultural Biotechnology, Seoul National University, Seoul, 151-921, Republic of Korea.
| | - Bo-Young Lee
- Department of Agricultural Biotechnology, Seoul National University, Seoul, 151-921, Republic of Korea.
| | - Taeheon Lee
- Department of Agricultural Biotechnology, Seoul National University, Seoul, 151-921, Republic of Korea.
| | - Kyu-Won Kim
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 151-742, Republic of Korea.
| | - Won Cheoul Park
- Department of Agricultural Biotechnology, Seoul National University, Seoul, 151-921, Republic of Korea.
| | - Dong Hyun Shin
- Department of Agricultural Biotechnology, Seoul National University, Seoul, 151-921, Republic of Korea.
| | - Young Sub Lee
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 151-742, Republic of Korea.
| | - Jaemin Kim
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 151-742, Republic of Korea. .,C&K Genomics, Seoul National University Research Park, Seoul, 151-919, Republic of Korea.
| | - Woori Kwak
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 151-742, Republic of Korea. .,C&K Genomics, Seoul National University Research Park, Seoul, 151-919, Republic of Korea.
| | - Hyeon Jeong Kim
- C&K Genomics, Seoul National University Research Park, Seoul, 151-919, Republic of Korea.
| | - Young-Jun Kwon
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 151-742, Republic of Korea.
| | - Sunjin Moon
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195-5065, USA.
| | - Yuseob Kim
- Department of Life Science, Ewha Womans University, Seoul, Korea.
| | - David W Burt
- The Roslin Institute, University of Edinburgh, Midlothian, EH25 9GR, UK.
| | - Seoae Cho
- C&K Genomics, Seoul National University Research Park, Seoul, 151-919, Republic of Korea.
| | - Heebal Kim
- Department of Agricultural Biotechnology, Seoul National University, Seoul, 151-921, Republic of Korea. .,Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 151-742, Republic of Korea. .,C&K Genomics, Seoul National University Research Park, Seoul, 151-919, Republic of Korea.
| |
Collapse
|
1970
|
Li D, Liu CM, Luo R, Sadakane K, Lam TW. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. ACTA ACUST UNITED AC 2015; 31:1674-6. [PMID: 25609793 DOI: 10.1093/bioinformatics/btv033] [Citation(s) in RCA: 4627] [Impact Index Per Article: 462.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2014] [Accepted: 01/14/2015] [Indexed: 01/25/2023]
Abstract
MEGAHIT is a NGS de novo assembler for assembling large and complex metagenomics data in a time- and cost-efficient manner. It finished assembling a soil metagenomics dataset with 252 Gbps in 44.1 and 99.6 h on a single computing node with and without a graphics processing unit, respectively. MEGAHIT assembles the data as a whole, i.e. no pre-processing like partitioning and normalization was needed. When compared with previous methods on assembling the soil data, MEGAHIT generated a three-time larger assembly, with longer contig N50 and average contig length; furthermore, 55.8% of the reads were aligned to the assembly, giving a fourfold improvement.
Collapse
Affiliation(s)
- Dinghua Li
- HKU-BGI Bioinformatics Algorithms Research Laboratory & Department of Computer Science, University of Hong Kong, Hong Kong, L3 Bioinformatics Limited, Hong Kong and National Institute of Informatics, Chiyoda-ku, Tokyo, Japan
| | - Chi-Man Liu
- HKU-BGI Bioinformatics Algorithms Research Laboratory & Department of Computer Science, University of Hong Kong, Hong Kong, L3 Bioinformatics Limited, Hong Kong and National Institute of Informatics, Chiyoda-ku, Tokyo, Japan
| | - Ruibang Luo
- HKU-BGI Bioinformatics Algorithms Research Laboratory & Department of Computer Science, University of Hong Kong, Hong Kong, L3 Bioinformatics Limited, Hong Kong and National Institute of Informatics, Chiyoda-ku, Tokyo, Japan
| | - Kunihiko Sadakane
- HKU-BGI Bioinformatics Algorithms Research Laboratory & Department of Computer Science, University of Hong Kong, Hong Kong, L3 Bioinformatics Limited, Hong Kong and National Institute of Informatics, Chiyoda-ku, Tokyo, Japan
| | - Tak-Wah Lam
- HKU-BGI Bioinformatics Algorithms Research Laboratory & Department of Computer Science, University of Hong Kong, Hong Kong, L3 Bioinformatics Limited, Hong Kong and National Institute of Informatics, Chiyoda-ku, Tokyo, Japan HKU-BGI Bioinformatics Algorithms Research Laboratory & Department of Computer Science, University of Hong Kong, Hong Kong, L3 Bioinformatics Limited, Hong Kong and National Institute of Informatics, Chiyoda-ku, Tokyo, Japan
| |
Collapse
|
1971
|
Deng X, Naccache SN, Ng T, Federman S, Li L, Chiu CY, Delwart EL. An ensemble strategy that significantly improves de novo assembly of microbial genomes from metagenomic next-generation sequencing data. Nucleic Acids Res 2015; 43:e46. [PMID: 25586223 PMCID: PMC4402509 DOI: 10.1093/nar/gkv002] [Citation(s) in RCA: 206] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2014] [Accepted: 01/04/2015] [Indexed: 11/12/2022] Open
Abstract
Next-generation sequencing (NGS) approaches rapidly produce millions to billions of short reads, which allow pathogen detection and discovery in human clinical, animal and environmental samples. A major limitation of sequence homology-based identification for highly divergent microorganisms is the short length of reads generated by most highly parallel sequencing technologies. Short reads require a high level of sequence similarities to annotated genes to confidently predict gene function or homology. Such recognition of highly divergent homologues can be improved by reference-free (de novo) assembly of short overlapping sequence reads into larger contigs. We describe an ensemble strategy that integrates the sequential use of various de Bruijn graph and overlap-layout-consensus assemblers with a novel partitioned sub-assembly approach. We also proposed new quality metrics that are suitable for evaluating metagenome de novo assembly. We demonstrate that this new ensemble strategy tested using in silico spike-in, clinical and environmental NGS datasets achieved significantly better contigs than current approaches.
Collapse
Affiliation(s)
- Xutao Deng
- Blood Systems Research Institute, San Francisco, CA 94118, USA Department of Laboratory Medicine, University of California at San Francisco, San Francisco, CA 94107, USA
| | - Samia N Naccache
- Department of Laboratory Medicine, University of California at San Francisco, San Francisco, CA 94107, USA UCSF-Abbott Viral Diagnostics and Discovery Center, San Francisco, CA 94107, USA
| | - Terry Ng
- Blood Systems Research Institute, San Francisco, CA 94118, USA Department of Laboratory Medicine, University of California at San Francisco, San Francisco, CA 94107, USA
| | - Scot Federman
- Department of Laboratory Medicine, University of California at San Francisco, San Francisco, CA 94107, USA UCSF-Abbott Viral Diagnostics and Discovery Center, San Francisco, CA 94107, USA
| | - Linlin Li
- Blood Systems Research Institute, San Francisco, CA 94118, USA Department of Laboratory Medicine, University of California at San Francisco, San Francisco, CA 94107, USA
| | - Charles Y Chiu
- Department of Laboratory Medicine, University of California at San Francisco, San Francisco, CA 94107, USA UCSF-Abbott Viral Diagnostics and Discovery Center, San Francisco, CA 94107, USA Department of Medicine, Division of Infectious Diseases, UCSF, San Francisco, CA 94143, USA
| | - Eric L Delwart
- Blood Systems Research Institute, San Francisco, CA 94118, USA Department of Laboratory Medicine, University of California at San Francisco, San Francisco, CA 94107, USA
| |
Collapse
|
1972
|
Sim M, Kim J. Metagenome assembly through clustering of next-generation sequencing data using protein sequences. J Microbiol Methods 2015; 109:180-7. [PMID: 25572018 DOI: 10.1016/j.mimet.2015.01.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2014] [Revised: 01/03/2015] [Accepted: 01/03/2015] [Indexed: 11/16/2022]
Abstract
The study of environmental microbial communities, called metagenomics, has gained a lot of attention because of the recent advances in next-generation sequencing (NGS) technologies. Microbes play a critical role in changing their environments, and the mode of their effect can be solved by investigating metagenomes. However, the difficulty of metagenomes, such as the combination of multiple microbes and different species abundance, makes metagenome assembly tasks more challenging. In this paper, we developed a new metagenome assembly method by utilizing protein sequences, in addition to the NGS read sequences. Our method (i) builds read clusters by using mapping information against available protein sequences, and (ii) creates contig sequences by finding consensus sequences through probabilistic choices from the read clusters. By using simulated NGS read sequences from real microbial genome sequences, we evaluated our method in comparison with four existing assembly programs. We found that our method could generate relatively long and accurate metagenome assemblies, indicating that the idea of using protein sequences, as a guide for the assembly, is promising.
Collapse
Affiliation(s)
- Mikang Sim
- Department of Animal Biotechnology, Konkuk University, Seoul 143-701, Republic of Korea
| | - Jaebum Kim
- Department of Animal Biotechnology, Konkuk University, Seoul 143-701, Republic of Korea.
| |
Collapse
|
1973
|
Hamilton TL, Jones DS, Schaperdoth I, Macalady JL. Metagenomic insights into S(0) precipitation in a terrestrial subsurface lithoautotrophic ecosystem. Front Microbiol 2015; 5:756. [PMID: 25620962 PMCID: PMC4288042 DOI: 10.3389/fmicb.2014.00756] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2014] [Accepted: 12/11/2014] [Indexed: 11/13/2022] Open
Abstract
The Frasassi and Acquasanta Terme cave systems in Italy host isolated lithoautotrophic ecosystems characterized by sulfur-oxidizing biofilms with up to 50% S(0) by mass. The net contributions of microbial taxa in the biofilms to production and consumption of S(0) are poorly understood and have implications for understanding the formation of geological sulfur deposits as well as the ecological niches of sulfur-oxidizing autotrophs. Filamentous Epsilonproteobacteria are among the principal biofilm architects in Frasassi and Acquasanta Terme streams, colonizing high-sulfide, low-oxygen niches relative to other major biofilm-forming populations. Metagenomic sequencing of eight biofilm samples indicated the presence of diverse and abundant Epsilonproteobacteria. Populations of Sulfurovum-like organisms were the most abundant Epsilonproteobacteria regardless of differences in biofilm morphology, temperature, or water chemistry. After assembling and binning the metagenomic data, we retrieved four nearly-complete genomes of Sulfurovum-like organisms as well as a Sulfuricurvum spp. Analyses of the binned and assembled metagenomic data indicate that the Epsilonproteobacteria are autotrophic and therefore provide organic carbon to the isolated subsurface ecosystem. Multiple homologs of sulfide-quinone oxidoreductase (Sqr), together with incomplete or absent Sox pathways, suggest that cave Sulfurovum-like Epsilonproteobacteria oxidize sulfide incompletely to S(0) using either O2 or nitrate as a terminal electron acceptor, consistent with previous evidence that they are most successful in niches with high dissolved sulfide to oxygen ratios. In contrast, we recovered homologs of the complete complement of Sox proteins affiliated Gammaproteobacteria and with less abundant Sulfuricurvum spp. and Arcobacter spp., suggesting that these populations are capable of the complete oxidation of sulfide to sulfate. These and other genomic data presented here offer new clues into the physiology and genetic potential of the largely uncultivated and ecologically successful cave Sulfurovum-like populations, and suggest that they play an integral role in subsurface S(0) formation.
Collapse
Affiliation(s)
- Trinity L Hamilton
- Department of Geosciences, Penn State Astrobiology Research Center, The Pennsylvania State University University Park, PA, USA
| | - Daniel S Jones
- Department of Geosciences, Penn State Astrobiology Research Center, The Pennsylvania State University University Park, PA, USA ; Department of Earth Sciences, University of Minnesota Minneapolis, MN, USA
| | - Irene Schaperdoth
- Department of Geosciences, Penn State Astrobiology Research Center, The Pennsylvania State University University Park, PA, USA
| | - Jennifer L Macalady
- Department of Geosciences, Penn State Astrobiology Research Center, The Pennsylvania State University University Park, PA, USA
| |
Collapse
|
1974
|
Clingenpeel S, Clum A, Schwientek P, Rinke C, Woyke T. Reconstructing each cell's genome within complex microbial communities-dream or reality? Front Microbiol 2015; 5:771. [PMID: 25620966 PMCID: PMC4287102 DOI: 10.3389/fmicb.2014.00771] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2014] [Accepted: 12/17/2014] [Indexed: 11/24/2022] Open
Abstract
As the vast majority of microorganisms have yet to be cultivated in a laboratory setting, access to their genetic makeup has largely been limited to cultivation-independent methods. These methods, namely metagenomics and more recently single-cell genomics, have become cornerstones for microbial ecology and environmental microbiology. One ultimate goal is the recovery of genome sequences from each cell within an environment to move toward a better understanding of community metabolic potential and to provide substrate for experimental work. As single-cell sequencing has the ability to decipher all sequence information contained in an individual cell, this method holds great promise in tackling such challenge. Methodological limitations and inherent biases however do exist, which will be discussed here based on environmental and benchmark data, to assess how far we are from reaching this goal.
Collapse
Affiliation(s)
| | - Alicia Clum
- DOE Joint Genome Institute Walnut Creek, CA, USA
| | | | | | - Tanja Woyke
- DOE Joint Genome Institute Walnut Creek, CA, USA
| |
Collapse
|
1975
|
Baker BJ, Lazar CS, Teske AP, Dick GJ. Genomic resolution of linkages in carbon, nitrogen, and sulfur cycling among widespread estuary sediment bacteria. MICROBIOME 2015; 3:14. [PMID: 25922666 PMCID: PMC4411801 DOI: 10.1186/s40168-015-0077-6] [Citation(s) in RCA: 233] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2014] [Accepted: 03/12/2015] [Indexed: 05/20/2023]
Abstract
BACKGROUND Estuaries are among the most productive habitats on the planet. Bacteria in estuary sediments control the turnover of organic carbon and the cycling of nitrogen and sulfur. These communities are complex and primarily made up of uncultured lineages, thus little is known about how ecological and metabolic processes are partitioned in sediments. RESULTS De novo assembly and binning resulted in the reconstruction of 82 bacterial genomes from different redox regimes of estuary sediments. These genomes belong to 23 bacterial groups, including uncultured candidate phyla (for example, KSB1, TA06, and KD3-62) and three newly described phyla (White Oak River (WOR)-1, WOR-2, and WOR-3). The uncultured phyla are generally most abundant in the sulfate-methane transition (SMTZ) and methane-rich zones, and genomic data predict that they mediate essential biogeochemical processes of the estuarine environment, including organic carbon degradation and fermentation. Among the most abundant organisms in the sulfate-rich layer are novel Gammaproteobacteria that have genes for the oxidation of sulfur and the reduction of nitrate and nitrite. Interestingly, the terminal steps of denitrification (NO3 to N2O and then N2O to N2) are present in distinct bacterial populations. CONCLUSIONS This dataset extends our knowledge of the metabolic potential of several uncultured phyla. Within the sediments, there is redundancy in the genomic potential in different lineages, often distinct phyla, for essential biogeochemical processes. We were able to chart the flow of carbon and nutrients through the multiple geochemical layers of bacterial processing and reveal potential ecological interactions within the communities.
Collapse
Affiliation(s)
- Brett J Baker
- />Department of Marine Science, University of Texas-Austin, Marine Science Institute, 750 Channel View Dr., Port Aransas, TX 78373 USA
- />Department of Earth and Environmental Sciences, University of Michigan, 1100 N. University Ave., Ann Arbor, MI 48109 USA
| | - Cassandre Sara Lazar
- />Department of Marine Sciences, University of North Carolina, Chapel Hill, NC USA
- />Organic Geochemistry Group, MARUM Center for Marine Environmental Sciences, Department of Geosciences, University of Bremen, Bremen, Germany
| | - Andreas P Teske
- />Department of Marine Sciences, University of North Carolina, Chapel Hill, NC USA
| | - Gregory J Dick
- />Department of Earth and Environmental Sciences, University of Michigan, 1100 N. University Ave., Ann Arbor, MI 48109 USA
- />Center for Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI USA
| |
Collapse
|
1976
|
Young JC, Chehoud C, Bittinger K, Bailey A, Diamond JM, Cantu E, Haas AR, Abbas A, Frye L, Christie JD, Bushman FD, Collman RG. Viral metagenomics reveal blooms of anelloviruses in the respiratory tract of lung transplant recipients. Am J Transplant 2015; 15:200-9. [PMID: 25403800 PMCID: PMC4276431 DOI: 10.1111/ajt.13031] [Citation(s) in RCA: 142] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2014] [Revised: 08/01/2014] [Accepted: 08/28/2014] [Indexed: 01/25/2023]
Abstract
Few studies have examined the lung virome in health and disease. Outcomes of lung transplantation are known to be influenced by several recognized respiratory viruses, but global understanding of the virome of the transplanted lung is incomplete. To define the DNA virome within the respiratory tract following lung transplantation we carried out metagenomic analysis of allograft bronchoalveolar lavage (BAL), and compared with healthy and HIV+ subjects. Viral concentrates were purified from BAL and analyzed by shotgun DNA sequencing. All of the BAL samples contained reads mapping to anelloviruses, with high proportions in lung transplant samples. Anellovirus populations in transplant recipients were complex, with multiple concurrent variants. Quantitative polymerase chain reaction quantification revealed that anellovirus sequences were 56-fold more abundant in BAL from lung transplant recipients compared with healthy controls or HIV+ subjects (p < 0.0001). Anellovirus sequences were also more abundant in upper respiratory tract specimens from lung transplant recipients than controls (p = 0.006). Comparison to metagenomic data on bacterial populations showed that high anellovirus loads correlated with dysbiotic bacterial communities in allograft BAL (p = 0.008). Thus the respiratory tracts of lung transplant recipients contain high levels and complex populations of anelloviruses, warranting studies of anellovirus lung infection and transplant outcome.
Collapse
Affiliation(s)
- Jacque C. Young
- Department of Microbiology, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania
| | - Christel Chehoud
- Department of Microbiology, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania
| | - Kyle Bittinger
- Department of Microbiology, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania
| | - Aubrey Bailey
- Department of Microbiology, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania
| | - Joshua M. Diamond
- Pulmonary, Allergy and Critical Care Division, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania
| | - Edward Cantu
- Department of Surgery, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania
| | - Andrew R. Haas
- Pulmonary, Allergy and Critical Care Division, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania
| | - Arwa Abbas
- Department of Microbiology, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania
| | - Laura Frye
- Pulmonary, Allergy and Critical Care Division, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania
| | - Jason D. Christie
- Pulmonary, Allergy and Critical Care Division, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, Center for Clinical Epidemiology and Biostatistics, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania,Penn Cardiovascular Institute, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania
| | - Frederic D. Bushman
- Department of Microbiology, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania
| | - Ronald G. Collman
- Department of Microbiology, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania,Pulmonary, Allergy and Critical Care Division, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania
| |
Collapse
|
1977
|
Hodkinson BP, Grice EA. Next-Generation Sequencing: A Review of Technologies and Tools for Wound Microbiome Research. Adv Wound Care (New Rochelle) 2015; 4:50-58. [PMID: 25566414 DOI: 10.1089/wound.2014.0542] [Citation(s) in RCA: 64] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2014] [Accepted: 06/23/2014] [Indexed: 12/26/2022] Open
Abstract
Significance: The colonization of wounds by specific microbes or communities of microbes may delay healing and/or lead to infection-related complication. Studies of wound-associated microbial communities (microbiomes) to date have primarily relied upon culture-based methods, which are known to have extreme biases and are not reliable for the characterization of microbiomes. Biofilms are very resistant to culture and are therefore especially difficult to study with techniques that remain standard in clinical settings. Recent Advances: Culture-independent approaches employing next-generation DNA sequencing have provided researchers and clinicians a window into wound-associated microbiomes that could not be achieved before and has begun to transform our view of wound-associated biodiversity. Within the past decade, many platforms have arisen for performing this type of sequencing, with various types of applications for microbiome research being possible on each. Critical Issues: Wound care incorporating knowledge of microbiomes gained from next-generation sequencing could guide clinical management and treatments. The purpose of this review is to outline the current platforms, their applications, and the steps necessary to undertake microbiome studies using next-generation sequencing. Future Directions: As DNA sequencing technology progresses, platforms will continue to produce longer reads and more reads per run at lower costs. A major future challenge is to implement these technologies in clinical settings for more precise and rapid identification of wound bioburden.
Collapse
Affiliation(s)
- Brendan P. Hodkinson
- Department of Dermatology, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania
| | - Elizabeth A. Grice
- Department of Dermatology, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania
| |
Collapse
|
1978
|
Walter MC, Öhrman C, Myrtennäs K, Sjödin A, Byström M, Larsson P, Macellaro A, Forsman M, Frangoulidis D. Genome sequence of Coxiella burnetii strain Namibia. Stand Genomic Sci 2014; 9:22. [PMID: 25593636 PMCID: PMC4286197 DOI: 10.1186/1944-3277-9-22] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2014] [Accepted: 11/17/2014] [Indexed: 11/28/2022] Open
Abstract
We present the whole genome sequence and annotation of the Coxiella burnetii strain Namibia. This strain was isolated from an aborting goat in 1991 in Windhoek, Namibia. The plasmid type QpRS was confirmed in our work. Further genomic typing placed the strain into a unique genomic group. The genome sequence is 2,101,438 bp long and contains 1,979 protein-coding and 51 RNA genes, including one rRNA operon. To overcome the poor yield from cell culture systems, an additional DNA enrichment with whole genome amplification (WGA) methods was applied. We describe a bioinformatics pipeline for improved genome assembly including several filters with a special focus on WGA characteristics.
Collapse
Affiliation(s)
- Mathias C Walter
- Institute of Bioinformatics and Systems Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany.,Department of Genome-Oriented Bioinformatics, Center of Life and Food Science Weihenstephan, Technische Universität München, Freising, Germany
| | - Caroline Öhrman
- CBRN Defence and Security, Swedish Defence Research Agency (FOI), Umeå, Sweden
| | - Kerstin Myrtennäs
- CBRN Defence and Security, Swedish Defence Research Agency (FOI), Umeå, Sweden
| | - Andreas Sjödin
- CBRN Defence and Security, Swedish Defence Research Agency (FOI), Umeå, Sweden
| | - Mona Byström
- CBRN Defence and Security, Swedish Defence Research Agency (FOI), Umeå, Sweden
| | - Pär Larsson
- CBRN Defence and Security, Swedish Defence Research Agency (FOI), Umeå, Sweden
| | - Anna Macellaro
- CBRN Defence and Security, Swedish Defence Research Agency (FOI), Umeå, Sweden
| | - Mats Forsman
- CBRN Defence and Security, Swedish Defence Research Agency (FOI), Umeå, Sweden
| | | |
Collapse
|
1979
|
Tan MH, Gan HM, Lee YP, Austin CM. The complete mitogenome of the porcelain crab Petrolisthes haswelli Miers, 1884 (Crustacea: Decapoda: Anomura). Mitochondrial DNA A DNA Mapp Seq Anal 2014; 27:3983-3984. [DOI: 10.3109/19401736.2014.989515] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- Mun Hua Tan
- School of Science, Monash University Malaysia, Jalan Lagoon Selatan, Bandar Sunway, Petaling Jaya, Selangor, Malaysia and
- Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan, Bandar Sunway, Petaling Jaya, Selangor, Malaysia
| | - Han Ming Gan
- School of Science, Monash University Malaysia, Jalan Lagoon Selatan, Bandar Sunway, Petaling Jaya, Selangor, Malaysia and
- Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan, Bandar Sunway, Petaling Jaya, Selangor, Malaysia
| | - Yin Peng Lee
- School of Science, Monash University Malaysia, Jalan Lagoon Selatan, Bandar Sunway, Petaling Jaya, Selangor, Malaysia and
- Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan, Bandar Sunway, Petaling Jaya, Selangor, Malaysia
| | - Christopher M. Austin
- School of Science, Monash University Malaysia, Jalan Lagoon Selatan, Bandar Sunway, Petaling Jaya, Selangor, Malaysia and
- Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan, Bandar Sunway, Petaling Jaya, Selangor, Malaysia
| |
Collapse
|
1980
|
Tan MH, Gan HM, Lee YP, Austin CM. The complete mitogenome of purple mottled shore crab Cyclograpsus granulosus H. Milne-Edwards, 1853 (Crustacea: Decapoda: Grapsoidea). Mitochondrial DNA A DNA Mapp Seq Anal 2014; 27:3981-3982. [DOI: 10.3109/19401736.2014.989514] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Affiliation(s)
- Mun Hua Tan
- School of Science, Monash University Malaysia, Jalan Lagoon Selatan, Bandar Sunway, Petaling Jaya, Selangor, Malaysia and
- Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan, Bandar Sunway, Petaling Jaya, Selangor, Malaysia
| | - Han Ming Gan
- School of Science, Monash University Malaysia, Jalan Lagoon Selatan, Bandar Sunway, Petaling Jaya, Selangor, Malaysia and
- Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan, Bandar Sunway, Petaling Jaya, Selangor, Malaysia
| | - Yin Peng Lee
- School of Science, Monash University Malaysia, Jalan Lagoon Selatan, Bandar Sunway, Petaling Jaya, Selangor, Malaysia and
- Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan, Bandar Sunway, Petaling Jaya, Selangor, Malaysia
| | - Christopher M. Austin
- School of Science, Monash University Malaysia, Jalan Lagoon Selatan, Bandar Sunway, Petaling Jaya, Selangor, Malaysia and
- Monash University Malaysia Genomics Facility, Jalan Lagoon Selatan, Bandar Sunway, Petaling Jaya, Selangor, Malaysia
| |
Collapse
|
1981
|
Complete Genome Sequence of Klebsiella pneumoniae Sequence Type 17, a Multidrug-Resistant Strain Isolated during Tigecycline Treatment. GENOME ANNOUNCEMENTS 2014; 2:2/6/e01337-14. [PMID: 25540349 PMCID: PMC4276827 DOI: 10.1128/genomea.01337-14] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Klbesiella pneumoniae is one of the most important human pathogens and frequently causes many diseases. To facilitate the comparative genome analysis in tigecycline resistance mechanism, we report the complete chromosomal sequence of a multidrug-resistance K. pneumoniae strain before tigecycline treatment for reference genome.
Collapse
|
1982
|
Leung HCM, Yiu SM, Chin FYL. IDBA-MTP: A Hybrid Metatranscriptomic Assembler Based on Protein Information. J Comput Biol 2014; 22:367-76. [PMID: 25535824 DOI: 10.1089/cmb.2014.0139] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Metatranscriptomic analysis provides information on how a microbial community reacts to environmental changes. Using next-generation sequencing (NGS) technology, biologists can study the microbe community by sampling short reads from a mixture of mRNAs (metatranscriptomic data). As most microbial genome sequences are unknown, it would seem that de novo assembly of the mRNAs is needed. However, NGS reads are short and mRNAs share many similar regions and differ tremendously in abundance levels, making de novo assembly challenging. The existing assembler, IDBA-MT, designed specifically for the assembly of metatranscriptomic data and performs well only on high-expressed mRNAs. This article introduces IDBA-MTP, which adopts a novel approach to metatranscriptomic assembly that makes use of the fact that there is a database of millions of known protein sequences associated with mRNAs. How to effectively use the protein information is nontrivial given the size of the database and given that different mRNAs might lead to proteins with similar functions (because different amino acids might have similar characteristics). IDBA-MTP employs a similarity measure between mRNAs and protein sequences, dynamic programming techniques, and seed-and-extend heuristics to tackle the problem effectively and efficiently. Experimental results show that IDBA-MTP outperforms existing assemblers by reconstructing 14% more mRNAs.
Collapse
Affiliation(s)
- Henry C M Leung
- Department of Computer Science, The University of Hong Kong , Hong Kong , Hong Kong
| | | | | |
Collapse
|
1983
|
Martin-Cuadrado AB, Garcia-Heredia I, Moltó AG, López-Úbeda R, Kimes N, López-García P, Moreira D, Rodriguez-Valera F. A new class of marine Euryarchaeota group II from the Mediterranean deep chlorophyll maximum. ISME JOURNAL 2014; 9:1619-34. [PMID: 25535935 DOI: 10.1038/ismej.2014.249] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/26/2014] [Revised: 11/14/2014] [Accepted: 11/19/2014] [Indexed: 11/09/2022]
Abstract
We have analyzed metagenomic fosmid clones from the deep chlorophyll maximum (DCM), which, by genomic parameters, correspond to the 16S ribosomal RNA (rRNA)-defined marine Euryarchaeota group IIB (MGIIB). The fosmid collections associated with this group add up to 4 Mb and correspond to at least two species within this group. From the proposed essential genes contained in the collections, we infer that large sections of the conserved regions of the genomes of these microbes have been recovered. The genomes indicate a photoheterotrophic lifestyle, similar to that of the available genome of MGIIA (assembled from an estuarine metagenome in Puget Sound, Washington Pacific coast), with a proton-pumping rhodopsin of the same kind. Several genomic features support an aerobic metabolism with diversified substrate degradation capabilities that include xenobiotics and agar. On the other hand, these MGIIB representatives are non-motile and possess similar genome size to the MGIIA-assembled genome, but with a lower GC content. The large phylogenomic gap with other known archaea indicates that this is a new class of marine Euryarchaeota for which we suggest the name Thalassoarchaea. The analysis of recruitment from available metagenomes indicates that the representatives of group IIB described here are largely found at the DCM (ca. 50 m deep), in which they are abundant (up to 0.5% of the reads), and at the surface mostly during the winter mixing, which explains formerly described 16S rRNA distribution patterns. Their uneven representation in environmental samples that are close in space and time might indicate sporadic blooms.
Collapse
Affiliation(s)
- Ana-Belen Martin-Cuadrado
- Evolutionary Genomics Group, Departamento de Producción Vegetal y Microbiología, Universidad Miguel Hernández, San Juan de Alicante, Alicante, Spain
| | - Inmaculada Garcia-Heredia
- Evolutionary Genomics Group, Departamento de Producción Vegetal y Microbiología, Universidad Miguel Hernández, San Juan de Alicante, Alicante, Spain
| | - Aitor Gonzaga Moltó
- Evolutionary Genomics Group, Departamento de Producción Vegetal y Microbiología, Universidad Miguel Hernández, San Juan de Alicante, Alicante, Spain
| | - Rebeca López-Úbeda
- Evolutionary Genomics Group, Departamento de Producción Vegetal y Microbiología, Universidad Miguel Hernández, San Juan de Alicante, Alicante, Spain
| | - Nikole Kimes
- Evolutionary Genomics Group, Departamento de Producción Vegetal y Microbiología, Universidad Miguel Hernández, San Juan de Alicante, Alicante, Spain
| | - Purificación López-García
- Unité d'Ecologie, Systématique et Evolution, UMR CNRS 8079, Université Paris-Sud, Orsay Cedex, France
| | - David Moreira
- Unité d'Ecologie, Systématique et Evolution, UMR CNRS 8079, Université Paris-Sud, Orsay Cedex, France
| | - Francisco Rodriguez-Valera
- Evolutionary Genomics Group, Departamento de Producción Vegetal y Microbiología, Universidad Miguel Hernández, San Juan de Alicante, Alicante, Spain
| |
Collapse
|
1984
|
Marbouty M, Cournac A, Flot JF, Marie-Nelly H, Mozziconacci J, Koszul R. Metagenomic chromosome conformation capture (meta3C) unveils the diversity of chromosome organization in microorganisms. eLife 2014; 3:e03318. [PMID: 25517076 PMCID: PMC4381813 DOI: 10.7554/elife.03318] [Citation(s) in RCA: 132] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2014] [Accepted: 11/05/2014] [Indexed: 12/18/2022] Open
Abstract
Genomic analyses of microbial populations in their natural environment remain limited by the difficulty to assemble full genomes of individual species. Consequently, the chromosome organization of microorganisms has been investigated in a few model species, but the extent to which the features described can be generalized to other taxa remains unknown. Using controlled mixes of bacterial and yeast species, we developed meta3C, a metagenomic chromosome conformation capture approach that allows characterizing individual genomes and their average organization within a mix of organisms. Not only can meta3C be applied to species already sequenced, but a single meta3C library can be used for assembling, scaffolding and characterizing the tridimensional organization of unknown genomes. By applying meta3C to a semi-complex environmental sample, we confirmed its promising potential. Overall, this first meta3C study highlights the remarkable diversity of microorganisms chromosome organization, while providing an elegant and integrated approach to metagenomic analysis. DOI:http://dx.doi.org/10.7554/eLife.03318.001 Microbial communities play vital roles in the environment and sustain animal and plant life. Marine microbes are part of the ocean's food chain; soil microbes support the turnover of major nutrients and facilitate plant growth; and the microbial communities residing in the human gut support digestion and the immune system, among other roles. These communities are very complex systems, often containing 1000s of different species engaged in co-dependent relationships, and are therefore very difficult to study. The entire DNA sequence of an organism constitutes its genome, and much of this genetic information is stored in large structures called chromosomes. Examining the genome of a species can provide important clues about its lifestyle and how it evolved. To do this, DNA is extracted from cells and is then usually cut into smaller fragments, amplified, and sequenced. The small stretches of sequence obtained, called reads, are finally assembled, yielding ideally the complete genome of the organism under study. Metagenomics attempts to interpret the combined genome of all the different species in a microbial community and has been instrumental in deciphering how the different species interact with each other. Metagenomics involves sequencing stretches of the community's DNA and matching these pieces to individual species to ultimately assemble whole genomes. While this may be a relatively straightforward task for communities that contain only a handful of members, the metagenomes derived from complex microbial communities are huge, fragmented, and incomplete. This often makes it very difficult or even nearly impossible to match the inferred DNA stretches to individual species. A method called chromosome conformation capture (or ‘3C’ for short) can reveal the physical contacts between different regions of a chromosome and between the different chromosomes of a cell. How often each of these chromosomal contacts occurs provides a kind of physical signature to each genome and each individual chromosome within it. Marbouty et al. took advantage of these interactions to develop a technique that combines metagenomics and chromosome conformation capture—called meta3C—that can analyze the DNA of many different species mixed together. Testing meta3C on artificial mixtures of a few species of yeast or bacteria showed that meta3C can separate the genomes of the different species without any prior knowledge of the composition of the mix. In a single experiment, meta3C can identify individual chromosomes, match each of them to its species of origin, and reveal the three-dimensional structure of each genome in the mix. Further tests showed that meta3C can also interpret more complex communities where the number and types of the species present are not known. Meta3C holds great promise for understanding how microbial communities work and how the genomes of the species within a community are organized. However, further developments of the technique will be required to investigate communities as diverse as those present in most natural environments. DOI:http://dx.doi.org/10.7554/eLife.03318.002
Collapse
Affiliation(s)
- Martial Marbouty
- Groupe Régulation Spatiale des Génomes, Département Génomes et Génétique, Institut Pasteur, Paris, France
| | - Axel Cournac
- Groupe Régulation Spatiale des Génomes, Département Génomes et Génétique, Institut Pasteur, Paris, France
| | - Jean-François Flot
- Biological Physics and Evolutionary Dynamics Group, Max Planck Institute for Dynamics and Self-Organization, Göttingen, Germany
| | - Hervé Marie-Nelly
- Groupe Régulation Spatiale des Génomes, Département Génomes et Génétique, Institut Pasteur, Paris, France
| | - Julien Mozziconacci
- Department of Physics, Laboratoire de physique théorique de la matière condensée, Université Pierre et Marie Curie, Paris, France
| | - Romain Koszul
- Groupe Régulation Spatiale des Génomes, Département Génomes et Génétique, Institut Pasteur, Paris, France
| |
Collapse
|
1985
|
Justice NB, Norman A, Brown CT, Singh A, Thomas BC, Banfield JF. Comparison of environmental and isolate Sulfobacillus genomes reveals diverse carbon, sulfur, nitrogen, and hydrogen metabolisms. BMC Genomics 2014; 15:1107. [PMID: 25511286 PMCID: PMC4378227 DOI: 10.1186/1471-2164-15-1107] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2014] [Accepted: 11/27/2014] [Indexed: 02/01/2023] Open
Abstract
BACKGROUND Bacteria of the genus Sulfobacillus are found worldwide as members of microbial communities that accelerate sulfide mineral dissolution in acid mine drainage environments (AMD), acid-rock drainage environments (ARD), as well as in industrial bioleaching operations. Despite their frequent identification in these environments, their role in biogeochemical cycling is poorly understood. RESULTS Here we report draft genomes of five species of the Sulfobacillus genus (AMDSBA1-5) reconstructed by cultivation-independent sequencing of biofilms sampled from the Richmond Mine (Iron Mountain, CA). Three of these species (AMDSBA2, AMDSBA3, and AMDSBA4) have no cultured representatives while AMDSBA1 is a strain of S. benefaciens, and AMDSBA5 a strain of S. thermosulfidooxidans. We analyzed the diversity of energy conservation and central carbon metabolisms for these genomes and previously published Sulfobacillus genomes. Pathways of sulfur oxidation vary considerably across the genus, including the number and type of subunits of putative heterodisulfide reductase complexes likely involved in sulfur oxidation. The number and type of nickel-iron hydrogenase proteins varied across the genus, as does the presence of different central carbon pathways. Only the AMDSBA3 genome encodes a dissimilatory nitrate reducatase and only the AMDSBA5 and S. thermosulfidooxidans genomes encode assimilatory nitrate reductases. Within the genus, AMDSBA4 is unusual in that its electron transport chain includes a cytochrome bc type complex, a unique cytochrome c oxidase, and two distinct succinate dehydrogenase complexes. CONCLUSIONS Overall, the results significantly expand our understanding of carbon, sulfur, nitrogen, and hydrogen metabolism within the Sulfobacillus genus.
Collapse
Affiliation(s)
- Nicholas B Justice
- />Department of Earth and Planetary Science, University of California, Berkeley, CA 94720 USA
- />Physical Biosciences Division, Lawrence Berkeley National Lab, Berkeley, CA USA
| | - Anders Norman
- />Department of Earth and Planetary Science, University of California, Berkeley, CA 94720 USA
- />Section for Infection Microbiology, Department of Systems Biology, Technical University of Denmark, Lyngby, Denmark
| | - Christopher T Brown
- />Department of Earth and Planetary Science, University of California, Berkeley, CA 94720 USA
| | - Andrea Singh
- />Department of Earth and Planetary Science, University of California, Berkeley, CA 94720 USA
| | - Brian C Thomas
- />Department of Earth and Planetary Science, University of California, Berkeley, CA 94720 USA
| | - Jillian F Banfield
- />Department of Earth and Planetary Science, University of California, Berkeley, CA 94720 USA
| |
Collapse
|
1986
|
Kwak W, Song KD, Oh JD, Heo KN, Lee JH, Lee WK, Yoon SH, Kim H, Cho S, Lee HK. Uncovering genomic features and maternal origin of korean native chicken by whole genome sequencing. PLoS One 2014; 9:e114763. [PMID: 25501044 PMCID: PMC4263466 DOI: 10.1371/journal.pone.0114763] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2014] [Accepted: 11/13/2014] [Indexed: 11/18/2022] Open
Abstract
The Korean Native Chicken (KNC) is an important endemic biological resource in Korea. While numerous studies have been conducted exploring this breed, none have used next-generation sequencing to identify its specific genomic features. We sequenced five strains of KNC and identified 10.9 million SNVs and 1.3 million InDels. Through the analysis, we found that the highly variable region common to all 5 strains had genes like PCHD15, CISD1, PIK3C2A, and NUCB2 that might be related to the phenotypic traits of the chicken such as auditory sense, growth rate and egg traits. In addition, we assembled unaligned reads that could not be mapped to the reference genome. By assembling the unaligned reads, we were able to present genomic sequences characteristic to the KNC. Based on this, we also identified genes related to the olfactory receptors and antigen that are common to all 5 strains. Finally, through the reconstructed mitochondrial genome sequences, we performed phylogenomic analysis and elucidated the maternal origin of the artificially restored KNC. Our results revealed that the KNC has multiple maternal origins which are in agreement with Korea's history of chicken breed imports. The results presented here provide a valuable basis for future research on genomic features of KNC and further understanding of KNC's origin.
Collapse
Affiliation(s)
- Woori Kwak
- C & K genomics, Seoul, Republic of Korea
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
| | - Ki-Duk Song
- Genomic Informatics Center, Hankyong National University, Anseong, Republic of Korea
| | - Jae-Don Oh
- Genomic Informatics Center, Hankyong National University, Anseong, Republic of Korea
| | - Kang-Nyeong Heo
- Poultry Science Division, National Institute of Animal Science, Cheonan, Republic of Korea
| | - Jun-Heon Lee
- Department of Animal Science and Biotechnology, Chungnam National University, Daejeon, Republic of Korea
| | - Woon Kyu Lee
- Inha Research Institute for Medical Sciences, Inha University School of Medicine, Inchon, Republic of Korea
| | - Sook Hee Yoon
- Department of Agricultural Biotechnology and Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, Republic of Korea
| | - Heebal Kim
- Department of Agricultural Biotechnology and Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, Republic of Korea
| | - Seoae Cho
- C & K genomics, Seoul, Republic of Korea
| | - Hak-Kyo Lee
- Genomic Informatics Center, Hankyong National University, Anseong, Republic of Korea
| |
Collapse
|
1987
|
Abbas MM, Malluhi QM, Balakrishnan P. Assessment of de novo assemblers for draft genomes: a case study with fungal genomes. BMC Genomics 2014; 15 Suppl 9:S10. [PMID: 25521762 PMCID: PMC4290589 DOI: 10.1186/1471-2164-15-s9-s10] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Recently, large bio-projects dealing with the release of different genomes have transpired. Most of these projects use next-generation sequencing platforms. As a consequence, many de novo assembly tools have evolved to assemble the reads generated by these platforms. Each tool has its own inherent advantages and disadvantages, which make the selection of an appropriate tool a challenging task. RESULTS We have evaluated the performance of frequently used de novo assemblers namely ABySS, IDBA-UD, Minia, SOAP, SPAdes, Sparse, and Velvet. These assemblers are assessed based on their output quality during the assembly process conducted over fungal data. We compared the performance of these assemblers by considering both computational as well as quality metrics. By analyzing these performance metrics, the assemblers are ranked and a procedure for choosing the candidate assembler is illustrated. CONCLUSIONS In this study, we propose an assessment method for the selection of de novo assemblers by considering their computational as well as quality metrics at the draft genome level. We divide the quality metrics into three groups: g1 measures the goodness of the assemblies, g2 measures the problems of the assemblies, and g3 measures the conservation elements in the assemblies. Our results demonstrate that the assemblers ABySS and IDBA-UD exhibit a good performance for the studied data from fungal genomes in terms of running time, memory, and quality. The results suggest that whole genome shotgun sequencing projects should make use of different assemblers by considering their merits.
Collapse
|
1988
|
Facile, high quality sequencing of bacterial genomes from small amounts of DNA. Int J Genomics 2014; 2014:434575. [PMID: 25478564 PMCID: PMC4247979 DOI: 10.1155/2014/434575] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2014] [Accepted: 10/02/2014] [Indexed: 11/26/2022] Open
Abstract
Sequencing bacterial genomes has traditionally required large amounts of genomic DNA (~1 μg). There have been few studies to determine the effects of the input DNA amount or library preparation method on the quality of sequencing data. Several new commercially available library preparation methods enable shotgun sequencing from as little as 1 ng of input DNA. In this study, we evaluated the NEBNext Ultra library preparation reagents for sequencing bacterial genomes. We have evaluated the utility of NEBNext Ultra for resequencing and de novo assembly of four bacterial genomes and compared its performance with the TruSeq library preparation kit. The NEBNext Ultra reagents enable high quality resequencing and de novo assembly of a variety of bacterial genomes when using 100 ng of input genomic DNA. For the two most challenging genomes (Burkholderia spp.), which have the highest GC content and are the longest, we also show that the quality of both resequencing and de novo assembly is not decreased when only 10 ng of input genomic DNA is used.
Collapse
|
1989
|
Zhu X, Leung HCM, Chin FYL, Yiu SM, Quan G, Liu B, Wang Y. PERGA: a paired-end read guided de novo assembler for extending contigs using SVM and look ahead approach. PLoS One 2014; 9:e114253. [PMID: 25461763 PMCID: PMC4252104 DOI: 10.1371/journal.pone.0114253] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2014] [Accepted: 11/05/2014] [Indexed: 12/31/2022] Open
Abstract
Since the read lengths of high throughput sequencing (HTS) technologies are short, de novo assembly which plays significant roles in many applications remains a great challenge. Most of the state-of-the-art approaches base on de Bruijn graph strategy and overlap-layout strategy. However, these approaches which depend on k-mers or read overlaps do not fully utilize information of paired-end and single-end reads when resolving branches. Since they treat all single-end reads with overlapped length larger than a fix threshold equally, they fail to use the more confident long overlapped reads for assembling and mix up with the relative short overlapped reads. Moreover, these approaches have not been special designed for handling tandem repeats (repeats occur adjacently in the genome) and they usually break down the contigs near the tandem repeats. We present PERGA (Paired-End Reads Guided Assembler), a novel sequence-reads-guided de novo assembly approach, which adopts greedy-like prediction strategy for assembling reads to contigs and scaffolds using paired-end reads and different read overlap size ranging from Omax to Omin to resolve the gaps and branches. By constructing a decision model using machine learning approach based on branch features, PERGA can determine the correct extension in 99.7% of cases. When the correct extension cannot be determined, PERGA will try to extend the contig by all feasible extensions and determine the correct extension by using look-ahead approach. Many difficult-resolved branches are due to tandem repeats which are close in the genome. PERGA detects such different copies of the repeats to resolve the branches to make the extension much longer and more accurate. We evaluated PERGA on both Illumina real and simulated datasets ranging from small bacterial genomes to large human chromosome, and it constructed longer and more accurate contigs and scaffolds than other state-of-the-art assemblers. PERGA can be freely downloaded at https://github.com/hitbio/PERGA.
Collapse
Affiliation(s)
- Xiao Zhu
- Center for Bioinformatics, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China
| | | | | | - Siu Ming Yiu
- Department of Computer Science, University of Hong Kong, Hong Kong
| | - Guangri Quan
- National Pilot School of Software, Harbin Institute of Technology, Weihai, Shandong, China
| | - Bo Liu
- Center for Bioinformatics, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China
| | - Yadong Wang
- Center for Bioinformatics, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China
- * E-mail:
| |
Collapse
|
1990
|
The complex task of choosing a de novo assembly: Lessons from fungal genomes. Comput Biol Chem 2014; 53 Pt A:97-107. [DOI: 10.1016/j.compbiolchem.2014.08.014] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/11/2014] [Indexed: 12/21/2022]
|
1991
|
Afiahayati, Sato K, Sakakibara Y. MetaVelvet-SL: an extension of the Velvet assembler to a de novo metagenomic assembler utilizing supervised learning. DNA Res 2014; 22:69-77. [PMID: 25431440 PMCID: PMC4379979 DOI: 10.1093/dnares/dsu041] [Citation(s) in RCA: 75] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
The assembly of multiple genomes from mixed sequence reads is a bottleneck in metagenomic analysis. A single-genome assembly program (assembler) is not capable of resolving metagenome sequences, so assemblers designed specifically for metagenomics have been developed. MetaVelvet is an extension of the single-genome assembler Velvet. It has been proved to generate assemblies with higher N50 scores and higher quality than single-genome assemblers such as Velvet and SOAPdenovo when applied to metagenomic sequence reads and is frequently used in this research community. One important open problem for MetaVelvet is its low accuracy and sensitivity in detecting chimeric nodes in the assembly (de Bruijn) graph, which prevents the generation of longer contigs and scaffolds. We have tackled this problem of classifying chimeric nodes using supervised machine learning to significantly improve the performance of MetaVelvet and developed a new tool, called MetaVelvet-SL. A Support Vector Machine is used for learning the classification model based on 94 features extracted from candidate nodes. In extensive experiments, MetaVelvet-SL outperformed the original MetaVelvet and other state-of-the-art metagenomic assemblers, IDBA-UD, Ray Meta and Omega, to reconstruct accurate longer assemblies with higher N50 scores for both simulated data sets and real data sets of human gut microbial sequences.
Collapse
Affiliation(s)
- Afiahayati
- Department of Biosciences and Informatics, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama 223-8522, Japan
| | - Kengo Sato
- Department of Biosciences and Informatics, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama 223-8522, Japan
| | - Yasubumi Sakakibara
- Department of Biosciences and Informatics, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama 223-8522, Japan
| |
Collapse
|
1992
|
Biology of a widespread uncultivated archaeon that contributes to carbon fixation in the subsurface. Nat Commun 2014; 5:5497. [PMID: 25425419 DOI: 10.1038/ncomms6497] [Citation(s) in RCA: 81] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2014] [Accepted: 10/07/2014] [Indexed: 02/03/2023] Open
Abstract
Subsurface microbial life contributes significantly to biogeochemical cycling, yet it remains largely uncharacterized, especially its archaeal members. This 'microbial dark matter' has been explored by recent studies that were, however, mostly based on DNA sequence information only. Here, we use diverse techniques including ultrastuctural analyses to link genomics to biology for the SM1 Euryarchaeon lineage, an uncultivated group of subsurface archaea. Phylogenomic analyses reveal this lineage to belong to a widespread group of archaea that we propose to classify as a new euryarchaeal order ('Candidatus Altiarchaeales'). The representative, double-membraned species 'Candidatus Altiarchaeum hamiconexum' has an autotrophic metabolism that uses a not-yet-reported Factor420-free reductive acetyl-CoA pathway, confirmed by stable carbon isotopic measurements of archaeal lipids. Our results indicate that this lineage has evolved specific metabolic and structural features like nano-grappling hooks empowering this widely distributed archaeon to predominate anaerobic groundwater, where it may represent an important carbon dioxide sink.
Collapse
|
1993
|
Community-integrated omics links dominance of a microbial generalist to fine-tuned resource usage. Nat Commun 2014; 5:5603. [PMID: 25424998 PMCID: PMC4263124 DOI: 10.1038/ncomms6603] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2014] [Accepted: 10/20/2014] [Indexed: 11/08/2022] Open
Abstract
Microbial communities are complex and dynamic systems that are primarily structured according to their members’ ecological niches. To investigate how niche breadth (generalist versus specialist lifestyle strategies) relates to ecological success, we develop and apply an integrative workflow for the multi-omic analysis of oleaginous mixed microbial communities from a biological wastewater treatment plant. Time- and space-resolved coupled metabolomic and taxonomic analyses demonstrate that the community-wide lipid accumulation phenotype is associated with the dominance of the generalist bacterium Candidatus Microthrix spp. By integrating population-level genomic reconstructions (reflecting fundamental niches) with transcriptomic and proteomic data (realised niches), we identify finely tuned gene expression governing resource usage by Candidatus Microthrix parvicella over time. Moreover, our results indicate that the fluctuating environmental conditions constrain the accumulation of genetic variation in Candidatus Microthrix parvicella likely due to fitness trade-offs. Based on our observations, niche breadth has to be considered as an important factor for understanding the evolutionary processes governing (microbial) population sizes and structures in situ. Within microbial communities, microorganisms adopt different lifestyle strategies to use the available resources. Here, the authors use an integrated ‘multi-omic’ approach to study niche breadth (generalist versus specialist lifestyles) in oleaginous microbial assemblages from an anoxic wastewater treatment tank.
Collapse
|
1994
|
Human papillomavirus type 197 is commonly present in skin tumors. Int J Cancer 2014; 136:2546-55. [DOI: 10.1002/ijc.29325] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2014] [Accepted: 10/20/2014] [Indexed: 11/07/2022]
|
1995
|
Tan MH, Gan HM, Lee YP, Austin CM. The complete mitogenome of the soldier crab Mictyris longicarpus (Latreille, 1806) (Crustacea: Decapoda: Mictyridae). Mitochondrial DNA A DNA Mapp Seq Anal 2014; 27:2121-2. [PMID: 25423510 DOI: 10.3109/19401736.2014.982585] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
The Mictyris longicarpus (soldier crab) complete mitochondrial genome sequence is reported making it the first for the family Mictyridae and the second for the superfamily Ocypodoidea. The mitogenome is 15,548 base pairs made up of 13 protein-coding genes, 2 ribosomal subunit genes, 22 transfer RNAs and a non-coding AT-rich region. The soldier crab mitogenome gene order is characteristic of brachyuran crabs with a base composition of 36.58% for T, 19.15% for C, 32.43% for A and 11.83% for G, with an AT bias of 69.01%.
Collapse
Affiliation(s)
- Mun Hua Tan
- a School of Science, Monash University Malaysia , Selangor , Malaysia and.,b Genomics Facility, Monash University Malaysia , Selangor , Malaysia
| | - Han Ming Gan
- a School of Science, Monash University Malaysia , Selangor , Malaysia and.,b Genomics Facility, Monash University Malaysia , Selangor , Malaysia
| | - Yin Peng Lee
- a School of Science, Monash University Malaysia , Selangor , Malaysia and.,b Genomics Facility, Monash University Malaysia , Selangor , Malaysia
| | - Christopher M Austin
- a School of Science, Monash University Malaysia , Selangor , Malaysia and.,b Genomics Facility, Monash University Malaysia , Selangor , Malaysia
| |
Collapse
|
1996
|
Ghai R, Mizuno CM, Picazo A, Camacho A, Rodriguez‐Valera F. Key roles for freshwater
A
ctinobacteria revealed by deep metagenomic sequencing. Mol Ecol 2014; 23:6073-90. [DOI: 10.1111/mec.12985] [Citation(s) in RCA: 113] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2014] [Revised: 10/15/2014] [Accepted: 10/17/2014] [Indexed: 02/04/2023]
Affiliation(s)
- Rohit Ghai
- Evolutionary Genomics Group Departamento de Producción Vegetal y Microbiología Universidad Miguel Hernández San Juan de Alicante 03550 Alicante Spain
| | - Carolina Megumi Mizuno
- Evolutionary Genomics Group Departamento de Producción Vegetal y Microbiología Universidad Miguel Hernández San Juan de Alicante 03550 Alicante Spain
| | - Antonio Picazo
- Cavanilles Institute of Biodiversity and Evolutionary Biology University of Valencia Burjassot E‐46100 Valencia Spain
| | - Antonio Camacho
- Cavanilles Institute of Biodiversity and Evolutionary Biology University of Valencia Burjassot E‐46100 Valencia Spain
| | - Francisco Rodriguez‐Valera
- Evolutionary Genomics Group Departamento de Producción Vegetal y Microbiología Universidad Miguel Hernández San Juan de Alicante 03550 Alicante Spain
| |
Collapse
|
1997
|
Predicting the functional repertoire of an organism from unassembled RNA-seq data. BMC Genomics 2014; 15:1003. [PMID: 25409897 PMCID: PMC4258056 DOI: 10.1186/1471-2164-15-1003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2014] [Accepted: 10/30/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The annotation of biomolecular functions is an essential step in the analysis of newly sequenced organisms. Usually, the functions are inferred from predicted genes on the genome using homology search techniques. A high quality genomic sequence is an important prerequisite which, however, is difficult to achieve for certain organisms, such as hybrids or organisms with a large genome. For functional analysis it is also possible to use a de novo transcriptome assembly but the computational requirements can be demanding. Up to now, it is unclear how much of the functional repertoire of an organism can be reliably predicted from unassembled RNA-seq short reads alone. RESULTS We have conducted a study to investigate to what degree it is possible to reconstruct the functional profile of an organism from unassembled transcriptome data. We simulated the de novo prediction of biomolecular functions for Arabidopsis thaliana using a comprehensive RNA-seq data set. We evaluated the prediction performance using several homology search methods in combination with different evidence measures. For the decision on the presence or absence of a particular function under noisy conditions we propose a statistical mixture model enabling unsupervised estimation of a detection threshold. Our results indicate that the prediction of the biomolecular functions from the KEGG database is possible with a high sensitivity up to 94 percent. In this setting, the application of the mixture model for automatic threshold calibration allowed the reduction of the falsely predicted functions down to 4 percent. Furthermore, we found that our statistical approach even outperforms the prediction from a de novo transcriptome assembly. CONCLUSION The analysis of an organism's transcriptome can provide a solid basis for the prediction of biomolecular functions. Using RNA-seq short reads directly, the functional profile of an organism can be reconstructed in a computationally efficient way to provide a draft annotation in cases where the classical genome-based approaches cannot be applied.
Collapse
|
1998
|
Lo CC, Chain PSG. Rapid evaluation and quality control of next generation sequencing data with FaQCs. BMC Bioinformatics 2014; 15:366. [PMID: 25408143 PMCID: PMC4246454 DOI: 10.1186/s12859-014-0366-2] [Citation(s) in RCA: 160] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2014] [Accepted: 10/29/2014] [Indexed: 11/15/2022] Open
Abstract
Background Next generation sequencing (NGS) technologies that parallelize the sequencing process and produce thousands to millions, or even hundreds of millions of sequences in a single sequencing run, have revolutionized genomic and genetic research. Because of the vagaries of any platform’s sequencing chemistry, the experimental processing, machine failure, and so on, the quality of sequencing reads is never perfect, and often declines as the read is extended. These errors invariably affect downstream analysis/application and should therefore be identified early on to mitigate any unforeseen effects. Results Here we present a novel FastQ Quality Control Software (FaQCs) that can rapidly process large volumes of data, and which improves upon previous solutions to monitor the quality and remove poor quality data from sequencing runs. Both the speed of processing and the memory footprint of storing all required information have been optimized via algorithmic and parallel processing solutions. The trimmed output compared side-by-side with the original data is part of the automated PDF output. We show how this tool can help data analysis by providing a few examples, including an increased percentage of reads recruited to references, improved single nucleotide polymorphism identification as well as de novo sequence assembly metrics. Conclusion FaQCs combines several features of currently available applications into a single, user-friendly process, and includes additional unique capabilities such as filtering the PhiX control sequences, conversion of FASTQ formats, and multi-threading. The original data and trimmed summaries are reported within a variety of graphics and reports, providing a simple way to do data quality control and assurance. Electronic supplementary material The online version of this article (doi:10.1186/s12859-014-0366-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Chien-Chi Lo
- Bioenergy and Biome Sciences Group, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA.
| | - Patrick S G Chain
- Bioenergy and Biome Sciences Group, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA. .,Genome Science Group, Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA.
| |
Collapse
|
1999
|
McCorrison JM, Venepally P, Singh I, Fouts DE, Lasken RS, Methé BA. NeatFreq: reference-free data reduction and coverage normalization for De Novo sequence assembly. BMC Bioinformatics 2014; 15:357. [PMID: 25407910 PMCID: PMC4245761 DOI: 10.1186/s12859-014-0357-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2013] [Accepted: 10/22/2014] [Indexed: 11/10/2022] Open
Abstract
Background Deep shotgun sequencing on next generation sequencing (NGS) platforms has contributed significant amounts of data to enrich our understanding of genomes, transcriptomes, amplified single-cell genomes, and metagenomes. However, deep coverage variations in short-read data sets and high sequencing error rates of modern sequencers present new computational challenges in data interpretation, including mapping and de novo assembly. New lab techniques such as multiple displacement amplification (MDA) of single cells and sequence independent single primer amplification (SISPA) allow for sequencing of organisms that cannot be cultured, but generate highly variable coverage due to amplification biases. Results Here we introduce NeatFreq, a software tool that reduces a data set to more uniform coverage by clustering and selecting from reads binned by their median kmer frequency (RMKF) and uniqueness. Previous algorithms normalize read coverage based on RMKF, but do not include methods for the preferred selection of (1) extremely low coverage regions produced by extremely variable sequencing of random-primed products and (2) 2-sided paired-end sequences. The algorithm increases the incorporation of the most unique, lowest coverage, segments of a genome using an error-corrected data set. NeatFreq was applied to bacterial, viral plaque, and single-cell sequencing data. The algorithm showed an increase in the rate at which the most unique reads in a genome were included in the assembled consensus while also reducing the count of duplicative and erroneous contigs (strings of high confidence overlaps) in the deliverable consensus. The results obtained from conventional Overlap-Layout-Consensus (OLC) were compared to simulated multi-de Bruijn graph assembly alternatives trained for variable coverage input using sequence before and after normalization of coverage. Coverage reduction was shown to increase processing speed and reduce memory requirements when using conventional bacterial assembly algorithms. Conclusions The normalization of deep coverage spikes, which would otherwise inhibit consensus resolution, enables High Throughput Sequencing (HTS) assembly projects to consistently run to completion with existing assembly software. The NeatFreq software package is free, open source and available at https://github.com/bioh4x/NeatFreq. Electronic supplementary material The online version of this article (doi:10.1186/s12859-014-0357-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jamison M McCorrison
- Informatics Core Services, The J. Craig Venter Institute (JCVI), 9704 Medical Center Drive, Rockville, MD, 20850, USA.
| | - Pratap Venepally
- Informatics Core Services, The J. Craig Venter Institute (JCVI), 9704 Medical Center Drive, Rockville, MD, 20850, USA.
| | - Indresh Singh
- Informatics Core Services, The J. Craig Venter Institute (JCVI), 9704 Medical Center Drive, Rockville, MD, 20850, USA.
| | - Derrick E Fouts
- Department of Genomic Medicine, The J. Craig Venter Institute (JCVI), 9704 Medical Center Drive, Rockville, MD, 20850, USA.
| | - Roger S Lasken
- Department of Microbial & Environmental Genomics, The J. Craig Venter Institute (JCVI), 9704 Medical Center Drive, Rockville, MD20850, USA.
| | - Barbara A Methé
- Department of Genomic Medicine, The J. Craig Venter Institute (JCVI), 9704 Medical Center Drive, Rockville, MD, 20850, USA. .,Department of Microbial & Environmental Genomics, The J. Craig Venter Institute (JCVI), 9704 Medical Center Drive, Rockville, MD20850, USA.
| |
Collapse
|
2000
|
Aguirre de Cárcer D, Angly FE, Alcamí A. Evaluation of viral genome assembly and diversity estimation in deep metagenomes. BMC Genomics 2014; 15:989. [PMID: 25407630 PMCID: PMC4247695 DOI: 10.1186/1471-2164-15-989] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2014] [Accepted: 10/30/2014] [Indexed: 01/21/2023] Open
Abstract
Background Viruses have unique properties, small genome and regions of high similarity, whose effects on metagenomic assemblies have not been characterized so far. This study uses diverse in silico simulated viromes to evaluate how extensively genomes can be assembled using different sequencing platforms and assemblers. Further, it investigates the suitability of different methods to estimate viral diversity in metagenomes. Results We created in silico metagenomes mimicking various platforms at different sequencing depths. The CLC assembler revealed subpar compared to IDBA_UD and CAMERA , which are metagenomic-specific. Up to a saturation point, Illumina platforms proved more capable of reconstructing large portions of viral genomes compared to 454. Read length was an important factor for limiting chimericity, while scaffolding marginally improved contig length and accuracy. The genome length of the various viruses in the metagenomes did not significantly affect genome reconstruction, but the co-existence of highly similar genomes was detrimental. When evaluating diversity estimation tools, we found that PHACCS results were more accurate than those from CatchAll and clustering, which were both orders of magnitude above expected. Conclusions Assemblers designed specifically for the analysis of metagenomes should be used to facilitate the creation of high-quality long contigs. Despite the high coverage possible, scientists should not expect to always obtain complete genomes, because their reconstruction may be hindered by co-existing species bearing highly similar genomic regions. Further development of metagenomics-oriented assemblers may help bypass these limitations in future studies. Meanwhile, the lack of fully reconstructed communities keeps methods to estimate viral diversity relevant. While none of the three methods tested had absolute precision, only PHACCS was deemed suitable for comparative studies. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-989) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Daniel Aguirre de Cárcer
- Centro de Biología Molecular Severo Ochoa, Consejo Superior de Investigaciones Científicas (CSIC)-Universidad Autónoma de Madrid, Madrid, Spain.
| | | | | |
Collapse
|