1
|
Muñoz-Marín MDC, Magasin JD, Zehr JP. Open ocean and coastal strains of the N2-fixing cyanobacterium UCYN-A have distinct transcriptomes. PLoS One 2023; 18:e0272674. [PMID: 37130101 PMCID: PMC10153697 DOI: 10.1371/journal.pone.0272674] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2022] [Accepted: 02/18/2023] [Indexed: 05/03/2023] Open
Abstract
Decades of research on marine N2 fixation focused on Trichodesmium, which are generally free-living cyanobacteria, but in recent years the endosymbiotic cyanobacterium Candidatus Atelocyanobacterium thalassa (UCYN-A) has received increasing attention. However, few studies have shed light on the influence of the host versus the habitat on UCYN-A N2 fixation and overall metabolism. Here we compared transcriptomes from natural populations of UCYN-A from oligotrophic open-ocean versus nutrient-rich coastal waters, using a microarray that targets the full genomes of UCYN-A1 and UCYN-A2 and known genes for UCYN-A3. We found that UCYN-A2, usually regarded as adapted to coastal environments, was transcriptionally very active in the open ocean and appeared to be less impacted by habitat change than UCYN-A1. Moreover, for genes with 24 h periodic expression we observed strong but inverse correlations among UCYN-A1, A2, and A3 to oxygen and chlorophyll, which suggests distinct host-symbiont relationships. Across habitats and sublineages, genes for N2 fixation and energy production had high transcript levels, and, intriguingly, were among the minority of genes that kept the same schedule of diel expression. This might indicate different regulatory mechanisms for genes that are critical to the symbiosis for the exchange of nitrogen for carbon from the host. Our results underscore the importance of N2 fixation in UCYN-A symbioses across habitats, with consequences for community interactions and global biogeochemical cycles.
Collapse
Affiliation(s)
- María Del Carmen Muñoz-Marín
- Department of Ocean Sciences, University of California Santa Cruz, Santa Cruz, California, United States of America
- Departamento de Bioquímica y Biología Molecular, Campus de Excelencia Internacional Agroalimentario, Universidad de Córdoba, Córdoba, Spain
| | - Jonathan D Magasin
- Department of Ocean Sciences, University of California Santa Cruz, Santa Cruz, California, United States of America
| | - Jonathan P Zehr
- Department of Ocean Sciences, University of California Santa Cruz, Santa Cruz, California, United States of America
| |
Collapse
|
2
|
Bhanse P, Kumar M, Singh L, Awasthi MK, Qureshi A. Role of plant growth-promoting rhizobacteria in boosting the phytoremediation of stressed soils: Opportunities, challenges, and prospects. CHEMOSPHERE 2022; 303:134954. [PMID: 35595111 DOI: 10.1016/j.chemosphere.2022.134954] [Citation(s) in RCA: 38] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Revised: 04/30/2022] [Accepted: 05/10/2022] [Indexed: 05/02/2023]
Abstract
Soil is considered as a vital natural resource equivalent to air and water which supports growth of the plants and provides habitats to microorganisms. Changes in soil properties, productivity, and, inevitably contamination/stress are the result of urbanisation, industrialization, and long-term use of synthetic fertiliser. Therefore, in the recent scenario, reclamation of contaminated/stressed soils has become a potential challenge. Several customized, such as, physical, chemical, and biological technologies have been deployed so far to restore contaminated land. Among them, microbial-assisted phytoremediation is considered as an economical and greener approach. In recent decades, soil microbes have successfully been used to improve plants' ability to tolerate biotic and abiotic stress and strengthen their phytoremediation capacity. Therefore, in this context, the current review work critically explored the microbial assisted phytoremediation mechanisms to restore different types of stressed soil. The role of plant growth-promoting rhizobacteria (PGPR) and their potential mechanisms that foster plants' growth and also enhance phytoremediation capacity are focussed. Finally, this review has emphasized on the application of advanced tools and techniques to effectively characterize potent soil microbial communities and their significance in boosting the phytoremediation process of stressed soils along with prospects for future research.
Collapse
Affiliation(s)
- Poonam Bhanse
- Environmental Biotechnology and Genomics Division, CSIR-National Environmental Engineering Research Institute (CSIR-NEERI), Nehru Marg, Nagpur, 440020, Maharashtra, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, India
| | - Manish Kumar
- Environmental Biotechnology and Genomics Division, CSIR-National Environmental Engineering Research Institute (CSIR-NEERI), Nehru Marg, Nagpur, 440020, Maharashtra, India
| | - Lal Singh
- Environmental Biotechnology and Genomics Division, CSIR-National Environmental Engineering Research Institute (CSIR-NEERI), Nehru Marg, Nagpur, 440020, Maharashtra, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, India
| | - Mukesh Kumar Awasthi
- College of Natural Resources and Environment, Northwest A&F University, Yangling, 712100, Shaanxi Province, PR China.
| | - Asifa Qureshi
- Environmental Biotechnology and Genomics Division, CSIR-National Environmental Engineering Research Institute (CSIR-NEERI), Nehru Marg, Nagpur, 440020, Maharashtra, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, India.
| |
Collapse
|
3
|
Microbial Involvement in the Bioremediation of Total Petroleum Hydrocarbon Polluted Soils: Challenges and Perspectives. ENVIRONMENTS 2022. [DOI: 10.3390/environments9040052] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Nowadays, soil contamination by total petroleum hydrocarbons is still one of the most widespread forms of contamination. Intervention technologies are consolidated; however, full-scale interventions turn out to be not sustainable. Sustainability is essential not only in terms of costs, but also in terms of restoration of the soil resilience. Bioremediation has the possibility to fill the gap of sustainability with proper knowledge. Bioremediation should be optimized by the exploitation of the recent “omic” approaches to the study of hydrocarburoclastic microbiomes. To reach the goal, an extensive and deep knowledge in the study of bacterial and fungal degradative pathways, their interactions within microbiomes and of microbiomes with the soil matrix has to be gained. “Omic” approaches permits to study both the culturable and the unculturable soil microbial communities active in degradation processes, offering the instruments to identify the key organisms responsible for soil contaminant depletion and restoration of soil resilience. Tools for the investigation of both microbial communities, their degradation pathways and their interaction, will be discussed, describing the dedicated genomic and metagenomic approaches, as well as the interpretative tools of the deriving data, that are exploitable for both optimizing bio-based approaches for the treatment of total petroleum hydrocarbon contaminated soils and for the correct scaling up of the technologies at the industrial scale.
Collapse
|
4
|
Williams EP, Bachvaroff TR, Place AR. A Global Approach to Estimating the Abundance and Duplication of Polyketide Synthase Domains in Dinoflagellates. Evol Bioinform Online 2021; 17:11769343211031871. [PMID: 34345159 PMCID: PMC8283056 DOI: 10.1177/11769343211031871] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2020] [Accepted: 06/23/2021] [Indexed: 11/17/2022] Open
Abstract
Many dinoflagellate species make toxins in a myriad of different molecular configurations but the underlying chemistry in all cases is presumably via modular synthases, primarily polyketide synthases. In many organisms modular synthases occur as discrete synthetic genes or domains within a gene that act in coordination thus forming a module that produces a particular fragment of a natural product. The modules usually occur in tandem as gene clusters with a syntenic arrangement that is often predictive of the resultant structure. Dinoflagellate genomes however are notoriously complex with individual genes present in many tandem repeats and very few synthetic modules occurring as gene clusters, unlike what has been seen in bacteria and fungi. However, modular synthesis in all organisms requires a free thiol group that acts as a carrier for sequential synthesis called a thiolation domain. We scanned 47 dinoflagellate transcriptomes for 23 modular synthase domain models and compared their abundance among 10 orders of dinoflagellates as well as their co-occurrence with thiolation domains. The total count of domain types was quite large with over thirty-thousand identified, 29 000 of which were in the core dinoflagellates. Although there were no specific trends in domain abundance associated with types of toxins, there were readily observable lineage specific differences. The Gymnodiniales, makers of long polyketide toxins such as brevetoxin and karlotoxin had a high relative abundance of thiolation domains as well as multiple thiolation domains within a single transcript. Orders such as the Gonyaulacales, makers of small polyketides such as spirolides, had fewer thiolation domains but a relative increase in the number of acyl transferases. Unique to the core dinoflagellates, however, were thiolation domains occurring alongside tetratricopeptide repeats that facilitate protein-protein interactions, especially hexa and hepta-repeats, that may explain the scaffolding required for synthetic complexes capable of making large toxins. Clustering analysis for each type of domain was also used to discern possible origins of duplication for the multitude of single domain transcripts. Single domain transcripts frequently clustered with synonymous domains from multi-domain transcripts such as the BurA and ZmaK like genes as well as the multi-ketosynthase genes, sometimes with a large degree of apparent gene duplication, while fatty acid synthesis genes formed distinct clusters. Surprisingly the acyl-transferases and ketoreductases involved in fatty acid synthesis (FabD and FabG, respectively) were found in very large clusters indicating an unprecedented degree of gene duplication for these genes. These results demonstrate a complex evolutionary history of core dinoflagellate modular synthases with domain specific duplications throughout the lineage as well as clues to how large protein complexes can be assembled to synthesize the largest natural products known.
Collapse
Affiliation(s)
- Ernest P Williams
- Institute of Marine and Environmental Technologies, University of Maryland Center for Environmental Science, Baltimore, MD, USA
| | - Tsvetan R Bachvaroff
- Institute of Marine and Environmental Technologies, University of Maryland Center for Environmental Science, Baltimore, MD, USA
| | - Allen R Place
- Institute of Marine and Environmental Technologies, University of Maryland Center for Environmental Science, Baltimore, MD, USA
| |
Collapse
|
5
|
Saito MA, Saunders JK, Chagnon M, Gaylord DA, Shepherd A, Held NA, Dupont C, Symmonds N, York A, Charron M, Kinkade DB. Development of an Ocean Protein Portal for Interactive Discovery and Education. J Proteome Res 2020; 20:326-336. [PMID: 32897077 DOI: 10.1021/acs.jproteome.0c00382] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
Proteins are critical in catalyzing chemical reactions, forming key cellular structures, and in regulating cellular processes. Investigation of marine microbial proteins by metaproteomics methods enables the discovery of numerous aspects of microbial biogeochemical processes. However, these datasets present big data challenges as they often involve many samples collected across broad geospatial and temporal scales, resulting in thousands of protein identifications, abundances, and corresponding annotation information. The Ocean Protein Portal (OPP) was created to enable data sharing and discovery among multiple scientific domains and serve both research and education functions. The portal focuses on three use case questions: "Where is my protein of interest?", "Who makes it?", and "How much is there?" and provides profile and section visualizations, real-time taxonomic analysis, and links to metadata, sequence analysis, and other external resources to enable connections to be made between biogeochemical and proteomics datasets.
Collapse
Affiliation(s)
- Mak A Saito
- Woods Hole Oceanographic Institution, Woods Hole, Falmouth, Massachusetts 02543, United States
| | - Jaclyn K Saunders
- Woods Hole Oceanographic Institution, Woods Hole, Falmouth, Massachusetts 02543, United States
| | - Michael Chagnon
- RPS Group, South Kingston, Rhode Island 02879, United States.,Kaimika Technology, Cumberland, Rhode Island 02864, United States
| | - David A Gaylord
- Woods Hole Oceanographic Institution, Woods Hole, Falmouth, Massachusetts 02543, United States
| | - Adam Shepherd
- Woods Hole Oceanographic Institution, Woods Hole, Falmouth, Massachusetts 02543, United States
| | - Noelle A Held
- Woods Hole Oceanographic Institution, Woods Hole, Falmouth, Massachusetts 02543, United States
| | - Christopher Dupont
- Woods Hole Oceanographic Institute, Falmouth, Massachusetts 02543, United States
| | - Nicholas Symmonds
- Woods Hole Oceanographic Institution, Woods Hole, Falmouth, Massachusetts 02543, United States
| | - Amber York
- Woods Hole Oceanographic Institution, Woods Hole, Falmouth, Massachusetts 02543, United States
| | - Matthew Charron
- Kaimika Technology, Cumberland, Rhode Island 02864, United States
| | - Danie B Kinkade
- Woods Hole Oceanographic Institute, Falmouth, Massachusetts 02543, United States
| |
Collapse
|
6
|
Santos-Júnior CD, Sarmento H, de Miranda FP, Henrique-Silva F, Logares R. Uncovering the genomic potential of the Amazon River microbiome to degrade rainforest organic matter. MICROBIOME 2020; 8:151. [PMID: 33126925 PMCID: PMC7597016 DOI: 10.1186/s40168-020-00930-w] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/29/2020] [Accepted: 10/06/2020] [Indexed: 06/11/2023]
Abstract
BACKGROUND The Amazon River is one of the largest in the world and receives huge amounts of terrestrial organic matter (TeOM) from the surrounding rainforest. Despite this TeOM is typically recalcitrant (i.e. resistant to degradation), only a small fraction of it reaches the ocean, pointing to a substantial TeOM degradation by the river microbiome. Yet, microbial genes involved in TeOM degradation in the Amazon River were barely known. Here, we examined the Amazon River microbiome by analysing 106 metagenomes from 30 sampling points distributed along the river. RESULTS We constructed the Amazon River basin Microbial non-redundant Gene Catalogue (AMnrGC) that includes ~ 3.7 million non-redundant genes, affiliating mostly to bacteria. We found that the Amazon River microbiome contains a substantial gene-novelty compared to other relevant known environments (rivers and rainforest soil). Genes encoding for proteins potentially involved in lignin degradation pathways were correlated to tripartite tricarboxylates transporters and hemicellulose degradation machinery, pointing to a possible priming effect. Based on this, we propose a model on how the degradation of recalcitrant TeOM could be modulated by labile compounds in the Amazon River waters. Our results also suggest changes of the microbial community and its genomic potential along the river course. CONCLUSIONS Our work contributes to expand significantly our comprehension of the world's largest river microbiome and its potential metabolism related to TeOM degradation. Furthermore, the produced gene catalogue (AMnrGC) represents an important resource for future research in tropical rivers. Video abstract.
Collapse
Affiliation(s)
- Célio Dias Santos-Júnior
- Molecular Biology Laboratory, Department of Genetics and Evolution – DGE, Universidade Federal de São Carlos – UFSCar, Rod. Washington Luis KM 235 - Monjolinho, São Carlos, SP 13565-905 Brazil
- Institute of Science and Technology for Brain-Inspired Intelligence – ISTBI, Fudan University, Handan Rd 220, Wu Jiao Chang, Yangpu, Shanghai, 200433 China
| | - Hugo Sarmento
- Laboratory of Microbial Processes & Biodiversity, Department of Hydrobiology – DHB, Universidade Federal de São Carlos – UFSCar, Via Washington Luis KM 235 - Monjolinho, São Carlos, SP 13565-905 Brazil
| | - Fernando Pellon de Miranda
- Centro de Pesquisas e Desenvolvimento Leopoldo Américo Miguez de Mello, Petróleo Brasileiro S.A. (Petrobras), Av. Horácio Macedo 950, Rio de Janeiro, RJ 21941-915 Brazil
| | - Flávio Henrique-Silva
- Molecular Biology Laboratory, Department of Genetics and Evolution – DGE, Universidade Federal de São Carlos – UFSCar, Rod. Washington Luis KM 235 - Monjolinho, São Carlos, SP 13565-905 Brazil
| | - Ramiro Logares
- Institute of Marine Sciences (ICM), CSIC, Passeig Marítim de la Barceloneta 37-49, ES08003, Barcelona, Catalonia Spain
| |
Collapse
|
7
|
Blanco-Míguez A, Fdez-Riverola F, Sánchez B, Lourenço A. Resources and tools for the high-throughput, multi-omic study of intestinal microbiota. Brief Bioinform 2020; 20:1032-1056. [PMID: 29186315 DOI: 10.1093/bib/bbx156] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2017] [Revised: 10/23/2017] [Indexed: 12/18/2022] Open
Abstract
The human gut microbiome impacts several aspects of human health and disease, including digestion, drug metabolism and the propensity to develop various inflammatory, autoimmune and metabolic diseases. Many of the molecular processes that play a role in the activity and dynamics of the microbiota go beyond species and genic composition and thus, their understanding requires advanced bioinformatics support. This article aims to provide an up-to-date view of the resources and software tools that are being developed and used in human gut microbiome research, in particular data integration and systems-level analysis efforts. These efforts demonstrate the power of standardized and reproducible computational workflows for integrating and analysing varied omics data and gaining deeper insights into microbe community structure and function as well as host-microbe interactions.
Collapse
Affiliation(s)
| | | | | | - Anália Lourenço
- Dpto. de Informática - Universidade de Vigo, ESEI - Escuela Superior de Ingeniería Informática, Edificio politécnico, Campus Universitario As Lagoas s/n, 32004 Ourense, Spain
| |
Collapse
|
8
|
Grimshaw SG, Smith AM, Arnold DS, Xu E, Hoptroff M, Murphy B. The diversity and abundance of fungi and bacteria on the healthy and dandruff affected human scalp. PLoS One 2019; 14:e0225796. [PMID: 31851674 PMCID: PMC6919596 DOI: 10.1371/journal.pone.0225796] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2019] [Accepted: 11/12/2019] [Indexed: 12/22/2022] Open
Abstract
Dandruff is a skin condition that affects the scalp of up to half the world's population, it is characterised by an itchy, flaky scalp and is associated with colonisation of the skin by Malassezia spp. Management of this condition is typically via antifungal therapies, however the precise role of microbes in the aggravation of the condition are incompletely characterised. Here, a combination of 454 sequencing and qPCR techniques were used to compare the scalp microbiota of dandruff and non-dandruff affected Chinese subjects. Based on 454 sequencing of the scalp microbiome, the two most abundant bacterial genera found on the scalp surface were Cutibacterium (formerly Propionibacterium) and Staphylococcus, while Malassezia was the main fungal inhabitant. Quantitative PCR (qPCR) analysis of four scalp taxa (M. restricta, M. globosa, C. acnes and Staphylococcus spp.) believed to represent the bulk of the overall population was additionally carried out. Metataxonomic and qPCR analyses were performed on healthy and lesional buffer scrub samples to facilitate assessment of whether the scalp condition is associated with differential microbial communities on the sampled skin. Dandruff was associated with greater frequencies of M. restricta and Staphylococcus spp. compared with the healthy population (p<0.05). Analysis also revealed the presence of an unclassified fungal taxon that could represent a novel Malassezia species.
Collapse
Affiliation(s)
- Sally G. Grimshaw
- Unilever Research & Development, Port Sunlight, England, United Kingdom
| | - Adrian M. Smith
- Unilever Research & Development, Colworth, England, United Kingdom
| | - David S. Arnold
- Unilever Research & Development, Port Sunlight, England, United Kingdom
| | - Elaine Xu
- Unilever Research & Development, Shanghai, China
| | - Michael Hoptroff
- Unilever Research & Development, Port Sunlight, England, United Kingdom
| | - Barry Murphy
- Unilever Research & Development, Port Sunlight, England, United Kingdom
| |
Collapse
|
9
|
Perz AI, Giles CB, Brown CA, Porter H, Roopnarinesingh X, Wren JD. MNEMONIC: MetageNomic Experiment Mining to create an OTU Network of Inhabitant Correlations. BMC Bioinformatics 2019; 20:96. [PMID: 30871469 PMCID: PMC6419333 DOI: 10.1186/s12859-019-2623-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Background The number of publicly available metagenomic experiments in various environments has been rapidly growing, empowering the potential to identify similar shifts in species abundance between different experiments. This could be a potentially powerful way to interpret new experiments, by identifying common themes and causes behind changes in species abundance. Results We propose a novel framework for comparing microbial shifts between conditions. Using data from one of the largest human metagenome projects to date, the American Gut Project (AGP), we obtain differential abundance vectors for microbes using experimental condition information provided with the AGP metadata, such as patient age, dietary habits, or health status. We show it can be used to identify similar and opposing shifts in microbial species, and infer putative interactions between microbes. Our results show that groups of shifts with similar effects on microbiome can be identified and that similar dietary interventions display similar microbial abundance shifts. Conclusions Without comparison to prior data, it is difficult for experimentalists to know if their observed changes in species abundance have been observed by others, both in their conditions and in others they would never consider comparable. Yet, this can be a very important contextual factor in interpreting the significance of a shift. We’ve proposed and tested an algorithmic solution to this problem, which also allows for comparing the metagenomic signature shifts between conditions in the existing body of data. Electronic supplementary material The online version of this article (10.1186/s12859-019-2623-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Aleksandra I Perz
- Arthritis and Clinical Immunology Program, Division of Genomics and Data Sciences, Oklahoma Medical Research Foundation, Oklahoma City, OK, 73104-5005, USA.
| | - Cory B Giles
- Arthritis and Clinical Immunology Program, Division of Genomics and Data Sciences, Oklahoma Medical Research Foundation, Oklahoma City, OK, 73104-5005, USA.,Department of Geriatric Medicine, University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA
| | - Chase A Brown
- Arthritis and Clinical Immunology Program, Division of Genomics and Data Sciences, Oklahoma Medical Research Foundation, Oklahoma City, OK, 73104-5005, USA.,Oklahoma Center for Neuroscience, University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA
| | - Hunter Porter
- Arthritis and Clinical Immunology Program, Division of Genomics and Data Sciences, Oklahoma Medical Research Foundation, Oklahoma City, OK, 73104-5005, USA.,Oklahoma Center for Neuroscience, University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA
| | - Xiavan Roopnarinesingh
- Arthritis and Clinical Immunology Program, Division of Genomics and Data Sciences, Oklahoma Medical Research Foundation, Oklahoma City, OK, 73104-5005, USA.,Department of Biochemistry and Molecular Biology, University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA
| | - Jonathan D Wren
- Arthritis and Clinical Immunology Program, Division of Genomics and Data Sciences, Oklahoma Medical Research Foundation, Oklahoma City, OK, 73104-5005, USA. .,Department of Biochemistry and Molecular Biology, University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA. .,Oklahoma Center for Neuroscience, University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA. .,Department of Geriatric Medicine, University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA.
| |
Collapse
|
10
|
The Transcriptional Cycle Is Suited to Daytime N 2 Fixation in the Unicellular Cyanobacterium " Candidatus Atelocyanobacterium thalassa" (UCYN-A). mBio 2019; 10:mBio.02495-18. [PMID: 30602582 PMCID: PMC6315102 DOI: 10.1128/mbio.02495-18] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The symbiotic N2-fixing cyanobacterium UCYN-A, which is closely related to Braarudosphaera bigelowii, and its eukaryotic algal host have been shown to be globally distributed and important in open-ocean N2 fixation. These unique cyanobacteria have reduced metabolic capabilities, even lacking genes for oxygenic photosynthesis and carbon fixation. Cyanobacteria generally use energy from photosynthesis for nitrogen fixation but require mechanisms for avoiding inactivation of the oxygen-sensitive nitrogenase enzyme by ambient oxygen (O2) or the O2 evolved through photosynthesis. This study showed that symbiosis between the N2-fixing cyanobacterium UCYN-A and its eukaryotic algal host has led to adaptation of its daily gene expression pattern in order to enable daytime aerobic N2 fixation, which is likely more energetically efficient than fixing N2 at night, as found in other unicellular marine cyanobacteria. Symbiosis between a marine alga and a N2-fixing cyanobacterium (Cyanobacterium UCYN-A) is geographically widespread in the oceans and is important in the marine N cycle. UCYN-A is uncultivated and is an unusual unicellular cyanobacterium because it lacks many metabolic functions, including oxygenic photosynthesis and carbon fixation, which are typical in cyanobacteria. It is now presumed to be an obligate symbiont of haptophytes closely related to Braarudosphaera bigelowii. N2-fixing cyanobacteria use different strategies to avoid inhibition of N2 fixation by the oxygen evolved in photosynthesis. Most unicellular cyanobacteria temporally separate the two incompatible activities by fixing N2 only at night, but, surprisingly, UCYN-A appears to fix N2 during the day. The goal of this study was to determine how the unicellular UCYN-A strain coordinates N2 fixation and general metabolism compared to other marine cyanobacteria. We found that UCYN-A has distinct daily cycles of many genes despite the fact that it lacks two of the three circadian clock genes found in most cyanobacteria. We also found that the transcription patterns in UCYN-A are more similar to those in marine cyanobacteria that are capable of aerobic N2 fixation in the light, such as Trichodesmium and heterocyst-forming cyanobacteria, than to those in Crocosphaera or Cyanothece species, which are more closely related to unicellular marine cyanobacteria evolutionarily. Our findings suggest that the symbiotic interaction has resulted in a shift of transcriptional regulation to coordinate UCYN-A metabolism with that of the phototrophic eukaryotic host, thus allowing efficient coupling of N2 fixation (by the cyanobacterium) to the energy obtained from photosynthesis (by the eukaryotic unicellular alga) in the light.
Collapse
|
11
|
Tangherlini M, Miralto M, Colantuono C, Sangiovanni M, Dell’ Anno A, Corinaldesi C, Danovaro R, Chiusano ML. GLOSSary: the GLobal Ocean 16S subunit web accessible resource. BMC Bioinformatics 2018; 19:443. [PMID: 30497362 PMCID: PMC6266928 DOI: 10.1186/s12859-018-2423-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Environmental metagenomics is a challenging approach that is exponentially spreading in the scientific community to investigate taxonomic diversity and possible functions of the biological components. The massive amount of sequence data produced, often endowed with rich environmental metadata, needs suitable computational tools to fully explore the embedded information. Bioinformatics plays a key role in providing methodologies to manage, process and mine molecular data, integrated with environmental metagenomics collections. One such relevant example is represented by the Tara Ocean Project. RESULTS We considered the Tara 16S miTAGs released by the consortium, representing raw sequences from a shotgun metagenomics approach with similarities to 16S rRNA genes. We generated assembled 16S rDNA sequences, which were classified according to their lengths, the possible presence of chimeric reads, the putative taxonomic affiliation. The dataset was included in GLOSSary (the GLobal Ocean 16S Subunit web accessible resource), a bioinformatics platform to organize environmental metagenomics data. The aims of this work were: i) to present alternative computational approaches to manage challenging metagenomics data; ii) to set up user friendly web-based platforms to allow the integration of environmental metagenomics sequences and of the associated metadata; iii) to implement an appropriate bioinformatics platform supporting the analysis of 16S rDNA sequences exploiting reference datasets, such as the SILVA database. We organized the data in a next-generation NoSQL "schema-less" database, allowing flexible organization of large amounts of data and supporting native geospatial queries. A web interface was developed to permit an interactive exploration and a visual geographical localization of the data, either raw miTAG reads or 16S contigs, from our processing pipeline. Information on unassembled sequences is also available. The taxonomic affiliations of contigs and miTAGs, and the spatial distribution of the sampling sites and their associated sequence libraries, as they are contained in the Tara metadata, can be explored by a query interface, which allows both textual and visual investigations. In addition, all the sequence data were made available for a dedicated BLAST-based web application alongside the SILVA collection. CONCLUSIONS GLOSSary provides an expandable bioinformatics environment, able to support the scientific community in current and forthcoming environmental metagenomics analyses.
Collapse
Affiliation(s)
- M. Tangherlini
- Stazione Zoologica “Anton Dohrn”, Villa Comunale, 80121 Naples, Italy
| | - M. Miralto
- Stazione Zoologica “Anton Dohrn”, Villa Comunale, 80121 Naples, Italy
| | - C. Colantuono
- Stazione Zoologica “Anton Dohrn”, Villa Comunale, 80121 Naples, Italy
| | - M. Sangiovanni
- Stazione Zoologica “Anton Dohrn”, Villa Comunale, 80121 Naples, Italy
| | - A. Dell’ Anno
- Dipartimento di Scienze della Vita e dell’Ambiente, Polytechnic University of Marche, Via Brecce Bianche, 60131 Ancona, Italy
| | - C. Corinaldesi
- Dipartimento di Scienze e Ingegneria della Materia, dell’Ambiente ed Urbanistica, Polytechnic University of Marche, Via Brecce Bianche, 60131 Ancona, Italy
| | - R. Danovaro
- Stazione Zoologica “Anton Dohrn”, Villa Comunale, 80121 Naples, Italy
- Dipartimento di Scienze della Vita e dell’Ambiente, Polytechnic University of Marche, Via Brecce Bianche, 60131 Ancona, Italy
| | - M. L. Chiusano
- Stazione Zoologica “Anton Dohrn”, Villa Comunale, 80121 Naples, Italy
- Dipartimento di Agraria, University of Naples “Federico II”, via Università 100, 80055 Portici, Italy
| |
Collapse
|
12
|
Versluis D, Nijsse B, Naim MA, Koehorst JJ, Wiese J, Imhoff JF, Schaap PJ, van Passel MWJ, Smidt H, Sipkema D. Comparative Genomics Highlights Symbiotic Capacities and High Metabolic Flexibility of the Marine Genus Pseudovibrio. Genome Biol Evol 2018; 10:125-142. [PMID: 29319806 PMCID: PMC5765558 DOI: 10.1093/gbe/evx271] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/18/2017] [Indexed: 12/19/2022] Open
Abstract
Pseudovibrio is a marine bacterial genus members of which are predominantly isolated from sessile marine animals, and particularly sponges. It has been hypothesized that Pseudovibrio spp. form mutualistic relationships with their hosts. Here, we studied Pseudovibrio phylogeny and genetic adaptations that may play a role in host colonization by comparative genomics of 31 Pseudovibrio strains, including 25 sponge isolates. All genomes were highly similar in terms of encoded core metabolic pathways, albeit with substantial differences in overall gene content. Based on gene composition, Pseudovibrio spp. clustered by geographic region, indicating geographic speciation. Furthermore, the fact that isolates from the Mediterranean Sea clustered by sponge species suggested host-specific adaptation or colonization. Genome analyses suggest that Pseudovibrio hongkongensis UST20140214-015BT is only distantly related to other Pseudovibrio spp., thereby challenging its status as typical Pseudovibrio member. All Pseudovibrio genomes were found to encode numerous proteins with SEL1 and tetratricopeptide repeats, which have been suggested to play a role in host colonization. For evasion of the host immune system, Pseudovibrio spp. may depend on type III, IV, and VI secretion systems that can inject effector molecules into eukaryotic cells. Furthermore, Pseudovibrio genomes carry on average seven secondary metabolite biosynthesis clusters, reinforcing the role of Pseudovibrio spp. as potential producers of novel bioactive compounds. Tropodithietic acid, bacteriocin, and terpene biosynthesis clusters were highly conserved within the genus, suggesting an essential role in survival, for example through growth inhibition of bacterial competitors. Taken together, these results support the hypothesis that Pseudovibrio spp. have mutualistic relations with sponges.
Collapse
Affiliation(s)
- Dennis Versluis
- Laboratory of Microbiology, Wageningen University & Research, The Netherlands
| | - Bart Nijsse
- Laboratory of Microbiology, Wageningen University & Research, The Netherlands.,Laboratory of Systems and Synthetic Biology, Wageningen University & Research, The Netherlands
| | - Mohd Azrul Naim
- Laboratory of Microbiology, Wageningen University & Research, The Netherlands
| | - Jasper J Koehorst
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, The Netherlands
| | - Jutta Wiese
- Marine Microbiology, GEOMAR Helmholtz Centre for Ocean Research Kiel, Kiel, Germany
| | - Johannes F Imhoff
- Marine Microbiology, GEOMAR Helmholtz Centre for Ocean Research Kiel, Kiel, Germany
| | - Peter J Schaap
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, The Netherlands
| | - Mark W J van Passel
- Laboratory of Microbiology, Wageningen University & Research, The Netherlands.,National Institute for Public Health and the Environment, Bilthoven, The Netherlands
| | - Hauke Smidt
- Laboratory of Microbiology, Wageningen University & Research, The Netherlands
| | - Detmer Sipkema
- Laboratory of Microbiology, Wageningen University & Research, The Netherlands
| |
Collapse
|
13
|
Kinjo S, Monma N, Misu S, Kitamura N, Imoto J, Yoshitake K, Gojobori T, Ikeo K. Maser: one-stop platform for NGS big data from analysis to visualization. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2018; 2018:4970007. [PMID: 29688385 PMCID: PMC5905357 DOI: 10.1093/database/bay027] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/15/2017] [Accepted: 02/26/2018] [Indexed: 11/13/2022]
Abstract
A major challenge in analyzing the data from high-throughput next-generation sequencing (NGS) is how to handle the huge amounts of data and variety of NGS tools and visualize the resultant outputs. To address these issues, we developed a cloud-based data analysis platform, Maser (Management and Analysis System for Enormous Reads), and an original genome browser, Genome Explorer (GE). Maser enables users to manage up to 2 terabytes of data to conduct analyses with easy graphical user interface operations and offers analysis pipelines in which several individual tools are combined as a single pipeline for very common and standard analyses. GE automatically visualizes genome assembly and mapping results output from Maser pipelines, without requiring additional data upload. With this function, the Maser pipelines can graphically display the results output from all the embedded tools and mapping results in a web browser. Therefore Maser realized a more user-friendly analysis platform especially for beginners by improving graphical display and providing the selected standard pipelines that work with built-in genome browser. In addition, all the analyses executed on Maser are recorded in the analysis history, helping users to trace and repeat the analyses. The entire process of analysis and its histories can be shared with collaborators or opened to the public. In conclusion, our system is useful for managing, analyzing, and visualizing NGS data and achieves traceability, reproducibility, and transparency of NGS analysis. Database URL: http://cell-innovation.nig.ac.jp/maser/
Collapse
Affiliation(s)
- Sonoko Kinjo
- Center for Information Biology, National Institute of Genetics, 1111 Yata, Mishima, Shizuoka 411-8540, Japan
| | | | | | - Norikazu Kitamura
- Center for Information Biology, National Institute of Genetics, 1111 Yata, Mishima, Shizuoka 411-8540, Japan
| | - Junichi Imoto
- Center for Information Biology, National Institute of Genetics, 1111 Yata, Mishima, Shizuoka 411-8540, Japan
| | - Kazutoshi Yoshitake
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| | - Takashi Gojobori
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Kazuho Ikeo
- Center for Information Biology, National Institute of Genetics, 1111 Yata, Mishima, Shizuoka 411-8540, Japan.,Department of Genetics, SOKENDAI, Mishima, Japan
| |
Collapse
|
14
|
Ugarte A, Vicedomini R, Bernardes J, Carbone A. A multi-source domain annotation pipeline for quantitative metagenomic and metatranscriptomic functional profiling. MICROBIOME 2018; 6:149. [PMID: 30153857 PMCID: PMC6114274 DOI: 10.1186/s40168-018-0532-2] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2018] [Accepted: 08/13/2018] [Indexed: 05/23/2023]
Abstract
BACKGROUND Biochemical and regulatory pathways have until recently been thought and modelled within one cell type, one organism and one species. This vision is being dramatically changed by the advent of whole microbiome sequencing studies, revealing the role of symbiotic microbial populations in fundamental biochemical functions. The new landscape we face requires the reconstruction of biochemical and regulatory pathways at the community level in a given environment. In order to understand how environmental factors affect the genetic material and the dynamics of the expression from one environment to another, we want to evaluate the quantity of gene protein sequences or transcripts associated to a given pathway by precisely estimating the abundance of protein domains, their weak presence or absence in environmental samples. RESULTS MetaCLADE is a novel profile-based domain annotation pipeline based on a multi-source domain annotation strategy. It applies directly to reads and improves identification of the catalog of functions in microbiomes. MetaCLADE is applied to simulated data and to more than ten metagenomic and metatranscriptomic datasets from different environments where it outperforms InterProScan in the number of annotated domains. It is compared to the state-of-the-art non-profile-based and profile-based methods, UProC and HMM-GRASPx, showing complementary predictions to UProC. A combination of MetaCLADE and UProC improves even further the functional annotation of environmental samples. CONCLUSIONS Learning about the functional activity of environmental microbial communities is a crucial step to understand microbial interactions and large-scale environmental impact. MetaCLADE has been explicitly designed for metagenomic and metatranscriptomic data and allows for the discovery of patterns in divergent sequences, thanks to its multi-source strategy. MetaCLADE highly improves current domain annotation methods and reaches a fine degree of accuracy in annotation of very different environments such as soil and marine ecosystems, ancient metagenomes and human tissues.
Collapse
Affiliation(s)
- Ari Ugarte
- Sorbonne Université, UPMC-Univ P6, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative - UMR 7238, 4 Place Jussieu, Paris, 75005 France
| | - Riccardo Vicedomini
- Sorbonne Université, UPMC-Univ P6, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative - UMR 7238, 4 Place Jussieu, Paris, 75005 France
- Sorbonne Université, UPMC-Univ P6, CNRS, Institut des Sciences du Calcul et des Donnees, 4 Place Jussieu, Paris, 75005 France
| | - Juliana Bernardes
- Sorbonne Université, UPMC-Univ P6, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative - UMR 7238, 4 Place Jussieu, Paris, 75005 France
| | - Alessandra Carbone
- Sorbonne Université, UPMC-Univ P6, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative - UMR 7238, 4 Place Jussieu, Paris, 75005 France
- Institut Universitaire de France, Paris, 75005 France
| |
Collapse
|
15
|
Dittberner H, Ohlmann N, Acquisti C. Stoichio-Metagenomics of Ocean Waters: A Molecular Evolution Approach to Trace the Dynamics of Nitrogen Conservation in Natural Communities. Front Microbiol 2018; 9:1590. [PMID: 30072968 PMCID: PMC6058095 DOI: 10.3389/fmicb.2018.01590] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2017] [Accepted: 06/26/2018] [Indexed: 11/21/2022] Open
Abstract
Nitrogen is crucially limiting in ocean surface waters, and its availability varies substantially with coastal regions typically richer in nutrients than open oceans. In a biological stoichiometry framework, a parsimonious strategy of nitrogen allocation predicts nitrogen content of proteins to be lower in communities adapted to open ocean than to coastal regions. To test this hypothesis we have directly interrogated marine microbial communities, using a series of metagenomics datasets with a broad geographical distribution from the Global Ocean Sampling Expedition. Analyzing over 20 million proteins, we document a ubiquitous signal of nitrogen conservation in open ocean communities, both in membrane and non-membrane proteins. Efficient nitrogen allocation is expected to specifically target proteins that are expressed at high rate in response to nitrogen starvation. Furthermore, in order to preserve protein functional efficiency, economic nitrogen allocation is predicted to target primarily the least functionally constrained regions of proteins. Contrasting the NtcA-induced pathway, typically up-regulated in response to nitrogen starvation, with the arginine anabolic pathway, which is instead up-regulated in response to nitrogen abundance, we show how both these predictions are fulfilled. Using evolutionary rates as an informative proxy of functional constraints, we show that variation in nitrogen allocation between open ocean and coastal communities is primarily localized in the least functionally constrained regions of the genes triggered by NtcA. As expected, such a pattern is not detectable in the genes involved in the arginine anabolic pathway. These results directly link environmental nitrogen availability to different adaptive strategies of genome evolution, and emphasize the relevance of the material costs of evolutionary change in natural ecosystems.
Collapse
Affiliation(s)
- Hannes Dittberner
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Niklas Ohlmann
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Claudia Acquisti
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| |
Collapse
|
16
|
Kauffman KM, Brown JM, Sharma RS, VanInsberghe D, Elsherbini J, Polz M, Kelly L. Viruses of the Nahant Collection, characterization of 251 marine Vibrionaceae viruses. Sci Data 2018; 5:180114. [PMID: 29969110 PMCID: PMC6029569 DOI: 10.1038/sdata.2018.114] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2018] [Accepted: 04/11/2018] [Indexed: 11/09/2022] Open
Abstract
Viruses are highly discriminating in their interactions with host cells and are thought to play a major role in maintaining diversity of environmental microbes. However, large-scale ecological and genomic studies of co-occurring virus-host pairs, required to characterize the mechanistic and genomic foundations of virus-host interactions, are lacking. Here, we present the largest dataset of cultivated and sequenced co-occurring virus-host pairs that captures ecologically representative fine-scale diversity. Using the ubiquitous and ecologically diverse marine Vibrionaceae as a host platform, we isolate and sequence 251 dsDNA viruses and their hosts from three time points within a 93-day time-series study. The virus collection includes representatives of the three Caudovirales tailed virus morphotypes, a novel family of nontailed viruses, and the smallest (10,046 bp) and largest (348,911 bp) Vibrio virus genomes described. We provide general characterization and annotation of the viruses and describe read-mapping protocols to standardize genome presentation. The rich ecological and genomic contextualization of hosts and viruses make the Nahant Collection a unique platform for high-resolution studies of environmental virus-host infection networks.
Collapse
Affiliation(s)
- Kathryn M. Kauffman
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, MA 02141 USA
| | - Julia M. Brown
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY 10461 USA
| | - Radhey S. Sharma
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, MA 02141 USA
| | - David VanInsberghe
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, MA 02141 USA
| | - Joseph Elsherbini
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, MA 02141 USA
| | - Martin Polz
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, MA 02141 USA
| | - Libusha Kelly
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY 10461 USA
| |
Collapse
|
17
|
Mihara T, Koyano H, Hingamp P, Grimsley N, Goto S, Ogata H. Taxon Richness of "Megaviridae" Exceeds those of Bacteria and Archaea in the Ocean. Microbes Environ 2018; 33:162-171. [PMID: 29806626 PMCID: PMC6031395 DOI: 10.1264/jsme2.me17203] [Citation(s) in RCA: 64] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Since the discovery of the giant mimivirus, evolutionarily related viruses have been isolated or identified from various environments. Phylogenetic analyses of this group of viruses, tentatively referred to as the family “Megaviridae”, suggest that it has an ancient origin that may predate the emergence of major eukaryotic lineages. Environmental genomics has since revealed that Megaviridae represents one of the most abundant and diverse groups of viruses in the ocean. In the present study, we compared the taxon richness and phylogenetic diversity of Megaviridae, Bacteria, and Archaea using DNA-dependent RNA polymerase as a common marker gene. By leveraging existing microbial metagenomic data, we found higher richness and phylogenetic diversity in this single viral family than in the two prokaryotic domains. We also obtained results showing that the evolutionary rate alone cannot account for the observed high diversity of Megaviridae lineages. These results suggest that the Megaviridae family has a deep co-evolutionary history with diverse marine protists since the early “Big-Bang” radiation of the eukaryotic tree of life.
Collapse
Affiliation(s)
- Tomoko Mihara
- Bioinformatics Center, Institute for Chemical Research, Kyoto University
| | - Hitoshi Koyano
- School of Life Science and Technology, Laboratory of Genome Informatics, Tokyo Institute of Technology
| | | | - Nigel Grimsley
- Integrative Marine Biology Laboratory (BIOM), CNRS UMR7232, Sorbonne Universities
| | - Susumu Goto
- Database Center for Life Science, Joint-Support Center for Data Science Research, Research Organization of Information and Systems
| | - Hiroyuki Ogata
- Bioinformatics Center, Institute for Chemical Research, Kyoto University
| |
Collapse
|
18
|
Zaidi SSA, Zhang X. Computational operon prediction in whole-genomes and metagenomes. Brief Funct Genomics 2018; 16:181-193. [PMID: 27659221 DOI: 10.1093/bfgp/elw034] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Microbial diversity in unique environmental settings enables abrupt responses catalysed by altering the gene regulation and formation of gene clusters called operons. Operons increases bacterial adaptability, which in turn increases their survival. This review article presents the emergence of computational operon prediction methods for whole microbial genomes and metagenomes, and discusses their strengths and limitations. Most of the whole-genome operon prediction methods struggle to generalize on unrelated genomes. The applicability of universal whole-genome operon prediction methods to metagenomic data is an interesting yet less investigated question. We have evaluated the potential of various operon prediction features for genomic and metagenomic data. Most of operon prediction methods with high accuracy have been compiled into databases. Despite of the high predictive performance, the data among many databases are not completely consistent for similar species. We performed a correlation analysis between the computationally predicted operon databases and experimentally validated data for Escherichia coli, Bacillus subtilis and Mycobacterium tuberculosis. Operon prediction for most of the less characterized microbes cannot be verified due to absence of experimentally validated operons. The generation of validated information for other microbes would test the authenticity of operon databases for other less annotated microbes as well. Advances in sequencing technologies and development of better analysis methods will help researchers to overcome the technological hurdles (such as long sequencing reads and improved contig size) and further improve operon predictions and better utilize operonic information.
Collapse
|
19
|
Jaenicke S, Albaum SP, Blumenkamp P, Linke B, Stoye J, Goesmann A. Flexible metagenome analysis using the MGX framework. MICROBIOME 2018; 6:76. [PMID: 29690922 PMCID: PMC5937802 DOI: 10.1186/s40168-018-0460-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/05/2017] [Accepted: 04/11/2018] [Indexed: 06/08/2023]
Abstract
BACKGROUND The characterization of microbial communities based on sequencing and analysis of their genetic information has become a popular approach also referred to as metagenomics; in particular, the recent advances in sequencing technologies have enabled researchers to study even the most complex communities. Metagenome analysis, the assignment of sequences to taxonomic and functional entities, however, remains a tedious task: large amounts of data need to be processed. There are a number of approaches addressing particular aspects, but scientific questions are often too specific to be answered by a general-purpose method. RESULTS We present MGX, a flexible and extensible client/server-framework for the management and analysis of metagenomic datasets; MGX features a comprehensive set of adaptable workflows required for taxonomic and functional metagenome analysis, combined with an intuitive and easy-to-use graphical user interface offering customizable result visualizations. At the same time, MGX allows to include own data sources and devise custom analysis pipelines, thus enabling researchers to perform basic as well as highly specific analyses within a single application. CONCLUSIONS With MGX, we provide a novel metagenome analysis platform giving researchers access to the most recent analysis tools. MGX covers taxonomic and functional metagenome analysis, statistical evaluation, and a wide range of visualizations easing data interpretation. Its default taxonomic classification pipeline provides equivalent or superior results in comparison to existing tools.
Collapse
Affiliation(s)
- Sebastian Jaenicke
- Bioinformatics and Systems Biology, Justus-Liebig-University, Gießen, Germany
- Genome Informatics, Faculty of Technology and Center for Biotechnology (CeBiTec), Bielefeld University, Bielefeld, Germany
| | - Stefan P. Albaum
- Bioinformatics Resource Facility, Center for Biotechnology (CeBiTec), Bielefeld University, Bielefeld, Germany
| | - Patrick Blumenkamp
- Bioinformatics and Systems Biology, Justus-Liebig-University, Gießen, Germany
| | - Burkhard Linke
- Bioinformatics and Systems Biology, Justus-Liebig-University, Gießen, Germany
| | - Jens Stoye
- Genome Informatics, Faculty of Technology and Center for Biotechnology (CeBiTec), Bielefeld University, Bielefeld, Germany
| | - Alexander Goesmann
- Bioinformatics and Systems Biology, Justus-Liebig-University, Gießen, Germany
| |
Collapse
|
20
|
Fonseca EL, Andrade BGN, Vicente ACP. The Resistome of Low-Impacted Marine Environments Is Composed by Distant Metallo-β-Lactamases Homologs. Front Microbiol 2018; 9:677. [PMID: 29675014 PMCID: PMC5895761 DOI: 10.3389/fmicb.2018.00677] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2017] [Accepted: 03/22/2018] [Indexed: 01/20/2023] Open
Abstract
The worldwide dispersion and sudden emergence of new antibiotic resistance genes (ARGs) determined the need in uncovering which environment participate most as their source and reservoir. ARGs closely related to those currently found in human pathogens occur in the resistome of anthropogenic impacted environments. However, the role of pristine environment as the origin and source of ARGs remains underexplored and controversy, particularly, the marine environments represented by the oceans. Here, due to the ocean nature, we hypothesized that the resistome of this pristine/low-impacted marine environment is represented by distant ARG homologs. To test this hypothesis we performed an in silico analysis on the Global Ocean Sampling (GOS) metagenomic project dataset focusing on the metallo-β-lactamases (MβLs) as the ARG model. MβLs have been a challenge to public health, since they hydrolyze the carbapenems, one of the last therapeutic choice in clinics. Using Hidden Markov Model (HMM) profiles, we were successful in identifying a high diversity of distant MβL homologs, related to the B1, B2, and B3 subclasses. The majority of them were distributed across the Atlantic, Indian, and Pacific Oceans being related to the chromosomally encoded MβL GOB present in Elizabethkingia genus. It was observed only a reduced number of metagenomic sequence homologs related to the acquired MβL enzymes (VIM, SPM-1, and AIM-1) that currently have impact in clinics. Therefore, low antibiotic impacted marine environment, as the ocean, are unlikely the source of ARGs that have been causing enormous threat to the public health.
Collapse
Affiliation(s)
- Erica L Fonseca
- Laboratório de Genética Molecular de Microrganismos, Instituto Oswaldo Cruz, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil
| | - Bruno G N Andrade
- Laboratório de Genética Molecular de Microrganismos, Instituto Oswaldo Cruz, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil
| | - Ana C P Vicente
- Laboratório de Genética Molecular de Microrganismos, Instituto Oswaldo Cruz, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil
| |
Collapse
|
21
|
Bannerman BP, Kramer S, Dorrell RG, Carrington M. Multispecies reconstructions uncover widespread conservation, and lineage-specific elaborations in eukaryotic mRNA metabolism. PLoS One 2018; 13:e0192633. [PMID: 29561870 PMCID: PMC5862402 DOI: 10.1371/journal.pone.0192633] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2017] [Accepted: 01/26/2018] [Indexed: 12/01/2022] Open
Abstract
The degree of conservation and evolution of cytoplasmic mRNA metabolism pathways across the eukaryotes remains incompletely resolved. In this study, we describe a comprehensive genome and transcriptome-wide analysis of proteins involved in mRNA maturation, translation, and mRNA decay across representative organisms from the six eukaryotic super-groups. We demonstrate that eukaryotes share common pathways for mRNA metabolism that were almost certainly present in the last eukaryotic common ancestor, and show for the first time a correlation between intron density and a selective absence of some Exon Junction Complex (EJC) components in eukaryotes. In addition, we identify pathways that have diversified in individual lineages, with a specific focus on the unique gene gains and losses in members of the Excavata and SAR groups that contribute to their unique gene expression pathways compared to other organisms.
Collapse
Affiliation(s)
| | - Susanne Kramer
- Biozentrum, Lehrstuhl für Zell-und Entwicklungsbiologie, Universität Würzburg, Am Hubland, Würzburg, Germany
| | - Richard G. Dorrell
- Institute of Biology, École Normale Supérieure, PSL Research University, Paris, France
| | - Mark Carrington
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
22
|
Eyice Ö, Myronova N, Pol A, Carrión O, Todd JD, Smith TJ, Gurman SJ, Cuthbertson A, Mazard S, Mennink-Kersten MA, Bugg TD, Andersson KK, Johnston AW, Op den Camp HJ, Schäfer H. Bacterial SBP56 identified as a Cu-dependent methanethiol oxidase widely distributed in the biosphere. ISME JOURNAL 2017; 12:145-160. [PMID: 29064480 PMCID: PMC5739008 DOI: 10.1038/ismej.2017.148] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/28/2017] [Revised: 06/07/2017] [Accepted: 07/27/2017] [Indexed: 12/05/2022]
Abstract
Oxidation of methanethiol (MT) is a significant step in the sulfur cycle. MT is an intermediate of metabolism of globally significant organosulfur compounds including dimethylsulfoniopropionate (DMSP) and dimethylsulfide (DMS), which have key roles in marine carbon and sulfur cycling. In aerobic bacteria, MT is degraded by a MT oxidase (MTO). The enzymatic and genetic basis of MT oxidation have remained poorly characterized. Here, we identify for the first time the MTO enzyme and its encoding gene (mtoX) in the DMS-degrading bacterium Hyphomicrobium sp. VS. We show that MTO is a homotetrameric metalloenzyme that requires Cu for enzyme activity. MTO is predicted to be a soluble periplasmic enzyme and a member of a distinct clade of the Selenium-binding protein (SBP56) family for which no function has been reported. Genes orthologous to mtoX exist in many bacteria able to degrade DMS, other one-carbon compounds or DMSP, notably in the marine model organism Ruegeria pomeroyi DSS-3, a member of the Rhodobacteraceae family that is abundant in marine environments. Marker exchange mutagenesis of mtoX disrupted the ability of R. pomeroyi to metabolize MT confirming its function in this DMSP-degrading bacterium. In R. pomeroyi, transcription of mtoX was enhanced by DMSP, methylmercaptopropionate and MT. Rates of MT degradation increased after pre-incubation of the wild-type strain with MT. The detection of mtoX orthologs in diverse bacteria, environmental samples and its abundance in a range of metagenomic data sets point to this enzyme being widely distributed in the environment and having a key role in global sulfur cycling.
Collapse
Affiliation(s)
- Özge Eyice
- School of Life Sciences, University of Warwick, Coventry, UK.,School of Biological and Chemical Sciences, Queen Mary University of London, London, UK
| | | | - Arjan Pol
- Department of Microbiology, Institute for Water and Wetland Research, Faculty of Science, Radboud University, Nijmegen, The Netherlands
| | - Ornella Carrión
- School of Biological Sciences, University of East Anglia, Norwich, UK
| | - Jonathan D Todd
- School of Biological Sciences, University of East Anglia, Norwich, UK
| | - Tom J Smith
- Department of Biosciences and Chemistry, Sheffield Hallam University, Sheffield, UK
| | - Stephen J Gurman
- Department of Physics and Astronomy, University of Leicester, Leicester, UK
| | | | - Sophie Mazard
- School of Life Sciences, University of Warwick, Coventry, UK
| | - Monique Ash Mennink-Kersten
- Department of Microbiology, Institute for Water and Wetland Research, Faculty of Science, Radboud University, Nijmegen, The Netherlands
| | - Timothy Dh Bugg
- Department of Chemistry, University of Warwick, Coventry, UK
| | | | | | - Huub Jm Op den Camp
- Department of Microbiology, Institute for Water and Wetland Research, Faculty of Science, Radboud University, Nijmegen, The Netherlands
| | - Hendrik Schäfer
- School of Life Sciences, University of Warwick, Coventry, UK
| |
Collapse
|
23
|
Lewin A, Zhou J, Pham VTT, Haugen T, Zeiny ME, Aarstad O, Liebl W, Wentzel A, Liles MR. Novel archaeal thermostable cellulases from an oil reservoir metagenome. AMB Express 2017; 7:183. [PMID: 28963711 PMCID: PMC5622026 DOI: 10.1186/s13568-017-0485-z] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2017] [Accepted: 09/20/2017] [Indexed: 01/24/2023] Open
Abstract
Microbial assemblages were sampled from an offshore deep sub-surface petroleum reservoir 2.5 km below the ocean floor off the coast of Norway, providing conditions of high temperature and pressure, to identify new thermostable enzymes. In this study, we used DNA sequences obtained directly from the sample metagenome and from a derived fosmid library to survey the functional diversity of this extreme habitat. The metagenomic fosmid library containing 11,520 clones was screened using function- and sequence-based methods to identify recombinant clones expressing carbohydrate-degrading enzymes. Open reading frames (ORFs) encoding carbohydrate-degrading enzymes were predicted by BLAST against the CAZy database, and many fosmid clones expressing carbohydrate-degrading activities were discovered by functional screening using Escherichia coli as a heterologous host. Each complete ORF predicted to encode a cellulase identified from sequence- or function-based screening was subcloned in an expression vector. Five subclones was found to have significant activity using a fluorescent cellulose model substrate, and three of these were observed to be highly thermostable. Based on phylogenetic analyses, the thermostable cellulases were derived from thermophilic Archaea and are distinct from known cellulases. Cellulase F1, obtained from function-based screening, contains two distinct cellulase modules, perhaps resulting from fusion of two archaeal cellulases and with a novel protein structure that may result in enhanced activity and thermostability. This enzyme was found to exhibit exocellulase function and to have a remarkably high activity compared to commercially available enzymes. Results from this study highlight the complementarity of hybrid approaches for enzyme discovery, combining sequence- and function-based screening.
Collapse
|
24
|
Moreno PS, Wagner J, Mansfield CS, Stevens M, Gilkerson JR, Kirkwood CD. Characterisation of the canine faecal virome in healthy dogs and dogs with acute diarrhoea using shotgun metagenomics. PLoS One 2017; 12:e0178433. [PMID: 28570584 PMCID: PMC5453527 DOI: 10.1371/journal.pone.0178433] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2016] [Accepted: 05/12/2017] [Indexed: 01/01/2023] Open
Abstract
The virome has been increasingly investigated in numerous animal species and in different sites of the body, facilitating the identification and discovery of a variety of viruses. In spite of this, the faecal virome of healthy dogs has not been investigated. In this study we describe the faecal virome of healthy dogs and dogs with acute diarrhoea in Australia, using a shotgun metagenomic approach. Viral sequences from a range of different virus families, including both RNA and DNA families, and known pathogens implicated in enteric disease were documented. Twelve viral families were identified, of which four were bacteriophages. Eight eukaryotic viral families were detected: Astroviridae, Coronaviridae, Reoviridae, Picornaviridae, Caliciviridae, Parvoviridae, Adenoviridae and Papillomaviridae. Families Astroviridae, Picornaviridae and Caliciviridae were found only in dogs with acute diarrhoea, with Astroviridae being the most common family identified in this group. Due to its prevalence, characterisation the complete genome of a canine astrovirus was performed. These studies indicate that metagenomic analyses are useful for the investigation of viral populations in the faeces of dogs. Further studies to elucidate the epidemiological and biological relevance of these findings are warranted.
Collapse
Affiliation(s)
- Paloma S. Moreno
- Enteric Viruses Group, Murdoch Childrens Research Institute, Melbourne, Victoria, Australia
- Translational Research and Animal Clinical Trial Study (TRACTS) group, Faculty of Veterinary and Agricultural Sciences, The University of Melbourne, Melbourne, Victoria, Australia
- * E-mail:
| | - Josef Wagner
- Enteric Viruses Group, Murdoch Childrens Research Institute, Melbourne, Victoria, Australia
- Department of Paediatrics, The University of Melbourne, Melbourne, Victoria, Australia
| | - Caroline S. Mansfield
- Translational Research and Animal Clinical Trial Study (TRACTS) group, Faculty of Veterinary and Agricultural Sciences, The University of Melbourne, Melbourne, Victoria, Australia
| | - Matthew Stevens
- Australian Genome Research Facility, Melbourne, Victoria, Australia
| | - James R. Gilkerson
- Centre for Equine Infectious Diseases, Faculty of Veterinary and Agricultural Sciences, The University of Melbourne, Melbourne, Victoria, Australia
| | - Carl D. Kirkwood
- Enteric Viruses Group, Murdoch Childrens Research Institute, Melbourne, Victoria, Australia
- Department of Paediatrics, The University of Melbourne, Melbourne, Victoria, Australia
| |
Collapse
|
25
|
Bonifay V, Wawrik B, Sunner J, Snodgrass EC, Aydin E, Duncan KE, Callaghan AV, Oldham A, Liengen T, Beech I. Metabolomic and Metagenomic Analysis of Two Crude Oil Production Pipelines Experiencing Differential Rates of Corrosion. Front Microbiol 2017; 8:99. [PMID: 28197141 PMCID: PMC5281625 DOI: 10.3389/fmicb.2017.00099] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2016] [Accepted: 01/13/2017] [Indexed: 01/06/2023] Open
Abstract
Corrosion processes in two North Sea oil production pipelines were studied by analyzing pig envelope samples via metagenomic and metabolomic techniques. Both production systems have similar physico-chemical properties and injection waters are treated with nitrate, but one pipeline experiences severe corrosion and the other does not. Early and late pigging material was collected to gain insight into the potential causes for differential corrosion rates. Metabolites were extracted and analyzed via ultra-high performance liquid chromatography/high-resolution mass spectrometry with electrospray ionization (ESI) in both positive and negative ion modes. Metabolites were analyzed by comparison with standards indicative of aerobic and anaerobic hydrocarbon metabolism and by comparison to predicted masses for KEGG metabolites. Microbial community structure was analyzed via 16S rRNA gene qPCR, sequencing of 16S PCR products, and MySeq Illumina shotgun sequencing of community DNA. Metagenomic data were used to reconstruct the full length 16S rRNA genes and genomes of dominant microorganisms. Sequence data were also interrogated via KEGG annotation and for the presence of genes related to terminal electron accepting (TEA) processes as well as aerobic and anaerobic hydrocarbon degradation. Significant and distinct differences were observed when comparing the ‘high corrosion’ (HC) and the ‘low corrosion’ (LC) pipeline systems, especially with respect to the TEA utilization potential. The HC samples were dominated by sulfate-reducing bacteria (SRB) and archaea known for their ability to utilize simple carbon substrates, whereas LC samples were dominated by pseudomonads with the genetic potential for denitrification and aerobic hydrocarbon degradation. The frequency of aerobic hydrocarbon degradation genes was low in the HC system, and anaerobic hydrocarbon degradation genes were not detected in either pipeline. This is in contrast with metabolite analysis, which demonstrated the presence of several succinic acids in HC samples that are diagnostic of anaerobic hydrocarbon metabolism. Identifiable aerobic metabolites were confined to the LC samples, consistent with the metagenomic data. Overall, these data suggest that corrosion management might benefit from a more refined understanding of microbial community resilience in the face of disturbances such as nitrate treatment or pigging, which frequently prove insufficient to alter community structure toward a stable, less-corrosive assemblage.
Collapse
Affiliation(s)
- Vincent Bonifay
- Department of Microbiology and Plant Biology, University of Oklahoma, Norman OK, USA
| | - Boris Wawrik
- Department of Microbiology and Plant Biology, University of Oklahoma, Norman OK, USA
| | - Jan Sunner
- Department of Microbiology and Plant Biology, University of Oklahoma, NormanOK, USA; Institute for Energy and the Environment, University of Oklahoma, NormanOK, USA
| | - Emily C Snodgrass
- Department of Microbiology and Plant Biology, University of Oklahoma, Norman OK, USA
| | - Egemen Aydin
- Department of Microbiology and Plant Biology, University of Oklahoma, Norman OK, USA
| | - Kathleen E Duncan
- Department of Microbiology and Plant Biology, University of Oklahoma, Norman OK, USA
| | - Amy V Callaghan
- Department of Microbiology and Plant Biology, University of Oklahoma, Norman OK, USA
| | - Athenia Oldham
- Department of Biology, University of Texas of the Permian Basin, Odessa TX, USA
| | - Turid Liengen
- Research Centre Porsgrunn, Statoil ASA, Herøya Industripark Porsgrunn, Norway
| | - Iwona Beech
- Department of Microbiology and Plant Biology, University of Oklahoma, NormanOK, USA; Institute for Energy and the Environment, University of Oklahoma, NormanOK, USA
| |
Collapse
|
26
|
Coloma SE, Dienstbier A, Bamford DH, Sivonen K, Roine E, Hiltunen T. Newly isolatedNodulariaphage influences cyanobacterial community dynamics. Environ Microbiol 2017; 19:273-286. [DOI: 10.1111/1462-2920.13601] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2016] [Revised: 10/25/2016] [Accepted: 11/16/2016] [Indexed: 12/18/2022]
Affiliation(s)
- S. E. Coloma
- Department of Food and Environmental SciencesViikinkaari 9, 00014 University of HelsinkiHelsinki Finland
| | - A. Dienstbier
- Department of BiosciencesViikinkaari 9, 00014 University of HelsinkiHelsinki Finland
| | - D. H. Bamford
- Department of BiosciencesViikinkaari 9, 00014 University of HelsinkiHelsinki Finland
- Institute of BiotechnologyViikinkaari 9, 00014 University of HelsinkiHelsinki Finland
| | - K. Sivonen
- Department of Food and Environmental SciencesViikinkaari 9, 00014 University of HelsinkiHelsinki Finland
| | - E. Roine
- Department of BiosciencesViikinkaari 9, 00014 University of HelsinkiHelsinki Finland
- Institute of BiotechnologyViikinkaari 9, 00014 University of HelsinkiHelsinki Finland
| | - T. Hiltunen
- Department of Food and Environmental SciencesViikinkaari 9, 00014 University of HelsinkiHelsinki Finland
| |
Collapse
|
27
|
Blomström AL, Fossum C, Wallgren P, Berg M. Viral Metagenomic Analysis Displays the Co-Infection Situation in Healthy and PMWS Affected Pigs. PLoS One 2016; 11:e0166863. [PMID: 27907010 PMCID: PMC5131951 DOI: 10.1371/journal.pone.0166863] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2016] [Accepted: 11/05/2016] [Indexed: 11/18/2022] Open
Abstract
The development of high-throughput sequencing technologies have allowed the possibility to investigate and characterise the entire microbiome of individuals, providing better insight to the complex interaction between different microorganisms. This will help to understand how the microbiome influence the susceptibility of secondary agents and development of disease. We have applied viral metagenomics to investigate the virome of lymph nodes from Swedish pigs suffering from the multifactorial disease postweaning multisystemic wasting syndrome (PMWS) as well as from healthy pigs. The aim is to increase knowledge of potential viruses, apart from porcine circovirus type 2 (PCV2), involved in PMWS development as well as to increase knowledge on the virome of healthy individuals. In healthy individuals, a diverse viral flora was seen with several different viruses present simultaneously. The majority of the identified viruses were small linear and circular DNA viruses, such as different circoviruses, anelloviruses and bocaviruses. In the pigs suffering from PMWS, PCV2 sequences were, as expected, detected to a high extent but other viruses were also identified in the background of PCV2. Apart from DNA viruses also RNA viruses were identified, among them were a porcine pestivirus showing high similarity to a recently (in 2015) discovered atypical porcine pestivirus in the US. Majority of the viruses identified in the background of PCV2 in PMWS pigs could also be identified in the healthy pigs. PCV2 sequences were also identified in the healthy pigs but to a much lower extent than in PMWS affected pigs. Although the method used here is not quantitative the very clear difference in amount of PCV2 sequences in PMWS affected pigs and healthy pigs most likely reflect the very strong replication of PCV2 known to be a hallmark of PMWS. Taken together, these findings illustrate that pigs appear to have a considerable viral flora consisting to a large extent of small single-stranded and circular DNA viruses. Future research on these types of viruses will help to better understand the role that these ubiquitous viruses may have on health and disease of pigs. We also demonstrate for the first time, in Europe, the presence of a novel porcine pestivirus.
Collapse
Affiliation(s)
- Anne-Lie Blomström
- Department of Biomedical Sciences and Veterinary Public Health, Section of Virology, Swedish University of Agricultural Sciences, Uppsala, Sweden
- * E-mail:
| | - Caroline Fossum
- Department of Biomedical Sciences and Veterinary Public Health, Section of Immunology, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Per Wallgren
- National Veterinary Institute (SVA), Uppsala, Sweden
| | - Mikael Berg
- Department of Biomedical Sciences and Veterinary Public Health, Section of Virology, Swedish University of Agricultural Sciences, Uppsala, Sweden
| |
Collapse
|
28
|
Fróes AM, da Mota FF, Cuadrat RRC, Dávila AMR. Distribution and Classification of Serine β-Lactamases in Brazilian Hospital Sewage and Other Environmental Metagenomes Deposited in Public Databases. Front Microbiol 2016; 7:1790. [PMID: 27895627 PMCID: PMC5108929 DOI: 10.3389/fmicb.2016.01790] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2016] [Accepted: 10/25/2016] [Indexed: 01/09/2023] Open
Abstract
β-lactam is the most used antibiotic class in the clinical area and it acts on blocking the bacteria cell wall synthesis, causing cell death. However, some bacteria have evolved resistance to these antibiotics mainly due the production of enzymes known as β-lactamases. Hospital sewage is an important source of dispersion of multidrug-resistant bacteria in rivers and oceans. In this work, we used next-generation DNA sequencing to explore the diversity and dissemination of serine β-lactamases in two hospital sewage from Rio de Janeiro, Brazil (South Zone, SZ and North Zone, NZ), presenting different profiles, and to compare them with public environmental data available. Also, we propose a Hidden-Markov-Model approach to screen potential serine β-lactamases genes (in public environments samples and generated hospital sewage data), exploring its evolutionary relationships. Due to the high variability in β-lactamases, we used a position-specific scoring matrix search method (RPS-BLAST) against conserved domain database profiles (CDD, Pfam, and COG) followed by visual inspection to detect conserved motifs, to increase the reliability of the results and remove possible false positives. We were able to identify novel β-lactamases from Brazilian hospital sewage and to estimate relative abundance of its types. The highest relative abundance found in SZ was the Class A (50%), while Class D is predominant in NZ (55%). CfxA (65%) and ACC (47%) types were the most abundant genes detected in SZ, while in NZ the most frequent were OXA-10 (32%), CfxA (28%), ACC (21%), CEPA (20%), and FOX (19%). Phylogenetic analysis revealed β-lactamases from Brazilian hospital sewage grouped in the same clade and close to sequences belonging to Firmicutes and Bacteroidetes groups, but distant from potential β-lactamases screened from public environmental data, that grouped closer to β-lactamases of Proteobacteria. Our results demonstrated that HMM-based approach identified homologs of serine β-lactamases, indicating the specificity and high sensitivity of this approach in large datasets, contributing for the identification and classification of a large number of homologous genes, comprising possible new ones. Phylogenetic analysis revealed the potential reservoir of β-lactam resistance genes in the environment, contributing to understanding the evolution and dissemination of these genes.
Collapse
Affiliation(s)
- Adriana M Fróes
- Laboratório de Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, FIOCRUZ Rio de Janeiro, Brazil
| | - Fábio F da Mota
- Laboratório de Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, FIOCRUZ Rio de Janeiro, Brazil
| | - Rafael R C Cuadrat
- Laboratório de Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, FIOCRUZ Rio de Janeiro, Brazil
| | - Alberto M R Dávila
- Laboratório de Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, FIOCRUZ Rio de Janeiro, Brazil
| |
Collapse
|
29
|
Blomström AL, Lalander C, Komakech AJ, Vinnerås B, Boqvist S. A metagenomic analysis displays the diverse microbial community of a vermicomposting system in Uganda. Infect Ecol Epidemiol 2016; 6:32453. [PMID: 27834174 PMCID: PMC5103666 DOI: 10.3402/iee.v6.32453] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2016] [Revised: 10/17/2016] [Accepted: 10/18/2016] [Indexed: 11/14/2022] Open
Abstract
Background Vermicomposting is a mesophilic process using earthworms to efficiently and at low cost process large volumes of organic waste. It has been suggested to not only increase soil fertility but also increase biomass of beneficial bacteria while reducing harmful bacteria. The aim of this study was to set up a strategy to investigate and characterise the viral as well as the bacterial composition of a vermicomposting system. Material and methods The vermicomposting unit used in this study was placed at the Makerere University Agricultural Research Institute Kabanyolo on the outskirts of Kampala, Uganda, and was fed with 80% cattle manure and 20% food waste. On Day 172, the compost was terminated and compost samples were collected from three layers of the unit: the top, the middle and the bottom layer. A metagenomic approach was then applied to characterise the viral and bacterial composition of the vermicomposting system. Results and discussion A high abundance and diversity of bacteria were identified. Proteobacteria was the largest phyla in the compost (mainly Alpha-, Gamma- and Betaproteobacteria), constituting almost 65% of the bacterial reads in the data sets. DNA samples from several possible pathogenic bacteria, such as Salmonella spp., Escherichia coli, Enterobacter spp., Enterococcus spp. and Clostridium spp, were detected in the vermicompost, suggesting that there might still be harmful bacteria in the vermicast. Phages constituted the main viral group; apart from phages, mainly insect viruses were identified. The only animal or human virus identified was kobuvirus. In summary, metagenomic analysis was shown to be an efficient technology to characterise the microbial composition of vermicast. The data from this study contribute to a better understanding of the microbes present in this kind of composting system and can help determine measures necessary for safe manure handling.
Collapse
Affiliation(s)
- Anne-Lie Blomström
- Section of Virology, Department of Biomedical Sciences and Veterinary Public Health, Swedish University of Agricultural Sciences, Uppsala, Sweden;
| | - Cecilia Lalander
- Department of Energy and Technology, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Allan John Komakech
- Department of Energy and Technology, Swedish University of Agricultural Sciences, Uppsala, Sweden.,Department of Agricultural and Bio-systems Engineering, Makerere University, Kampala, Uganda
| | - Björn Vinnerås
- Department of Energy and Technology, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Sofia Boqvist
- Department of Biomedical Sciences and Veterinary Public Health, Swedish University of Agricultural Sciences, Uppsala, Sweden
| |
Collapse
|
30
|
Søborg DA, Hendriksen NB, Kilian M, Christensen JH, Kroer N. Bacterial Human Virulence Genes across Diverse Habitats As Assessed by In silico Analysis of Environmental Metagenomes. Front Microbiol 2016; 7:1712. [PMID: 27857707 PMCID: PMC5093120 DOI: 10.3389/fmicb.2016.01712] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2016] [Accepted: 10/12/2016] [Indexed: 01/18/2023] Open
Abstract
The occurrence and distribution of clinically relevant bacterial virulence genes across natural (non-human) environments is not well understood. We aimed to investigate the occurrence of homologs to bacterial human virulence genes in a variety of ecological niches to better understand the role of natural environments in the evolution of bacterial virulence. Twenty four bacterial virulence genes were analyzed in 46 diverse environmental metagenomic datasets, representing various soils, seawater, freshwater, marine sediments, hot springs, the deep-sea, hypersaline mats, microbialites, gutless worms and glacial ice. Homologs to 16 bacterial human virulence genes, involved in urinary tract infections, gastrointestinal diseases, skin diseases, and wound and systemic infections, showed global ubiquity. A principal component analysis did not demonstrate clear trends across the metagenomes with respect to occurrence and frequency of observed gene homologs. Full-length (>95%) homologs of several virulence genes were identified, and translated sequences of the environmental and clinical genes were up to 50-100% identical. Furthermore, phylogenetic analyses indicated deep branching positions of some of the environmental gene homologs, suggesting that they represent ancient lineages in the phylogeny of the clinical genes. Fifteen virulence gene homologs were detected in metatranscriptomes, providing evidence of environmental expression. The ubiquitous presence and transcription of the virulence gene homologs in non-human environments point to an important ecological role of the genes for the activity and survival of environmental bacteria. Furthermore, the high degree of sequence conservation between several of the environmental and clinical genes suggests common ancestral origins.
Collapse
Affiliation(s)
- Ditte A Søborg
- Department of Environmental Science, Aarhus UniversityRoskilde, Denmark; Research Group for Energy and Environment, VIA University CollegeHorsens, Denmark
| | - Niels B Hendriksen
- Department of Environmental Science, Aarhus University Roskilde, Denmark
| | - Mogens Kilian
- Department of Biomedicine, Aarhus University Aarhus, Denmark
| | - Jan H Christensen
- Department of Plant and Environmental Sciences, University of Copenhagen Frederiksberg, Denmark
| | - Niels Kroer
- Department of Biology, University of Copenhagen Copenhagen, Denmark
| |
Collapse
|
31
|
Carlos C, Pereira LB, Ottoboni LMM. Comparative genomics of Paracoccus sp. SM22M-07 isolated from coral mucus: insights into bacteria-host interactions. Curr Genet 2016; 63:509-518. [PMID: 27796486 DOI: 10.1007/s00294-016-0658-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2016] [Accepted: 10/17/2016] [Indexed: 10/20/2022]
Abstract
One of the main goals of coral microbiology is to understand the ways in which coral-bacteria associations are established and maintained. This work describes the sequencing of the genome of Paracoccus sp. SM22M-07 isolated from the mucus of the endemic Brazilian coral species Mussismilia hispida. Comparative analysis was used to identify unique genomic features of SM22M-07 that might be involved in its adaptation to the marine ecosystem and the nutrient-rich environment provided by coral mucus, as well as in the establishment and strengthening of the interaction with the host. These features included genes related to the type IV protein secretion system, erythritol catabolism, and succinoglycan biosynthesis. We experimentally confirmed the production of succinoglycan by Paracoccus sp. SM22M-07 and we hypothesize that it may be involved in the association of the bacterium with coral surfaces.
Collapse
Affiliation(s)
- Camila Carlos
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, 53703, USA.
| | - Letícia Bianca Pereira
- Center for Molecular Biology and Genetic Engineering (CBMEG), State University of Campinas (UNICAMP), C. P. 6010, Campinas, SP, 13083-875, Brazil
| | - Laura Maria Mariscal Ottoboni
- Center for Molecular Biology and Genetic Engineering (CBMEG), State University of Campinas (UNICAMP), C. P. 6010, Campinas, SP, 13083-875, Brazil
| |
Collapse
|
32
|
Direct 16S rRNA-seq from bacterial communities: a PCR-independent approach to simultaneously assess microbial diversity and functional activity potential of each taxon. Sci Rep 2016; 6:32165. [PMID: 27577787 PMCID: PMC5006002 DOI: 10.1038/srep32165] [Citation(s) in RCA: 59] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2016] [Accepted: 07/28/2016] [Indexed: 12/04/2022] Open
Abstract
The analysis of environmental microbial communities has largely relied on a PCR-dependent amplification of genes entailing species identity as 16S rRNA. This approach is susceptible to biases depending on the level of primer matching in different species. Moreover, possible yet-to-discover taxa whose rRNA could differ enough from known ones would not be revealed. DNA-based methods moreover do not provide information on the actual physiological relevance of each taxon within an environment and are affected by the variable number of rRNA operons in different genomes. To overcome these drawbacks we propose an approach of direct sequencing of 16S ribosomal RNA without any primer- or PCR-dependent step. The method was tested on a microbial community developing in an anammox bioreactor sampled at different time-points. A conventional PCR-based amplicon pyrosequencing was run in parallel. The community resulting from direct rRNA sequencing was highly consistent with the known biochemical processes operative in the reactor. As direct rRNA-seq is based not only on taxon abundance but also on physiological activity, no comparison between its results and those from PCR-based approaches can be applied. The novel principle is in this respect proposed not as an alternative but rather as a complementary methodology in microbial community studies.
Collapse
|
33
|
Laffy PW, Wood-Charlson EM, Turaev D, Weynberg KD, Botté ES, van Oppen MJH, Webster NS, Rattei T. HoloVir: A Workflow for Investigating the Diversity and Function of Viruses in Invertebrate Holobionts. Front Microbiol 2016; 7:822. [PMID: 27375564 PMCID: PMC4899465 DOI: 10.3389/fmicb.2016.00822] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2015] [Accepted: 05/16/2016] [Indexed: 11/13/2022] Open
Abstract
Abundant bioinformatics resources are available for the study of complex microbial metagenomes, however their utility in viral metagenomics is limited. HoloVir is a robust and flexible data analysis pipeline that provides an optimized and validated workflow for taxonomic and functional characterization of viral metagenomes derived from invertebrate holobionts. Simulated viral metagenomes comprising varying levels of viral diversity and abundance were used to determine the optimal assembly and gene prediction strategy, and multiple sequence assembly methods and gene prediction tools were tested in order to optimize our analysis workflow. HoloVir performs pairwise comparisons of single read and predicted gene datasets against the viral RefSeq database to assign taxonomy and additional comparison to phage-specific and cellular markers is undertaken to support the taxonomic assignments and identify potential cellular contamination. Broad functional classification of the predicted genes is provided by assignment of COG microbial functional category classifications using EggNOG and higher resolution functional analysis is achieved by searching for enrichment of specific Swiss-Prot keywords within the viral metagenome. Application of HoloVir to viral metagenomes from the coral Pocillopora damicornis and the sponge Rhopaloeides odorabile demonstrated that HoloVir provides a valuable tool to characterize holobiont viral communities across species, environments, or experiments.
Collapse
Affiliation(s)
- Patrick W. Laffy
- Australian Institute of Marine ScienceTownsville, QLD, Australia
| | - Elisha M. Wood-Charlson
- Center for Microbial Oceanography: Research and Education, University of Hawai‘i at MānoaHonolulu, HI, USA
| | - Dmitrij Turaev
- Division of Computational Systems Biology, Department of Microbiology and Ecosystem Science, University of ViennaVienna, Austria
| | | | | | - Madeleine J. H. van Oppen
- Australian Institute of Marine ScienceTownsville, QLD, Australia
- School of Biosciences, University of MelbourneMelbourne, VIC, Australia
| | | | - Thomas Rattei
- Division of Computational Systems Biology, Department of Microbiology and Ecosystem Science, University of ViennaVienna, Austria
| |
Collapse
|
34
|
Fondi M, Karkman A, Tamminen MV, Bosi E, Virta M, Fani R, Alm E, McInerney JO. "Every Gene Is Everywhere but the Environment Selects": Global Geolocalization of Gene Sharing in Environmental Samples through Network Analysis. Genome Biol Evol 2016; 8:1388-400. [PMID: 27190206 PMCID: PMC4898794 DOI: 10.1093/gbe/evw077] [Citation(s) in RCA: 64] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
The spatial distribution of microbes on our planet is famously formulated in the Baas Becking hypothesis as “everything is everywhere but the environment selects.” While this hypothesis does not strictly rule out patterns caused by geographical effects on ecology and historical founder effects, it does propose that the remarkable dispersal potential of microbes leads to distributions generally shaped by environmental factors rather than geographical distance. By constructing sequence similarity networks from uncultured environmental samples, we show that microbial gene pool distributions are not influenced nearly as much by geography as ecology, thus extending the Bass Becking hypothesis from whole organisms to microbial genes. We find that gene pools are shaped by their broad ecological niche (such as sea water, fresh water, host, and airborne). We find that freshwater habitats act as a gene exchange bridge between otherwise disconnected habitats. Finally, certain antibiotic resistance genes deviate from the general trend of habitat specificity by exhibiting a high degree of cross-habitat mobility. The strong cross-habitat mobility of antibiotic resistance genes is a cause for concern and provides a paradigmatic example of the rate by which genes colonize new habitats when new selective forces emerge.
Collapse
Affiliation(s)
- Marco Fondi
- Laboratory of Microbial and Molecular Evolution, Department of Biology, University of Florence, Italy Computational Biology Group, University of Florence, Italy
| | - Antti Karkman
- Department of Food and Environmental Sciences, University of Helsinki, Finland
| | - Manu V Tamminen
- Department of Environmental Systems Science, ETH Zürich, Switzerland Department of Aquatic Ecology, Eawag, Switzerland
| | - Emanuele Bosi
- Laboratory of Microbial and Molecular Evolution, Department of Biology, University of Florence, Italy Computational Biology Group, University of Florence, Italy
| | - Marko Virta
- Department of Food and Environmental Sciences, University of Helsinki, Finland
| | - Renato Fani
- Laboratory of Microbial and Molecular Evolution, Department of Biology, University of Florence, Italy Computational Biology Group, University of Florence, Italy
| | - Eric Alm
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology
| | - James O McInerney
- Department of Biology, National University of Ireland Maynooth, County Kildare, Ireland Computational Evolutionary Biology, Faculty of Life Sciences, The University of Manchester, United Kingdom
| |
Collapse
|
35
|
Ledermann B, Béjà O, Frankenberg-Dinkel N. New biosynthetic pathway for pink pigments from uncultured oceanic viruses. Environ Microbiol 2016; 18:4337-4347. [PMID: 26950653 DOI: 10.1111/1462-2920.13290] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2016] [Accepted: 03/04/2016] [Indexed: 12/18/2022]
Abstract
The pink open-chain tetrapyrrole pigment phycoerythrobilin (PEB) is employed by marine cyanobacteria, red algae and cryptophytes as a light-harvesting chromophore in phycobiliproteins. Genes encoding biosynthesis proteins for PEB have also been discovered in cyanophages, viruses that infect cyanobacteria, and mimic host pigment biosynthesis with the exception of PebS which combines the enzymatic activities of two host enzymes. In this study, we have identified novel members of the PEB biosynthetic enzyme families, heme oxygenases and ferredoxin-dependent bilin reductases. Encoding genes were found in metagenomic datasets and could be traced back to bacteriophage but not cyanophage origin. While the heme oxygenase exhibited standard activity, a new bilin reductase with highest homology to the teal pigment producing enzyme PcyA revealed PEB biosynthetic activity. Although PcyX possesses PebS-like activity both enzymes share only 9% sequence identity and likely catalyze the reaction via two independent mechanisms. Our data point towards the presence of phycobilin biosynthetic genes in phages that probably infect alphaproteobacteria and, therefore, further support a role of phycobilins outside oxygenic phototrophs.
Collapse
Affiliation(s)
- Benjamin Ledermann
- Department of Biology, Microbiology, Technical University Kaiserslautern, Kaiserslautern, Germany
| | - Oded Béjà
- Technion-Israel Institute of Technology, Haifa, Israel
| | | |
Collapse
|
36
|
Discovery Strategies of Bioactive Compounds Synthesized by Nonribosomal Peptide Synthetases and Type-I Polyketide Synthases Derived from Marine Microbiomes. Mar Drugs 2016; 14:md14040080. [PMID: 27092515 PMCID: PMC4849084 DOI: 10.3390/md14040080] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2016] [Revised: 04/01/2016] [Accepted: 04/08/2016] [Indexed: 11/17/2022] Open
Abstract
Considering that 70% of our planet's surface is covered by oceans, it is likely that undiscovered biodiversity is still enormous. A large portion of marine biodiversity consists of microbiomes. They are very attractive targets of bioprospecting because they are able to produce a vast repertoire of secondary metabolites in order to adapt in diverse environments. In many cases secondary metabolites of pharmaceutical and biotechnological interest such as nonribosomal peptides (NRPs) and polyketides (PKs) are synthesized by multimodular enzymes named nonribosomal peptide synthetases (NRPSes) and type-I polyketide synthases (PKSes-I), respectively. Novel findings regarding the mechanisms underlying NRPS and PKS evolution demonstrate how microorganisms could leverage their metabolic potential. Moreover, these findings could facilitate synthetic biology approaches leading to novel bioactive compounds. Ongoing advances in bioinformatics and next-generation sequencing (NGS) technologies are driving the discovery of NRPs and PKs derived from marine microbiomes mainly through two strategies: genome-mining and metagenomics. Microbial genomes are now sequenced at an unprecedented rate and this vast quantity of biological information can be analyzed through genome mining in order to identify gene clusters encoding NRPSes and PKSes of interest. On the other hand, metagenomics is a fast-growing research field which directly studies microbial genomes and their products present in marine environments using culture-independent approaches. The aim of this review is to examine recent developments regarding discovery strategies of bioactive compounds synthesized by NRPS and type-I PKS derived from marine microbiomes and to highlight the vast diversity of NRPSes and PKSes present in marine environments by giving examples of recently discovered bioactive compounds.
Collapse
|
37
|
Li S, Hwang XY, Stav S, Breaker RR. The yjdF riboswitch candidate regulates gene expression by binding diverse azaaromatic compounds. RNA (NEW YORK, N.Y.) 2016; 22:530-41. [PMID: 26843526 PMCID: PMC4793209 DOI: 10.1261/rna.054890.115] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/14/2015] [Accepted: 12/10/2015] [Indexed: 05/21/2023]
Abstract
The yjdF motif RNA is an orphan riboswitch candidate that almost exclusively associates with the yjdF protein-coding gene in many bacteria. The function of the YjdF protein is unknown, which has made speculation regarding the natural ligand for this putative riboswitch unusually challenging. By using a structure-probing assay for ligand binding, we found that a surprisingly broad diversity of nitrogen-containing aromatic heterocycles, or "azaaromatics," trigger near-identical changes in the structures adopted by representative yjdF motif RNAs. Regions of the RNA that undergo ligand-induced structural modulation reside primarily in portions of the putative aptamer region that are highly conserved in nucleotide sequence, as is typical for riboswitches. Some azaaromatic molecules are bound by the RNA with nanomolar dissociation constants, and a subset of these ligands activate riboswitch-mediated gene expression in cells. Furthermore, genetic elements most commonly adjacent to the yjdF motif RNA or to the yjdF protein-coding region are homologous to protein regulators implicated in mitigating the toxic effects of diverse phenolic acids or polycyclic compounds. Although the precise type of natural ligand sensed by yjdF motif RNAs remains unknown, our findings suggest that this riboswitch class might serve as part of a genetic response system to toxic or signaling compounds with chemical structures similar to azaaromatics.
Collapse
Affiliation(s)
- Sanshu Li
- Howard Hughes Medical Institute, Yale University, New Haven, Connecticut 06520-8103, USA
| | - Xue Ying Hwang
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520-8103, USA
| | - Shira Stav
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520-8103, USA
| | - Ronald R Breaker
- Howard Hughes Medical Institute, Yale University, New Haven, Connecticut 06520-8103, USA Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520-8103, USA Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520-8103, USA
| |
Collapse
|
38
|
Abstract
Dissolved organic matter (DOM) in the oceans is one of the largest pools of reduced carbon on Earth, comparable in size to the atmospheric CO2 reservoir. A vast number of compounds are present in DOM, and they play important roles in all major element cycles, contribute to the storage of atmospheric CO2 in the ocean, support marine ecosystems, and facilitate interactions between organisms. At the heart of the DOM cycle lie molecular-level relationships between the individual compounds in DOM and the members of the ocean microbiome that produce and consume them. In the past, these connections have eluded clear definition because of the sheer numerical complexity of both DOM molecules and microorganisms. Emerging tools in analytical chemistry, microbiology, and informatics are breaking down the barriers to a fuller appreciation of these connections. Here we highlight questions being addressed using recent methodological and technological developments in those fields and consider how these advances are transforming our understanding of some of the most important reactions of the marine carbon cycle.
Collapse
|
39
|
Bench SR, Frank I, Robidart J, Zehr JP. Two subpopulations of C
rocosphaera watsonii
have distinct distributions in the North and South Pacific. Environ Microbiol 2016; 18:514-24. [DOI: 10.1111/1462-2920.13180] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2015] [Revised: 11/25/2015] [Accepted: 12/04/2015] [Indexed: 11/29/2022]
Affiliation(s)
- Shellie R. Bench
- Department of Ocean Sciences; University of California Santa Cruz; 1156 High Street Santa Cruz CA 95064 USA
| | - Ildiko Frank
- Department of Ocean Sciences; University of California Santa Cruz; 1156 High Street Santa Cruz CA 95064 USA
| | - Julie Robidart
- Department of Ocean Sciences; University of California Santa Cruz; 1156 High Street Santa Cruz CA 95064 USA
- Monterey Bay Aquarium Research Institute; 7700 Sandholdt Road Moss Landing CA 95039 USA
- Center for Microbial Oceanography: Research and Education; University of Hawaii; 1950 East-West Road Honolulu HI 96822 USA
| | - Jonathan P. Zehr
- Department of Ocean Sciences; University of California Santa Cruz; 1156 High Street Santa Cruz CA 95064 USA
- Monterey Bay Aquarium Research Institute; 7700 Sandholdt Road Moss Landing CA 95039 USA
| |
Collapse
|
40
|
Roux S, Enault F, Ravet V, Colombet J, Bettarel Y, Auguet JC, Bouvier T, Lucas-Staat S, Vellet A, Prangishvili D, Forterre P, Debroas D, Sime-Ngando T. Analysis of metagenomic data reveals common features of halophilic viral communities across continents. Environ Microbiol 2015; 18:889-903. [PMID: 26472517 DOI: 10.1111/1462-2920.13084] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2014] [Revised: 09/28/2015] [Accepted: 10/08/2015] [Indexed: 12/31/2022]
Abstract
Microbial communities from hypersaline ponds, dominated by halophilic archaea, are considered specific of such extreme conditions. The associated viral communities have accordingly been shown to display specific features, such as similar morphologies among different sites. However, little is known about the genetic diversity of these halophilic viral communities across the Earth. Here, we studied viral communities in hypersaline ponds sampled on the coast of Senegal (8-36% of salinity) using metagenomics approach, and compared them with hypersaline viromes from Australia and Spain. The specificity of hyperhalophilic viruses could first be demonstrated at a community scale, salinity being a strong discriminating factor between communities. For the major viral group detected in all samples (Caudovirales), only a limited number of halophilic Caudovirales clades were highlighted. These clades gather viruses from different continents and display consistent genetic composition, indicating that they represent related lineages with a worldwide distribution. Non-tailed hyperhalophilic viruses display a greater rate of gene transfer and recombination, with uncharacterized genes conserved across different kind of viruses and plasmids. Thus, hypersaline viral communities around the world appear to form a genetically consistent community that are likely to harbour new genes coding for enzymes specifically adapted to these environments.
Collapse
Affiliation(s)
- Simon Roux
- Université Blaise Pascal, Laboratoire 'Microorganismes: Génome et Environnement', Clermont Université, Clermont-Ferrand, France.,CNRS UMR 6023, LMGE, Aubière, France
| | - Francois Enault
- Université Blaise Pascal, Laboratoire 'Microorganismes: Génome et Environnement', Clermont Université, Clermont-Ferrand, France.,CNRS UMR 6023, LMGE, Aubière, France
| | - Viviane Ravet
- Université Blaise Pascal, Laboratoire 'Microorganismes: Génome et Environnement', Clermont Université, Clermont-Ferrand, France.,CNRS UMR 6023, LMGE, Aubière, France
| | - Jonathan Colombet
- Université Blaise Pascal, Laboratoire 'Microorganismes: Génome et Environnement', Clermont Université, Clermont-Ferrand, France.,CNRS UMR 6023, LMGE, Aubière, France
| | - Yvan Bettarel
- IRD UMR 5119 ECOSYM, Université Montpellier 2, Montpellier, 34095, France
| | - Jean-Christophe Auguet
- Marine Biodiversity, Exploitation and Conservation (MARBEC), UMR CNRS 9190, Montpellier University, Place Eugéne Bataillon, Montpellier, France
| | - Thierry Bouvier
- IRD UMR 5119 ECOSYM, Université Montpellier 2, Montpellier, 34095, France
| | - Soizick Lucas-Staat
- Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Département de Microbiologie, Institut Pasteur, Paris, France
| | - Agnès Vellet
- Université Blaise Pascal, Laboratoire 'Microorganismes: Génome et Environnement', Clermont Université, Clermont-Ferrand, France.,CNRS UMR 6023, LMGE, Aubière, France
| | - David Prangishvili
- Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Département de Microbiologie, Institut Pasteur, Paris, France
| | - Patrick Forterre
- Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Département de Microbiologie, Institut Pasteur, Paris, France.,Laboratoire de Biologie Moléculaire du Gène chez les Extrêmophiles, CNRS UMR 8621, Université Paris Sud, Institut de Génétique et Microbiologie, Orsay, France
| | - Didier Debroas
- Université Blaise Pascal, Laboratoire 'Microorganismes: Génome et Environnement', Clermont Université, Clermont-Ferrand, France.,CNRS UMR 6023, LMGE, Aubière, France
| | - Telesphore Sime-Ngando
- Université Blaise Pascal, Laboratoire 'Microorganismes: Génome et Environnement', Clermont Université, Clermont-Ferrand, France.,CNRS UMR 6023, LMGE, Aubière, France
| |
Collapse
|
41
|
Dudhagara P, Bhavsar S, Bhagat C, Ghelani A, Bhatt S, Patel R. Web Resources for Metagenomics Studies. GENOMICS PROTEOMICS & BIOINFORMATICS 2015; 13:296-303. [PMID: 26602607 PMCID: PMC4678780 DOI: 10.1016/j.gpb.2015.10.003] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/06/2015] [Revised: 10/09/2015] [Accepted: 10/13/2015] [Indexed: 10/26/2022]
Abstract
The development of next-generation sequencing (NGS) platforms spawned an enormous volume of data. This explosion in data has unearthed new scalability challenges for existing bioinformatics tools. The analysis of metagenomic sequences using bioinformatics pipelines is complicated by the substantial complexity of these data. In this article, we review several commonly-used online tools for metagenomics data analysis with respect to their quality and detail of analysis using simulated metagenomics data. There are at least a dozen such software tools presently available in the public domain. Among them, MGRAST, IMG/M, and METAVIR are the most well-known tools according to the number of citations by peer-reviewed scientific media up to mid-2015. Here, we describe 12 online tools with respect to their web link, annotation pipelines, clustering methods, online user support, and availability of data storage. We have also done the rating for each tool to screen more potential and preferential tools and evaluated five best tools using synthetic metagenome. The article comprehensively deals with the contemporary problems and the prospects of metagenomics from a bioinformatics viewpoint.
Collapse
Affiliation(s)
- Pravin Dudhagara
- Department of Biosciences, Veer Narmad South Gujarat University, Surat 395007, India.
| | - Sunil Bhavsar
- Department of Biosciences, Veer Narmad South Gujarat University, Surat 395007, India
| | - Chintan Bhagat
- Department of Biosciences, Veer Narmad South Gujarat University, Surat 395007, India
| | - Anjana Ghelani
- Department of Microbiology, Shree Ramkrishna Institute of Computer Education and Applied Sciences, Surat 395001, India
| | - Shreyas Bhatt
- Department of Life Sciences, Hemchandracharya North Gujarat University, Patan 384265, India
| | - Rajesh Patel
- Department of Life Sciences, Hemchandracharya North Gujarat University, Patan 384265, India
| |
Collapse
|
42
|
Metagenomics: Retrospect and Prospects in High Throughput Age. BIOTECHNOLOGY RESEARCH INTERNATIONAL 2015; 2015:121735. [PMID: 26664751 PMCID: PMC4664791 DOI: 10.1155/2015/121735] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 07/13/2015] [Accepted: 10/26/2015] [Indexed: 01/30/2023]
Abstract
In recent years, metagenomics has emerged as a powerful tool for mining of hidden microbial treasure in a culture independent manner. In the last two decades, metagenomics has been applied extensively to exploit concealed potential of microbial communities from almost all sorts of habitats. A brief historic progress made over the period is discussed in terms of origin of metagenomics to its current state and also the discovery of novel biological functions of commercial importance from metagenomes of diverse habitats. The present review also highlights the paradigm shift of metagenomics from basic study of community composition to insight into the microbial community dynamics for harnessing the full potential of uncultured microbes with more emphasis on the implication of breakthrough developments, namely, Next Generation Sequencing, advanced bioinformatics tools, and systems biology.
Collapse
|
43
|
Yutin N, Shevchenko S, Kapitonov V, Krupovic M, Koonin EV. A novel group of diverse Polinton-like viruses discovered by metagenome analysis. BMC Biol 2015; 13:95. [PMID: 26560305 PMCID: PMC4642659 DOI: 10.1186/s12915-015-0207-4] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2015] [Accepted: 10/28/2015] [Indexed: 01/08/2023] Open
Abstract
Background The rapidly growing metagenomic databases provide increasing opportunities for computational discovery of new groups of organisms. Identification of new viruses is particularly straightforward given the comparatively small size of viral genomes, although fast evolution of viruses complicates the analysis of novel sequences. Here we report the metagenomic discovery of a distinct group of diverse viruses that are distantly related to the eukaryotic virus-like transposons of the Polinton superfamily. Results The sequence of the putative major capsid protein (MCP) of the unusual linear virophage associated with Phaeocystis globosa virus (PgVV) was used as a bait to identify potential related viruses in metagenomic databases. Assembly of the contigs encoding the PgVV MCP homologs followed by comprehensive sequence analysis of the proteins encoded in these contigs resulted in the identification of a large group of Polinton-like viruses (PLV) that resemble Polintons (polintoviruses) and virophages in genome size, and share with them a conserved minimal morphogenetic module that consists of major and minor capsid proteins and the packaging ATPase. With a single exception, the PLV lack the retrovirus-type integrase that is encoded in the genomes of all Polintons and the Mavirus group of virophages. However, some PLV encode a newly identified tyrosine recombinase-integrase that is common in bacteria and bacteriophages and is also found in the Organic Lake virophage group. Although several PLV genomes and individual genes are integrated into algal genomes, it appears likely that most of the PLV are viruses. Given the absence of protease and retrovirus-type integrase, the PLV could resemble the ancestral polintoviruses that evolved from bacterial tectiviruses. Apart from the conserved minimal morphogenetic module, the PLV widely differ in their genome complements but share a gene network with Polintons and virophages, suggestive of multiple gene exchanges within a shared gene pool. Conclusions The discovery of PLV substantially expands the emerging class of eukaryotic viruses and transposons that also includes Polintons and virophages. This class of selfish elements is extremely widespread and might have been a hotbed of eukaryotic virus, transposon and plasmid evolution. New families of these elements are expected to be discovered. Electronic supplementary material The online version of this article (doi:10.1186/s12915-015-0207-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Natalya Yutin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Sofiya Shevchenko
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Vladimir Kapitonov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Mart Krupovic
- Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Department of Microbiology, Institut Pasteur, Paris, France
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA.
| |
Collapse
|
44
|
Bose T, Haque MM, Reddy CVSK, Mande SS. COGNIZER: A Framework for Functional Annotation of Metagenomic Datasets. PLoS One 2015; 10:e0142102. [PMID: 26561344 PMCID: PMC4641738 DOI: 10.1371/journal.pone.0142102] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2015] [Accepted: 10/16/2015] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND Recent advances in sequencing technologies have resulted in an unprecedented increase in the number of metagenomes that are being sequenced world-wide. Given their volume, functional annotation of metagenomic sequence datasets requires specialized computational tools/techniques. In spite of having high accuracy, existing stand-alone functional annotation tools necessitate end-users to perform compute-intensive homology searches of metagenomic datasets against "multiple" databases prior to functional analysis. Although, web-based functional annotation servers address to some extent the problem of availability of compute resources, uploading and analyzing huge volumes of sequence data on a shared public web-service has its own set of limitations. In this study, we present COGNIZER, a comprehensive stand-alone annotation framework which enables end-users to functionally annotate sequences constituting metagenomic datasets. The COGNIZER framework provides multiple workflow options. A subset of these options employs a novel directed-search strategy which helps in reducing the overall compute requirements for end-users. The COGNIZER framework includes a cross-mapping database that enables end-users to simultaneously derive/infer KEGG, Pfam, GO, and SEED subsystem information from the COG annotations. RESULTS Validation experiments performed with real-world metagenomes and metatranscriptomes, generated using diverse sequencing technologies, indicate that the novel directed-search strategy employed in COGNIZER helps in reducing the compute requirements without significant loss in annotation accuracy. A comparison of COGNIZER's results with pre-computed benchmark values indicate the reliability of the cross-mapping database employed in COGNIZER. CONCLUSION The COGNIZER framework is capable of comprehensively annotating any metagenomic or metatranscriptomic dataset from varied sequencing platforms in functional terms. Multiple search options in COGNIZER provide end-users the flexibility of choosing a homology search protocol based on available compute resources. The cross-mapping database in COGNIZER is of high utility since it enables end-users to directly infer/derive KEGG, Pfam, GO, and SEED subsystem annotations from COG categorizations. Furthermore, availability of COGNIZER as a stand-alone scalable implementation is expected to make it a valuable annotation tool in the field of metagenomic research. AVAILABILITY AND IMPLEMENTATION A Linux implementation of COGNIZER is freely available for download from the following links: http://metagenomics.atc.tcs.com/cognizer, https://metagenomics.atc.tcs.com/function/cognizer.
Collapse
Affiliation(s)
- Tungadri Bose
- Bio-Sciences R&D Division, TCS Innovation Labs, Tata Consultancy Services Limited, 54-B, Hadapsar Industrial Estate, Pune, 411013, Maharashtra, India
| | - Mohammed Monzoorul Haque
- Bio-Sciences R&D Division, TCS Innovation Labs, Tata Consultancy Services Limited, 54-B, Hadapsar Industrial Estate, Pune, 411013, Maharashtra, India
| | - CVSK Reddy
- Bio-Sciences R&D Division, TCS Innovation Labs, Tata Consultancy Services Limited, 54-B, Hadapsar Industrial Estate, Pune, 411013, Maharashtra, India
| | - Sharmila S. Mande
- Bio-Sciences R&D Division, TCS Innovation Labs, Tata Consultancy Services Limited, 54-B, Hadapsar Industrial Estate, Pune, 411013, Maharashtra, India
- * E-mail:
| |
Collapse
|
45
|
Peabody MA, Van Rossum T, Lo R, Brinkman FSL. Evaluation of shotgun metagenomics sequence classification methods using in silico and in vitro simulated communities. BMC Bioinformatics 2015; 16:363. [PMID: 26537885 PMCID: PMC4634789 DOI: 10.1186/s12859-015-0788-5] [Citation(s) in RCA: 90] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2015] [Accepted: 10/20/2015] [Indexed: 01/14/2023] Open
Abstract
Background The field of metagenomics (study of genetic material recovered directly from an environment) has grown rapidly, with many bioinformatics analysis methods being developed. To ensure appropriate use of such methods, robust comparative evaluation of their accuracy and features is needed. For taxonomic classification of sequence reads, such evaluation should include use of clade exclusion, which better evaluates a method’s accuracy when identical sequences are not present in any reference database, as is common in metagenomic analysis. To date, relatively small evaluations have been performed, with evaluation approaches like clade exclusion limited to assessment of new methods by the authors of the given method. What is needed is a rigorous, independent comparison between multiple major methods, using the same in silico and in vitro test datasets, with and without approaches like clade exclusion, to better characterize accuracy under different conditions. Results An overview of the features of 38 bioinformatics methods is provided, evaluating accuracy with a focus on 11 programs that have reference databases that can be modified and therefore most robustly evaluated with clade exclusion. Taxonomic classification of sequence reads was evaluated using both in silico and in vitro mock bacterial communities. Clade exclusion was used at taxonomic levels from species to class—identifying how well methods perform in progressively more difficult scenarios. A wide range of variability was found in the sensitivity, precision, overall accuracy, and computational demand for the programs evaluated. In experiments where distilled water was spiked with only 11 bacterial species, frequently dozens to hundreds of species were falsely predicted by the most popular programs. The different features of each method (forces predictions or not, etc.) are summarized, and additional analysis considerations discussed. Conclusions The accuracy of shotgun metagenomics classification methods varies widely. No one program clearly outperformed others in all evaluation scenarios; rather, the results illustrate the strengths of different methods for different purposes. Researchers must appreciate method differences, choosing the program best suited for their particular analysis to avoid very misleading results. Use of standardized datasets for method comparisons is encouraged, as is use of mock microbial community controls suitable for a particular metagenomic analysis. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0788-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Michael A Peabody
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada.
| | - Thea Van Rossum
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada.
| | - Raymond Lo
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada.
| | - Fiona S L Brinkman
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada.
| |
Collapse
|
46
|
Balcázar JL, Subirats J, Borrego CM. The role of biofilms as environmental reservoirs of antibiotic resistance. Front Microbiol 2015; 6:1216. [PMID: 26583011 PMCID: PMC4628128 DOI: 10.3389/fmicb.2015.01216] [Citation(s) in RCA: 235] [Impact Index Per Article: 26.1] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2015] [Accepted: 10/19/2015] [Indexed: 12/12/2022] Open
Abstract
Antibiotic resistance has become a significant and growing threat to public and environmental health. To face this problem both at local and global scales, a better understanding of the sources and mechanisms that contribute to the emergence and spread of antibiotic resistance is required. Recent studies demonstrate that aquatic ecosystems are reservoirs of resistant bacteria and antibiotic resistance genes as well as potential conduits for their transmission to human pathogens. Despite the wealth of information about antibiotic pollution and its effect on the aquatic microbial resistome, the contribution of environmental biofilms to the acquisition and spread of antibiotic resistance has not been fully explored in aquatic systems. Biofilms are structured multicellular communities embedded in a self-produced extracellular matrix that acts as a barrier to antibiotic diffusion. High population densities and proximity of cells in biofilms also increases the chances for genetic exchange among bacterial species converting biofilms in hot spots of antibiotic resistance. This review focuses on the potential effect of antibiotic pollution on biofilm microbial communities, with special emphasis on ecological and evolutionary processes underlying acquired resistance to these compounds.
Collapse
Affiliation(s)
| | | | - Carles M Borrego
- Catalan Institute for Water Research Girona, Spain ; Group of Molecular Microbial Ecology, Institute of Aquatic Ecology, University of Girona Girona, Spain
| |
Collapse
|
47
|
Wang J, Crawl D, Purawat S, Nguyen M, Altintas I. Big Data Provenance: Challenges, State of the Art and Opportunities. PROCEEDINGS : ... IEEE INTERNATIONAL CONFERENCE ON BIG DATA. IEEE INTERNATIONAL CONFERENCE ON BIG DATA 2015; 2015:2509-2516. [PMID: 29399671 DOI: 10.1109/bigdata.2015.7364047] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
Ability to track provenance is a key feature of scientific workflows to support data lineage and reproducibility. The challenges that are introduced by the volume, variety and velocity of Big Data, also pose related challenges for provenance and quality of Big Data, defined as veracity. The increasing size and variety of distributed Big Data provenance information bring new technical challenges and opportunities throughout the provenance lifecycle including recording, querying, sharing and utilization. This paper discusses the challenges and opportunities of Big Data provenance related to the veracity of the datasets themselves and the provenance of the analytical processes that analyze these datasets. It also explains our current efforts towards tracking and utilizing Big Data provenance using workflows as a programming model to analyze Big Data.
Collapse
Affiliation(s)
- Jianwu Wang
- Department of Information Systems, University of Maryland, Baltimore County, Baltimore, MD, U.S.A
| | - Daniel Crawl
- San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA, U.S.A
| | - Shweta Purawat
- San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA, U.S.A
| | - Mai Nguyen
- San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA, U.S.A
| | - Ilkay Altintas
- San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA, U.S.A
| |
Collapse
|
48
|
Provirophages in the Bigelowiella genome bear testimony to past encounters with giant viruses. Proc Natl Acad Sci U S A 2015; 112:E5318-26. [PMID: 26305943 DOI: 10.1073/pnas.1506469112] [Citation(s) in RCA: 57] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Virophages are recently discovered double-stranded DNA virus satellites that prey on giant viruses (nucleocytoplasmic large DNA viruses; NCLDVs), which are themselves parasites of unicellular eukaryotes. This coupled parasitism can result in the indirect control of eukaryotic cell mortality by virophages. However, the details of such tripartite relationships remain largely unexplored. We have discovered ∼300 predicted genes of putative virophage origin in the nuclear genome of the unicellular alga Bigelowiella natans. Physical clustering of these genes indicates that virophage genomes are integrated into the B. natans genome. Virophage inserts show high levels of similarity and synteny between each other, indicating that they are closely related. Virophage genes are transcribed not only in the sequenced B. natans strain but also in other Bigelowiella isolates, suggesting that transcriptionally active virophage inserts are widespread in Bigelowiella populations. Evidence that B. natans is also a host to NCLDV members is provided by the identification of NCLDV inserts in its genome. These putative large DNA viruses may be infected by B. natans virophages. We also identify four repeated elements sharing structural and genetic similarities with transpovirons--a class of mobile elements first discovered in giant viruses--that were probably independently inserted in the B. natans genome. We argue that endogenized provirophages may be beneficial to both the virophage and B. natans by (i) increasing the chances for the virophage to coinfect the host cell with an NCLDV prey and (ii) defending the host cell against fatal NCLDV infections.
Collapse
|
49
|
Pucciarelli S, Devaraj RR, Mancini A, Ballarini P, Castelli M, Schrallhammer M, Petroni G, Miceli C. Microbial Consortium Associated with the Antarctic Marine Ciliate Euplotes focardii: An Investigation from Genomic Sequences. MICROBIAL ECOLOGY 2015; 70:484-97. [PMID: 25704316 PMCID: PMC4494151 DOI: 10.1007/s00248-015-0568-9] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/13/2014] [Accepted: 01/08/2015] [Indexed: 05/23/2023]
Abstract
We report the characterization of the bacterial consortium associated to Euplotes focardii, a strictly psychrophilic marine ciliate that was maintained in laboratory cultures at 4 °C after its first isolation from Terra Nova Bay, in Antarctica. By Illumina genome analyser, we obtained 11,179 contigs of potential prokaryotic origin and classified them according to the NCBI's prokaryotic attributes table. The majority of these sequences correspond to either Bacteroidetes (16 %) or Proteobacteria (78 %). The latter were dominated by gamma- (39 %, including sequences related to the pathogenic genus Francisella), and alpha-proteobacterial (30 %) sequences. Analysis of the Pfam domain family and Gene Ontology term variation revealed that the most frequent terms that appear unique to this consortium correspond to proteins involved in "transmembrane transporter activity" and "oxidoreductase activity". Furthermore, we identified genes that encode for enzymes involved in the catabolism of complex substance for energy reserves. We also characterized members of the transposase and integrase superfamilies, whose role in bacterial evolution is well documented, as well as putative antifreeze proteins. Antibiotic treatments of E. focardii cultures delayed the cell division of the ciliate. To conclude, our results indicate that this consortium is largely represented by bacteria derived from the original Antarctic sample and may contribute to the survival of E. focardii in laboratory condition. Furthermore, our results suggest that these bacteria may have a more general role in E. focardii survival in its natural cold and oxidative environment.
Collapse
Affiliation(s)
- Sandra Pucciarelli
- School of Biosciences and Veterinary Medicine, University of Camerino, Camerino, 62032, Italy,
| | | | | | | | | | | | | | | |
Collapse
|
50
|
New classes of self-cleaving ribozymes revealed by comparative genomics analysis. Nat Chem Biol 2015; 11:606-10. [PMID: 26167874 PMCID: PMC4509812 DOI: 10.1038/nchembio.1846] [Citation(s) in RCA: 144] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2015] [Accepted: 05/12/2015] [Indexed: 12/15/2022]
Abstract
Enzymes made of RNA catalyze reactions that are essential for protein synthesis and RNA processing. However, such natural ribozymes are exceedingly rare, as evident by the fact that the discovery rate for new classes has dropped to one per decade from about one per year during the 1980s. Indeed, only 11 distinct ribozyme classes have been experimentally validated to date. Recently, we recognized that self-cleaving ribozymes frequently associate with certain types of genes from bacteria. Herein this synteny was exploited to identify divergent architectures for two previously known ribozyme classes and to discover additional noncoding RNA motifs that are self-cleaving RNA candidates. Three new self-cleaving classes, named twister sister, pistol and hatchet, have been identified from this collection, suggesting that even more ribozymes remain hidden in modern cells.
Collapse
|