Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Qi J, Luo H, Hao B. CVTree: a phylogenetic tree reconstruction tool based on whole genomes. Nucleic Acids Res 2004;32:W45-7. [PMID: 15215347 PMCID: PMC441500 DOI: 10.1093/nar/gkh362] [Citation(s) in RCA: 161] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2004] [Revised: 03/03/2004] [Accepted: 03/03/2004] [Indexed: 11/14/2022] Open

For:	Qi J, Luo H, Hao B. CVTree: a phylogenetic tree reconstruction tool based on whole genomes. Nucleic Acids Res 2004;32:W45-7. [PMID: 15215347 PMCID: PMC441500 DOI: 10.1093/nar/gkh362] [Citation(s) in RCA: 161] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2004] [Revised: 03/03/2004] [Accepted: 03/03/2004] [Indexed: 11/14/2022] Open

Number

Cited by Other Article(s)

Wang T, Yu ZG, Li J. CGRWDL: alignment-free phylogeny reconstruction method for viruses based on chaos game representation weighted by dynamical language model. Front Microbiol 2024;15:1339156. [PMID: 38572227 PMCID: PMC10987876 DOI: 10.3389/fmicb.2024.1339156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Accepted: 02/23/2024] [Indexed: 04/05/2024] Open

Chao P, Zhang X, Zhang L, Yang A, Wang Y, Chen X. Proteomics-based vaccine targets annotation and design of multi-epitope vaccine against antibiotic-resistant Streptococcus gallolyticus. Sci Rep 2024;14:4836. [PMID: 38418560 PMCID: PMC10901886 DOI: 10.1038/s41598-024-55372-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Accepted: 02/22/2024] [Indexed: 03/01/2024] Open

Guo X, Guo Y, Chen H, Liu X, He P, Li W, Zhang MQ, Dai Q. Systematic comparison of genome information processing and boundary recognition tools used for genomic island detection. Comput Biol Med 2023;166:107550. [PMID: 37826950 DOI: 10.1016/j.compbiomed.2023.107550] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 09/12/2023] [Accepted: 09/28/2023] [Indexed: 10/14/2023]

Li X, Li H, Yang Z, Wu Y, Zhang M. Exploring objective feature sets in constructing the evolution relationship of animal genome sequences. BMC Genomics 2023;24:634. [PMID: 37872534 PMCID: PMC10594854 DOI: 10.1186/s12864-023-09747-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Accepted: 10/17/2023] [Indexed: 10/25/2023] Open

Sengupta S, Azad RK. Leveraging comparative genomics to uncover alien genes in bacterial genomes. Microb Genom 2023;9:mgen000939. [PMID: 36748570 PMCID: PMC9973850 DOI: 10.1099/mgen.0.000939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open

Ma Z, Lu YY, Wang Y, Lin R, Yang Z, Zhang F, Wang Y. Metric learning for comparing genomic data with triplet network. Brief Bioinform 2022;23:6679451. [DOI: 10.1093/bib/bbac345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Revised: 07/20/2022] [Accepted: 07/26/2022] [Indexed: 11/13/2022] Open

Birth N, Dencker T, Morgenstern B. Insertions and deletions as phylogenetic signal in an alignment-free context. PLoS Comput Biol 2022;18:e1010303. [PMID: 35939516 PMCID: PMC9387925 DOI: 10.1371/journal.pcbi.1010303] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 08/18/2022] [Accepted: 06/14/2022] [Indexed: 11/18/2022] Open

Abstract

Most methods for phylogenetic tree reconstruction are based on sequence alignments; they infer phylogenies from substitutions that may have occurred at the aligned sequence positions. Gaps in alignments are usually not employed as phylogenetic signal. In this paper, we explore an alignment-free approach that uses insertions and deletions (indels) as an additional source of information for phylogeny inference. For a set of four or more input sequences, we generate so-called quartet blocks of four putative homologous segments each. For pairs of such quartet blocks involving the same four sequences, we compare the distances between the two blocks in these sequences, to obtain hints about indels that may have happened between the blocks since the respective four sequences have evolved from their last common ancestor. A prototype implementation that we call Gap-SpaM is presented to infer phylogenetic trees from these data, using a quartet-tree approach or, alternatively, under the maximum-parsimony paradigm. This approach should not be regarded as an alternative to established methods, but rather as a complementary source of phylogenetic information. Interestingly, however, our software is able to produce phylogenetic trees from putative indels alone that are comparable to trees obtained with existing alignment-free methods.

Phylogenetic tree inference based on DNA or protein sequence comparison is a fundamental task in computational biology. Given a multiple alignment of a set of input sequences, most approaches compare aligned sequence positions to each other, to find a suitable tree, based on a model of molecular evolution. Insertions and deletions that may have happened since the input sequences evolved from their last common ancestor are ignored by most phylogeny methods. Herein, we show that insertions and deletions can provide an additional source of information for phylogeny inference, and that such information can be obtained with a simple alignment-free approach. We provide an implementation of this idea that we call Gap-SpaM. The proposed approach is complementary to existing phylogeny methods since it is based on a completely different source of information. It is, thus, not meant to be an alternative to those existing methods but rather as a possible additional source of information for tree inference.

Collapse

Wang Y, Sun F, Lin W, Zhang S. AC-PCoA: Adjustment for confounding factors using principal coordinate analysis. PLoS Comput Biol 2022;18:e1010184. [PMID: 35830390 PMCID: PMC9278763 DOI: 10.1371/journal.pcbi.1010184] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Accepted: 05/08/2022] [Indexed: 12/01/2022] Open

Abstract

Confounding factors exist widely in various biological data owing to technical variations, population structures and experimental conditions. Such factors may mask the true signals and lead to spurious associations in the respective biological data, making it necessary to adjust confounding factors accordingly. However, existing confounder correction methods were mainly developed based on the original data or the pairwise Euclidean distance, either one of which is inadequate for analyzing different types of data, such as sequencing data.

In this work, we proposed a method called Adjustment for Confounding factors using Principal Coordinate Analysis, or AC-PCoA, which reduces data dimension and extracts the information from different distance measures using principal coordinate analysis, and adjusts confounding factors across multiple datasets by minimizing the associations between lower-dimensional representations and confounding variables. Application of the proposed method was further extended to classification and prediction. We demonstrated the efficacy of AC-PCoA on three simulated datasets and five real datasets. Compared to the existing methods, AC-PCoA shows better results in visualization, statistical testing, clustering, and classification.

With today’s unprecedented amount of data, researchers are challenged by the need to enhance meaningful signals without the interference of unwanted confounders hidden inside the data. Data visualization is an important step toward exploring and explaining data in order to intuitively identify the dominant patterns. Principal coordinate analysis (PCoA), as a visualization tool, allows flexible ways to define pairwise distances and project the samples into lower dimensions without changing the distances. However, when visualizing large-scale biological datasets, the true patterns are often hindered by unwanted confounding variations, either biologically or technically in origin. To eliminate these confounding factors and recover underlying signals, we proposed a method called Adjustment for Confounding factors using Principal Coordinate Analysis, or AC-PCoA, and showed that it significantly outperforms existing methods in visualization through three simulation studies and five real datasets. We further showed that the low-dimensional representations given by AC-PCoA provide promising results in statistical testing, clustering, and classification as well.

Collapse

An accurate alignment-free protein sequence comparator based on physicochemical properties of amino acids. Sci Rep 2022;12:11158. [PMID: 35778592 PMCID: PMC9247937 DOI: 10.1038/s41598-022-15266-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2021] [Accepted: 06/21/2022] [Indexed: 11/08/2022] Open

Kröber E, Kanukollu S, Wende S, Bringel F, Kolb S. A putatively new family of alphaproteobacterial chloromethane degraders from a deciduous forest soil revealed by stable isotope probing and metagenomics. ENVIRONMENTAL MICROBIOME 2022;17:24. [PMID: 35527282 PMCID: PMC9080209 DOI: 10.1186/s40793-022-00416-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Accepted: 04/19/2022] [Indexed: 06/14/2023]

Abstract

BACKGROUND

Chloromethane (CH₃Cl) is the most abundant halogenated organic compound in the atmosphere and substantially responsible for the destruction of the stratospheric ozone layer. Since anthropogenic CH₃Cl sources have become negligible with the application of the Montreal Protocol (1987), natural sources, such as vegetation and soils, have increased proportionally in the global budget. CH₃Cl-degrading methylotrophs occurring in soils might be an important and overlooked sink.

RESULTS AND CONCLUSIONS

The objective of our study was to link the biotic CH₃Cl sink with the identity of active microorganisms and their biochemical pathways for CH₃Cl degradation in a deciduous forest soil. When tested in laboratory microcosms, biological CH₃Cl consumption occurred in leaf litter, senescent leaves, and organic and mineral soil horizons. Highest consumption rates, around 2 mmol CH₃Cl g^-1 dry weight h^-1, were measured in organic soil and senescent leaves, suggesting that top soil layers are active (micro-)biological CH₃Cl degradation compartments of forest ecosystems. The DNA of these [¹³C]-CH₃Cl-degrading microbial communities was labelled using stable isotope probing (SIP), and the corresponding taxa and their metabolic pathways studied using high-throughput metagenomics sequencing analysis. [¹³C]-labelled Metagenome-Assembled Genome closely related to the family Beijerinckiaceae may represent a new methylotroph family of Alphaproteobacteria, which is found in metagenome databases of forest soils samples worldwide. Gene markers of the only known pathway for aerobic CH₃Cl degradation, via the methyltransferase system encoded by the CH₃Cl utilisation genes (cmu), were undetected in the DNA-SIP metagenome data, suggesting that biological CH₃Cl sink in this deciduous forest soil operates by a cmu-independent metabolism.

Collapse

Aledo JC. Phylogenies from unaligned proteomes using sequence environments of amino acid residues. Sci Rep 2022;12:7497. [PMID: 35523825 PMCID: PMC9076898 DOI: 10.1038/s41598-022-11370-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2022] [Accepted: 04/21/2022] [Indexed: 11/09/2022] Open

Xue Y, Bao Y, Zhang Z, Zhao W, Xiao J, He S, Zhang G, Li Y, Zhao G, Chen R, Zeng J, Zhang Y, Shang Y, Mai J, Shi S, Lu M, Bu C, Zhang Z, Du Z, Xiao J, Wang Y, Kang H, Xu T, Hao L, Bao Y, Jia P, Jiang S, Qian Q, Zhu T, Shang Y, Zong W, Jin T, Zhang Y, Zou D, Bao Y, Xiao J, Zhang Z, Jiang S, Du Q, Feng C, Ma L, Zhang S, Wang A, Dong L, Wang Y, Zou D, Zhang Z, Liu W, Yan X, Ling Y, Zhao G, Zhou Z, Zhang G, Kang W, Jin T, Zhang T, Ma S, Yan H, Liu Z, Ji Z, Cai Y, Wang S, Song M, Ren J, Zhou Q, Qu J, Zhang W, Bao Y, Liu G, Chen X, Chen T, Zhang S, Sun Y, Yu C, Tang B, Zhu J, Dong L, Zhai S, Sun Y, Chen Q, Yang X, Zhang X, Sang Z, Wang Y, Zhao Y, Chen H, Lan L, Wang Y, Zhao W, Ma Y, Jia Y, Zheng X, Chen M, Zhang Y, Zou D, Zhu T, Xu T, Chen M, Niu G, Zong W, Pan R, Jing W, Sang J, Liu C, Xiong Y, Sun Y, Zhai S, Chen H, Zhao W, Xiao J, Bao Y, Hao L, Zhang M, Wang G, Zou D, Yi L, Zhao W, Zong W, Wu S, Xiong Z, Li R, Zong W, Kang H, Xiong Z, Ma Y, Jin T, Gong Z, Yi L, Zhang M, Wu S, Wang G, Li R, Liu L, Li Z, Liu C, Zou D, Li Q, Feng C, Jing W, Luo S, Ma L, Wang J, Shi Y, Zhou H, Zhang P, Song T, Li Y, He S, Xiong Z, Yang F, Li M, Zhao W, Wang G, Li Z, Ma Y, Zou D, Zong W, Kang H, Jia Y, Zheng X, Li R, Tian D, Liu X, Li C, Teng X, Song S, Liu L, Zhang Y, Niu G, Li Q, Li Z, Zhu T, Feng C, Liu X, Zhang Y, Xu T, Chen R, Teng X, Zhang R, Zou D, Ma L, Xu F, Wang Y, Ling Y, Zhou C, Wang H, Teschendorff AE, He Y, Zhang G, Yang Z, Song S, Ma L, Zou D, Tian D, Li C, Zhu J, Li L, Li N, Gong Z, Chen M, Wang A, Ma Y, Teng X, Cui Y, Duan G, Zhang M, Jin T, Wu G, Huang T, Jin E, Zhao W, Kang H, Wang Z, Du Z, Zhang Y, Li R, Zeng J, Hao L, Jiang S, Chen H, Li M, Xiao J, Zhang Z, Zhao W, Xue Y, Bao Y, Ning W, Xue Y, Tang B, Liu Y, Sun Y, Duan G, Cui Y, Zhou Q, Dong L, Jin E, Liu X, Zhang L, Mao B, Zhang S, Zhang Y, Wang G, Zhao W, Wang Z, Zhu Q, Li X, Zhu J, Tian D, Kang H, Li C, Zhang S, Song S, Li M, Zhao W, Liu Y, Wang Z, Luo H, Zhu J, Wu X, Tian D, Li C, Zhao W, Jing H, Zhu J, Tang B, Zou D, Liu L, Pan Y, Liu C, Chen M, Liu X, Zhang Y, Li Z, Feng C, Du Q, Chen R, Zhu T, Ma L, Zou D, Jiang S, Zhang Z, Gong Z, Zhu J, Li C, Jiang S, Ma L, Tang B, Zou D, Chen M, Sun Y, Shi L, Song S, Zhang Z, Li M, Xiao J, Xue Y, Bao Y, Du Z, Zhao W, Li Z, Du Q, Jiang S, Ma L, Zhang Z, Xiong Z, Li M, Zou D, Zong W, Li R, Chen M, Du Z, Zhao W, Bao Y, Ma Y, Zhang X, Lan L, Xue Y, Bao Y, Jiang S, Feng C, Zhao W, Xiao J, Bao Y, Zhang Z, Zuo Z, Ren J, Zhang X, Xiao Y, Li X, Zhang X, Xiao Y, Li X, Liu D, Zhang C, Xue Y, Zhao Z, Jiang T, Wu W, Zhao F, Meng X, Chen M, Peng D, Xue Y, Luo H, Gao F, Ning W, Xue Y, Lin S, Xue Y, Liu C, Guo A, Yuan H, Su T, Zhang YE, Zhou Y, Chen M, Guo G, Fu S, Tan X, Xue Y, Zhang W, Xue Y, Luo M, Guo A, Xie Y, Ren J, Zhou Y, Chen M, Guo G, Wang C, Xue Y, Liao X, Gao X, Wang J, Xie G, Guo A, Yuan C, Chen M, Tian F, Yang D, Gao G, Tang D, Xue Y, Wu W, Chen M, Gou Y, Han C, Xue Y, Cui Q, Li X, Li CY, Luo X, Ren J, Zhang X, Xiao Y, Li X. Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2022. Nucleic Acids Res 2022;50:D27-D38. [PMID: 34718731 PMCID: PMC8728233 DOI: 10.1093/nar/gkab951] [Citation(s) in RCA: 297] [Impact Index Per Article: 148.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 09/29/2021] [Accepted: 10/08/2021] [Indexed: 12/21/2022] Open

Chen NWG, Ruh M, Darrasse A, Foucher J, Briand M, Costa J, Studholme DJ, Jacques M. Common bacterial blight of bean: a model of seed transmission and pathological convergence. MOLECULAR PLANT PATHOLOGY 2021;22:1464-1480. [PMID: 33942466 PMCID: PMC8578827 DOI: 10.1111/mpp.13067] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Revised: 03/22/2021] [Accepted: 03/22/2021] [Indexed: 05/31/2023]

Abstract

BACKGROUND

Xanthomonas citri pv. fuscans (Xcf) and Xanthomonas phaseoli pv. phaseoli (Xpp) are the causal agents of common bacterial blight of bean (CBB), an important disease worldwide that remains difficult to control. These pathogens belong to distinct species within the Xanthomonas genus and have undergone a dynamic evolutionary history including the horizontal transfer of genes encoding factors probably involved in adaptation to and pathogenicity on common bean. Seed transmission is a key point of the CBB disease cycle, favouring both vertical transmission of the pathogen and worldwide distribution of the disease through global seed trade.

TAXONOMY

Kingdom: Bacteria; phylum: Proteobacteria; class: Gammaproteobacteria; order: Lysobacterales (also known as Xanthomonadales); family: Lysobacteraceae (also known as Xanthomonadaceae); genus: Xanthomonas; species: X. citri pv. fuscans and X. phaseoli pv. phaseoli (Xcf-Xpp).

HOST RANGE

The main host of Xcf-Xpp is the common bean (Phaseolus vulgaris). Lima bean (Phaseolus lunatus) and members of the Vigna genus (Vigna aconitifolia, Vigna angularis, Vigna mungo, Vigna radiata, and Vigna umbellata) are also natural hosts of Xcf-Xpp. Natural occurrence of Xcf-Xpp has been reported for a handful of other legumes such as Calopogonium sp., Pueraria sp., pea (Pisum sativum), Lablab purpureus, Macroptilium lathyroides, and Strophostyles helvola. There are conflicting reports concerning the natural occurrence of CBB agents on tepary bean (Phaseolus acutifolius) and cowpea (Vigna unguiculata subsp. unguiculata).

SYMPTOMS

CBB symptoms occur on all aerial parts of beans, that is, seedlings, leaves, stems, pods, and seeds. Symptoms initially appear as water-soaked spots evolving into necrosis on leaves, pustules on pods, and cankers on twigs. In severe infections, defoliation and wilting may occur.

DISTRIBUTION

CBB is distributed worldwide, meaning that it is frequently encountered in most places where bean is cultivated in the Americas, Asia, Africa, and Oceania, except for arid tropical areas. Xcf-Xpp are regulated nonquarantine pathogens in Europe and are listed in the A2 list by the European and Mediterranean Plant Protection Organization (EPPO).

GENOME

The genome consists of a single circular chromosome plus one to four extrachromosomal plasmids of various sizes, for a total mean size of 5.27 Mb with 64.7% GC content and an average predicted number of 4,181 coding sequences.

DISEASE CONTROL

Management of CBB is based on integrated approaches that comprise measures aimed at avoiding Xcf-Xpp introduction through infected seeds, cultural practices to limit Xcf-Xpp survival between host crops, whenever possible the use of tolerant or resistant bean genotypes, and chemical treatments, mainly restricted to copper compounds. The use of pathogen-free seeds is essential in an effective management strategy and requires appropriate sampling, detection, and identification methods. USEFUL WEBSITES: https://gd.eppo.int/taxon/XANTPH, https://gd.eppo.int/taxon/XANTFF, and http://www.cost.eu/COST_Actions/ca/CA16107.

Collapse

Wu YQ, Yu ZG, Tang RB, Han GS, Anh VV. An Information-Entropy Position-Weighted K-Mer Relative Measure for Whole Genome Phylogeny Reconstruction. Front Genet 2021;12:766496. [PMID: 34745231 PMCID: PMC8568955 DOI: 10.3389/fgene.2021.766496] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2021] [Accepted: 09/29/2021] [Indexed: 11/30/2022] Open

Geptop 2.0: Accurately Select Essential Genes from the List of Protein-Coding Genes in Prokaryotic Genomes. Methods Mol Biol 2021. [PMID: 34709630 DOI: 10.1007/978-1-0716-1720-5_23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2023]

Ramanathan N, Ramamurthy J, Natarajan G. Numerical Characterization of DNA Sequences for Alignment-free Sequence Comparison - A Review. Comb Chem High Throughput Screen 2021;25:365-380. [PMID: 34382516 DOI: 10.2174/1386207324666210811101437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Revised: 06/16/2021] [Accepted: 06/24/2021] [Indexed: 11/22/2022]

CVTree: A Parallel Alignment-free Phylogeny and Taxonomy Tool based on Composition Vectors of Genomes. GENOMICS PROTEOMICS & BIOINFORMATICS 2021;19:662-667. [PMID: 34119695 PMCID: PMC9040009 DOI: 10.1016/j.gpb.2021.03.006] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/22/2020] [Revised: 02/23/2021] [Accepted: 03/06/2021] [Indexed: 11/21/2022]

Muggia L, Ametrano CG, Sterflinger K, Tesei D. An Overview of Genomics, Phylogenomics and Proteomics Approaches in Ascomycota. Life (Basel) 2020;10:E356. [PMID: 33348904 PMCID: PMC7765829 DOI: 10.3390/life10120356] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Revised: 12/10/2020] [Accepted: 12/12/2020] [Indexed: 12/26/2022] Open

Abstract

Fungi are among the most successful eukaryotes on Earth: they have evolved strategies to survive in the most diverse environments and stressful conditions and have been selected and exploited for multiple aims by humans. The characteristic features intrinsic of Fungi have required evolutionary changes and adaptations at deep molecular levels. Omics approaches, nowadays including genomics, metagenomics, phylogenomics, transcriptomics, metabolomics, and proteomics have enormously advanced the way to understand fungal diversity at diverse taxonomic levels, under changeable conditions and in still under-investigated environments. These approaches can be applied both on environmental communities and on individual organisms, either in nature or in axenic culture and have led the traditional morphology-based fungal systematic to increasingly implement molecular-based approaches. The advent of next-generation sequencing technologies was key to boost advances in fungal genomics and proteomics research. Much effort has also been directed towards the development of methodologies for optimal genomic DNA and protein extraction and separation. To date, the amount of proteomics investigations in Ascomycetes exceeds those carried out in any other fungal group. This is primarily due to the preponderance of their involvement in plant and animal diseases and multiple industrial applications, and therefore the need to understand the biological basis of the infectious process to develop mechanisms for biologic control, as well as to detect key proteins with roles in stress survival. Here we chose to present an overview as much comprehensive as possible of the major advances, mainly of the past decade, in the fields of genomics (including phylogenomics) and proteomics of Ascomycota, focusing particularly on those reporting on opportunistic pathogenic, extremophilic, polyextremotolerant and lichenized fungi. We also present a review of the mostly used genome sequencing technologies and methods for DNA sequence and protein analyses applied so far for fungi.

Collapse

Song K. Classifying the Lifestyle of Metagenomically-Derived Phages Sequences Using Alignment-Free Methods. Front Microbiol 2020;11:567769. [PMID: 33304326 PMCID: PMC7693541 DOI: 10.3389/fmicb.2020.567769] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2020] [Accepted: 10/22/2020] [Indexed: 01/20/2023] Open

Yang Z, Li H, Jia Y, Zheng Y, Meng H, Bao T, Li X, Luo L. Intrinsic laws of k-mer spectra of genome sequences and evolution mechanism of genomes. BMC Evol Biol 2020;20:157. [PMID: 33228538 PMCID: PMC7684957 DOI: 10.1186/s12862-020-01723-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Accepted: 11/10/2020] [Indexed: 11/17/2022] Open

Bonnici V, Maresi E, Giugno R. Challenges in gene-oriented approaches for pangenome content discovery. Brief Bioinform 2020;22:5901976. [PMID: 32893299 DOI: 10.1093/bib/bbaa198] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2020] [Revised: 05/14/2020] [Accepted: 08/04/2020] [Indexed: 01/17/2023] Open

Systematic Analysis of REBASE Identifies Numerous Type I Restriction-Modification Systems with Duplicated, Distinct hsdS Specificity Genes That Can Switch System Specificity by Recombination. mSystems 2020;5:5/4/e00497-20. [PMID: 32723795 PMCID: PMC7394358 DOI: 10.1128/msystems.00497-20] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

Abstract

Many bacterial species contain DNA methyltransferases that have random on/off switching of expression. These systems, called phasevarions (phase-variable regulons), control the expression of multiple genes by global methylation changes. In every previously characterized phasevarion, genes involved in pathobiology, antibiotic resistance, and potential vaccine candidates are randomly varied in their expression, commensurate with methyltransferase switching. Our systematic study to determine the extent of phasevarions controlled by invertible Type I R-M systems will provide valuable information for understanding how bacteria regulate genes and is key to the study of physiology, virulence, and vaccine development; therefore, it is critical to identify and characterize phase-variable methyltransferases controlling phasevarions.

N⁶-Adenine DNA methyltransferases associated with some Type I and Type III restriction-modification (R-M) systems are able to undergo phase variation, randomly switching expression ON or OFF by varying the length of locus-encoded simple sequence repeats (SSRs). This variation of methyltransferase expression results in genome-wide methylation differences and global changes in gene expression. These epigenetic regulatory systems are called phasevarions, phase-variable regulons, and are widespread in bacteria. A distinct switching system has also been described in Type I R-M systems, based on recombination-driven changes in hsdS genes, which dictate the DNA target site. In order to determine the prevalence of recombination-driven phasevarions, we generated a program called RecombinationRepeatSearch to interrogate REBASE and identify the presence and number of inverted repeats of hsdS downstream of Type I R-M loci. We report that 3.9% of Type I R-M systems have duplicated variable hsdS genes containing inverted repeats capable of phase variation. We report the presence of these systems in the major pathogens Enterococcus faecalis and Listeria monocytogenes, which could have important implications for pathogenesis and vaccine development. These data suggest that in addition to SSR-driven phasevarions, many bacteria have independently evolved phase-variable Type I R-M systems via recombination between multiple, variable hsdS genes.

IMPORTANCE Many bacterial species contain DNA methyltransferases that have random on/off switching of expression. These systems, called phasevarions (phase-variable regulons), control the expression of multiple genes by global methylation changes. In every previously characterized phasevarion, genes involved in pathobiology, antibiotic resistance, and potential vaccine candidates are randomly varied in their expression, commensurate with methyltransferase switching. Our systematic study to determine the extent of phasevarions controlled by invertible Type I R-M systems will provide valuable information for understanding how bacteria regulate genes and is key to the study of physiology, virulence, and vaccine development; therefore, it is critical to identify and characterize phase-variable methyltransferases controlling phasevarions.

Collapse

Mismatch-tolerant, alignment-free sequence classification using multiple spaced seeds and multiindex Bloom filters. Proc Natl Acad Sci U S A 2020;117:16961-16968. [PMID: 32641514 PMCID: PMC7382288 DOI: 10.1073/pnas.1903436117] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Leal NC, Campos TL, Rezende AM, Docena C, Mendes-Marques CL, de Sá Cavalcanti FL, Wallau GL, Rocha IV, Cavalcanti CLB, Veras DL, Alves LR, Andrade-Figueiredo M, de Barros MPS, de Almeida AMP, de Morais MMC, Leal-Balbino TC, Xavier DE, de-Melo-Neto OP. Comparative Genomics of Acinetobacter baumannii Clinical Strains From Brazil Reveals Polyclonal Dissemination and Selective Exchange of Mobile Genetic Elements Associated With Resistance Genes. Front Microbiol 2020;11:1176. [PMID: 32655514 PMCID: PMC7326025 DOI: 10.3389/fmicb.2020.01176] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2020] [Accepted: 05/08/2020] [Indexed: 12/13/2022] Open

Abstract

Acinetobacter baumannii is an opportunistic bacterial pathogen infecting immunocompromised patients and has gained attention worldwide due to its increased antimicrobial resistance. Here, we report a comparative whole-genome sequencing and analysis coupled with an assessment of antibiotic resistance of 46 Acinetobacter strains (45 A. baumannii plus one Acinetobacter nosocomialis) originated from five hospitals from the city of Recife, Brazil, between 2010 and 2014. An average of 3,809 genes were identified per genome, although only 2,006 genes were single copy orthologs or core genes conserved across all sequenced strains, with an average of 42 new genes found per strain. We evaluated genetic distance through a phylogenetic analysis and MLST as well as the presence of antibiotic resistance genes, virulence markers and mobile genetic elements (MGE). The phylogenetic analysis recovered distinct monophyletic A. baumannii groups corresponding to five known (ST1, ST15, ST25, ST79, and ST113) and one novel ST (ST881, related to ST1). A large number of ST specific genes were found, with the ST79 strains having the largest number of genes in common that were missing from the other STs. Multiple genes associated with resistance to β-lactams, aminoglycosides and other antibiotics were found. Some of those were clearly mapped to defined MGEs and an analysis of those revealed known elements as well as a novel Tn7-Tn3 transposon with a clear ST specific distribution. An association of selected resistance/virulence markers with specific STs was indeed observed, as well as the recent spread of the OXA-253 carbapenemase encoding gene. Virulence genes associated with the synthesis of the capsular antigens were noticeably more variable in the ST113 and ST79 strains. Indeed, several resistance and virulence genes were common to the ST79 and ST113 strains only, despite a greater genetic distance between them, suggesting common means of genetic exchange. Our comparative analysis reveals the spread of multiple STs and the genomic plasticity of A. baumannii from different hospitals in a single metropolitan area. It also highlights differences in the spread of resistance markers and other MGEs between the investigated STs, impacting on the monitoring and treatment of Acinetobacter in the ongoing and future outbreaks.

Collapse

Affiliation(s)

Nilma C Leal Aggeu Magalhães Institute (IAM), Fundação Oswaldo Cruz (Fiocruz), Recife, Brazil
Túlio L Campos Aggeu Magalhães Institute (IAM), Fundação Oswaldo Cruz (Fiocruz), Recife, Brazil
Antonio M Rezende Aggeu Magalhães Institute (IAM), Fundação Oswaldo Cruz (Fiocruz), Recife, Brazil
Cássia Docena Aggeu Magalhães Institute (IAM), Fundação Oswaldo Cruz (Fiocruz), Recife, Brazil
Carina L Mendes-Marques Aggeu Magalhães Institute (IAM), Fundação Oswaldo Cruz (Fiocruz), Recife, Brazil
Felipe L de Sá Cavalcanti Aggeu Magalhães Institute (IAM), Fundação Oswaldo Cruz (Fiocruz), Recife, Brazil.,Department of Pathology, Institute of Biological Sciences, University of Pernambuco, Recife, Brazil
Gabriel L Wallau Aggeu Magalhães Institute (IAM), Fundação Oswaldo Cruz (Fiocruz), Recife, Brazil
Igor V Rocha Aggeu Magalhães Institute (IAM), Fundação Oswaldo Cruz (Fiocruz), Recife, Brazil
Carmelita L B Cavalcanti Laboratory of Immunopathology Keizo Asami, Federal University of Pernambuco, Recife, Brazil
Dyana L Veras Aggeu Magalhães Institute (IAM), Fundação Oswaldo Cruz (Fiocruz), Recife, Brazil
Lilian R Alves Department of Tropical Medicine, Federal University of Pernambuco, Recife, Brazil
Mariana Andrade-Figueiredo Aggeu Magalhães Institute (IAM), Fundação Oswaldo Cruz (Fiocruz), Recife, Brazil
Maria P Silva de Barros Aggeu Magalhães Institute (IAM), Fundação Oswaldo Cruz (Fiocruz), Recife, Brazil
Alzira M Paiva de Almeida Aggeu Magalhães Institute (IAM), Fundação Oswaldo Cruz (Fiocruz), Recife, Brazil
Marcia M Camargo de Morais Department of Pathology, Institute of Biological Sciences, University of Pernambuco, Recife, Brazil
Tereza C Leal-Balbino Aggeu Magalhães Institute (IAM), Fundação Oswaldo Cruz (Fiocruz), Recife, Brazil
Danilo E Xavier Aggeu Magalhães Institute (IAM), Fundação Oswaldo Cruz (Fiocruz), Recife, Brazil
Osvaldo P de-Melo-Neto Aggeu Magalhães Institute (IAM), Fundação Oswaldo Cruz (Fiocruz), Recife, Brazil

Collapse

Dong J, Liu S, Zhang Y, Dai Y, Wu Q. A New Alignment-Free Whole Metagenome Comparison Tool and Its Application on Gut Microbiomes of Wild Giant Pandas. Front Microbiol 2020;11:1061. [PMID: 32612579 PMCID: PMC7309450 DOI: 10.3389/fmicb.2020.01061] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2020] [Accepted: 04/29/2020] [Indexed: 11/13/2022] Open

Li J, Gu T, Li L, Wu X, Shen L, Yu R, Liu Y, Qiu G, Zeng W. Complete genome sequencing and comparative genomic analyses of Bacillus sp. S3, a novel hyper Sb(III)-oxidizing bacterium. BMC Microbiol 2020;20:106. [PMID: 32354325 PMCID: PMC7193398 DOI: 10.1186/s12866-020-01737-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Accepted: 02/25/2020] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Antimonite [Sb(III)]-oxidizing bacterium has great potential in the environmental bioremediation of Sb-polluted sites. Bacillus sp. S3 that was previously isolated from antimony-contaminated soil displayed high Sb(III) resistance and Sb(III) oxidation efficiency. However, the genomic information and evolutionary feature of Bacillus sp. S3 are very scarce.

RESULTS

Here, we identified a 5,436,472 bp chromosome with 40.30% GC content and a 241,339 bp plasmid with 36.74% GC content in the complete genome of Bacillus sp. S3. Genomic annotation showed that Bacillus sp. S3 contained a key aioB gene potentially encoding As (III)/Sb(III) oxidase, which was not shared with other Bacillus strains. Furthermore, a wide variety of genes associated with Sb(III) and other heavy metal (loid) s were also ascertained in Bacillus sp. S3, reflecting its adaptive advantage for growth in the harsh eco-environment. Based on the analysis of phylogenetic relationship and the average nucleotide identities (ANI), Bacillus sp. S3 was proved to a novel species within the Bacillus genus. The majority of mobile genetic elements (MGEs) mainly distributed on chromosomes within the Bacillus genus. Pan-genome analysis showed that the 45 genomes contained 554 core genes and many unique genes were dissected in analyzed genomes. Whole genomic alignment showed that Bacillus genus underwent frequently large-scale evolutionary events. In addition, the origin and evolution analysis of Sb(III)-resistance genes revealed the evolutionary relationships and horizontal gene transfer (HGT) events among the Bacillus genus. The assessment of functionality of heavy metal (loid) s resistance genes emphasized its indispensable role in the harsh eco-environment of Bacillus genus. Real-time quantitative PCR (RT-qPCR) analysis indicated that Sb(III)-related genes were all induced under the Sb(III) stress, while arsC gene was down-regulated.

CONCLUSIONS

The results in this study shed light on the molecular mechanisms of Bacillus sp. S3 coping with Sb(III), extended our understanding on the evolutionary relationships between Bacillus sp. S3 and other closely related species, and further enriched the Sb(III) resistance genetic data sources.

Collapse

Affiliation(s)

Jiaokun Li School of Minerals Processing and Bioengineering, Central South University, Changsha, 410083, China.,Key Laboratory of Biometallurgy, Ministry of Education, Central South University, Changsha, 410083, China
Tianyuan Gu School of Minerals Processing and Bioengineering, Central South University, Changsha, 410083, China.,Key Laboratory of Biometallurgy, Ministry of Education, Central South University, Changsha, 410083, China
Liangzhi Li School of Minerals Processing and Bioengineering, Central South University, Changsha, 410083, China.,Key Laboratory of Biometallurgy, Ministry of Education, Central South University, Changsha, 410083, China
Xueling Wu School of Minerals Processing and Bioengineering, Central South University, Changsha, 410083, China.,Key Laboratory of Biometallurgy, Ministry of Education, Central South University, Changsha, 410083, China
Li Shen School of Minerals Processing and Bioengineering, Central South University, Changsha, 410083, China.,Key Laboratory of Biometallurgy, Ministry of Education, Central South University, Changsha, 410083, China
Runlan Yu School of Minerals Processing and Bioengineering, Central South University, Changsha, 410083, China.,Key Laboratory of Biometallurgy, Ministry of Education, Central South University, Changsha, 410083, China
Yuandong Liu School of Minerals Processing and Bioengineering, Central South University, Changsha, 410083, China.,Key Laboratory of Biometallurgy, Ministry of Education, Central South University, Changsha, 410083, China
Guanzhou Qiu School of Minerals Processing and Bioengineering, Central South University, Changsha, 410083, China.,Key Laboratory of Biometallurgy, Ministry of Education, Central South University, Changsha, 410083, China
Weimin Zeng School of Minerals Processing and Bioengineering, Central South University, Changsha, 410083, China. .,Key Laboratory of Biometallurgy, Ministry of Education, Central South University, Changsha, 410083, China.

Collapse

A comparative genomics study of 23 Aspergillus species from section Flavi. Nat Commun 2020;11:1106. [PMID: 32107379 PMCID: PMC7046712 DOI: 10.1038/s41467-019-14051-y] [Citation(s) in RCA: 92] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2019] [Accepted: 12/02/2019] [Indexed: 02/01/2023] Open

Röhling S, Linne A, Schellhorn J, Hosseini M, Dencker T, Morgenstern B. The number of k-mer matches between two DNA sequences as a function of k and applications to estimate phylogenetic distances. PLoS One 2020;15:e0228070. [PMID: 32040534 PMCID: PMC7010260 DOI: 10.1371/journal.pone.0228070] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Accepted: 01/08/2020] [Indexed: 12/14/2022] Open

Salwan R, Sharma V. Molecular and biotechnological aspects of secondary metabolites in actinobacteria. Microbiol Res 2020;231:126374. [DOI: 10.1016/j.micres.2019.126374] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2019] [Revised: 11/10/2019] [Accepted: 11/11/2019] [Indexed: 12/21/2022]

Agüero-Chapin G, Galpert D, Molina-Ruiz R, Ancede-Gallardo E, Pérez-Machado G, De la Riva GA, Antunes A. Graph Theory-Based Sequence Descriptors as Remote Homology Predictors. Biomolecules 2019;10:E26. [PMID: 31878100 PMCID: PMC7022958 DOI: 10.3390/biom10010026] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2019] [Revised: 12/16/2019] [Accepted: 12/18/2019] [Indexed: 12/23/2022] Open

Tang K, Ren J, Sun F. Afann: bias adjustment for alignment-free sequence comparison based on sequencing data using neural network regression. Genome Biol 2019;20:266. [PMID: 31801606 PMCID: PMC6891986 DOI: 10.1186/s13059-019-1872-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2019] [Accepted: 10/29/2019] [Indexed: 11/27/2022] Open

Song K, Ren J, Sun F. Reads Binning Improves Alignment-Free Metagenome Comparison. Front Genet 2019;10:1156. [PMID: 31824565 PMCID: PMC6881972 DOI: 10.3389/fgene.2019.01156] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2019] [Accepted: 10/22/2019] [Indexed: 12/26/2022] Open

Zhou Y, Zhang W, Wu H, Huang K, Jin J. A high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms. BMC Genomics 2019;20:754. [PMID: 31638897 PMCID: PMC6805505 DOI: 10.1186/s12864-019-6119-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Accepted: 09/20/2019] [Indexed: 12/03/2022] Open

Abstract

Background

Genomic composition has been found to be species specific and is used to differentiate bacterial species. To date, almost no published composition-based approaches are able to distinguish between most closely related organisms, including intra-genus species and intra-species strains. Thus, it is necessary to develop a novel approach to address this problem.

Results

Here, we initially determine that the “tetranucleotide-derived z-value Pearson correlation coefficient” (TETRA) approach is representative of other published statistical methods. Then, we devise a novel method called “Tetranucleotide-derived Z-value Manhattan Distance” (TZMD) and compare it with the TETRA approach. Our results show that TZMD reflects the maximal genome difference, while TETRA does not in most conditions, demonstrating in theory that TZMD provides improved resolution. Additionally, our analysis of real data shows that TZMD improves species differentiation and clearly differentiates similar organisms, including similar species belonging to the same genospecies, subspecies and intraspecific strains, most of which cannot be distinguished by TETRA. Furthermore, TZMD is able to determine clonal strains with the TZMD = 0 criterion, which intrinsically encompasses identical composition, high average nucleotide identity and high percentage of shared genomes.

Conclusions

Our extensive assessment demonstrates that TZMD has high resolution. This study is the first to propose a composition-based method for differentiating bacteria at the strain level and to demonstrate that composition is also strain specific. TZMD is a powerful tool and the first easy-to-use approach for differentiating clonal and non-clonal strains. Therefore, as the first composition-based algorithm for strain typing, TZMD will facilitate bacterial studies in the future.

Collapse

Huang GD, Liu XM, Huang TL, Xia LC. The statistical power of k-mer based aggregative statistics for alignment-free detection of horizontal gene transfer. Synth Syst Biotechnol 2019;4:150-156. [PMID: 31508512 PMCID: PMC6723412 DOI: 10.1016/j.synbio.2019.08.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2019] [Revised: 07/14/2019] [Accepted: 08/05/2019] [Indexed: 12/21/2022] Open

Pei S, Dong R, He RL, Yau SST. Large-Scale Genome Comparison Based on Cumulative Fourier Power and Phase Spectra: Central Moment and Covariance Vector. Comput Struct Biotechnol J 2019;17:982-994. [PMID: 31384399 PMCID: PMC6661692 DOI: 10.1016/j.csbj.2019.07.003] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2019] [Revised: 06/24/2019] [Accepted: 07/10/2019] [Indexed: 01/04/2023] Open

The Shared and Specific Genes and a Comparative Genomics Analysis within Three Hanseniaspora Strains. Int J Genomics 2019;2019:7910865. [PMID: 31281829 PMCID: PMC6589277 DOI: 10.1155/2019/7910865] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2018] [Revised: 02/17/2019] [Accepted: 04/16/2019] [Indexed: 11/21/2022] Open

Alanjary M, Steinke K, Ziemert N. AutoMLST: an automated web server for generating multi-locus species trees highlighting natural product potential. Nucleic Acids Res 2019;47:W276-W282. [PMID: 30997504 PMCID: PMC6602446 DOI: 10.1093/nar/gkz282] [Citation(s) in RCA: 229] [Impact Index Per Article: 45.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2019] [Revised: 03/29/2019] [Accepted: 04/10/2019] [Indexed: 12/31/2022] Open

Lu YY, Tang K, Ren J, Fuhrman JA, Waterman MS, Sun F. CAFE: aCcelerated Alignment-FrEe sequence analysis. Nucleic Acids Res 2019;45:W554-W559. [PMID: 28472388 PMCID: PMC5793812 DOI: 10.1093/nar/gkx351] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Accepted: 04/20/2017] [Indexed: 12/13/2022] Open

Wen QF, Liu S, Dong C, Guo HX, Gao YZ, Guo FB. Geptop 2.0: An Updated, More Precise, and Faster Geptop Server for Identification of Prokaryotic Essential Genes. Front Microbiol 2019;10:1236. [PMID: 31214154 PMCID: PMC6558110 DOI: 10.3389/fmicb.2019.01236] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2019] [Accepted: 05/17/2019] [Indexed: 12/16/2022] Open

Leimeister CA, Schellhorn J, Dörrer S, Gerth M, Bleidorn C, Morgenstern B. Prot-SpaM: fast alignment-free phylogeny reconstruction based on whole-proteome sequences. Gigascience 2019;8:giy148. [PMID: 30535314 PMCID: PMC6436989 DOI: 10.1093/gigascience/giy148] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2018] [Revised: 09/10/2018] [Accepted: 11/20/2018] [Indexed: 11/20/2022] Open

Prabha R, Singh DP. Cyanobacterial phylogenetic analysis based on phylogenomics approaches render evolutionary diversification and adaptation: an overview of representative orders. 3 Biotech 2019;9:87. [PMID: 30800598 DOI: 10.1007/s13205-019-1635-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2018] [Accepted: 02/11/2019] [Indexed: 12/12/2022] Open

Abstract

Phylogenetic studies based on a definite set of marker genes usually reconstruct evolutionary relationships among the prokaryotic species. Based on specific target sequences, such studies represent variations and allow identification of similarities or dissimilarities in organisms. With the advent of completely sequenced genomes and accumulation of information on whole prokaryotic genomes, phylogenetic reconstructions should be considered more reliable if they are ideally based on entire genomes to resolve phylogenetic interest. We applied phylogenomics approaches taking into account completely sequenced cyanobacterial genomes to reconstruct underlying species that represented major taxonomic classes and belonged to distinctly different habitats (freshwater, marine, soils, and rocks). We did not rely on describing phylogeny of all representative class of cyanobacterial species on the basis of only ribosomal gene, 16S rDNA gene. In contrast, we analyzed combined molecular marker and phylogenomics approaches (genome alignment, gene content and gene order, composition vector and protein domain content) for accurately inferring phylogenetic relationship of species. We have shown that this approach reflects the impact of evolution on the organisms and considers connects with the ecological adaptation in cyanobacteria in different habitats. Analysis revealed that the members from marine habitat occupy different profile than those from freshwater. Impact of GC content and genomic repetitiveness over the diversification of cyanobacterial species and their possible role in adaptation was also reflected. Members occupying similar habitats cover more evolutionary distance together and also evolve various strategies for adaptation and survival either through genomic repetitiveness or preferences for genes of particular functions or modified GC content. Genomes undergo different changes for their adaptation in diverse habitats.

Collapse

Sarmashghi S, Bohmann K, P. Gilbert MT, Bafna V, Mirarab S. Skmer: assembly-free and alignment-free sample identification using genome skims. Genome Biol 2019;20:34. [PMID: 30760303 PMCID: PMC6374904 DOI: 10.1186/s13059-019-1632-4] [Citation(s) in RCA: 54] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2018] [Accepted: 01/16/2019] [Indexed: 01/10/2023] Open

PVTree: A Sequential Pattern Mining Method for Alignment Independent Phylogeny Reconstruction. Genes (Basel) 2019;10:genes10020073. [PMID: 30678245 PMCID: PMC6410268 DOI: 10.3390/genes10020073] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2018] [Revised: 01/04/2019] [Accepted: 01/14/2019] [Indexed: 11/21/2022] Open

Polyphyly in 16S rRNA-based LVTree Versus Monophyly in Whole-genome-based CVTree. GENOMICS PROTEOMICS & BIOINFORMATICS 2018;16:310-319. [PMID: 30550857 PMCID: PMC6364046 DOI: 10.1016/j.gpb.2018.06.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/22/2018] [Revised: 05/11/2018] [Accepted: 06/25/2018] [Indexed: 11/23/2022]

Sahay S, Shome R, Sankarasubramanian J, Vishnu US, Prajapati A, Natesan K, Shome BR, Rahman H, Rajendhran J. Genome sequence analysis of the Indian strain Mannheimia haemolytica serotype A2 from ovine pneumonic pasteurellosis. ANN MICROBIOL 2018. [DOI: 10.1007/s13213-018-1410-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022] Open

Tang K, Ren J, Cronn R, Erickson DL, Milligan BG, Parker-Forney M, Spouge JL, Sun F. Alignment-free genome comparison enables accurate geographic sourcing of white oak DNA. BMC Genomics 2018;19:896. [PMID: 30526482 PMCID: PMC6288960 DOI: 10.1186/s12864-018-5253-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2018] [Accepted: 11/15/2018] [Indexed: 01/14/2023] Open

Insights into the genome sequence of ovine Pasteurella multocida type A strain associated with pneumonic pasteurellosis. Small Rumin Res 2018. [DOI: 10.1016/j.smallrumres.2018.10.004] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]

Bonnici V, Giugno R, Manca V. PanDelos: a dictionary-based method for pan-genome content discovery. BMC Bioinformatics 2018;19:437. [PMID: 30497358 PMCID: PMC6266927 DOI: 10.1186/s12859-018-2417-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open

Abstract

Background

Pan-genome approaches afford the discovery of homology relations in a set of genomes, by determining how some gene families are distributed among a given set of genomes. The retrieval of a complete gene distribution among a class of genomes is an NP-hard problem because computational costs increase with the number of analyzed genomes, in fact, all-against-all gene comparisons are required to completely solve the problem. In presence of phylogenetically distant genomes, due to the variability introduced in gene duplication and transmission, the task of recognizing homologous genes becomes even more difficult. A challenge on this field is that of designing fast and adaptive similarity measures in order to find a suitable pan-genome structure of homology relations.

Results

We present PanDelos, a stand alone tool for the discovery of pan-genome contents among phylogenetic distant genomes. The methodology is based on information theory and network analysis. It is parameter-free because thresholds are automatically deduced from the context. PanDelos avoids sequence alignment by introducing a measure based on k-mer multiplicity. The k-mer length is defined according to general arguments rather than empirical considerations. Homology candidate relations are integrated into a global network and groups of homologous genes are extracted by applying a community detection algorithm.

Conclusions

PanDelos outperforms existing approaches, Roary and EDGAR, in terms of running times and quality content discovery. Tests were run on collections of real genomes, previously used in analogous studies, and in synthetic benchmarks that represent fully trusted golden truth. The software is available at https://github.com/GiugnoLab/PanDelos.

Electronic supplementary material

The online version of this article (10.1186/s12859-018-2417-6) contains supplementary material, which is available to authorized users.

Collapse

Chen NWG, Serres-Giardi L, Ruh M, Briand M, Bonneau S, Darrasse A, Barbe V, Gagnevin L, Koebnik R, Jacques MA. Horizontal gene transfer plays a major role in the pathological convergence of Xanthomonas lineages on common bean. BMC Genomics 2018;19:606. [PMID: 30103675 PMCID: PMC6090828 DOI: 10.1186/s12864-018-4975-4] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2018] [Accepted: 07/31/2018] [Indexed: 12/18/2022] Open

Comparative studies of alignment, alignment-free and SVM based approaches for predicting the hosts of viruses based on viral sequences. Sci Rep 2018;8:10032. [PMID: 29968780 PMCID: PMC6030160 DOI: 10.1038/s41598-018-28308-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Accepted: 06/15/2018] [Indexed: 12/05/2022] Open