1
|
Forés-Martos J, Forte A, García-Martínez J, Pérez-Ortín JE. A Trans-Omics Comparison Reveals Common Gene Expression Strategies in Four Model Organisms and Exposes Similarities and Differences between Them. Cells 2021; 10:334. [PMID: 33562654 PMCID: PMC7914595 DOI: 10.3390/cells10020334] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Revised: 01/29/2021] [Accepted: 02/01/2021] [Indexed: 12/01/2022] Open
Abstract
The ultimate goal of gene expression regulation is on the protein level. However, because the amounts of mRNAs and proteins are controlled by their synthesis and degradation rates, the cellular amount of a given protein can be attained by following different strategies. By studying omics data for six expression variables (mRNA and protein amounts, plus their synthesis and decay rates), we previously demonstrated the existence of common expression strategies (CESs) for functionally related genes in the yeast Saccharomyces cerevisiae. Here we extend that study to two other eukaryotes: the yeast Schizosaccharomyces pombe and cultured human HeLa cells. We also use genomic data from the model prokaryote Escherichia coli as an external reference. We show that six-variable profiles (6VPs) can be constructed for every gene and that these 6VPs are similar for genes with similar functions in all the studied organisms. The differences in 6VPs between organisms can be used to establish their phylogenetic relationships. The analysis of the correlations among the six variables supports the hypothesis that most gene expression control occurs in actively growing organisms at the transcription rate level, and that translation plays a minor role. We propose that living organisms use CESs for the genes acting on the same physiological pathways, especially for those belonging to stable macromolecular complexes, but CESs have been modeled by evolution to adapt to the specific life circumstances of each organism.
Collapse
Affiliation(s)
- Jaume Forés-Martos
- Instituto de Biotecnología y Biomedicina (Biotecmed), Universitat de València, C/Dr. Moliner 50, E46100 Burjassot, Spain;
| | - Anabel Forte
- Departamento de Estadística e Investigación Operativa, Facultad de Matemáticas, Universitat de València, C/Dr. Moliner 50, E46100 Burjassot, Spain;
| | - José García-Martínez
- Instituto de Biotecnología y Biomedicina (Biotecmed), Universitat de València, C/Dr. Moliner 50, E46100 Burjassot, Spain;
| | - José E. Pérez-Ortín
- Instituto de Biotecnología y Biomedicina (Biotecmed), Universitat de València, C/Dr. Moliner 50, E46100 Burjassot, Spain;
| |
Collapse
|
2
|
Abstract
With the increasing number of sequenced genomes and their comparisons, the detection of orthologs is crucial for reliable functional annotation and evolutionary analyses of genes and species. Yet, the dynamic remodeling of genome content through gain, loss, transfer of genes, and segmental and whole-genome duplication hinders reliable orthology detection. Moreover, the lack of direct functional evidence and the questionable quality of some available genome sequences and annotations present additional difficulties to assess orthology. This article reviews the existing computational methods and their potential accuracy in the high-throughput era of genome sequencing and anticipates open questions in terms of methodology, reliability, and computation. Appropriate taxon sampling together with combination of methods based on similarity, phylogeny, synteny, and evolutionary knowledge that may help detecting speciation events appears to be the most accurate strategy. This review also raises perspectives on the potential determination of orthology throughout the whole species phylogeny.
Collapse
Affiliation(s)
- Fredj Tekaia
- Institut Pasteur, Unit of Structural Microbiology, CNRS URA 3528 and University Paris Diderot, Sorbonne Paris Cité, Paris, France
| |
Collapse
|
3
|
House CH, Pellegrini M, Fitz-Gibbon ST. Genome-wide gene order distances support clustering the gram-positive bacteria. Front Microbiol 2015; 5:785. [PMID: 25653643 PMCID: PMC4299520 DOI: 10.3389/fmicb.2014.00785] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2014] [Accepted: 12/21/2014] [Indexed: 11/29/2022] Open
Abstract
Initially using 143 genomes, we developed a method for calculating the pair-wise distance between prokaryotic genomes using a Monte Carlo method to estimate the conservation of gene order. The method was based on repeatedly selecting five or six non-adjacent random orthologs from each of two genomes and determining if the chosen orthologs were in the same order. The raw distances were then corrected for gene order convergence using an adaptation of the Jukes-Cantor model, as well as using the common distance correction D' = -ln(1-D). First, we compared the distances found via the order of six orthologs to distances found based on ortholog gene content and small subunit rRNA sequences. The Jukes-Cantor gene order distances are reasonably well correlated with the divergence of rRNA (R (2) = 0.24), especially at rRNA Jukes-Cantor distances of less than 0.2 (R (2) = 0.52). Gene content is only weakly correlated with rRNA divergence (R (2) = 0.04) over all distances, however, it is especially strongly correlated at rRNA Jukes-Cantor distances of less than 0.1 (R (2) = 0.67). This initial work suggests that gene order may be useful in conjunction with other methods to help understand the relatedness of genomes. Using the gene order distances in 143 genomes, the relations of prokaryotes were studied using neighbor joining and agreement subtrees. We then repeated our study of the relations of prokaryotes using gene order in 172 complete genomes better representing a wider-diversity of prokaryotes. Consistently, our trees show the Actinobacteria as a sister group to the bulk of the Firmicutes. In fact, the robustness of gene order support was found to be considerably greater for uniting these two phyla than for uniting any of the proteobacterial classes together. The results are supportive of the idea that Actinobacteria and Firmicutes are closely related, which in turn implies a single origin for the gram-positive cell.
Collapse
Affiliation(s)
- Christopher H. House
- Penn State Astrobiology Research Center and Department of Geosciences, The Pennsylvania State UniversityUniversity Park, PA, USA
| | - Matteo Pellegrini
- Department of Molecular, Cell, and Developmental Biology, University of California, Los AngelesLos Angeles, CA, USA
- Department of Molecular, Cell, and Developmental Biology, Institute of Genomics and Proteomics, University of California, Los AngelesLos Angeles, CA, USA
| | - Sorel T. Fitz-Gibbon
- Department of Molecular, Cell, and Developmental Biology, University of California, Los AngelesLos Angeles, CA, USA
- Department of Molecular, Cell, and Developmental Biology, Institute of Genomics and Proteomics, University of California, Los AngelesLos Angeles, CA, USA
| |
Collapse
|
4
|
Abstract
Myosins are key components of the eukaryotic cytoskeleton, providing motility for a broad diversity of cargoes. Therefore, understanding the origin and evolutionary history of myosin classes is crucial to address the evolution of eukaryote cell biology. Here, we revise the classification of myosins using an updated taxon sampling that includes newly or recently sequenced genomes and transcriptomes from key taxa. We performed a survey of eukaryotic genomes and phylogenetic analyses of the myosin gene family, reconstructing the myosin toolkit at different key nodes in the eukaryotic tree of life. We also identified the phylogenetic distribution of myosin diversity in terms of number of genes, associated protein domains and number of classes in each taxa. Our analyses show that new classes (i.e., paralogs) and domain architectures were continuously generated throughout eukaryote evolution, with a significant expansion of myosin abundance and domain architectural diversity at the stem of Holozoa, predating the origin of animal multicellularity. Indeed, single-celled holozoans have the most complex myosin complement among eukaryotes, with paralogs of most myosins previously considered animal specific. We recover a dynamic evolutionary history, with several lineage-specific expansions (e.g., the myosin III-like gene family diversification in choanoflagellates), convergence in protein domain architectures (e.g., fungal and animal chitin synthase myosins), and important secondary losses. Overall, our evolutionary scheme demonstrates that the ancestral eukaryote likely had a complex myosin repertoire that included six genes with different protein domain architectures. Finally, we provide an integrative and robust classification, useful for future genomic and functional studies on this crucial eukaryotic gene family.
Collapse
Affiliation(s)
- Arnau Sebé-Pedrós
- Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), Passeig Marítim de la Barceloneta, Barcelona, Catalonia, Spain
| | | | | | | |
Collapse
|
5
|
Grouzdev DS, Kuznetsov BB, Keppen OI, Krasil’nikova EN, Lebedeva NV, Ivanovsky RN. Reconstruction of bacteriochlorophyll biosynthesis pathways in the filamentous anoxygenic phototrophic bacterium Oscillochloris trichoides DG-6 and evolution of anoxygenic phototrophs of the order Chloroflexales. Microbiology (Reading) 2015; 161:120-130. [DOI: 10.1099/mic.0.082313-0] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Affiliation(s)
- Denis S. Grouzdev
- Faculty of Biology, Moscow State University, Moscow, Russia
- Bioengineering Center, Russian Academy of Sciences, Moscow, Russia
| | | | - Olga I. Keppen
- Faculty of Biology, Moscow State University, Moscow, Russia
| | | | | | | |
Collapse
|
6
|
Abstract
Bacterial genomes are remarkably stable from one generation to the next but are plastic on an evolutionary time scale, substantially shaped by horizontal gene transfer, genome rearrangement, and the activities of mobile DNA elements. This implies the existence of a delicate balance between the maintenance of genome stability and the tolerance of genome instability. In this review, we describe the specialized genetic elements and the endogenous processes that contribute to genome instability. We then discuss the consequences of genome instability at the physiological level, where cells have harnessed instability to mediate phase and antigenic variation, and at the evolutionary level, where horizontal gene transfer has played an important role. Indeed, this ability to share DNA sequences has played a major part in the evolution of life on Earth. The evolutionary plasticity of bacterial genomes, coupled with the vast numbers of bacteria on the planet, substantially limits our ability to control disease.
Collapse
|
7
|
Abstract
The origin of complex centralized brains is one of the major evolutionary transitions in the history of animals. Monophyly (i.e. presence of a centralized nervous system in urbilateria) vs polyphyly (i.e. multiple origins by parallel centralization of nervous systems within several lineages) are two historically conflicting scenarios to explain such transitions. However, recent phylogenomic and cladistic analysis suggests that complex brains may have independently evolved at least 9 times within different animal lineages. Indeed, even within the phylum Mollusca cephalization might have occurred at least 5 times. Emerging molecular data further suggest that at the genomic level such transitions might have been achieved by changes in expression of just a few transcriptional factors - not surprising since such events might happen multiple times over 700 million years of animal evolution. Both cladistic and genomic analyses also imply that neurons themselves evolved more than once. Ancestral polarized secretory cells were likely involved in coordination of ciliated locomotion in early animals, and these cells can be considered as evolutionary precursors of neurons within different lineages. Under this scenario, the origins of neurons can be linked to adaptations to stress/injury factors in the form of integrated regeneration-type cellular response with secretory signaling peptides as early neurotransmitters. To further reconstruct the parallel evolution of nervous systems genomic approaches are essential to probe enigmatic neurons of basal metazoans, selected lophotrochozoans (e.g. phoronids, brachiopods) and deuterostomes.
Collapse
Affiliation(s)
- L L Moroz
- The Whitney Laboratory for Marine Bioscience, University of Florida, 9505 Ocean Shore Blvd. St. Augustine Florida 32080, USA.
| |
Collapse
|
8
|
Meinel T, Krause A. Meta-analysis of general bacterial subclades in whole-genome phylogenies using tree topology profiling. Evol Bioinform Online 2012; 8:489-525. [PMID: 22915837 PMCID: PMC3422217 DOI: 10.4137/ebo.s9642] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
In the last two decades, a large number of whole-genome phylogenies have been inferred to reconstruct the Tree of Life (ToL). Underlying data models range from gene or functionality content in species to phylogenetic gene family trees and multiple sequence alignments of concatenated protein sequences. Diversity in data models together with the use of different tree reconstruction techniques, disruptive biological effects and the steadily increasing number of genomes have led to a huge diversity in published phylogenies. Comparison of those and, moreover, identification of the impact of inference properties (underlying data model, inference technique) on particular reconstructions is almost impossible. In this work, we introduce tree topology profiling as a method to compare already published whole-genome phylogenies. This method requires visual determination of the particular topology in a drawn whole-genome phylogeny for a set of particular bacterial clans. For each clan, neighborhoods to other bacteria are collected into a catalogue of generalized alternative topologies. Particular topology alternatives found for an ordered list of bacterial clans reveal a topology profile that represents the analyzed phylogeny. To simulate the inhomogeneity of published gene content phylogenies we generate a set of seven phylogenies using different inference techniques and the SYSTERS-PhyloMatrix data model. After tree topology profiling on in total 54 selected published and newly inferred phylogenies, we separate artefactual from biologically meaningful phylogenies and associate particular inference results (phylogenies) with inference background (inference techniques as well as data models). Topological relationships of particular bacterial species groups are presented. With this work we introduce tree topology profiling into the scientific field of comparative phylogenomics.
Collapse
Affiliation(s)
- Thomas Meinel
- Charité-University Medicine Berlin, Institute for Physiology, Structural Bioinformatics Group, Thielallee 71, 14195 Berlin, Germany
| | | |
Collapse
|
9
|
Abstract
We examine the Tree of Life (TOL) as an evolutionary hypothesis and a heuristic. The original TOL hypothesis has failed but a new "statistical TOL hypothesis" is promising. The TOL heuristic usefully organizes data without positing fundamental evolutionary truth.
Collapse
Affiliation(s)
- Maureen A O'Malley
- Department of Philosophy, Quadrangle A14, University of Sydney, NSW 2006, Australia
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD20894, USA
| |
Collapse
|
10
|
Hendrickson RC, Wang C, Hatcher EL, Lefkowitz EJ. Orthopoxvirus genome evolution: the role of gene loss. Viruses 2010; 2:1933-67. [PMID: 21994715 DOI: 10.3390/v2091933] [Citation(s) in RCA: 125] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2010] [Revised: 08/25/2010] [Accepted: 09/01/2010] [Indexed: 12/26/2022] Open
Abstract
Poxviruses are highly successful pathogens, known to infect a variety of hosts. The family Poxviridae includes Variola virus, the causative agent of smallpox, which has been eradicated as a public health threat but could potentially reemerge as a bioterrorist threat. The risk scenario includes other animal poxviruses and genetically engineered manipulations of poxviruses. Studies of orthologous gene sets have established the evolutionary relationships of members within the Poxviridae family. It is not clear, however, how variations between family members arose in the past, an important issue in understanding how these viruses may vary and possibly produce future threats. Using a newly developed poxvirus-specific tool, we predicted accurate gene sets for viruses with completely sequenced genomes in the genus Orthopoxvirus. Employing sensitive sequence comparison techniques together with comparison of syntenic gene maps, we established the relationships between all viral gene sets. These techniques allowed us to unambiguously identify the gene loss/gain events that have occurred over the course of orthopoxvirus evolution. It is clear that for all existing Orthopoxvirus species, no individual species has acquired protein-coding genes unique to that species. All existing species contain genes that are all present in members of the species Cowpox virus and that cowpox virus strains contain every gene present in any other orthopoxvirus strain. These results support a theory of reductive evolution in which the reduction in size of the core gene set of a putative ancestral virus played a critical role in speciation and confining any newly emerging virus species to a particular environmental (host or tissue) niche.
Collapse
|
11
|
Klenk HP, Göker M. En route to a genome-based classification of Archaea and Bacteria? Syst Appl Microbiol 2010; 33:175-82. [PMID: 20409658 DOI: 10.1016/j.syapm.2010.03.003] [Citation(s) in RCA: 250] [Impact Index Per Article: 17.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2009] [Revised: 03/10/2010] [Accepted: 03/17/2010] [Indexed: 11/23/2022]
Abstract
Given the considerable promise whole-genome sequencing offers for phylogeny and classification, it is surprising that microbial systematics and genomics have not yet been reconciled. This might be due to the intrinsic difficulties in inferring reasonable phylogenies from genomic sequences, particularly in the light of the significant amount of lateral gene transfer in prokaryotic genomes. However, recent studies indicate that the species tree and the hierarchical classification based on it are still meaningful concepts, and that state-of-the-art phylogenetic inference methods are able to provide reliable estimates of the species tree to the benefit of taxonomy. Conversely, we suspect that the current lack of completely sequenced genomes for many of the major lineages of prokaryotes and for most type strains is a major obstacle in progress towards a genome-based classification of microorganisms. We conclude that phylogeny-driven microbial genome sequencing projects such as the Genomic Encyclopaedia of Archaea and Bacteria (GEBA) project are likely to rectify this situation.
Collapse
|