1
|
Hartmann T, Bannach M, Middendorf M. Sorting Signed Permutations by Inverse Tandem Duplication Random Losses. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:2177-2188. [PMID: 31095495 DOI: 10.1109/tcbb.2019.2917198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Gene order evolution of unichromosomal genomes, for example mitochondrial genomes, has been modelled mostly by four major types of genome rearrangements: inversions, transpositions, inverse transpositions, and tandem duplication random losses. Generalizing models that include all those rearrangements while admitting computational tractability are rare. In this paper, we study such a rearrangement model, namely the inverse tandem duplication random loss (iTDRL) model, where an iTDRL duplicates and inverts a continuous segment of a gene order followed by the random loss of one of the redundant copies of each gene. The iTDRL rearrangement has currently been proposed by several authors suggesting it to be a possible mechanisms of mitochondrial gene order evolution. We initiate the algorithmic study of this new model of genome rearrangement by proving that a shortest rearrangement scenario that transforms one given gene order into another given gene order can be obtained in quasilinear time. Furthermore, we show that the length of such a scenario, i.e., the minimum number of iTDRLs in the transformation, can be computed in linear time.
Collapse
|
2
|
Papetti C, Babbucci M, Dettai A, Basso A, Lucassen M, Harms L, Bonillo C, Heindler FM, Patarnello T, Negrisolo E. Not Frozen in the Ice: Large and Dynamic Rearrangements in the Mitochondrial Genomes of the Antarctic Fish. Genome Biol Evol 2021; 13:6133229. [PMID: 33570582 PMCID: PMC7936035 DOI: 10.1093/gbe/evab017] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/22/2021] [Indexed: 12/21/2022] Open
Abstract
The vertebrate mitochondrial genomes generally present a typical gene order. Exceptions are uncommon and important to study the genetic mechanisms of gene order rearrangements and their consequences on phylogenetic output and mitochondrial function. Antarctic notothenioid fish carry some peculiar rearrangements of the mitochondrial gene order. In this first systematic study of 28 species, we analyzed known and undescribed mitochondrial genome rearrangements for a total of eight different gene orders within the notothenioid fish. Our reconstructions suggest that transpositions, duplications, and inversion of multiple genes are the most likely mechanisms of rearrangement in notothenioid mitochondrial genomes. In Trematominae, we documented an extremely rare inversion of a large genomic segment of 5,300 bp that partially affected the gene compositional bias but not the phylogenetic output. The genomic region delimited by nad5 and trnF, close to the area of the Control Region, was identified as the hot spot of variation in Antarctic fish mitochondrial genomes. Analyzing the sequence of several intergenic spacers and mapping the arrangements on a newly generated phylogeny showed that the entire history of the Antarctic notothenioids is characterized by multiple, relatively rapid, events of disruption of the gene order. We hypothesized that a pre-existing genomic flexibility of the ancestor of the Antarctic notothenioids may have generated a precondition for gene order rearrangement, and the pressure of purifying selection could have worked for a rapid restoration of the mitochondrial functionality and compactness after each event of rearrangement.
Collapse
Affiliation(s)
- Chiara Papetti
- Department of Biology, University of Padova, Padova 35121,Italy.,Consorzio Nazionale Interuniversitario per le Scienze del Mare (CoNISMa), Roma 00196, Italy
| | - Massimiliano Babbucci
- Department of Comparative Biomedicine and Food Science, University of Padova, Legnaro 35020, Italy
| | - Agnes Dettai
- Institut de Systematique, Evolution, Biodiversité (ISYEB) Muséum national d'Histoire naturelle-CNRS-Sorbonne Université-EPHE, MNHN, Paris 75005, France
| | - Andrea Basso
- Department of Comparative Biomedicine and Food Science, University of Padova, Legnaro 35020, Italy
| | - Magnus Lucassen
- Helmholtz Centre for Polar and Marine Research, Alfred Wegener Institute, Am Handelshafen 12, Bremerhaven 27570, Germany
| | - Lars Harms
- Helmholtz Centre for Polar and Marine Research, Alfred Wegener Institute, Am Handelshafen 12, Bremerhaven 27570, Germany.,Helmholtz Institute for Functional Marine Biodiversity, University of Oldenburg (HIFMB), Ammerlsity of Oldenburg (HIFMOldenburg 26129, Germany
| | - Celine Bonillo
- Service de Systématique Moléculaire, UMS2700 Acquisition et Analyse de Données (2AD), MNHN, Paris 75005, France
| | | | - Tomaso Patarnello
- Department of Comparative Biomedicine and Food Science, University of Padova, Legnaro 35020, Italy
| | - Enrico Negrisolo
- Department of Comparative Biomedicine and Food Science, University of Padova, Legnaro 35020, Italy.,CRIBI Interdepartmental Research Centre for Innovative Biotechnologies, University of Padova, viale G. Colombo 3, Padova 35121, Italy
| |
Collapse
|
3
|
Hartmann T, Bernt M, Middendorf M. An Exact Algorithm for Sorting by Weighted Preserving Genome Rearrangements. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2019; 16:52-62. [PMID: 29994030 DOI: 10.1109/tcbb.2018.2831661] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
The preserving Genome Sorting Problem (pGSP) asks for a shortest sequence of rearrangement operations that transforms a given gene order into another given gene order by using rearrangement operations that preserve common intervals, i.e., groups of genes that form an interval in both given gene orders. The wpGSP is the weighted version of the problem were each type of rearrangement operation has a weight and a minimum weight sequence of rearrangement operations is sought. An exact algorithm - called CREx2 - is presented, which solves the wpGSP for arbitrary gene orders and the following types of rearrangement operations: inversions, transpositions, inverse transpositions, and tandem duplication random loss operations. CREx2 has a (worst case) exponential runtime, but a linear runtime for problem instances where the common intervals are organized in a linear structure. The efficiency of CREx2 and its usefulness for phylogenetic analysis is shown empirically for gene orders of fungal mitochondrial genomes.
Collapse
|
4
|
Hartmann T, Bernt M, Middendorf M. EqualTDRL: illustrating equivalent tandem duplication random loss rearrangements. BMC Bioinformatics 2018; 19:192. [PMID: 29843612 PMCID: PMC5975268 DOI: 10.1186/s12859-018-2170-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2018] [Accepted: 04/30/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND To study the differences between two unichromosomal circular genomes, e.g., mitochondrial genomes, under the tandem duplication random loss (TDRL) rearrangement it is important to consider the whole set of potential TDRL rearrangement events that could have taken place. The reason is that for two given circular gene orders there can exist different TDRL rearrangements that transform one of the gene orders into the other. Hence, a TDRL event cannot always be reconstructed only from the knowledge of the circular gene order before a TDRL event and the circular gene order after it. RESULTS We present the program EqualTDRL that computes and illustrates the complete set of TDRLs for pairs of circular gene orders that differ by only one TDRL. EqualTDRL considers the circularity of the given genomes and certain restrictions on the TDRL rearrangements. Examples for the latter are sequences of genes that have to be conserved during a TDRL or pairs of genes that frame intergenic regions which might represent remnants of duplicated genes. Additionally, EqualTDRL allows to determine the set of TDRLs that are minimum with respect to the number of duplicated genes. CONCLUSION EqualTDRL supports scientists to study the complete set of TDRLs that possibly could have taken place in the evolution of mitochondrial genomes. EqualTDRL is implemented in C++ using the ggplot2 package of the open source programming language R and is freely available from http://pacosy.informatik.uni-leipzig.de/equaltdrl .
Collapse
Affiliation(s)
- Tom Hartmann
- Swarm Intelligence and Complex Systems Group, Faculty of Mathematics and Computer Science, Leipzig University, Augustusplatz 10, Leipzig, D-04109 Germany
| | - Matthias Bernt
- Helmholtz Centre for Environmental Research - UFZ, Permoserstraße 15, Leipzig, D-04318 Germany
| | - Martin Middendorf
- Swarm Intelligence and Complex Systems Group, Faculty of Mathematics and Computer Science, Leipzig University, Augustusplatz 10, Leipzig, D-04109 Germany
| |
Collapse
|
5
|
A complete logical approach to resolve the evolution and dynamics of mitochondrial genome in bilaterians. PLoS One 2018; 13:e0194334. [PMID: 29547666 PMCID: PMC5856267 DOI: 10.1371/journal.pone.0194334] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2017] [Accepted: 03/01/2018] [Indexed: 01/12/2023] Open
Abstract
Investigating how recombination might modify gene order during the evolution has become a routine part of mitochondrial genome analysis. A new method of genomic maps analysis based on formal logic is described. The purpose of this method is to 1) use mitochondrial gene order of current taxa as datasets 2) calculate rearrangements between all mitochondrial gene orders and 3) reconstruct phylogenetic relationships according to these calculated rearrangements within a tree under the assumption of maximum parsimony. Unlike existing methods mainly based on the probabilistic approach, the main strength of this new approach is that it calculates all the exact tree solutions with completeness and provides logical consequences as highly robust results. Moreover, this method infers all possible hypothetical ancestors and reconstructs character states for all internal nodes of the trees. We started by testing our method using the deuterostomes as a study case. Then, with sponges as an outgroup, we investigated the evolutionary history of mitochondrial genomes of 47 bilaterian phyla and emphasised the peculiar case of chaetognaths. This pilot work showed that the use of formal logic in a hypothetico-deductive background such as phylogeny (where experimental testing of hypotheses is impossible) is very promising to explore mitochondrial gene order in deuterostomes and should be applied to many other bilaterian clades.
Collapse
|
6
|
Hellmuth M, Hernandez-Rosales M, Long Y, Stadler PF. Inferring phylogenetic trees from the knowledge of rare evolutionary events. J Math Biol 2017; 76:1623-1653. [PMID: 29218395 DOI: 10.1007/s00285-017-1194-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2016] [Revised: 11/16/2017] [Indexed: 10/18/2022]
Abstract
Rare events have played an increasing role in molecular phylogenetics as potentially homoplasy-poor characters. In this contribution we analyze the phylogenetic information content from a combinatorial point of view by considering the binary relation on the set of taxa defined by the existence of a single event separating two taxa. We show that the graph-representation of this relation must be a tree. Moreover, we characterize completely the relationship between the tree of such relations and the underlying phylogenetic tree. With directed operations such as tandem-duplication-random-loss events in mind we demonstrate how non-symmetric information constrains the position of the root in the partially reconstructed phylogeny.
Collapse
Affiliation(s)
- Marc Hellmuth
- Department of Mathematics and Computer Science, University of Greifswald, Walther-Rathenau-Straße 47, 17487, Greifswald, Germany.,Center for Bioinformatics, Saarland University, Building E 2.1, P.O. Box 151150, 66041, Saarbrücken, Germany
| | - Maribel Hernandez-Rosales
- CONACYT-Instituto de Matemáticas, UNAM Juriquilla, Blvd. Juriquilla 3001, 76230, Juriquilla, Querétaro, QRO, Mexico
| | - Yangjing Long
- Department of Mathematics and Computer Science, University of Greifswald, Walther-Rathenau-Straße 47, 17487, Greifswald, Germany. .,School of Mathematical Sciences, Central China Normal University, Luoyu Road 152, Wuhan, 430079, Hubei, China.
| | - Peter F Stadler
- Bioinformatics Group, Department of Computer Science, Interdisciplinary Center of Bioinformatics, German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Competence Center for Scalable Data Services and Solutions, and Leipzig Research Center for Civilization Diseases, Leipzig University, Härtelstraße 16-18, 04107, Leipzig, Germany.,Max-Planck-Institute for Mathematics in the Sciences, Inselstraße 22, 04103, Leipzig, Germany.,Inst. f. Theoretical Chemistry, University of Vienna, Währingerstraße 17, 1090, Vienna, Austria.,Santa Fe Institute, 1399 Hyde Park Rd., Santa Fe, NM, 87501, USA
| |
Collapse
|
7
|
Partially local three-way alignments and the sequence signatures of mitochondrial genome rearrangements. Algorithms Mol Biol 2017; 12:22. [PMID: 28852417 PMCID: PMC5569537 DOI: 10.1186/s13015-017-0113-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2017] [Accepted: 08/11/2017] [Indexed: 12/01/2022] Open
Abstract
Background Genomic DNA frequently undergoes rearrangement of the gene order that can be localized by comparing the two DNA sequences. In mitochondrial genomes different mechanisms are likely at work, at least some of which involve the duplication of sequence around the location of the apparent breakpoints. We hypothesize that these different mechanisms of genome rearrangement leave distinctive sequence footprints. In order to study such effects it is important to locate the breakpoint positions with precision. Results We define a partially local sequence alignment problem that assumes that following a rearrangement of a sequence F, two fragments L, and R are produced that may exactly fit together to match F, leave a gap of deleted DNA between L and R, or overlap with each other. We show that this alignment problem can be solved by dynamic programming in cubic space and time. We apply the new method to evaluate rearrangements of animal mitogenomes and find that a surprisingly large fraction of these events involved local sequence duplications. Conclusions The partially local sequence alignment method is an effective way to investigate the mechanism of genomic rearrangement events. While applied here only to mitogenomes there is no reason why the method could not be used to also consider rearrangements in nuclear genomes. Electronic supplementary material The online version of this article (doi:10.1186/s13015-017-0113-0) contains supplementary material, which is available to authorized users.
Collapse
|
8
|
Basso A, Babbucci M, Pauletto M, Riginella E, Patarnello T, Negrisolo E. The highly rearranged mitochondrial genomes of the crabs Maja crispata and Maja squinado (Majidae) and gene order evolution in Brachyura. Sci Rep 2017; 7:4096. [PMID: 28642542 PMCID: PMC5481413 DOI: 10.1038/s41598-017-04168-9] [Citation(s) in RCA: 51] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2016] [Accepted: 05/11/2017] [Indexed: 11/09/2022] Open
Abstract
We sequenced the mitochondrial genomes of the spider crabs Maja crispata and Maja squinado (Majidae, Brachyura). Both genomes contain the whole set of 37 genes characteristic of Bilaterian genomes, encoded on both α- and β-strands. Both species exhibit the same gene order, which is unique among known animal genomes. In particular, all the genes located on the β-strand form a single block. This gene order was analysed together with the other nine gene orders known for the Brachyura. Our study confirms that the most widespread gene order (BraGO) represents the plesiomorphic condition for Brachyura and was established at the onset of this clade. All other gene orders are the result of transformational pathways originating from BraGO. The different gene orders exhibit variable levels of genes rearrangements, which involve only tRNAs or all types of genes. Local homoplastic arrangements were identified, while complete gene orders remain unique and represent signatures that can have a diagnostic value. Brachyura appear to be a hot-spot of gene order diversity within the phylum Arthropoda. Our analysis, allowed to track, for the first time, the fully evolutionary pathways producing the Brachyuran gene orders. This goal was achieved by coupling sophisticated bioinformatic tools with phylogenetic analysis.
Collapse
Affiliation(s)
- Andrea Basso
- University of Padova, Department of Comparative Biomedicine and Food Science (BCA), 35020, Agripolis, Legnaro (PD), Italy
| | - Massimiliano Babbucci
- University of Padova, Department of Comparative Biomedicine and Food Science (BCA), 35020, Agripolis, Legnaro (PD), Italy
| | - Marianna Pauletto
- University of Padova, Department of Comparative Biomedicine and Food Science (BCA), 35020, Agripolis, Legnaro (PD), Italy
| | - Emilio Riginella
- University of Padova, Department of Biology, 35131, Padova, Italy
| | - Tomaso Patarnello
- University of Padova, Department of Comparative Biomedicine and Food Science (BCA), 35020, Agripolis, Legnaro (PD), Italy
| | - Enrico Negrisolo
- University of Padova, Department of Comparative Biomedicine and Food Science (BCA), 35020, Agripolis, Legnaro (PD), Italy.
| |
Collapse
|