1
|
Mitochondrial genome plasticity of mammalian species. BMC Genomics 2024; 25:278. [PMID: 38486136 PMCID: PMC10941376 DOI: 10.1186/s12864-024-10201-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 03/08/2024] [Indexed: 03/17/2024] Open
Abstract
There is an ongoing process in which mitochondrial sequences are being integrated into the nuclear genome. The importance of these sequences has already been revealed in cancer biology, forensic, phylogenetic studies and in the evolution of the eukaryotic genetic information. Human and numerous model organisms' genomes were described from those sequences point of view. Furthermore, recent studies were published on the patterns of these nuclear localised mitochondrial sequences in different taxa.However, the results of the previously released studies are difficult to compare due to the lack of standardised methods and/or using few numbers of genomes. Therefore, in this paper our primary goal is to establish a uniform mining pipeline to explore these nuclear localised mitochondrial sequences.Our results show that the frequency of several repetitive elements is higher in the flanking regions of these sequences than expected. A machine learning model reveals that the flanking regions' repetitive elements and different structural characteristics are highly influential during the integration process.In this paper, we introduce a general mining pipeline for all mammalian genomes. The workflow is publicly available and is believed to serve as a validated baseline for future research in this field. We confirm the widespread opinion, on - as to our current knowledge - the largest dataset, that structural circumstances and events corresponding to repetitive elements are highly significant. An accurate model has also been trained to predict these sequences and their corresponding flanking regions.
Collapse
|
2
|
Bringing to light nuclear-mitochondrial insertions in the genomes of nocturnal predatory birds. Mol Phylogenet Evol 2023; 181:107722. [PMID: 36720422 DOI: 10.1016/j.ympev.2023.107722] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 01/24/2023] [Accepted: 01/24/2023] [Indexed: 01/31/2023]
Abstract
Mito-nuclear insertions, or NUMTs, relate to genetic material of mitochondrial origin that have been transferred to the nuclear DNA molecule. The increasing amounts of genomic data currently being produced presents an opportunity to investigate this type of patterns in genome evolution of non-model organisms. Identifying NUMTs across a range of closely related taxa allows one to generalize patterns of insertion and maintenance in autosomes, which is ultimately relevant to the understanding of genome biology and evolution. Here we collected existing pairwise genome-mitogenome data of the order Strigiformes, a group that includes all the nocturnal bird predators. We identified NUMTs by applying percent similarity thresholds after blasting mitochondrial genomes against nuclear genome assemblies. We identified NUMTsin all genomes with numbers ranging from 4 in Bubo bubo to 24 in Ciccaba nigrolineata. Statistical analyses revealed NUMT size to negatively correlate with NUMT's sequence similarity to with original mtDNA region. Lastly, characterizing these nuclear insertions of mitochondrial origin in a comparative genomics framework produced variable phylogenetic patterns, suggesting in some cases that insertions might pre-date speciation events within Strigiformes.
Collapse
|
3
|
Comparison of detection methods and genome quality when quantifying nuclear mitochondrial insertions in vertebrate genomes. Front Genet 2022; 13:984513. [PMID: 36482890 PMCID: PMC9723244 DOI: 10.3389/fgene.2022.984513] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2022] [Accepted: 11/03/2022] [Indexed: 01/27/2024] Open
Abstract
The integration of mitochondrial genome fragments into the nuclear genome is well documented, and the transfer of these mitochondrial nuclear pseudogenes (numts) is thought to be an ongoing evolutionary process. With the increasing number of eukaryotic genomes available, genome-wide distributions of numts are often surveyed. However, inconsistencies in genome quality can reduce the accuracy of numt estimates, and methods used for identification can be complicated by the diverse sizes and ages of numts. Numts have been previously characterized in rodent genomes and it was postulated that they might be more prevalent in a group of voles with rapidly evolving karyotypes. Here, we examine 37 rodent genomes, and an additional 26 vertebrate genomes, while also considering numt detection methods. We identify numts using DNA:DNA and protein:translated-DNA similarity searches and compare numt distributions among rodent and vertebrate taxa to assess whether some groups are more susceptible to transfer. A combination of protein sequence comparisons (protein:translated-DNA) and BLASTN genomic DNA searches detect 50% more numts than genomic DNA:DNA searches alone. In addition, higher-quality RefSeq genomes produce lower estimates of numts than GenBank genomes, suggesting that lower quality genome assemblies can overestimate numts abundance. Phylogenetic analysis shows that mitochondrial transfers are not associated with karyotypic diversity among rodents. Surprisingly, we did not find a strong correlation between numt counts and genome size. Estimates using DNA: DNA analyses can underestimate the amount of mitochondrial DNA that is transferred to the nucleus.
Collapse
|
4
|
NUMTs Can Imitate Biparental Transmission of mtDNA-A Case in Drosophila melanogaster. Genes (Basel) 2022; 13:genes13061023. [PMID: 35741785 PMCID: PMC9222939 DOI: 10.3390/genes13061023] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Revised: 05/27/2022] [Accepted: 06/02/2022] [Indexed: 11/16/2022] Open
Abstract
mtDNA sequences can be incorporated into the nuclear genome and produce nuclear mitochondrial fragments (NUMTs), which resemble mtDNA in their sequence but are transmitted biparentally, like the nuclear genome. NUMTs can be mistaken as real mtDNA and may lead to the erroneous impression that mtDNA is biparentally transmitted. Here, we report a case of mtDNA heteroplasmy in a Drosophila melanogaster DGRP line, in which the one haplotype was biparentally transmitted in an autosomal manner. Given the sequence identity of this haplotype with the mtDNA, the crossing experiments led to uncertainty about whether heteroplasmy was real or an artifact due to a NUMT. More specific experiments revealed that there is a large NUMT insertion in the X chromosome of a specific DGRP line, imitating biparental inheritance of mtDNA. Our result suggests that studies on mtDNA heteroplasmy and on mtDNA inheritance should first exclude the possibility of NUMT interference in their data.
Collapse
|
5
|
Mitochondrial Pseudogenes Suggest Repeated Inter-Species Hybridization among Direct Human Ancestors. Genes (Basel) 2022; 13:810. [PMID: 35627195 PMCID: PMC9140377 DOI: 10.3390/genes13050810] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 04/12/2022] [Accepted: 04/16/2022] [Indexed: 12/02/2022] Open
Abstract
The hypothesis that the evolution of humans involves hybridization between diverged species has been actively debated in recent years. We present the following novel evidence in support of this hypothesis: the analysis of nuclear pseudogenes of mtDNA ("NUMTs"). NUMTs are considered "mtDNA fossils" as they preserve sequences of ancient mtDNA and thus carry unique information about ancestral populations. Our comparison of a NUMT sequence shared by humans, chimpanzees, and gorillas with their mtDNAs implies that, around the time of divergence between humans and chimpanzees, our evolutionary history involved the interbreeding of individuals whose mtDNA had diverged as much as ~4.5 Myr prior. This large divergence suggests a distant interspecies hybridization. Additionally, analysis of two other NUMTs suggests that such events occur repeatedly. Our findings suggest a complex pattern of speciation in primate/human ancestors and provide one potential explanation for the mosaic nature of fossil morphology found at the emergence of the hominin lineage. A preliminary version of this manuscript was uploaded to the preprint server BioRxiv in 2017 (10.1101/134502).
Collapse
|
6
|
Genus-Wide Characterization of Nuclear Mitochondrial DNAs in Bumblebee (Hymenoptera: Apidae) Genomes. INSECTS 2021; 12:insects12110963. [PMID: 34821764 PMCID: PMC8625877 DOI: 10.3390/insects12110963] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Revised: 10/03/2021] [Accepted: 10/13/2021] [Indexed: 11/23/2022]
Abstract
Simple Summary The DNA of mitochondria can be transferred into the nucleus and form nuclear mitochondrial DNAs (NUMTs). In this study, we identified and characterized NUMTs in genus-wide bumblebee species. The number of NUMTs in bumblebee is much lower than those in its closely related taxon, honeybee. The insertion sites of NUMTs in bumblebee are not random, with AT-rich regions harboring more NUMTs. In addition, NUMTs derived from the mitochondrial COX1 gene are most abundant in the nuclear genome. While the majority of NUMTs seem unfunctional in the bumblebee, some NUMTs show functional clues, which could fuse with their flanking sequences to form novel proteins. Our results shed light on the molecular features of NUMTs and uncover their contribution to genome innovation in the bumblebee. Abstract In eukaryotes, DNA of mitochondria is transferred into the nucleus and forms nuclear mitochondrial DNAs (NUMTs). Taking advantage of the abundant genomic resources for bumblebees, in this study, we de novo generated mitochondrial genomes (mitogenomes) for 11 bumblebee species. Then, we identified and characterized NUMTs in genus-wide bumblebee species. The number of identified NUMTs varies across those species, with numbers ranging from 32 to 72, and nuclear genome size is not positively related to NUMT number. The insertion sites of NUMTs in the nuclear genome are not random, with AT-rich regions harboring more NUMTs. In addition, our results suggest that NUMTs derived from the mitochondrial COX1 gene are most abundant in the bumblebee nuclear genome. Although the majority of NUMTs are found within intergenic regions, some NUMTs do reside within genic regions. Transcripts that contain both the NUMT sequence and its flanking non-NUMT sequences could be found in the bumblebee transcriptome, suggesting a potential domestication of NUMTs in the bumblebee. Taken together, our results shed light on the molecular features of NUMTs in the bumblebee and uncover their contribution to genome innovation.
Collapse
|
7
|
Validated removal of nuclear pseudogenes and sequencing artefacts from mitochondrial metabarcode data. Mol Ecol Resour 2021; 21:1772-1787. [PMID: 33503286 DOI: 10.1111/1755-0998.13337] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Revised: 12/08/2020] [Accepted: 01/11/2021] [Indexed: 01/04/2023]
Abstract
Metabarcoding of Metazoa using mitochondrial genes may be confounded by both the accumulation of PCR and sequencing artefacts and the co-amplification of nuclear mitochondrial pseudogenes (NUMTs). The application of read abundance thresholds and denoising methods is efficient in reducing noise accompanying authentic mitochondrial amplicon sequence variants (ASVs). However, these procedures do not fully account for the complex nature of concomitant sequences and the highly variable DNA contribution of specimens in a metabarcoding sample. We propose, as a complement to denoising, the metabarcoding Multidimensional Abundance Threshold Evaluation (metaMATE) framework, a novel approach that allows comprehensive examination of multiple dimensions of abundance filtering and the evaluation of the prevalence of unwanted concomitant sequences in denoised metabarcoding datasets. metaMATE requires a denoised set of ASVs as input, and designates a subset of ASVs as being either authentic (mitochondrial DNA haplotypes) or nonauthentic ASVs (NUMTs and erroneous sequences) by comparison to external reference data and by analysing nucleotide substitution patterns. metaMATE (i) facilitates the application of read abundance filtering strategies, which are structured with regard to sequence library and phylogeny and applied for a range of increasing abundance threshold values, and (ii) evaluates their performance by quantifying the prevalence of nonauthentic ASVs and the collateral effects on the removal of authentic ASVs. The output from metaMATE facilitates decision-making about required filtering stringency and can be used to improve the reliability of intraspecific genetic information derived from metabarcode data. The framework is implemented in the metaMATE software (available at https://github.com/tjcreedy/metamate).
Collapse
|
8
|
Identification of Mitochondrial DNA ( NUMTs) in the Nuclear Genome of Daphnia magna. Int J Mol Sci 2020; 21:E8725. [PMID: 33218217 PMCID: PMC7699184 DOI: 10.3390/ijms21228725] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Revised: 11/06/2020] [Accepted: 11/16/2020] [Indexed: 01/30/2023] Open
Abstract
This is the first study in which the Daphnia magna (D. magna) nuclear genome (nDNA) obtained from the GenBank database was analyzed for pseudogene sequences of mitochondrial origin. To date, there is no information about pseudogenes localized in D. magna genome. This study aimed to identify NUMTs, their length, homology, and location for potential use in evolutionary studies and to check whether their occurrence causes co-amplification during mitochondrial genome (mtDNA) analyses. Bioinformatic analysis showed 1909 fragments of the mtDNA of D. magna, of which 1630 were located in ten linkage groups (LG) of the nDNA. The best-matched NUMTs covering >90% of the gene sequence have been identified for two mt-tRNA genes, and they may be functional nuclear RNA molecules. Isolating the total DNA in mtDNA studies, co-amplification of nDNA fragments is unlikely in the case of amplification of the whole tRNA genes as well as fragments of other genes. It was observed that TRNA-MET fragments had the highest level of sequence homology, thus they could be evolutionarily the youngest. The lowest homology was found in the D-loop-derived pseudogene. It may probably be the oldest NUMT incorporated into the nDNA; however, further analysis is necessary.
Collapse
|
9
|
Detection of mitochondria-pertinent components in exosomes. Mitochondrion 2020; 55:100-110. [PMID: 32980480 PMCID: PMC7669644 DOI: 10.1016/j.mito.2020.09.006] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Revised: 06/27/2020] [Accepted: 09/18/2020] [Indexed: 01/01/2023]
Abstract
We screened cell line and plasma-derived exosomes for molecules that localize to mitochondria or that reflect mitochondrial integrity. SH-SY5Y cell-derived exosomes contained humanin, citrate synthase, and fibroblast growth factor 21 protein, and plasma-derived exosomes contained humanin, voltage-dependent anion-selective channel 1, and transcription factor A protein. Nuclear mitochondrial (NUMT) DNA complicated analyses of mitochondrial DNA (mtDNA), which otherwise suggested exosomes contain at most very low amounts of extended mtDNA sequences but likely contain degraded pieces of mtDNA. Cell and plasma-derived exosomes contained several mtDNA-derived mRNA sequences, including those for ND2, CO2, and humanin. These results can guide exosome-focused, mitochondria-pertinent biomarker development.
Collapse
|
10
|
Distinguishing mitochondrial DNA and NUMT sequences amplified with the precision ID mtDNA whole genome panel. Mitochondrion 2020; 55:122-133. [PMID: 32949792 DOI: 10.1016/j.mito.2020.09.001] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2020] [Revised: 08/31/2020] [Accepted: 09/04/2020] [Indexed: 11/22/2022]
Abstract
Nuclear mitochondrial DNA segments (NUMTs) are generated via transfer of portions of the mitochondrial genome into the nuclear genome. Given their common origin, there is the possibility that both the mitochondrial and NUMT segments may co-amplify using the same set of primers. Thus, analysis of the variation of the mitochondrial genome must take into account this co-amplification of mitochondrial and NUMT sequences. The study herein builds on data from the study by Strobl et al. (Strobl et al., 2019), in which multiple point heteroplasmies were called with an "N" to prevent labeling NUMT sequences mimicking mitochondrial heteroplasmy and being interpreted as true mitochondrial in origin sequence variants. Each of these point heteroplasmies was studied in greater detail, both molecularly and bioinformatically, to determine whether NUMT or true mitochondrial DNA variation was present. The bioinformatic and molecular tools available to help distinguish between NUMT and mitochondrial DNA and the effect of NUMT sequences on interpretation were discussed.
Collapse
|
11
|
The Genomic Origins of Small Mitochondrial RNAs: Are They Transcribed by the Mitochondrial DNA or by Mitochondrial Pseudogenes within the Nucleus ( NUMTs)? Genome Biol Evol 2020; 11:1883-1896. [PMID: 31218347 PMCID: PMC6619488 DOI: 10.1093/gbe/evz132] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/15/2019] [Indexed: 02/06/2023] Open
Abstract
Several studies have linked mitochondrial genetic variation to phenotypic modifications; albeit the identity of the mitochondrial polymorphisms involved remains elusive. The search for these polymorphisms led to the discovery of small noncoding RNAs, which appear to be transcribed by the mitochondrial DNA ("small mitochondrial RNAs"). This contention is, however, controversial because the nuclear genome of most animals harbors mitochondrial pseudogenes (NUMTs) of identical sequence to regions of mtDNA, which could alternatively represent the source of these RNAs. To discern the likely contributions of the mitochondrial and nuclear genome to transcribing these small mitochondrial RNAs, we leverage data from six vertebrate species exhibiting markedly different levels of NUMT sequence. We explore whether abundances of small mitochondrial RNAs are associated with levels of NUMT sequence across species, or differences in tissue-specific mtDNA content within species. Evidence for the former would support the hypothesis these RNAs are primarily transcribed by NUMT sequence, whereas evidence for the latter would provide strong evidence for the counter hypothesis that these RNAs are transcribed directly by the mtDNA. No association exists between the abundance of small mitochondrial RNAs and NUMT levels across species. Moreover, a sizable proportion of transcripts map exclusively to the mtDNA sequence, even in species with highest NUMT levels. Conversely, tissue-specific abundances of small mitochondrial RNAs are strongly associated with the mtDNA content. These results support the hypothesis that small mitochondrial RNAs are primarily transcribed by the mitochondrial genome and that this capacity is conserved across Amniota and, most likely, across most metazoan lineages.
Collapse
|
12
|
Ice-Age Climate Adaptations Trap the Alpine Marmot in a State of Low Genetic Diversity. Curr Biol 2019; 29:1712-1720.e7. [PMID: 31080084 PMCID: PMC6538971 DOI: 10.1016/j.cub.2019.04.020] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2018] [Revised: 02/16/2019] [Accepted: 04/09/2019] [Indexed: 12/30/2022]
Abstract
Some species responded successfully to prehistoric changes in climate [1, 2], while others failed to adapt and became extinct [3]. The factors that determine successful climate adaptation remain poorly understood. We constructed a reference genome and studied physiological adaptations in the Alpine marmot (Marmota marmota), a large ground-dwelling squirrel exquisitely adapted to the "ice-age" climate of the Pleistocene steppe [4, 5]. Since the disappearance of this habitat, the rodent persists in large numbers in the high-altitude Alpine meadow [6, 7]. Genome and metabolome showed evidence of adaptation consistent with cold climate, affecting white adipose tissue. Conversely, however, we found that the Alpine marmot has levels of genetic variation that are among the lowest for mammals, such that deleterious mutations are less effectively purged. Our data rule out typical explanations for low diversity, such as high levels of consanguineous mating, or a very recent bottleneck. Instead, ancient demographic reconstruction revealed that genetic diversity was lost during the climate shifts of the Pleistocene and has not recovered, despite the current high population size. We attribute this slow recovery to the marmot's adaptive life history. The case of the Alpine marmot reveals a complicated relationship between climatic changes, genetic diversity, and conservation status. It shows that species of extremely low genetic diversity can be very successful and persist over thousands of years, but also that climate-adapted life history can trap a species in a persistent state of low genetic diversity.
Collapse
|
13
|
Alignment-free approaches for predicting novel Nuclear Mitochondrial Segments ( NUMTs) in the human genome. Gene 2019; 691:141-152. [PMID: 30630097 DOI: 10.1016/j.gene.2018.12.040] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Revised: 12/07/2018] [Accepted: 12/14/2018] [Indexed: 10/27/2022]
Abstract
The nuclear human genome harbors sequences of mitochondrial origin, indicating an ancestral transfer of DNA from the mitogenome. Several Nuclear Mitochondrial Segments (NUMTs) have been detected by alignment-based sequence similarity search, as implemented in the Basic Local Alignment Search Tool (BLAST). Identifying NUMTs is important for the comprehensive annotation and understanding of the human genome. Here we explore the possibility of detecting NUMTs in the human genome by alignment-free sequence similarity search, such as k-mers (k-tuples, k-grams, oligos of length k) distributions. We find that when k=6 or larger, the k-mer approach and BLAST search produce almost identical results, e.g., detect the same set of NUMTs longer than 3 kb. However, when k=5 or k=4, certain signals are only detected by the alignment-free approach, and these may indicate yet unrecognized, and potentially more ancestral NUMTs. We introduce a "Manhattan plot" style representation of NUMT predictions across the genome, which are calculated based on the reciprocal of the Jensen-Shannon divergence between the nuclear and mitochondrial k-mer frequencies. The further inspection of the k-mer-based NUMT predictions however shows that most of them contain long-terminal-repeat (LTR) annotations, whereas BLAST-based NUMT predictions do not. Thus, similarity of the mitogenome to LTR sequences is recognized, which we validate by finding the mitochondrial k-mer distribution closer to those for transposable sequences and specifically, close to some types of LTR.
Collapse
|
14
|
The Trouble with MEAM2: Implications of Pseudogenes on Species Delimitation in the Globally Invasive Bemisia tabaci (Hemiptera: Aleyrodidae) Cryptic Species Complex. Genome Biol Evol 2018; 9:2732-2738. [PMID: 28985301 PMCID: PMC5647793 DOI: 10.1093/gbe/evx173] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/29/2017] [Indexed: 11/23/2022] Open
Abstract
Molecular species identification using suboptimal PCR primers can over-estimate species diversity due to coamplification of nuclear mitochondrial (NUMT) DNA/pseudogenes. For the agriculturally important whitefly Bemisia tabaci cryptic pest species complex, species identification depends primarily on characterization of the mitochondrial DNA cytochrome oxidase I (mtDNA COI) gene. The lack of robust PCR primers for the mtDNA COI gene can undermine correct species identification which in turn compromises management strategies. This problem is identified in the B. tabaci Africa/Middle East/Asia Minor clade which comprises the globally invasive Mediterranean (MED) and Middle East Asia Minor I (MEAM1) species, Middle East Asia Minor 2 (MEAM2), and the Indian Ocean (IO) species. Initially identified from the Indian Ocean island of Réunion, MEAM2 has since been reported from Japan, Peru, Turkey and Iraq. We identified MEAM2 individuals from a Peruvian population via Sanger sequencing of the mtDNA COI gene. In attempting to characterize the MEAM2 mitogenome, we instead characterized mitogenomes of MEAM1. We also report on the mitogenomes of MED, AUS, and IO thereby increasing genomic resources for members of this complex. Gene synteny (i.e., same gene composition and orientation) was observed with published B. tabaci cryptic species mitogenomes. Pseudogene fragments matching MEAM2 partial mtDNA COI gene exhibited low frequency single nucleotide polymorphisms that matched low copy number DNA fragments (<3%) of MEAM1 genomes, whereas presence of internal stop codons, loss of expected stop codons and poor primer annealing sites, all suggested MEAM2 as a pseudogene artifact and so not a real species.
Collapse
|
15
|
First Report of a Mitochondrial Pseudogene in Agnathan Vertebrates (Cyclostomata: Petromyzontidae). J Mol Evol 2018; 86:187-189. [PMID: 29564489 DOI: 10.1007/s00239-018-9835-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2018] [Accepted: 03/15/2018] [Indexed: 11/28/2022]
Abstract
We report herein the characterization of a nuclear paralog of a fragment of the mitochondrial genome (a numt) in two closely related species of lampreys (Ichthyomyzon spp.). Although numts have been characterized in several vertebrate taxa, numts have yet to be reported for fishes in general. Given the phylogenetic position of lampreys relative to other vertebrates, the presence of numts within the lamprey genome is either evidence of an ancestral trait lost in other fishes but uniquely retained in agnathans and amniotes, or (more intriguingly) a product of the genome rearrangements these animals undergo during development.
Collapse
|
16
|
Abstract
In the human the peptide Humanin is produced from the small Humanin gene which is embedded as a gene-within-a-gene in the 16S ribosomal molecule of the mitochondrial DNA (mtDNA). The peptide itself appears to be significant in the prevention of cell death in many tissues and improve cognition in animal models. By using simple data mining techniques, it is possible to show that 99.4% of the human Humanin sequences in the GenBank database are unaffected by mutations. However, in other vertebrates, pseudogenization of the Humanin gene is a common feature; occurring apparently randomly in some species and not others. The persistence, or loss, of a functional Humanin gene may be an important factor in laboratory animals, especially if they are being used as animal models in studies of Alzheimer's disease (AD). The exact reason why Humanin underwent pseudogenization in some vertebrate species during their evolution remains to be determined. This study was originally planned to review the available information about Humanin and it was a surprise to be able to show that pseudogenization has occurred in a gene in the mtDNA and is not restricted solely to chromosomal genes.
Collapse
|
17
|
Data on the time of integration of the human mitochondrial pseudogenes ( NUMTs) into the nuclear genome. Data Brief 2017; 13:536-544. [PMID: 28702491 PMCID: PMC5491396 DOI: 10.1016/j.dib.2017.05.024] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2016] [Revised: 05/01/2017] [Accepted: 05/09/2017] [Indexed: 11/29/2022] Open
Abstract
The data and methods presented in this article are supplementing the research article "Integration of mtDNA pseudogenes into the nuclear genome coincides with speciation of the human genus. A hypothesis", DOI: 10.1016/j.mito.2016.12.001 (Gunbin et al., 2017) [1]. Mitochondrial DNA is known to get inserted into nuclear DNA to form NUMTs, i.e. nuclear DNA pseudogenes of the mtDNA. We present here the sequences of selected NUMTs, in which time of integration can be determined with sufficient precision. We report their chromosomal positions , their position within the great ape mtDNA phylogeny, and their times of integration into the nuclear genome. The methods used to generate the data and to control their quality are also presented. The dataset is made publicly available to enable critical or extended analyzes.
Collapse
|
18
|
Migration of mitochondrial DNA in the nuclear genome of colorectal adenocarcinoma. Genome Med 2017; 9:31. [PMID: 28356157 PMCID: PMC5370490 DOI: 10.1186/s13073-017-0420-6] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2016] [Accepted: 03/09/2017] [Indexed: 12/31/2022] Open
Abstract
Background Colorectal adenocarcinomas are characterized by abnormal mitochondrial DNA (mtDNA) copy number and genomic instability, but a molecular interaction between mitochondrial and nuclear genome remains unknown. Here we report the discovery of increased copies of nuclear mtDNA (NUMT) in colorectal adenocarcinomas, which supports link between mtDNA and genomic instability in the nucleus. We name this phenomenon of nuclear occurrence of mitochondrial component as numtogenesis. We provide a description of NUMT abundance and distribution in tumor versus matched blood-derived normal genomes. Methods Whole-genome sequence data were obtained for colon adenocarcinoma and rectum adenocarcinoma patients participating in The Cancer Genome Atlas, via the Cancer Genomics Hub, using the GeneTorrent file acquisition tool. Data were analyzed to determine NUMT proportion and distribution on a genome-wide scale. A NUMT suppressor gene was identified by comparing numtogenesis in other organisms. Results Our study reveals that colorectal adenocarcinoma genomes, on average, contains up to 4.2-fold more somatic NUMTs than matched normal genomes. Women colorectal tumors contained more NUMT than men. NUMT abundance in tumor predicted parallel abundance in blood. NUMT abundance positively correlated with GC content and gene density. Increased numtogenesis was observed with higher mortality. We identified YME1L1, a human homolog of yeast YME1 (yeast mitochondrial DNA escape 1) to be frequently mutated in colorectal tumors. YME1L1 was also mutated in tumors derived from other tissues. We show that inactivation of YME1L1 results in increased transfer of mtDNA in the nuclear genome. Conclusions Our study demonstrates increased somatic transfer of mtDNA in colorectal tumors. Our study also reveals sex-based differences in frequency of NUMT occurrence and that NUMT in blood reflects NUMT in tumors, suggesting NUMT may be used as a biomarker for tumorigenesis. We identify YME1L1 as the first NUMT suppressor gene in human and demonstrate that inactivation of YME1L1 induces migration of mtDNA to the nuclear genome. Our study reveals that numtogenesis plays an important role in the development of cancer. Electronic supplementary material The online version of this article (doi:10.1186/s13073-017-0420-6) contains supplementary material, which is available to authorized users.
Collapse
|
19
|
Integration of mtDNA pseudogenes into the nuclear genome coincides with speciation of the human genus. A hypothesis. Mitochondrion 2016; 34:20-23. [PMID: 27979772 DOI: 10.1016/j.mito.2016.12.001] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2016] [Revised: 10/14/2016] [Accepted: 12/09/2016] [Indexed: 11/30/2022]
Abstract
Fragments of mitochondrial DNA are known to get inserted into nuclear DNA to form NUMTs, i.e. nuclear pseudogenes of the mtDNA. The insertion of a NUMT is a rare event. Hundreds of pseudogenes have been cataloged in the human genome. NUMTs are, in essence, a special type of mutation with their own internal timer, which is synchronized with an established molecular clock, the mtDNA. Thus insertion of NUMTs can be timed with respect to evolution milestones such as the emergence of new species. We asked whether NUMTs were inserted uniformly over time or preferentially during certain periods of evolution, as implied by the "punctuated evolution" model. To our surprise, the NUMT insertion times do appear nonrandom with at least one cluster positioned at around 2.8 million years ago (Ma). Interestingly, 2.8Ma closely corresponds to the time of emergence of the genus Homo, and to a well-documented period of major climate change ca. 2.9-2.5Ma. It is tempting to hypothesize that the insertion of NUMTs is related to the speciation process. NUMTs could be either "riders", i.e., their insertion could be facilitated by the overall higher genome rearrangement activity during speciation, or "drivers", i.e. they may more readily get fixed in the population due to positive selection associated with speciation. If correct, the hypothesis would support the idea that evolution of our genus may have happened in a rapid, punctuated manner.
Collapse
|
20
|
The ability of human nuclear DNA to cause false positive low-abundance heteroplasmy calls varies across the mitochondrial genome. BMC Genomics 2016; 17:1017. [PMID: 27955616 PMCID: PMC5153897 DOI: 10.1186/s12864-016-3375-x] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2016] [Accepted: 12/05/2016] [Indexed: 02/03/2023] Open
Abstract
Background Low-abundance mutations in mitochondrial populations (mutations with minor allele frequency ≤ 1%), are associated with cancer, aging, and neurodegenerative disorders. While recent progress in high-throughput sequencing technology has significantly improved the heteroplasmy identification process, the ability of this technology to detect low-abundance mutations can be affected by the presence of similar sequences originating from nuclear DNA (nDNA). To determine to what extent nDNA can cause false positive low-abundance heteroplasmy calls, we have identified mitochondrial locations of all subsequences that are common or similar (one mismatch allowed) between nDNA and mitochondrial DNA (mtDNA). Results Performed analysis revealed up to a 25-fold variation in the lengths of longest common and longest similar (one mismatch allowed) subsequences across the mitochondrial genome. The size of the longest subsequences shared between nDNA and mtDNA in several regions of the mitochondrial genome were found to be as low as 11 bases, which not only allows using these regions to design new, very specific PCR primers, but also supports the hypothesis of the non-random introduction of mtDNA into the human nuclear DNA. Conclusion Analysis of the mitochondrial locations of the subsequences shared between nDNA and mtDNA suggested that even very short (36 bases) single-end sequencing reads can be used to identify low-abundance variation in 20.4% of the mitochondrial genome. For longer (76 and 150 bases) reads, the proportion of the mitochondrial genome where nDNA presence will not interfere found to be 44.5 and 67.9%, when low-abundance mutations at 100% of locations can be identified using 417 bases long single reads. This observation suggests that the analysis of low-abundance variations in mitochondria population can be extended to a variety of large data collections such as NCBI Sequence Read Archive, European Nucleotide Archive, The Cancer Genome Atlas, and International Cancer Genome Consortium. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3375-x) contains supplementary material, which is available to authorized users.
Collapse
|
21
|
Cytogenetic and Sequence Analyses of Mitochondrial DNA Insertions in Nuclear Chromosomes of Maize. G3-GENES GENOMES GENETICS 2015; 5:2229-39. [PMID: 26333837 PMCID: PMC4632043 DOI: 10.1534/g3.115.020677] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The transfer of mitochondrial DNA (mtDNA) into nuclear genomes is a regularly occurring process that has been observed in many species. Few studies, however, have focused on the variation of nuclear-mtDNA sequences (NUMTs) within a species. This study examined mtDNA insertions within chromosomes of a diverse set of Zea mays ssp. mays (maize) inbred lines by the use of fluorescence in situ hybridization. A relatively large NUMT on the long arm of chromosome 9 (9L) was identified at approximately the same position in four inbred lines (B73, M825, HP301, and Oh7B). Further examination of the similarly positioned 9L NUMT in two lines, B73 and M825, indicated that the large size of these sites is due to the presence of a majority of the mitochondrial genome; however, only portions of this NUMT (~252 kb total) were found in the publically available B73 nuclear sequence for chromosome 9. Fiber-fluorescence in situ hybridization analysis estimated the size of the B73 9L NUMT to be ~1.8 Mb and revealed that the NUMT is methylated. Two regions of mtDNA (2.4 kb and 3.3 kb) within the 9L NUMT are not present in the B73 mitochondrial NB genome; however, these 2.4-kb and 3.3-kb segments are present in other Zea mitochondrial genomes, including that of Zea mays ssp. parviglumis, a progenitor of domesticated maize.
Collapse
|
22
|
Analysis of the complete mitochondrial genome and characterization of diverse NUMTs of Macaca leonina. Gene 2015; 571:279-85. [PMID: 26151895 DOI: 10.1016/j.gene.2015.06.085] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2015] [Revised: 06/08/2015] [Accepted: 06/26/2015] [Indexed: 11/19/2022]
Abstract
As a non-human primate, the pig-tailed macaque has received wide attention because it can be infected by HIV-1. In this study, we determined the complete mtDNA sequence of the northern pig-tailed macaque (Macaca leonina). Unexpectedly, during the amplification of the mtDNA control region (D-loop region) we observed several D-loop-like sequences, which were NUMTs (nuclear mitochondrial sequences) and a total of 14 D-loop-like NUMT haplotypes were later identified in five individuals. The neighbor-joining tree and estimated divergence time based on these D-loop-like NUMT sequences of M. leonina provide some insights into the understanding of the evolutionary history of NUMTs. D-loop-like haplotypes G and H, which also exist in the nuclear genome of mulatta, appear to have been translocated into the nuclear genome before the divergence of M. mulatta and M. leonina. The other D-loop-like NUMT haplotypes were translocated into the nuclear genome of M. leonina after the divergence of the two species. Later sequence conversion was predicted to occur among these 14 D-loop-like NUMT haplotypes. The overall structure of the mtDNA of M. leonina was found to be similar to that seen in other mammalian mitochondrial genomes. Phylogenetic analysis based on the maximum likelihood method shows M. leonina clustered with Macaca silenus among the analyzed mammalian species.
Collapse
|
23
|
Exceptionally large mitochondrial fragments to the nucleus in sequenced mollusk genomes. Mitochondrial DNA A DNA Mapp Seq Anal 2014; 27:1409-10. [PMID: 25103445 DOI: 10.3109/19401736.2014.947604] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
The available genome sequences of three mollusks (Biomphalaria glabrata, Aplysia californica and Crassostrea gigas) were first used to investigate the nuclear mitochondrial DNAs (NUMTs) in mollusks. The analysis showed that the NUMT contents were high in B. glabrata (17.738 Kb) and C. gigas (17.192 Kb), of which all or almost all mtDNA sequences were transferred to the nucleus, whereas NUMTs are rare (584 bp) in A. californica. The length of NUMTs was 61 to 5492 bp for B. glabrata, 1711 to 15,481 bp for C. gigas, and 124 to 460 bp for A. californica. The largest C. gigas NUMT covered 84.9% (15,481 bp) of its mitochondrial genome, which is rarely found in invertebrates so far. No correlation was found between NUMT content and genome size in the three sequenced mollusk genomes.
Collapse
|
24
|
|