1
|
Hazkani-Covo E, Martin WF. Quantifying the Number of Independent Organelle DNA Insertions in Genome Evolution and Human Health. Genome Biol Evol 2017; 9:1190-1203. [PMID: 28444372 PMCID: PMC5570036 DOI: 10.1093/gbe/evx078] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/20/2017] [Indexed: 12/28/2022] Open
Abstract
Fragments of organelle genomes are often found as insertions in nuclear DNA. These fragments of mitochondrial DNA (numts) and plastid DNA (nupts) are ubiquitous components of eukaryotic genomes. They are, however, often edited out during the genome assembly process, leading to systematic underestimation of their frequency. Numts and nupts, once inserted, can become further fragmented through subsequent insertion of mobile elements or other recombinational events that disrupt the continuity of the inserted sequence relative to the genuine organelle DNA copy. Because numts and nupts are typically identified through sequence comparison tools such as BLAST, disruption of insertions into smaller fragments can lead to systematic overestimation of numt and nupt frequencies. Accurate identification of numts and nupts is important, however, both for better understanding of their role during evolution, and for monitoring their increasingly evident role in human disease. Human populations are polymorphic for 141 numt loci, five numts are causal to genetic disease, and cancer genomic studies are revealing an abundance of numts associated with tumor progression. Here, we report investigation of salient parameters involved in obtaining accurate estimates of numt and nupt numbers in genome sequence data. Numts and nupts from 44 sequenced eukaryotic genomes reveal lineage-specific differences in the number, relative age and frequency of insertional events as well as lineage-specific dynamics of their postinsertional fragmentation. Our findings outline the main technical parameters influencing accurate identification and frequency estimation of numts in genomic studies pertinent to both evolution and human health.
Collapse
Affiliation(s)
- Einat Hazkani-Covo
- Department of Natural and Life Sciences, The Open University of Israel, Ra'anana, Israel
| | - William F Martin
- Institute of Molecular Evolution, Heinrich-Heine University, Düsseldorf, Germany
| |
Collapse
|
2
|
Cytogenetic and Sequence Analyses of Mitochondrial DNA Insertions in Nuclear Chromosomes of Maize. G3-GENES GENOMES GENETICS 2015; 5:2229-39. [PMID: 26333837 PMCID: PMC4632043 DOI: 10.1534/g3.115.020677] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The transfer of mitochondrial DNA (mtDNA) into nuclear genomes is a regularly occurring process that has been observed in many species. Few studies, however, have focused on the variation of nuclear-mtDNA sequences (NUMTs) within a species. This study examined mtDNA insertions within chromosomes of a diverse set of Zea mays ssp. mays (maize) inbred lines by the use of fluorescence in situ hybridization. A relatively large NUMT on the long arm of chromosome 9 (9L) was identified at approximately the same position in four inbred lines (B73, M825, HP301, and Oh7B). Further examination of the similarly positioned 9L NUMT in two lines, B73 and M825, indicated that the large size of these sites is due to the presence of a majority of the mitochondrial genome; however, only portions of this NUMT (~252 kb total) were found in the publically available B73 nuclear sequence for chromosome 9. Fiber-fluorescence in situ hybridization analysis estimated the size of the B73 9L NUMT to be ~1.8 Mb and revealed that the NUMT is methylated. Two regions of mtDNA (2.4 kb and 3.3 kb) within the 9L NUMT are not present in the B73 mitochondrial NB genome; however, these 2.4-kb and 3.3-kb segments are present in other Zea mitochondrial genomes, including that of Zea mays ssp. parviglumis, a progenitor of domesticated maize.
Collapse
|
3
|
Nergadze SG, Lupotto M, Pellanda P, Santagostino M, Vitelli V, Giulotto E. Mitochondrial DNA insertions in the nuclear horse genome. Anim Genet 2015; 41 Suppl 2:176-85. [PMID: 21070293 DOI: 10.1111/j.1365-2052.2010.02130.x] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
The insertion of mitochondrial DNA in the nuclear genome generates numts, nuclear sequences of mitochondrial origin. In the horse reference genome, we identified 82 numts and showed that the entire horse mitochondrial DNA is represented as numts without gross bias. Numts were inserted in the horse nuclear genome at random sites and were probably generated during the repair of DNA double-strand breaks. We then analysed 12 numt loci in 20 unrelated horses and found that null alleles, lacking the mitochondrial DNA insertion, were present at six of these loci. At some loci, the null allele is prevalent in the sample analysed, suggesting that, in the horse population, the number of numt loci may be higher than 82 present in the reference genome. Contrary to humans, the insertion polymorphism of numts is extremely frequent in the horse population, supporting the hypothesis that the genome of this species is in a rapidly evolving state.
Collapse
Affiliation(s)
- S G Nergadze
- Dipartimento di Genetica e Microbiologia Adriano Buzzati-Traverso, Università di Pavia, Via Ferrata 1, 27100 Pavia, Italy
| | | | | | | | | | | |
Collapse
|
4
|
Cooper DN, Bacolla A, Férec C, Vasquez KM, Kehrer-Sawatzki H, Chen JM. On the sequence-directed nature of human gene mutation: the role of genomic architecture and the local DNA sequence environment in mediating gene mutations underlying human inherited disease. Hum Mutat 2011; 32:1075-99. [PMID: 21853507 DOI: 10.1002/humu.21557] [Citation(s) in RCA: 94] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2011] [Accepted: 06/17/2011] [Indexed: 12/21/2022]
Abstract
Different types of human gene mutation may vary in size, from structural variants (SVs) to single base-pair substitutions, but what they all have in common is that their nature, size and location are often determined either by specific characteristics of the local DNA sequence environment or by higher order features of the genomic architecture. The human genome is now recognized to contain "pervasive architectural flaws" in that certain DNA sequences are inherently mutation prone by virtue of their base composition, sequence repetitivity and/or epigenetic modification. Here, we explore how the nature, location and frequency of different types of mutation causing inherited disease are shaped in large part, and often in remarkably predictable ways, by the local DNA sequence environment. The mutability of a given gene or genomic region may also be influenced indirectly by a variety of noncanonical (non-B) secondary structures whose formation is facilitated by the underlying DNA sequence. Since these non-B DNA structures can interfere with subsequent DNA replication and repair and may serve to increase mutation frequencies in generalized fashion (i.e., both in the context of subtle mutations and SVs), they have the potential to serve as a unifying concept in studies of mutational mechanisms underlying human inherited disease.
Collapse
Affiliation(s)
- David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, United Kingdom.
| | | | | | | | | | | |
Collapse
|
5
|
An isolated case of lissencephaly caused by the insertion of a mitochondrial genome-derived DNA sequence into the 5' untranslated region of the PAFAH1B1 (LIS1) gene. Hum Genomics 2011; 4:384-93. [PMID: 20846927 PMCID: PMC3525223 DOI: 10.1186/1479-7364-4-6-384] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
A 130 base pair (bp) insertion (g.-8delCins130) into the 5' untranslated region of the PAFAH1B1 (LIS1) gene, seven nucleotides upstream of the translational initiation site, was detected in an isolated case of lissencephaly. The inserted DNA sequence exhibited perfect homology to two non-contiguous regions of the mitochondrial genome (8479 to 8545 and 8775 to 8835, containing portions of two genes, ATP8 and ATP6), as well as near-perfect homology (1 bp mismatch) to a nuclear mitochondrial pseudogene (NUMT) sequence located on chromosome 1p36. This lesion was not evident on polymerase chain reaction (PCR) sequence analysis of either parent, indicating that the mutation had occurred de novo in the patient. Experiments designed to distinguish between a mitochondrial and a nuclear genomic origin for the inserted DNA sequence were, however, inconclusive. Mitochondrial genome sequences from both the patient and his parents were sequenced and found to be identical to the sequence inserted into the PAFAH1B1 gene. Analysis of parental PCR products from the chromosome 1-specific NUMT were also consistent with the interpretation that the inserted sequence had originated directly from the mitochondrial genome. The chromosome 1-specific NUMT in the patient proved to be refractory to PCR analysis, however, suggesting that this region of chromosome 1 could have been deleted or rearranged. Although it remains by far the most likely scenario, in the absence of DNA sequence information from the patient's own chromosome 1-specific NUMT, we cannot unequivocally confirm that the 130 bp insertion originated from mitochondrial genome rather than from the NUMT.
Collapse
|
6
|
Abstract
The natural transfer of DNA from mitochondria to the nucleus generates nuclear copies of mitochondrial DNA (numts) and is an ongoing evolutionary process, as genome sequences attest. In humans, five different numts cause genetic disease and a dozen human loci are polymorphic for the presence of numts, underscoring the rapid rate at which mitochondrial sequences reach the nucleus over evolutionary time. In the laboratory and in nature, numts enter the nuclear DNA via non-homolgous end joining (NHEJ) at double-strand breaks (DSBs). The frequency of numt insertions among 85 sequenced eukaryotic genomes reveal that numt content is strongly correlated with genome size, suggesting that the numt insertion rate might be limited by DSB frequency. Polymorphic numts in humans link maternally inherited mitochondrial genotypes to nuclear DNA haplotypes during the past, offering new opportunities to associate nuclear markers with mitochondrial markers back in time.
Collapse
|
7
|
Karim MA, Craig RL, Wang X, Hale TC, Elbein SC. Analysis of FOXO1A as a candidate gene for type 2 diabetes. Mol Genet Metab 2006; 88:171-7. [PMID: 16497530 DOI: 10.1016/j.ymgme.2006.01.003] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/14/2005] [Revised: 01/06/2006] [Accepted: 01/10/2006] [Indexed: 11/22/2022]
Abstract
The human forkhead box O1A (FOXO1A) gene on chromosome 13q14.1 is a key transcription factor in insulin signaling in liver and adipose tissue and plays a central role in the regulation of key pancreatic beta-cell genes including IPF1. We hypothesized that sequence variants of FOXO1A contribute to the observed defects in hepatic and peripheral insulin action and altered beta-cell compensation that characterize type 2 diabetes (T2DM). To test this hypothesis, we screened the three exons, 3' untranslated region, and 5' flanking region for sequence variants in Caucasian and African-American individuals with early onset (<45 years) T2DM. We identified only six variants; none altered the coding sequence, and except for one variant in the 3' untranslated region, they were rare or absent in Caucasians. To increase coverage of the gene, we selected seven additional variants in the large first intron and 5' flanking region, thus providing 13 variants that spanned 116.4kb. Based on frequency and linkage disequilibrium patterns in a subset of individuals, we selected eight SNPs to type in a Caucasian population comprising 192 unrelated nondiabetic control individuals and 192 individuals with T2DM, and 10 SNPs to type in 182 controls and 352 diabetic individuals of African-American ancestry. No variant was associated with T2DM (African-Americans, p>0.08; Caucasians, p>0.09). Of the 8 Caucasian SNPs, six comprised a single haplotype block spanning over 100kb and including most of the large first intron. In contrast, no block was observed among SNPs typed in African-Americans. No haplotype was associated with T2DM. FOXO1A variation is rare and is unlikely to contribute to T2DM in either Caucasian or African-American populations.
Collapse
Affiliation(s)
- Mohammad A Karim
- Division of Endocrinology and Metabolism, Department of Medicine, College of Medicine, University of Arkansas for Medical Sciences, Little Rock, AR, USA
| | | | | | | | | |
Collapse
|
8
|
Chen JM, Chuzhanova N, Stenson PD, Férec C, Cooper DN. Meta-analysis of gross insertions causing human genetic disease: novel mutational mechanisms and the role of replication slippage. Hum Mutat 2006; 25:207-21. [PMID: 15643617 DOI: 10.1002/humu.20133] [Citation(s) in RCA: 128] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Although gross insertions (>20 bp) comprise <1% of disease-causing mutations, they nevertheless represent an important category of pathological lesion. In an attempt to study these insertions in a systematic way, 158 gross insertions ranging in size between 21 bp and approximately 10 kb were identified using the Human Gene Mutation Database (www.hgmd.org). A careful meta-analytical study revealed extensive diversity in terms of the nature of the inserted DNA sequence and has provided new insights into the underlying mutational mechanisms. Some 70% of gross insertions were found to represent sequence duplications of different types (tandem, partial tandem, or complex). Although most of the tandem duplications were explicable by simple replication slippage, the three complex duplications appear to result from multiple slippage events. Some 11% of gross insertions were attributable to nonpolyglutamine repeat expansions (including octapeptide repeat expansions in the prion protein gene [PRNP] and polyalanine tract expansions) and evidence is presented to support the contention that these mutations are also caused by replication slippage rather than by unequal crossing over. Some 17% of gross insertions, all >or=276 bp in length, were found to be due to LINE-1 (L1) retrotransposition involving different types of element (L1 trans-driven Alu, L1 direct, and L1 trans-driven SVA). A second example of pathological mitochondrial-nuclear sequence transfer was identified in the USH1C gene but appears to arise via a novel mechanism, trans-replication slippage. Finally, evidence for another novel mechanism of human genetic disease, involving the possible capture of DNA oligonucleotides, is presented in the context of a 26-bp insertion into the ERCC6 gene.
Collapse
Affiliation(s)
- Jian-Min Chen
- INSERM (Institut National de la Santé et de la Recherche Médicale) U613-Génétique Moléculaire et Génétique Epidémiologique, Université de Bretagne Occidentale, Centre Hospitalier Universitaire, Brest, France.
| | | | | | | | | |
Collapse
|