1
|
Abstract
Artificial recombination of two or more mitochondrial DNA fragments from different samples would constitute a serious cause of error in forensic DNA typing, and yet one can demonstrate that such events have happened in the preparation of several published mtDNA databases. Focussed database searches, phylogenetic analysis, and network representations can highlight mosaic patterns and thus pinpoint sample mix-up. Therefore, we suggest that this approach should be applied to data prior to publication in order to uncover such errors in time.
Collapse
Affiliation(s)
- H J Bandelt
- Fachbereich Mathematik, Universität Hamburg, Bundesstrasse 55, 20146 Hamburg, Germany.
| | | | | |
Collapse
|
2
|
Bandelt HJ, Alves-Silva J, Guimarães PE, Santos MS, Brehm A, Pereira L, Coppa A, Larruga JM, Rengo C, Scozzari R, Torroni A, Prata MJ, Amorim A, Prado VF, Pena SD. Phylogeography of the human mitochondrial haplogroup L3e: a snapshot of African prehistory and Atlantic slave trade. Ann Hum Genet 2001; 65:549-63. [PMID: 11851985 DOI: 10.1017/s0003480001008892] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
The mtDNA haplogroup L3e, which is identified by the restriction site +2349 MboI within the Afro-Eurasian superhaplogroup L3 (-3592 HpaI), is omnipresent in Africa but virtually absent in Eurasia (except for neighbouring areas with limited genetic exchange). L3e was hitherto poorly characterised in terms of HVS-I motifs, as the ancestral HVS-I type of L3e cannot be distinguished from the putative HVS-I ancestor of the entire L3 (differing from the CRS by a transition at np 16223). An MboI screening at np 2349 of a large number of Brazilian and Caribbean mtDNAs (encompassing numerous mtDNAs of African ancestry), now reveals that L3e is subdivided into four principal clades, each characterised by a single mutation in HVS-I, with additional support coming from HVS-II and partial RFLP analysis. The apparently oldest of these clades (transition at np 16327) occurs mainly in central Africa and was probably carried to southern Africa with the Bantu expansion(s). The most frequent clade (transition at np 16320) testifies to a pronounced expansion event in the mid-Holocene and seems to be prominent in many Bantu groups from all of Africa. In contrast, one clade (transition at np 16264) is essentially restricted to Atlantic western Africa (including Cabo Verde). We propose a tentative L3e phylogeny that is based on 197 HVS-I sequences. We conclude that haplogroup L3e originated in central or eastern Africa about 46,000 (+/-14,000) years ago, and was a hitchhiker of much later dispersal and local expansion events, with the rise of food production and iron smelting. Enforced migration of African slaves to the Americas translocated L3e mitochondria, the descendants of which in Brazil and the Caribbean still reflect their different regional African ancestries.
Collapse
Affiliation(s)
- H J Bandelt
- Fachbereich Mathematik, Universität Hamburg, Germany.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
3
|
Abstract
Sequencing and documenting a sample of homologous DNA stretches is prone to copying errors in a way rather analogous to the biological replication process. Previous attempts at obtaining representative mtDNA sequences, typically of the control region, for evolutionary studies or forensic purposes have yielded rather unsatisfactory results in many cases. The key ingredient in pinpointing problems with given data is the phylogenetic analysis of closely related mtDNAs within the framework of an established worldwide phylogeny that is supported by coding region information. We develop some general rules by which likely errors in data tables can readily be detected without rereading whole sequences repeatedly. Following these guidelines, one can expect to lower the error rate by at least an order of magnitude, although it will still be hard to beat the mitochondrial gamma polymerase in precision.
Collapse
Affiliation(s)
- H J Bandelt
- Fachbereich Mathematik. Universität Hamburg, Germany
| | | | | | | |
Collapse
|
4
|
Bandelt HJ, Macaulay V, Richards M. Median networks: speedy construction and greedy reduction, one simulation, and two case studies from human mtDNA. Mol Phylogenet Evol 2000; 16:8-28. [PMID: 10877936 DOI: 10.1006/mpev.2000.0792] [Citation(s) in RCA: 114] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Molecular data sets characterized by few phylogenetically informative characters with a broad spectrum of mutation rates, such as intraspecific control-region sequence variation of human mitochondrial DNA (mtDNA), can be usefully visualized in the form of median networks. Here we provide a step-by-step guide to the construction of such networks by hand. We improve upon a previously implemented algorithm by outlining an efficient parametrized strategy amenable to large data sets, greedy reduction, which makes it possible to reconstruct some of the confounding recurrent mutations. This entails some postprocessing as well, which assists in capturing more parsimonious solutions. To simplify the creation of the resulting network by hand, we describe a speedy approach to network construction, based on a careful planning of the processing order. A coalescent simulation tailored to human mtDNA variation in Eurasia testifies to the usefulness of reduced median networks, while highlighting notorious problems faced by all phylogenetic methods in this context. Finally, we discuss two case studies involving the comparison of characters in the two hypervariable segments of the human mtDNA control region in the light of the worldwide control-region sequence database, as well as additional restriction fragment length polymorphism information. We conclude that only a minority of the mutations that hit the second segment occur at sites that would have a mutation rate comparable to those at most sites in the first segment. Discarding the known "noisy" sites of the second segment enhances the analysis.
Collapse
Affiliation(s)
- H J Bandelt
- Fachbereich Mathematik, Universität Hamburg, Bundesstrasse 55, Hamburg, D-20146, Germany
| | | | | |
Collapse
|
5
|
Quintana-Murci L, Semino O, Bandelt HJ, Passarino G, McElreavey K, Santachiara-Benerecetti AS. Genetic evidence of an early exit of Homo sapiens sapiens from Africa through eastern Africa. Nat Genet 1999; 23:437-41. [PMID: 10581031 DOI: 10.1038/70550] [Citation(s) in RCA: 377] [Impact Index Per Article: 15.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The out-of-Africa scenario has hitherto provided little evidence for the precise route by which modern humans left Africa. Two major routes of dispersal have been hypothesized: one through North Africa into the Levant, documented by fossil remains, and one through Ethiopia along South Asia, for which little, if any, evidence exists. Mitochondrial DNA (mtDNA) can be used to trace maternal ancestry. The geographic distribution and variation of mtDNAs can be highly informative in defining potential range expansions and migration routes in the distant past. The mitochondrial haplogroup M, first regarded as an ancient marker of East-Asian origin, has been found at high frequency in India and Ethiopia, raising the question of its origin. (A haplogroup is a group of haplotypes that share some sequence variations.) Its variation and geographical distribution suggest that Asian haplogroup M separated from eastern-African haplogroup M more than 50,000 years ago. Two other variants (489C and 10873C) also support a single origin of haplogroup M in Africa. These findings, together with the virtual absence of haplogroup M in the Levant and its high frequency in the South-Arabian peninsula, render M the first genetic indicator for the hypothesized exit route from Africa through eastern Africa/western India. This was possibly the only successful early dispersal event of modern humans out of Africa.
Collapse
Affiliation(s)
- L Quintana-Murci
- Dipartimento di Genetica e Microbiologia, University of Pavia, Italy.
| | | | | | | | | | | |
Collapse
|
6
|
Rando JC, Cabrera VM, Larruga JM, Hernández M, González AM, Pinto F, Bandelt HJ. Phylogeographic patterns of mtDNA reflecting the colonization of the Canary Islands. Ann Hum Genet 1999; 63:413-28. [PMID: 10735583 DOI: 10.1046/j.1469-1809.1999.6350413.x] [Citation(s) in RCA: 95] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Although the Canary Islands were settled by humans, possibly of Berber origin, as late as 2500 years ago, the precise course and numbers of early migrations to the archipelago remain controversial. We have therefore analysed mtDNA variation (HVS-I as well as selected RFLP sites) in 300 individuals from the seven Canary Islands. The distribution and variation across the islands in a specific mtDNA clade of Northwest African ancestry suggest that there was one dominant initial settlement process that affected all the islands, from east to west. This indicates that a certain genetic affinity of present-day Canary Islanders to Northwest African Berbers mainly stems from the autochthonous population rather than slaves captured on the neighbouring African coast. The slave trade after the European conquest left measurable, though minor, traces in the mtDNA pool of the Canary Islands, which in its majority testifies to the European immigration.
Collapse
Affiliation(s)
- J C Rando
- Departamento de Genética, Universidad de La Laguna, Tenerife, Spain
| | | | | | | | | | | | | |
Collapse
|
7
|
Abstract
Reconstructing phylogenies from intraspecific data (such as human mitochondrial DNA variation) is often a challenging task because of large sample sizes and small genetic distances between individuals. The resulting multitude of plausible trees is best expressed by a network which displays alternative potential evolutionary paths in the form of cycles. We present a method ("median joining" [MJ]) for constructing networks from recombination-free population data that combines features of Kruskal's algorithm for finding minimum spanning trees by favoring short connections, and Farris's maximum-parsimony (MP) heuristic algorithm, which sequentially adds new vertices called "median vectors", except that our MJ method does not resolve ties. The MJ method is hence closely related to the earlier approach of Foulds, Hendy, and Penny for estimating MP trees but can be adjusted to the level of homoplasy by setting a parameter epsilon. Unlike our earlier reduced median (RM) network method, MJ is applicable to multistate characters (e.g., amino acid sequences). An additional feature is the speed of the implemented algorithm: a sample of 800 worldwide mtDNA hypervariable segment I sequences requires less than 3 h on a Pentium 120 PC. The MJ method is demonstrated on a Tibetan mitochondrial DNA RFLP data set.
Collapse
Affiliation(s)
- H J Bandelt
- Mathematisches Seminar, Universität Hamburg, Germany
| | | | | |
Collapse
|
8
|
Brown MD, Hosseini SH, Torroni A, Bandelt HJ, Allen JC, Schurr TG, Scozzari R, Cruciani F, Wallace DC. mtDNA haplogroup X: An ancient link between Europe/Western Asia and North America? Am J Hum Genet 1998; 63:1852-61. [PMID: 9837837 PMCID: PMC1377656 DOI: 10.1086/302155] [Citation(s) in RCA: 209] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open
Abstract
On the basis of comprehensive RFLP analysis, it has been inferred that approximately 97% of Native American mtDNAs belong to one of four major founding mtDNA lineages, designated haplogroups "A"-"D." It has been proposed that a fifth mtDNA haplogroup (haplogroup X) represents a minor founding lineage in Native Americans. Unlike haplogroups A-D, haplogroup X is also found at low frequencies in modern European populations. To investigate the origins, diversity, and continental relationships of this haplogroup, we performed mtDNA high-resolution RFLP and complete control region (CR) sequence analysis on 22 putative Native American haplogroup X and 14 putative European haplogroup X mtDNAs. The results identified a consensus haplogroup X motif that characterizes our European and Native American samples. Among Native Americans, haplogroup X appears to be essentially restricted to northern Amerindian groups, including the Ojibwa, the Nuu-Chah-Nulth, the Sioux, and the Yakima, although we also observed this haplogroup in the Na-Dene-speaking Navajo. Median network analysis indicated that European and Native American haplogroup X mtDNAs, although distinct, nevertheless are distantly related to each other. Time estimates for the arrival of X in North America are 12,000-36,000 years ago, depending on the number of assumed founders, thus supporting the conclusion that the peoples harboring haplogroup X were among the original founders of Native American populations. To date, haplogroup X has not been unambiguously identified in Asia, raising the possibility that some Native American founders were of Caucasian ancestry.
Collapse
Affiliation(s)
- M D Brown
- Center for Molecular Medicine, Emory University School of Medicine, Atlanta, GA, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
9
|
Rando JC, Pinto F, González AM, Hernández M, Larruga JM, Cabrera VM, Bandelt HJ. Mitochondrial DNA analysis of northwest African populations reveals genetic exchanges with European, near-eastern, and sub-Saharan populations. Ann Hum Genet 1998; 62:531-50. [PMID: 10363131 DOI: 10.1046/j.1469-1809.1998.6260531.x] [Citation(s) in RCA: 149] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Genetic studies have emphasized the contrast between North African and sub-Saharan populations, but the particular affinities of the North African mtDNA pool to that of Europe, the Near East, and sub-Saharan Africa have not previously been investigated. We have analysed 268 mtDNA control-region sequences from various Northwest African populations including several Senegalese groups and compared these with the mtDNA database. We have identified a few mitochondrial motifs that are geographically specific and likely predate the distribution and diversification of modern language families in North and West Africa. A certain mtDNA motif (16172C, 16219G), previously found in Algerian Berbers at high frequency, is apparently omnipresent in Northwest Africa and may reflect regional continuity of more than 20,000 years. The majority of the maternal ancestors of the Berbers must have come from Europe and the Near East since the Neolithic. The Mauritanians and West-Saharans, in contrast, bear substantial though not dominant mtDNA affinity with sub-Saharans.
Collapse
Affiliation(s)
- J C Rando
- Departamento de Genética, Universidad de La Laguna, Tenerife.
| | | | | | | | | | | | | |
Collapse
|
10
|
Abstract
For most of the past century, prehistorians have had to rely on the fossil and archaeological records in order to reconstruct the past. In the last few decades, this evidence has been substantially supplemented from classical human genetics. More recently, phylogenetic analyses of DNA sequences that incorporate geographical information have provided a high-resolution tool for the investigation of prehistoric demographic events, such as founder effects and population expansions. These events can be dated using a molecular clock when the mutation rate and founder haplotypes are known. We have previously applied such methods to sequence data from the mitochondrial DNA control region, to suggest that most extant mitochondrial sequences in western Europe have a local ancestry in the Early Upper Palaeolithic, with a smaller proportion arriving from the Near East in the Neolithic. Here, we describe a cladistic notation for mitochondrial variation and expand upon our earlier analysis to present a more detailed portrait of the European mitochondrial record.
Collapse
Affiliation(s)
- M B Richards
- Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, United Kingdom.
| | | | | | | |
Collapse
|
11
|
Torroni A, Bandelt HJ, D'Urbano L, Lahermo P, Moral P, Sellitto D, Rengo C, Forster P, Savontaus ML, Bonné-Tamir B, Scozzari R. mtDNA analysis reveals a major late Paleolithic population expansion from southwestern to northeastern Europe. Am J Hum Genet 1998; 62:1137-52. [PMID: 9545392 PMCID: PMC1377079 DOI: 10.1086/301822] [Citation(s) in RCA: 269] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open
Abstract
mtDNA sequence variation was studied in 419 individuals from nine Eurasian populations, by high-resolution RFLP analysis, and it was followed by sequencing of the control region of a subset of these mtDNAs and a detailed survey of previously published data from numerous other European populations. This analysis revealed that a major Paleolithic population expansion from the "Atlantic zone" (southwestern Europe) occurred 10,000-15,000 years ago, after the Last Glacial Maximum. As an mtDNA marker for this expansion we identified haplogroup V, an autochthonous European haplogroup, which most likely originated in the northern Iberian peninsula or southwestern France at about the time of the Younger Dryas. Its sister haplogroup, H, which is distributed throughout the entire range of Caucasoid populations and which originated in the Near East approximately 25,000-30,000 years ago, also took part in this expansion, thus rendering it by far the most frequent (40%-60%) haplogroup in western Europe. Subsequent migrations after the Younger Dryas eventually carried those "Atlantic" mtDNAs into central and northern Europe. This scenario, already implied by archaeological records, is given overwhelming support from both the distribution of the autochthonous European Y chromosome type 15, as detected by the probes 49a/f, and the synthetic maps of nuclear data.
Collapse
Affiliation(s)
- A Torroni
- Dipartimento di Genetica e Biologia Molecolare, Universitá di Roma "La Sapienza," Rome, Italy.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
|
13
|
|
14
|
Abstract
mtDNA studies support an African origin for modern Eurasians, but expansion events within Africa have not previously been investigated. We have therefore analyzed 407 mtDNA control-region sequences from 13 African ethnic groups. A number of sequences (13%) were highly divergent and coalesced on the "mitochondrial Eve" in Africans. The remaining sequences also ultimately coalesced on this sequence but fell into four major clusters whose starlike phylogenies testify to demographic expansions. The oldest of these African expansions dates to approximately 60,000-80,000 years ago. Eurasian sequences are derived from essentially one sequence within this ancient cluster, even though a diverse mitochondrial pool was present in Africa at the time.
Collapse
Affiliation(s)
- E Watson
- School of Biological Sciences, Massey University, Palmerston North, New Zealand.
| | | | | | | |
Collapse
|
15
|
Forster P, Harding R, Torroni A, Bandelt HJ. Origin and evolution of Native American mtDNA variation: a reappraisal. Am J Hum Genet 1996; 59:935-45. [PMID: 8808611 PMCID: PMC1914796] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
The timing and number of prehistoric migrations involved in the settlement of the American continent is subject to intense debate. Here, we reanalyze Native American control region mtDNA data and demonstrate that only an appropriate phylogenetic analysis accompanied by an appreciation of demographic factors allows us to discern different migrations and to estimate their ages. Reappraising 574 mtDNA control region sequences from aboriginal Siberians and Native Americans, we confirm in agreement with linguistic, archaeological and climatic evidence that (i) the major wave of migration brought one population, ancestral to the Amerinds, from northeastern Siberia to America 20,000-25,000 years ago and (ii) a rapid expansion of a Beringian source population took place at the end of the Younger Dryas glacial phase approximately 11,300 years ago, ancestral to present Eskimo and Na-Dene populations.
Collapse
Affiliation(s)
- P Forster
- Heinrich-Pette-Institut für Experimentelle Virologie und Immunologie an der Universität Hamburg, Germany
| | | | | | | |
Collapse
|
16
|
Richards M, Côrte-Real H, Forster P, Macaulay V, Wilkinson-Herbots H, Demaine A, Papiha S, Hedges R, Bandelt HJ, Sykes B. Paleolithic and neolithic lineages in the European mitochondrial gene pool. Am J Hum Genet 1996; 59:185-203. [PMID: 8659525 PMCID: PMC1915109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
Phylogenetic and diversity analysis of the mtDNA control region sequence variation of 821 individuals from Europe and the Middle East distinguishes five major lineage groups with different internal diversities and divergence times. Consideration of the diversities and geographic distribution of these groups within Europe and the Middle East leads to the conclusion that ancestors of the great majority of modern, extant lineages entered Europe during the Upper Paleolithic. A further set of lineages arrived from the Middle East much later, and their age and geographic distribution within Europe correlates well with archaeological evidence for two culturally and geographically distinct Neolithic colonization events that are associated with the spread of agriculture. It follows from this interpretation that the major extant lineages throughout Europe predate the Neolithic expansion and that the spread of agriculture was a substantially indigenous development accompanied by only a relatively minor component of contemporary Middle Eastern agriculturalists. There is no evidence of any surviving Neanderthal lineages among modern Europeans.
Collapse
Affiliation(s)
- M Richards
- Department of Cellular Science, Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Abstract
Analysis of variation in the hypervariable region of mitochondrial DNA (mtDNA) has emerged as an important tool for studying human evolution and migration. However, attempts to reconstruct optimal intraspecific mtDNA phylogenies frequently fail because parallel mutation events partly obscure the true evolutionary pathways. This makes it inadvisable to present a single phylogenetic tree at the expense of neglecting equally acceptable ones. As an alternative, we propose a novel network approach for portraying mtDNA relationships. For small sample sizes (< approximately 50), an unmodified median network contains all most parsimonious trees, displays graphically the full information content of the sequence data, and can easily be generated by hand. For larger sample sizes, we reduce the complexity of the network by identifying parallelisms. This reduction procedure is guided by a compatibility argument and an additional source of phylogenetic information: the frequencies of the mitochondrial haplotypes. As a spin-off, our approach can also assist in identifying sequencing errors, which manifest themselves in implausible network substructures. We illustrate the advantages of our approach with several examples from existing data sets.
Collapse
Affiliation(s)
- H J Bandelt
- Mathematisches Seminar, Universität Hamburg, Germany
| | | | | | | |
Collapse
|
18
|
Abstract
In order to analyze the structure inherent to a matrix of dissimilarities (such as evolutionary distances) we propose to use a new technique called split decomposition. This method accurately dissects the given dissimilarity measure as a sum of elementary "split" metrics plus a (small) residue. The split summands identify related groups which are susceptible to further interpretation when casted against the available biological information. Reanalysis of previously published ribosomal RNA data sets using split decomposition illustrate the potential of this approach.
Collapse
Affiliation(s)
- H J Bandelt
- Mathematisches Seminar, Universität Hamburg, Federal Republic of Germany
| | | |
Collapse
|
19
|
Abstract
A new and apparently rather useful and natural concept in cluster analysis is studied: given a similarity measure on a set of objects, a sub-set is regarded as a cluster if any two objects a, b inside this sub-set have greater similarity than any third object outside has to at least one of a, b. These clusters then form a closure system which can be described as a hypergraph without triangles. Conversely, given such a system, one may attach some weight to each cluster and then compose a similarity measure additively, by letting the similarity of a pair be the sum of weights of the clusters containing that particular pair. The original clusters can be reconstructed from the obtained similarity measure. This clustering model is thus located between the general additive clustering model of Shepard and Arabie (1979) and the standard hierarchical model. Potential applications include fitting dendrograms with few additional nonnested clusters and simultaneous representation of some families of multiple dendrograms (in particular, two-dendrogram solutions), as well as assisting the search for phylogenetic relationships by proposing a somewhat larger system of possibly relevant "family groups", from which an appropriate choice (based on additional insight or individual preferences) remains to be made.
Collapse
|