1
|
Can irisin be developed as the molecular evolutionary clock based on the origin and functions? Gen Comp Endocrinol 2024; 352:114515. [PMID: 38582177 DOI: 10.1016/j.ygcen.2024.114515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Revised: 12/21/2023] [Accepted: 04/03/2024] [Indexed: 04/08/2024]
Abstract
Irisin, a myokine identified in 2012, has garnered research interest for its capacity to induce browning of adipocytes and improve metabolic parameters. As such, the potential therapeutic applications of this exercise-induced peptide continue to be explored. Though present across diverse animal species, sequence analysis has revealed subtle variation in the irisin protein. In this review, we consider the effects of irisin on disease states in light of its molecular evolution. We summarize current evidence for irisin's influence on pathologies and discuss how sequence changes may inform development of irisin-based therapies. Furthermore, we propose that the phylogenetic variations in irisin could potentially be leveraged as a molecular clock to elucidate evolutionary relationships.
Collapse
|
2
|
ClockstaRX: Testing Molecular Clock Hypotheses With Genomic Data. Genome Biol Evol 2024; 16:evae064. [PMID: 38526019 PMCID: PMC10999959 DOI: 10.1093/gbe/evae064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Revised: 01/11/2024] [Accepted: 03/21/2024] [Indexed: 03/26/2024] Open
Abstract
Phylogenomic data provide valuable opportunities for studying evolutionary rates and timescales. These analyses require theoretical and statistical tools based on molecular clocks. We present ClockstaRX, a flexible platform for exploring and testing evolutionary rate signals in phylogenomic data. Here, information about evolutionary rates in branches across gene trees is placed in Euclidean space, allowing data transformation, visualization, and hypothesis testing. ClockstaRX implements formal tests for identifying groups of loci and branches that make a large contribution to patterns of rate variation. This information can then be used to test for drivers of genomic evolutionary rates or to inform models for molecular dating. Drawing on the results of a simulation study, we recommend forms of data exploration and filtering that might be useful prior to molecular-clock analyses.
Collapse
|
3
|
A comprehensive genus-level phylogeny and biogeographical history of the Lythraceae based on whole plastome sequences. ANNALS OF BOTANY 2023; 132:293-318. [PMID: 37439499 PMCID: PMC10583215 DOI: 10.1093/aob/mcad091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Accepted: 07/05/2023] [Indexed: 07/14/2023]
Abstract
BACKGROUND AND AIMS The Lythraceae are a mainly subtropical to tropical family of the order Myrtales with 28 currently accepted genera and approximately 600 species. There is currently no well-supported phylogenetic and biogeographical hypothesis of the Lythraceae incorporating all currently accepted genera, which we sought to provide. METHODS Plastomes of representative species of 18 distinct Lythraceae genera were sequenced and annotated. Together with existing sequences, plastomes of all 28 currently accepted genera in the Lythraceae were brought together for the first time. The plastomes were aligned and a Bayesian phylogenetic hypothesis was produced. We then conducted a time-calibrated Bayesian analysis and a biogeographical analysis. KEY RESULTS Plastome-based Bayesian and maximum-likelihood phylogenetic trees are generally congruent with recent nuclear phylogenomic data and resolve two deeply branching major clades in the Lythraceae. One major clade concentrates shrubby and arboreal South American and African genera that inhabit seasonally dry environments, with larger, often winged seeds, adapted to dispersal by the wind. The second major clade concentrates North American, Asian, African and several near-cosmopolitan herbaceous, shrubby and arboreal genera, often inhabiting humid or aquatic environments, with smaller seeds possessing structures that facilitate dispersal by water. CONCLUSIONS We hypothesize that the Lythraceae dispersed early in the Late Cretaceous from South American to North American continents, with subsequent expansion in the Late Cretaceous of a North American lineage through Laurasia to Africa via a boreotropical route. Two later expansions of South American clades to Africa in the Palaeocene and Eocene, respectively, are also hypothesized. Transoceanic dispersal in the family is possibly facilitated by adaptations to aquatic environments that are common to many extant genera of the Lythraceae, where long-distance dispersal and vicariance may be invoked to explain several remarkable disjunct distributions in Lythraceae clades.
Collapse
|
4
|
No phylogenomic support for a Cenozoic origin of the "living fossil" Isoetes. AMERICAN JOURNAL OF BOTANY 2023; 110:e16108. [PMID: 36401556 PMCID: PMC10108322 DOI: 10.1002/ajb2.16108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/21/2022] [Revised: 11/11/2022] [Accepted: 11/11/2022] [Indexed: 06/16/2023]
Abstract
PREMISE The isoetalean lineage has a rich fossil record that extends to the Devonian, but the age of the living clade is unclear. Recent results indicate that it is young, from the Cenozoic, whereas earlier work based on less data from a denser taxon sampling yielded Mesozoic median ages. METHODS We analyzed node ages in Isoetes using two genomic data sets (plastome and nuclear ribosomal cistron), three clock models implemented in MrBayes (ILN, WN, and TK02 models), and a conservative approach to calibration. RESULTS While topological results were consistently resolved in Isoetes estimated crown group ages range from the latest Paleozoic (mid-Permian) to the Mesozoic depending on data type and clock model. The oldest estimates were retrieved using the autocorrelated TK02 clock model. An (early) Cenozoic age was only obtained under one specific condition (plastome data analyzed with the uncorrelated ILN clock model). That same plastome data set also yielded the oldest (mid-Permian) age estimate when analyzed with the autocorrelated TK02 clock model. Adding the highly divergent, recently established sister species Isoetes wormaldii to the data set approximately doubled the average median node depth to the Isoetes crown group. CONCLUSIONS There is no consistent support for a Cenozoic origin of the living clade Isoetes. We obtained seemingly well-founded, yet strongly deviating results depending on data type and clock model. The single most important future improvement is probably to add calibration points, which requires an improved understanding of the isoetalean fossil record or alternative bases for calibration.
Collapse
|
5
|
Environmental niche and flight intensity are associated with molecular evolutionary rates in a large avian radiation. BMC Ecol Evol 2022; 22:95. [PMID: 35918644 PMCID: PMC9347078 DOI: 10.1186/s12862-022-02047-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Accepted: 07/20/2022] [Indexed: 11/27/2022] Open
Abstract
Background Metabolic activity and environmental energy are two of the most studied putative drivers of molecular evolutionary rates. Their extensive study, however, has resulted in mixed results and has rarely included the exploration of interactions among various factors impacting molecular evolutionary rates across large clades. Taking the diverse avian family Furnariidae as a case study, we examined the association between several estimates of molecular evolutionary rates with proxies of metabolic demands imposed by flight (wing loading and wing shape) and proxies of environmental energy across the geographic ranges of species (temperature and UV radiation). Results We found weak evidence of a positive effect of environmental and morphological variables on mitochondrial substitution rates. Additionally, we found that temperature and UV radiation interact to explain molecular rates at nucleotide sites affected by selection and population size (non-synonymous substitutions), contrary to the expectation of their impact on sites associated with mutation rates (synonymous substitutions). We also found a negative interaction between wing shape (as described by the hand-wing index) and body mass explaining mitochondrial molecular rates, suggesting molecular signatures of positive selection or reduced population sizes in small-bodied species with greater flight activity. Conclusions Our results suggest that the demands of flight and environmental energy pose multiple evolutionary pressures on the genome either by driving mutation rates or via their association with natural selection or population size. Data from whole genomes and detailed physiology across taxa will bring a more complete picture of the impact of metabolism, population size, and the environment on avian genome evolution. Supplementary Information The online version contains supplementary material available at 10.1186/s12862-022-02047-0.
Collapse
|
6
|
Phylogeographic analysis of the Bantu language expansion supports a rainforest route. Proc Natl Acad Sci U S A 2022; 119:e2112853119. [PMID: 35914165 PMCID: PMC9372543 DOI: 10.1073/pnas.2112853119] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Southern Africa has been shaped by the large-scale expansion of Bantu populations fueled by agriculture: Currently, 240 million people speak one of the more than 500 Bantu languages. However, the timing and geographic routes undergone by the Bantu populations remain largely unknown. We use cutting-edge phylogeographic techniques to show that Bantu populations migrated through the Central African tropical rainforest around 4,400 y ago. This adds to the growing evidence that agricultural expansions can successfully overcome ecological challenges as they unfold. The Bantu expansion transformed the linguistic, economic, and cultural composition of sub-Saharan Africa. However, the exact dates and routes taken by the ancestors of the speakers of the more than 500 current Bantu languages remain uncertain. Here, we use the recently developed “break-away” geographical diffusion model, specially designed for modeling migrations, with “augmented” geographic information, to reconstruct the Bantu language family expansion. This Bayesian phylogeographic approach with augmented geographical data provides a powerful way of linking linguistic, archaeological, and genetic data to test hypotheses about large language family expansions. We compare four hypotheses: an early major split north of the rainforest; a migration through the Sangha River Interval corridor around 2,500 BP; a coastal migration around 4,000 BP; and a migration through the rainforest before the corridor opening, at 4,000 BP. Our results produce a topology and timeline for the Bantu language family, which supports the hypothesis of an expansion through Central African tropical forests at 4,420 BP (4,040 to 5,000 95% highest posterior density interval), well before the Sangha River Interval was open.
Collapse
|
7
|
Investigating the reliability of molecular estimates of evolutionary time when substitution rates and speciation rates vary. BMC Ecol Evol 2022; 22:61. [PMID: 35538412 PMCID: PMC9088092 DOI: 10.1186/s12862-022-02015-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Accepted: 04/14/2022] [Indexed: 11/17/2022] Open
Abstract
Background An accurate timescale of evolutionary history is essential to testing hypotheses about the influence of historical events and processes, and the timescale for evolution is increasingly derived from analysis of DNA sequences. But variation in the rate of molecular evolution complicates the inference of time from DNA. Evidence is growing for numerous factors, such as life history and habitat, that are linked both to the molecular processes of mutation and fixation and to rates of macroevolutionary diversification. However, the most widely used methods rely on idealised models of rate variation, such as the uncorrelated and autocorrelated clocks, and molecular dating methods are rarely tested against complex models of rate change. One relationship that is not accounted for in molecular dating is the potential for interaction between molecular substitution rates and speciation, a relationship that has been supported by empirical studies in a growing number of taxa. If these relationships are as widespread as current evidence suggests, they may have a significant influence on molecular dates. Results We simulate phylogenies and molecular sequences under three different realistic rate variation models—one in which speciation rates and substitution rates both vary but are unlinked, one in which they covary continuously and one punctuated model in which molecular change is concentrated in speciation events, using empirical case studies to parameterise realistic simulations. We test three commonly used “relaxed clock” molecular dating methods against these realistic simulations to explore the degree of error in molecular dates under each model. We find average divergence time inference errors ranging from 12% of node age for the unlinked model when reconstructed under an uncorrelated rate prior using BEAST 2, to up to 91% when sequences evolved under the punctuated model are reconstructed under an autocorrelated prior using PAML. Conclusions We demonstrate the potential for substantial errors in molecular dates when both speciation rates and substitution rates vary between lineages. This study highlights the need for tests of molecular dating methods against realistic models of rate variation generated from empirical parameters and known relationships. Supplementary Information The online version contains supplementary material available at 10.1186/s12862-022-02015-8.
Collapse
|
8
|
Molecular data confirm Triatoma pallidipennis Stål, 1872 (Hemiptera: Reduviidae: Triatominae) as a novel cryptic species complex. Acta Trop 2022; 229:106382. [PMID: 35189124 DOI: 10.1016/j.actatropica.2022.106382] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Revised: 02/17/2022] [Accepted: 02/17/2022] [Indexed: 12/11/2022]
Abstract
Triatoma pallidipennis constitues one of the most important Chagas disease vector in Mexico. Previous studies based on molecular data suggest T. pallidipennis as a complex of cryptic species. For that reason, we analyzed the phylogenetic relationships of T. pallidipennis using DNA sequences from the mitochondrial ND4 gene and the ITS-2 gene. In addition, the divergence times were estimated, and possible new taxa were delimited with three species delimitation methods. Finally, genetic distances and possible connectivity routes based on shared haplotypes were obtained among the T. pallidipennis populations. Five haplogroups (possible cryptic species) were found, based on delimitation methods and genetic distances. Haplogroup divergence began about 3 Ma, in the Pleistocene. Moreover, none of the haplogroups showed potential connectivity routes between them, evidencing lack of gene flow. Our results suggest the existence of a new cryptic species complex within what is currently recognized as a T. pallidipennis.
Collapse
|
9
|
Spontaneous rate of clonal single nucleotide mutations in Daphnia galeata. PLoS One 2022; 17:e0265632. [PMID: 35363773 PMCID: PMC8975155 DOI: 10.1371/journal.pone.0265632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Accepted: 03/05/2022] [Indexed: 11/30/2022] Open
Abstract
Mutations are the ultimate source of heritable variation and therefore the fuel for evolution, but direct estimates of mutation rates exist only for few species. We estimated the spontaneous single nucleotide mutation rate among clonal generations in the waterflea Daphnia galeata with a short-term mutation accumulation approach. Individuals from eighteen mutation accumulation lines over five generations were deep sequenced to count de novo mutations that were not present in a pool of F1 individuals, representing the parental genotype. We identified 12 new nucleotide mutations in 90 clonal generational passages. This resulted in an estimated single nucleotide mutation rate of 0.745 x 10-9 (95% c.f. 0.39 x 10-9-1.26 x 10-9), which is slightly lower than recent estimates for other Daphnia species. We discuss the implications for the population genetics of Cladocerans.
Collapse
|
10
|
exTREEmaTIME: a method for incorporating uncertainty into divergence time estimates. Biol Open 2022; 11:274355. [PMID: 35147180 PMCID: PMC8845097 DOI: 10.1242/bio.059181] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Accepted: 12/09/2021] [Indexed: 11/20/2022] Open
Abstract
We present a method of divergence time estimation (exTREEmaTIME) that aims to effectively account for uncertainty in divergence time estimates. The method requires a minimal set of assumptions, and, based on these assumptions, estimates the oldest possible divergence times and youngest possible divergence times that are consistent with the assumptions. We use a series of simulations and empirical analyses to illustrate that exTREEmaTIME is effective at representing uncertainty. We then describe how exTREEmaTIME can act as a basis to determine the implications of the more stringent assumptions that are incorporated into other methods of divergence time estimation that produce more precise estimates. This is critically important given that many of the assumptions that are incorporated into these methods are highly complex, difficult to justify biologically, and as such can lead to estimates that are highly inaccurate. This article has an associated First Person interview with the first author of the paper.
Collapse
|
11
|
Divergence time estimation of Galliformes based on the best gene shopping scheme of ultraconserved elements. BMC Ecol Evol 2021; 21:209. [PMID: 34809586 PMCID: PMC8609756 DOI: 10.1186/s12862-021-01935-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Accepted: 11/08/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Divergence time estimation is fundamental to understanding many aspects of the evolution of organisms, such as character evolution, diversification, and biogeography. With the development of sequence technology, improved analytical methods, and knowledge of fossils for calibration, it is possible to obtain robust molecular dating results. However, while phylogenomic datasets show great promise in phylogenetic estimation, the best ways to leverage the large amounts of data for divergence time estimation has not been well explored. A potential solution is to focus on a subset of data for divergence time estimation, which can significantly reduce the computational burdens and avoid problems with data heterogeneity that may bias results. RESULTS In this study, we obtained thousands of ultraconserved elements (UCEs) from 130 extant galliform taxa, including representatives of all genera, to determine the divergence times throughout galliform history. We tested the effects of different "gene shopping" schemes on divergence time estimation using a carefully, and previously validated, set of fossils. Our results found commonly used clock-like schemes may not be suitable for UCE dating (or other data types) where some loci have little information. We suggest use of partitioning (e.g., PartitionFinder) and selection of tree-like partitions may be good strategies to select a subset of data for divergence time estimation from UCEs. Our galliform time tree is largely consistent with other molecular clock studies of mitochondrial and nuclear loci. With our increased taxon sampling, a well-resolved topology, carefully vetted fossil calibrations, and suitable molecular dating methods, we obtained a high quality galliform time tree. CONCLUSIONS We provide a robust galliform backbone time tree that can be combined with more fossil records to further facilitate our understanding of the evolution of Galliformes and can be used as a resource for comparative and biogeographic studies in this group.
Collapse
|
12
|
Next-generation cophylogeny: unravelling eco-evolutionary processes. Trends Ecol Evol 2021; 36:907-918. [PMID: 34243958 DOI: 10.1016/j.tree.2021.06.006] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Revised: 06/09/2021] [Accepted: 06/11/2021] [Indexed: 11/19/2022]
Abstract
A fundamental question in evolutionary biology is how microevolutionary processes translate into species diversification. Cophylogeny provides an appropriate framework to address this for symbiotic associations, but historically has been primarily limited to unveiling patterns. We argue that it is essential to integrate advances from ecology and evolutionary biology into cophylogeny, to gain greater mechanistic insights and transform cophylogeny into a platform to advance understanding of interspecific interactions and diversification more widely. We discuss key directions, such as incorporating trait reconstruction and considering multiple scales of network organization, and highlight recent developments for implementation. A new quantitative framework is proposed to allow integration of relevant information, such as quantitative traits and assessment of the contribution of individual mechanisms to cophylogenetic patterns.
Collapse
|
13
|
Large-Scale Phylogenomic Analyses Reveal the Monophyly of Bryophytes and Neoproterozoic Origin of Land Plants. Mol Biol Evol 2021; 38:3332-3344. [PMID: 33871608 PMCID: PMC8321542 DOI: 10.1093/molbev/msab106] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
The relationships among the four major embryophyte lineages (mosses, liverworts, hornworts, vascular plants) and the timing of the origin of land plants are enigmatic problems in plant evolution. Here, we resolve the monophyly of bryophytes by improving taxon sampling of hornworts and eliminating the effect of synonymous substitutions. We then estimate the divergence time of crown embryophytes based on three fossil calibration strategies, and reveal that maximum calibration constraints have a major effect on estimating the time of origin of land plants. Moreover, comparison of priors and posteriors provides a guide for evaluating the optimal calibration strategy. By considering the reliability of fossil calibrations and the influences of molecular data, we estimate that land plants originated in the Precambrian (980–682 Ma), much older than widely recognized. Our study highlights the important contribution of molecular data when faced with contentious fossil evidence, and that fossil calibrations used in estimating the timescale of plant evolution require critical scrutiny.
Collapse
|
14
|
The implications of interrelated assumptions on estimates of divergence times and rates of diversification. Syst Biol 2021; 70:1181-1199. [PMID: 33760070 DOI: 10.1093/sysbio/syab021] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Revised: 03/16/2021] [Accepted: 03/22/2021] [Indexed: 11/15/2022] Open
Abstract
Phylogenies are increasingly being used as a basis to provide insight into macroevolutionary history. Here, we use simulation experiments and empirical analyses to evaluate methods that use phylogenies as a basis to make estimates of divergence times and rates of diversification. This is the first study to present a comprehensive assessment of the key variables that underpin analyses in this field - including substitution rates, speciation rates, and extinction, plus character sampling and taxon sampling. We show that in unrealistically simplistic cases (where substitution rates and speciation rates are constant, and where there is no extinction), increased character and taxon sampling lead to more accurate and precise parameter estimates. By contrast, in more complex but realistic cases (where substitution rates, speciation rates, and extinction rates vary), gains in accuracy and precision from increased character and taxon sampling are far more limited. The lack of accuracy and precision even occurs when using methods that are designed to account for more complex cases, such as relaxed clocks, fossil calibrations, and models that allow speciation rates and extinction rates to vary. The problem also persists when analysing genomic scale datasets. These results suggest two interrelated problems that occur when the processes that generated the data are more complex. First, methodological assumptions are more likely to be violated. Second, limitations in the information content of the data become more important.
Collapse
|
15
|
Abstract
Phylogenetic trees inferred from sequence data often have branch lengths measured in the expected number of substitutions and therefore, do not have divergence times estimated. These trees give an incomplete view of evolutionary histories since many applications of phylogenies require time trees. Many methods have been developed to convert the inferred branch lengths from substitution unit to time unit using calibration points, but none is universally accepted as they are challenged in both scalability and accuracy under complex models. Here, we introduce a new method that formulates dating as a nonconvex optimization problem where the variance of log-transformed rate multipliers is minimized across the tree. On simulated and real data, we show that our method, wLogDate, is often more accurate than alternatives and is more robust to various model assumptions.
Collapse
|
16
|
The Implications of Lineage-Specific Rates for Divergence Time Estimation. Syst Biol 2021; 69:660-670. [PMID: 31808929 PMCID: PMC7302051 DOI: 10.1093/sysbio/syz080] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Revised: 11/28/2019] [Accepted: 12/01/2019] [Indexed: 11/29/2022] Open
Abstract
Rate variation adds considerable complexity to divergence time estimation in molecular phylogenies. Here, we evaluate the impact of lineage-specific rates—which we define as among-branch-rate-variation that acts consistently across the entire genome. We compare its impact to residual rates—defined as among-branch-rate-variation that shows a different pattern of rate variation at each sampled locus, and gene-specific rates—defined as variation in the average rate across all branches at each sampled locus. We show that lineage-specific rates lead to erroneous divergence time estimates, regardless of how many loci are sampled. Further, we show that stronger lineage-specific rates lead to increasing error. This contrasts to residual rates and gene-specific rates, where sampling more loci significantly reduces error. If divergence times are inferred in a Bayesian framework, we highlight that error caused by lineage-specific rates significantly reduces the probability that the 95% highest posterior density includes the correct value, and leads to sensitivity to the prior. Use of a more complex rate prior—which has recently been proposed to model rate variation more accurately—does not affect these conclusions. Finally, we show that the scale of lineage-specific rates used in our simulation experiments is comparable to that of an empirical data set for the angiosperm genus Ipomoea. Taken together, our findings demonstrate that lineage-specific rates cause error in divergence time estimates, and that this error is not overcome by analyzing genomic scale multilocus data sets. [Divergence time estimation; error; rate variation.]
Collapse
|
17
|
Abstract
Understanding and representing uncertainty is crucial in academic research because it enables studies to build on the conclusions of previous studies, leading to robust advances in a particular field. Here, we evaluate the nature of uncertainty and the manner by which it is represented in divergence time estimation, a field that is fundamental to many aspects of macroevolutionary research, and where there is evidence that uncertainty has been seriously underestimated. We address this issue in the context of methods used in divergence time estimation, and with respect to the manner by which time-calibrated phylogenies are interpreted. With respect to methods, we discuss how the assumptions underlying different methods may not adequately reflect uncertainty about molecular evolution, the fossil record, or diversification rates. Therefore, divergence time estimates may not adequately reflect uncertainty and may be directly contradicted by subsequent findings. For the interpretation of time-calibrated phylogenies, we discuss how the use of time-calibrated phylogenies for reconstructing general evolutionary timescales leads to inferences about macroevolution that are highly sensitive to methodological limitations in how uncertainty is accounted for. By contrast, we discuss how the use of time-calibrated phylogenies to test specific hypotheses leads to inferences about macroevolution that are less sensitive to methodological limitations. Given that many biologists wish to use time-calibrated phylogenies to reconstruct general evolutionary timescales, we conclude that the development of methods of divergence time estimation that adequately account for uncertainty is necessary. [Divergence time estimation; macroevolution; uncertainty.].
Collapse
|
18
|
Phylogenetic diversity metrics from molecular phylogenies: modelling expected degree of error under realistic rate variation. DIVERS DISTRIB 2020. [DOI: 10.1111/ddi.13179] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
|
19
|
Phylogeny of the North-Central American clade of blood-sucking reduviid bugs of the tribe Triatomini (Hemiptera: Triatominae) based on the mitochondrial genome. INFECTION GENETICS AND EVOLUTION 2020; 84:104373. [DOI: 10.1016/j.meegid.2020.104373] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/26/2020] [Revised: 05/18/2020] [Accepted: 05/19/2020] [Indexed: 12/19/2022]
|
20
|
Molecular dating for phylogenies containing a mix of populations and species by using Bayesian and RelTime approaches. Mol Ecol Resour 2020; 21:122-136. [PMID: 32881388 DOI: 10.1111/1755-0998.13249] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2019] [Revised: 08/14/2020] [Accepted: 08/19/2020] [Indexed: 12/11/2022]
Abstract
Simultaneous molecular dating of population and species divergences is essential in many biological investigations, including phylogeography, phylodynamics and species delimitation studies. In these investigations, multiple sequence alignments consist of both intra- and interspecies samples (mixed samples). As a result, the phylogenetic trees contain interspecies, interpopulation and within-population divergences. Bayesian relaxed clock methods are often employed in these analyses, but they assume the same tree prior for both inter- and intraspecies branching processes and require specification of a clock model for branch rates (independent vs. autocorrelated rates models). We evaluated the impact of a single tree prior on Bayesian divergence time estimates by analysing computer-simulated data sets. We also examined the effect of the assumption of independence of evolutionary rate variation among branches when the branch rates are autocorrelated. Bayesian approach with coalescent tree priors generally produced excellent molecular dates and highest posterior densities with high coverage probabilities. We also evaluated the performance of a non-Bayesian method, RelTime, which does not require the specification of a tree prior or a clock model. RelTime's performance was similar to that of the Bayesian approach, suggesting that it is also suitable to analyse data sets containing both populations and species variation when its computational efficiency is needed.
Collapse
|
21
|
Deep-Time Demographic Inference Suggests Ecological Release as Driver of Neoavian Adaptive Radiation. DIVERSITY-BASEL 2020. [DOI: 10.3390/d12040164] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Assessing the applicability of theory to major adaptive radiations in deep time represents an extremely difficult problem in evolutionary biology. Neoaves, which includes 95% of living birds, is believed to have undergone a period of rapid diversification roughly coincident with the Cretaceous–Paleogene (K-Pg) boundary. We investigate whether basal neoavian lineages experienced an ecological release in response to ecological opportunity, as evidenced by density compensation. We estimated effective population sizes (Ne) of basal neoavian lineages by combining coalescent branch lengths (CBLs) and the numbers of generations between successive divergences. We used a modified version of Accurate Species TRee Algorithm (ASTRAL) to estimate CBLs directly from insertion–deletion (indel) data, as well as from gene trees using DNA sequence and/or indel data. We found that some divergences near the K-Pg boundary involved unexpectedly high gene tree discordance relative to the estimated number of generations between speciation events. The simplest explanation for this result is an increase in Ne, despite the caveats discussed herein. It appears that at least some early neoavian lineages, similar to the ancestor of the clade comprising doves, mesites, and sandgrouse, experienced ecological release near the time of the K-Pg mass extinction.
Collapse
|
22
|
A Simulation-Based Evaluation of Tip-Dating Under the Fossilized Birth-Death Process. Syst Biol 2020; 69:325-344. [PMID: 31132125 PMCID: PMC7175741 DOI: 10.1093/sysbio/syz038] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2018] [Revised: 05/13/2019] [Accepted: 05/17/2019] [Indexed: 11/25/2022] Open
Abstract
Bayesian molecular dating is widely used to study evolutionary timescales. This procedure usually involves phylogenetic analysis of nucleotide sequence data, with fossil-based calibrations applied as age constraints on internal nodes of the tree. An alternative approach is tip-dating, which explicitly includes fossil data in the analysis. This can be done, for example, through the joint analysis of molecular data from present-day taxa and morphological data from both extant and fossil taxa. In the context of tip-dating, an important development has been the fossilized birth-death process, which allows non-contemporaneous tips and sampled ancestors while providing a model of lineage diversification for the prior on the tree topology and internal node times. However, tip-dating with fossils faces a number of considerable challenges, especially, those associated with fossil sampling and evolutionary models for morphological characters. We conducted a simulation study to evaluate the performance of tip-dating using the fossilized birth-death model. We simulated fossil occurrences and the evolution of nucleotide sequences and morphological characters under a wide range of conditions. Our analyses of these data show that the number and the maximum age of fossil occurrences have a greater influence than the degree of among-lineage rate variation or the number of morphological characters on estimates of node times and the tree topology. Tip-dating with the fossilized birth-death model generally performs well in recovering the relationships among extant taxa but has difficulties in correctly placing fossil taxa in the tree and identifying the number of sampled ancestors. The method yields accurate estimates of the ages of the root and crown group, although the precision of these estimates varies with the probability of fossil occurrence. The exclusion of morphological characters results in a slight overestimation of node times, whereas the exclusion of nucleotide sequences has a negative impact on inference of the tree topology. Our results provide an overview of the performance of tip-dating using the fossilized birth-death model, which will inform further development of the method and its application to key questions in evolutionary biology.
Collapse
|
23
|
Reliable Confidence Intervals for RelTime Estimates of Evolutionary Divergence Times. Mol Biol Evol 2020; 37:280-290. [PMID: 31638157 DOI: 10.1093/molbev/msz236] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Confidence intervals (CIs) depict the statistical uncertainty surrounding evolutionary divergence time estimates. They capture variance contributed by the finite number of sequences and sites used in the alignment, deviations of evolutionary rates from a strict molecular clock in a phylogeny, and uncertainty associated with clock calibrations. Reliable tests of biological hypotheses demand reliable CIs. However, current non-Bayesian methods may produce unreliable CIs because they do not incorporate rate variation among lineages and interactions among clock calibrations properly. Here, we present a new analytical method to calculate CIs of divergence times estimated using the RelTime method, along with an approach to utilize multiple calibration uncertainty densities in dating analyses. Empirical data analyses showed that the new methods produce CIs that overlap with Bayesian highest posterior density intervals. In the analysis of computer-simulated data, we found that RelTime CIs show excellent average coverage probabilities, that is, the actual time is contained within the CIs with a 94% probability. These developments will encourage broader use of computationally efficient RelTime approaches in molecular dating analyses and biological hypothesis testing.
Collapse
|
24
|
Linking Branch Lengths across Sets of Loci Provides the Highest Statistical Support for Phylogenetic Inference. Mol Biol Evol 2019; 37:1202-1210. [DOI: 10.1093/molbev/msz291] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
AbstractEvolution leaves heterogeneous patterns of nucleotide variation across the genome, with different loci subject to varying degrees of mutation, selection, and drift. In phylogenetics, the potential impacts of partitioning sequence data for the assignment of substitution models are well appreciated. In contrast, the treatment of branch lengths has received far less attention. In this study, we examined the effects of linking and unlinking branch-length parameters across loci or subsets of loci. By analyzing a range of empirical data sets, we find consistent support for a model in which branch lengths are proportionate between subsets of loci: gene trees share the same pattern of branch lengths, but form subsets that vary in their overall tree lengths. These models had substantially better statistical support than models that assume identical branch lengths across gene trees, or those in which genes form subsets with distinct branch-length patterns. We show using simulations and empirical data that the complexity of the branch-length model with the highest support depends on the length of the sequence alignment and on the numbers of taxa and loci in the data set. Our findings suggest that models in which branch lengths are proportionate between subsets have the highest statistical support under the conditions that are most commonly seen in practice. The results of our study have implications for model selection, computational efficiency, and experimental design in phylogenomics.
Collapse
|
25
|
Post K-Pg diversification of the mammalian order Eulipotyphla as suggested by phylogenomic analyses of ultra-conserved elements. Mol Phylogenet Evol 2019; 141:106605. [PMID: 31479732 DOI: 10.1016/j.ympev.2019.106605] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2018] [Revised: 08/26/2019] [Accepted: 08/27/2019] [Indexed: 11/26/2022]
Abstract
The origin of the mammalian order Eulipotyphla has been debated intensively with arguments around whether they began diversifying before or after the Cretaceous-Palaeogene (K-Pg) boundary at 66 Ma. Here, we used an in-solution nucleotide capture method and next generation DNA sequencing to determine the sequence of hundreds of ultra-conserved elements (UCEs), and conducted phylogenomic and molecular dating analyses for the four extant eulipotyphlan lineages-Erinaceidae, Solenodontidae, Soricidae, and Talpidae. Concatenated maximum-likelihood analyses with single or partitioned models and a coalescent species-tree analysis showed that divergences among the four major eulipotyphlan lineages occurred within a short period of evolutionary time, but did not resolve the interrelationships among them. Alternative suboptimal phylogenetic hypotheses received consistently the same amount of support from different UCE loci, and were not significantly different from the maximum likelihood tree topology, suggesting the prevalence of stochastic lineage sorting. Molecular dating analyses that incorporated among-lineage evolutionary rate differences supported a scenario where the four eulipotyphlan families diversified between 57.8 and 63.2 Ma. Given short branch lengths with low support values, traces of rampant genome-wide stochastic lineage sorting, and post K-Pg diversification, we concluded that the crown eulipotyphlan lineages arose through a rapid diversification after the K-Pg boundary when novel niches were created by the mass extinction of species.
Collapse
|
26
|
The Estimated Pacemaker for Great Apes Supports the Hominoid Slowdown Hypothesis. Evol Bioinform Online 2019; 15:1176934319855988. [PMID: 31223232 PMCID: PMC6566470 DOI: 10.1177/1176934319855988] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2019] [Accepted: 05/17/2019] [Indexed: 11/16/2022] Open
Abstract
The recent surge of genomic data has prompted the investigation of substitution rate variation across the genome, as well as among lineages. Evolutionary trees inferred from distinct genomic regions may display branch lengths that differ between loci by simple proportionality constants, indicating that rate variation follows a pacemaker model, which may be attributed to lineage effects. Analyses of genes from diverse biological clades produced contrasting results, supporting either this model or alternative scenarios where multiple pacemakers exist. So far, an evaluation of the pacemaker hypothesis for all great apes has never been carried out. In this work, we tested whether the evolutionary rates of hominids conform to pacemakers, which were inferred accounting for gene tree/species tree discordance. For higher precision, substitution rates in branches were estimated with a calibration-free approach, the relative rate framework. A predominant evolutionary trend in great apes was evidenced by the recovery of a large pacemaker, encompassing most hominid genomic regions. In addition, the majority of genes followed a pace of evolution that was closely related to the strict molecular clock. However, slight rate decreases were recovered in the internal branches leading to humans, corroborating the hominoid slowdown hypothesis. Our findings suggest that in great apes, life history traits were the major drivers of substitution rate variation across the genome.
Collapse
|
27
|
East African cichlid lineages (Teleostei: Cichlidae) might be older than their ancient host lakes: new divergence estimates for the east African cichlid radiation. BMC Evol Biol 2019; 19:94. [PMID: 31023223 PMCID: PMC6482553 DOI: 10.1186/s12862-019-1417-0] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2018] [Accepted: 03/31/2019] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Cichlids are a prime model system in evolutionary research and several of the most prominent examples of adaptive radiations are found in the East African Lakes Tanganyika, Malawi and Victoria, all part of the East African cichlid radiation (EAR). In the past, great effort has been invested in reconstructing the evolutionary and biogeographic history of cichlids (Teleostei: Cichlidae). In this study, we present new divergence age estimates for the major cichlid lineages with the main focus on the EAR based on a dataset encompassing representative taxa of almost all recognized cichlid tribes and ten mitochondrial protein genes. We have thoroughly re-evaluated both fossil and geological calibration points, and we included the recently described fossil †Tugenchromis pickfordi in the cichlid divergence age estimates. RESULTS Our results estimate the origin of the EAR to Late Eocene/Early Oligocene (28.71 Ma; 95% HPD: 24.43-33.15 Ma). More importantly divergence ages of the most recent common ancestor (MRCA) of several Tanganyika cichlid tribes were estimated to be substantially older than the oldest estimated maximum age of the Lake Tanganyika: Trematocarini (16.13 Ma, 95% HPD: 11.89-20.46 Ma), Bathybatini (20.62 Ma, 95% HPD: 16.88-25.34 Ma), Lamprologini (15.27 Ma; 95% HPD: 12.23-18.49 Ma). The divergence age of the crown haplochromine H-lineage is estimated to 22.8 Ma (95% HPD: 14.40-26.32 Ma) and of the Lake Malawi radiation to 4.07 Ma (95% HDP: 2.93-5.26 Ma). In addition, we recovered a novel lineage within the Lamprologini tribe encompassing only Lamprologus of the lower and central Congo drainage with its divergence estimated to the Late Miocene or early Pliocene. Furthermore we recovered two novel mitochondrial haplotype lineages within the Haplochromini tribe: 'Orthochromis' indermauri and 'Haplochormis' vanheusdeni. CONCLUSIONS Divergence time estimates of the MRCA of several Tanganyika cichlid tribes predate the age of the extant Lake Tanganyika basin, and hence are in line with the recently formulated "Melting-Pot Tanganyika" hypothesis. The radiation of the 'Lower Congo Lamprologus clade' might be linked with the Pliocene origin of the modern lower Congo rapids as has been shown for other Lower Congo cichlid assemblages. Finally, the age of origin of the Lake Malawi cichlid flock agrees well with the oldest age estimate for lacustrine conditions in Lake Malawi.
Collapse
|
28
|
Contrasting patterns of diversification between Amazonian and Atlantic forest clades of Neotropical lianas (Amphilophium, Bignonieae) inferred from plastid genomic data. Mol Phylogenet Evol 2019; 133:92-106. [DOI: 10.1016/j.ympev.2018.12.021] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2018] [Revised: 11/20/2018] [Accepted: 12/16/2018] [Indexed: 01/23/2023]
|
29
|
Six Impossible Things before Breakfast: Assumptions, Models, and Belief in Molecular Dating. Trends Ecol Evol 2019; 34:474-486. [PMID: 30904189 DOI: 10.1016/j.tree.2019.01.017] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2018] [Revised: 01/29/2019] [Accepted: 01/31/2019] [Indexed: 01/16/2023]
Abstract
Confidence in molecular dating analyses has grown with the increasing sophistication of the methods. Some problematic cases where molecular dates disagreed with paleontological estimates appear to have been resolved with a growing agreement between molecules and fossils. But we cannot relax just yet. The growing analytical sophistication of many molecular dating methods relies on an increasingly large number of assumptions about evolutionary history and processes. Many of these assumptions are based on statistical tractability rather than being informed by improved understanding of molecular evolution, yet changing the assumptions can influence molecular dates. How can we tell if the answers we get are driven more by the assumptions we make than by the molecular data being analyzed?
Collapse
|
30
|
Bayesian Estimation of Species Divergence Times Using Correlated Quantitative Characters. Syst Biol 2019; 68:967-986. [DOI: 10.1093/sysbio/syz015] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2018] [Revised: 02/16/2019] [Accepted: 02/20/2019] [Indexed: 11/12/2022] Open
Abstract
Abstract
Discrete morphological data have been widely used to study species evolution, but the use of quantitative (or continuous) morphological characters is less common. Here, we implement a Bayesian method to estimate species divergence times using quantitative characters. Quantitative character evolution is modeled using Brownian diffusion with character correlation and character variation within populations. Through simulations, we demonstrate that ignoring the population variation (or population “noise”) and the correlation among characters leads to biased estimates of divergence times and rate, especially if the correlation and population noise are high. We apply our new method to the analysis of quantitative characters (cranium landmarks) and molecular data from carnivoran mammals. Our results show that time estimates are affected by whether the correlations and population noise are accounted for or ignored in the analysis. The estimates are also affected by the type of data analyzed, with analyses of morphological characters only, molecular data only, or a combination of both; showing noticeable differences among the time estimates. Rate variation of morphological characters among the carnivoran species appears to be very high, with Bayesian model selection indicating that the independent-rates model fits the morphological data better than the autocorrelated-rates model. We suggest that using morphological continuous characters, together with molecular data, can bring a new perspective to the study of species evolution. Our new model is implemented in the MCMCtree computer program for Bayesian inference of divergence times.
Collapse
|
31
|
An In Silico Comparison of Protocols for Dated Phylogenomics. Syst Biol 2018; 67:633-650. [PMID: 29319797 DOI: 10.1093/sysbio/syx089] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2015] [Accepted: 10/24/2017] [Indexed: 01/02/2023] Open
Abstract
In the age of genome-scale DNA sequencing, choice of molecular marker arguably remains an important decision in planning a phylogenetic study. Using published genomes from 23 primate species, we make a standardized comparison of four of the most frequently used protocols in phylogenomics, viz., targeted sequence-enrichment using ultraconserved element and exon-capture probes, and restriction-site-associated DNA sequencing (RADseq and ddRADseq). Here, we present a procedure to perform in silico extractions from genomes and create directly comparable data sets for each class of marker. We then compare these data sets in terms of both phylogenetic resolution and ability to consistently and precisely estimate clade ages using fossil-calibrated molecular-clock models. Furthermore, we were also able to directly compare these results to previously published data sets from Sanger-sequenced nuclear exons and mitochondrial genomes under the same analytical conditions. Our results show-although with the exception of the mitochondrial genome data set and the smallest ddRADseq data set-that for uncontroversial nodes all data classes performed equally well, that is they recovered the same well supported topology. However, for one difficult-to-resolve node comprising a rapid diversification, we report well supported but conflicting topologies among the marker classes consistent with the mismodeling of gene tree heterogeneity as demonstrated by species tree analyses of single nucleotide polymorphisms. Likewise, clade age estimates showed consistent discrepancies between data sets under strict and relaxed clock models; for recent nodes, clade ages estimated by nuclear exon data sets were younger than those of the UCE, RADseq and mitochondrial data, but vice versa for the deepest nodes in the primate phylogeny. This observation is explained by temporal differences in phylogenetic informativeness (PI), with the data sets with strong PI peaks toward the present underestimating the deepest node ages. Finally, we conclude by emphasizing that while huge numbers of loci are probably not required for uncontroversial phylogenetic questions-for which practical considerations such as ease of data generation, sharing, and aggregating, therefore become increasingly important-accurately modeling heterogeneous data remains as relevant as ever for the more recalcitrant problems.
Collapse
|
32
|
The molecular clock and evolutionary timescales. Biochem Soc Trans 2018; 46:1183-1190. [PMID: 30154097 DOI: 10.1042/bst20180186] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2018] [Revised: 07/17/2018] [Accepted: 07/24/2018] [Indexed: 11/17/2022]
Abstract
The molecular clock provides a valuable means of estimating evolutionary timescales from genetic and biochemical data. Proposed in the early 1960s, it was first applied to amino acid sequences and immunological measures of genetic distances between species. The molecular clock has undergone considerable development over the years, and it retains profound relevance in the genomic era. In this mini-review, we describe the history of the molecular clock, its impact on evolutionary theory, the challenges brought by evidence of evolutionary rate variation among species, and the statistical models that have been developed to account for these heterogeneous rates of genetic change. We explain how the molecular clock can be used to infer rates and timescales of evolution, and we list some of the key findings that have been obtained when molecular clocks have been applied to genomic data. Despite the numerous challenges that it has faced over the decades, the molecular clock continues to offer the most effective method of resolving the details of the evolutionary timescale of the Tree of Life.
Collapse
|
33
|
A tangle of forms and phylogeny: Extensive morphological homoplasy and molecular clock heterogeneity in Bonnetina and related tarantulas. Mol Phylogenet Evol 2018; 127:55-73. [DOI: 10.1016/j.ympev.2018.05.013] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2017] [Revised: 04/25/2018] [Accepted: 05/13/2018] [Indexed: 12/13/2022]
|
34
|
Time-calibrated molecular phylogeny reveals a Miocene–Pliocene diversification in the Amazon miniature killifish genus Fluviphylax (Cyprinodontiformes: Cyprinodontoidei). ORG DIVERS EVOL 2018. [DOI: 10.1007/s13127-018-0373-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
35
|
Using Phylogenomic Data to Explore the Effects of Relaxed Clocks and Calibration Strategies on Divergence Time Estimation: Primates as a Test Case. Syst Biol 2018; 67:594-615. [PMID: 29342307 PMCID: PMC6005039 DOI: 10.1093/sysbio/syy001] [Citation(s) in RCA: 94] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2017] [Revised: 12/26/2017] [Accepted: 01/05/2018] [Indexed: 11/13/2022] Open
Abstract
Primates have long been a test case for the development of phylogenetic methods for divergence time estimation. Despite a large number of studies, however, the timing of origination of crown Primates relative to the Cretaceous-Paleogene (K-Pg) boundary and the timing of diversification of the main crown groups remain controversial. Here, we analysed a data set of 372 taxa (367 Primates and 5 outgroups, 3.4 million aligned base pairs) that includes nine primate genomes. We systematically explore the effect of different interpretations of fossil calibrations and molecular clock models on primate divergence time estimates. We find that even small differences in the construction of fossil calibrations can have a noticeable impact on estimated divergence times, especially for the oldest nodes in the tree. Notably, choice of molecular rate model (autocorrelated or independently distributed rates) has an especially strong effect on estimated times, with the independent rates model producing considerably more ancient age estimates for the deeper nodes in the phylogeny. We implement thermodynamic integration, combined with Gaussian quadrature, in the program MCMCTree, and use it to calculate Bayes factors for clock models. Bayesian model selection indicates that the autocorrelated rates model fits the primate data substantially better, and we conclude that time estimates under this model should be preferred. We show that for eight core nodes in the phylogeny, uncertainty in time estimates is close to the theoretical limit imposed by fossil uncertainties. Thus, these estimates are unlikely to be improved by collecting additional molecular sequence data. All analyses place the origin of Primates close to the K-Pg boundary, either in the Cretaceous or straddling the boundary into the Palaeogene.
Collapse
|
36
|
|
37
|
Strategies for Partitioning Clock Models in Phylogenomic Dating: Application to the Angiosperm Evolutionary Timescale. Genome Biol Evol 2018; 9:2752-2763. [PMID: 29036288 PMCID: PMC5647803 DOI: 10.1093/gbe/evx198] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/25/2017] [Indexed: 12/14/2022] Open
Abstract
Evolutionary timescales can be inferred from molecular sequence data using a Bayesian phylogenetic approach. In these methods, the molecular clock is often calibrated using fossil data. The uncertainty in these fossil calibrations is important because it determines the limiting posterior distribution for divergence-time estimates as the sequence length tends to infinity. Here, we investigate how the accuracy and precision of Bayesian divergence-time estimates improve with the increased clock-partitioning of genome-scale data into clock-subsets. We focus on a data set comprising plastome-scale sequences of 52 angiosperm taxa. There was little difference among the Bayesian date estimates whether we chose clock-subsets based on patterns of among-lineage rate heterogeneity or relative rates across genes, or by random assignment. Increasing the degree of clock-partitioning usually led to an improvement in the precision of divergence-time estimates, but this increase was asymptotic to a limit presumably imposed by fossil calibrations. Our clock-partitioning approaches yielded highly precise age estimates for several key nodes in the angiosperm phylogeny. For example, when partitioning the data into 20 clock-subsets based on patterns of among-lineage rate heterogeneity, we inferred crown angiosperms to have arisen 198–178 Ma. This demonstrates that judicious clock-partitioning can improve the precision of molecular dating based on phylogenomic data, but the meaning of this increased precision should be considered critically.
Collapse
|
38
|
The Microtus voles: Resolving the phylogeny of one of the most speciose mammalian genera using genomics. Mol Phylogenet Evol 2018; 125:85-92. [PMID: 29574272 DOI: 10.1016/j.ympev.2018.03.017] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2017] [Revised: 01/03/2018] [Accepted: 03/14/2018] [Indexed: 11/24/2022]
Abstract
Sequential rapid radiations pose some of the greatest difficulties in phylogenetics, especially when analysing only a small number of genetic markers. Given that most of the speciation events occur in quick succession at various points in time, this creates particular challenges in determining phylogenetic relationships, i.e. branching order and divergence times. With the development of high throughput sequencing, thousands of markers can now readily be used to tackle these issues. Microtus is a speciose genus currently composed of 65 species that evolved over the last 2 million years. Although it is a well-studied group, there is still phylogenetic uncertainty at various divergence levels. Building upon previous studies that generally used small numbers of mitochondrial and/or nuclear loci, in this genomic-scale study we used both mitochondrial and nuclear data to study the rapid radiation within Microtus, using partial mitogenomes and genotyping-by-sequencing (GBS) on seven species representing five Microtus subgenera and the main biogeographic ranges where this group occurs. Both types of genome (mitochondrial and nuclear) generated similar tree topologies, with a basal split of the Nearctic (M. ochrogaster) and Holarctic (M. oeconomus) species, and then a subdivision of the five Palearctic species into two subgroups. These data support the occurrence of two European radiations, one North American radiation, and a later expansion of M. oeconomus from Asia to both Europe and North America. We further resolved the positioning of M. cabrerae as sister group of M. agrestis and refute the claim that M. cabrerae should be elevated to its own genus (Iberomys). Finally, the data support ongoing speciation events, especially within M. agrestis, with high levels of genetic divergence between the three Evolutionarily Significant Units (ESUs) previously identified. Similar high levels of divergence were also found among ESUs within M. oeconomus and M. arvalis.
Collapse
|
39
|
Ice age unfrozen: severe effect of the last interglacial, not glacial, climate change on East Asian avifauna. BMC Evol Biol 2017; 17:244. [PMID: 29212454 PMCID: PMC5719578 DOI: 10.1186/s12862-017-1100-2] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2017] [Accepted: 11/28/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The glacial-interglacial cycles in the Pleistocene caused repeated range expansion and contraction of species in several regions in the world. However, it remains uncertain whether such climate oscillations had similar impact on East Asian biota, despite its widely recognized importance in global biodiversity. Here we use both molecular and ecological niche profiles on 11 East Asian avian species with various elevational ranges to reveal their response to the late Pleistocene climate changes. RESULTS The ecological niche models (ENM) consistently showed that these avian species might substantially contract their ranges to the south during the Last Interglacial period (LIG) and expanded their northern range margins through the Last Glacial Maximum (LGM), leading to the LGM ranges observed for all 11 species. Consistently, coalescent simulations based on 25-30 nuclear genes retrieved signatures of significant population growth through the last glacial period across all species studied. Climate statistics suggested that high climatic variability during the LIG and a relatively mild climate at the LGM potentially explained the historical population dynamics of these birds. CONCLUSIONS This is the first study based on multiple species and both lines of ecological niche profiles and genetic data to characterize the unique response of East Asian biota to late Pleistocene climate. The present study highlights regional differences in the evolutionary consequence of climate change during the last glacial cycle and implies that global warming might pose a great risk to species in this region given potentially higher climatic variation in the future analogous to that during the LIG.
Collapse
|
40
|
|
41
|
Analysis of Phylogenomic Tree Space Resolves Relationships Among Marsupial Families. Syst Biol 2017; 67:400-412. [DOI: 10.1093/sysbio/syx076] [Citation(s) in RCA: 52] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2017] [Accepted: 09/08/2017] [Indexed: 02/02/2023] Open
|
42
|
Interpreting the genomic landscape of speciation: a road map for finding barriers to gene flow. J Evol Biol 2017; 30:1450-1477. [DOI: 10.1111/jeb.13047] [Citation(s) in RCA: 306] [Impact Index Per Article: 43.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2016] [Revised: 01/31/2017] [Accepted: 02/01/2017] [Indexed: 12/14/2022]
|
43
|
Molecular phylogeny and timing of diversification in South American Cynolebiini seasonal killifishes. Mol Phylogenet Evol 2017; 116:61-68. [PMID: 28754241 DOI: 10.1016/j.ympev.2017.07.020] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2017] [Revised: 07/10/2017] [Accepted: 07/24/2017] [Indexed: 11/17/2022]
Abstract
The rich biological diversity of South America has motivated a series of studies associating evolution of endemic taxa with the dramatic geologic and climatic changes that occurred during the Cainozoic. The organism here studied is the killifish tribe Cynolebiini, a group of seasonal fishes uniquely inhabiting temporary pools formed during the rainy seasons. The Cynolebiini are found in open vegetation areas inserted in the main tropical and subtropical South American phytogeographical regions east of the Andes. Here, we present the first molecular phylogeny sampling all the eight genera of the Cynolebiini, using fragments of two mitochondrial and four nuclear genes for 35 species of Cynolebiini plus 19 species as outgroups. The dataset, 4448bp, was analysed under Bayesian and maximum likelihood approaches, providing a relatively well solved tree, which retrieves high support values for the Cynolebiini and most included clades. The resulting tree was used to estimate the time of divergence in included lineages using two cyprinodontiform fossils to calibrate the tree. We further investigated historical biogeography through the likelihood-based DEC model. Our estimates indicate that divergence between the clades comprising New World and Old World aplocheiloids occurred during the Eocene, about 50Mya, much more recent than the Gondwanan fragmentation scenario assumed in previous studies. This estimation is nearly synchronous to estimated splits involving other South American and African vertebrate clades, which have been explained by transoceanic dispersal through an ancient Atlantic island chain during the Palaeogene. We estimate that Cynolebiini split from its sister group Cynopoecilini in the Oligocene, about 25Mya and that Cynolebiini started to diversify giving origin to the present genera during the Miocene, about 20-14Mya. The Cynolebiini had an ancestral origin in the Atlantic Forest and probably were not present in the open vegetation formations of central and northeastern South America until the Middle Miocene, when expansion of dry open vegetation was favoured by cool temperatures and strike seasonality. Initial splitting between the genera Cynolebias and Simpsonichthys during the Miocene (about 14Mya) is attributed to the uplift of the Central Brazilian Plateau.
Collapse
|
44
|
Direct estimation of the spontaneous mutation rate by short-term mutation accumulation lines in Chironomus riparius. Evol Lett 2017; 1:86-92. [PMID: 30283641 PMCID: PMC6121839 DOI: 10.1002/evl3.8] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2017] [Revised: 03/10/2017] [Accepted: 04/03/2017] [Indexed: 12/15/2022] Open
Abstract
Mutations are the ultimate basis of evolution, yet their occurrence rate is known only for few species. We directly estimated the spontaneous mutation rate and the mutational spectrum in the nonbiting midge C. riparius with a new approach. Individuals from ten mutation accumulation lines over five generations were deep genome sequenced to count de novo mutations that were not present in a pool of F1 individuals, representing parental genotypes. We identified 51 new single site mutations of which 25 were insertions or deletions and 26 single nucleotide mutations. This shift in the mutational spectrum compared to other organisms was explained by the high A/T content of the species. We estimated a haploid mutation rate of 2.1 × 10-9 (95% confidence interval: 1.4 × 10-9 - 3.1 × 10-9) that is in the range of recent estimates for other insects and supports the drift barrier hypothesis. We show that accurate mutation rate estimation from a high number of observed mutations is feasible with moderate effort even for nonmodel species.
Collapse
|
45
|
Correlated evolutionary rates across genomic compartments in Annonaceae. Mol Phylogenet Evol 2017; 114:63-72. [PMID: 28578201 DOI: 10.1016/j.ympev.2017.05.026] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2017] [Revised: 05/29/2017] [Accepted: 05/29/2017] [Indexed: 11/28/2022]
Abstract
The molecular clock hypothesis is an important concept in biology. Deviations from a constant rate of nucleotide substitution have been found widely among lineages, genomes, genes and individual sites. Phylogenetic research can accommodate for these differences in applying specific models of evolution. Lineage-specific rate heterogeneity however can generate bi- or multimodal distributions of substitution rates across the branches of a tree and this may mislead phylogenetic inferences with currently available models. The plant family Annonaceae is an excellent case to study lineage-specific rate heterogeneity. The two major sister subfamilies, Annonoideae and Malmeoideae, have shown great discrepancies in branch lengths. We used high-throughput sequencing data of 72 genes, 99 spacers and 16 introns from 24 chloroplast genomes and nuclear ribosomal DNA of 23 species to study the molecular rate of evolution in Annonaceae. In all analyses, longer branch lengths and/or higher substitution rates were found for the Annonoideae compared to the Malmeoideae. The Annonaceae had wide variability in chloroplast length, ranging from minimal 175,684bp to 201,723 for Annonoideae and minimal 152,357 to 170,985bp in Malmeoideae, mostly reflecting variation in inverted-repeat length. The Annonoideae showed a higher GC-content in the conserved parts of the chloroplast genome and higher omega (dN/dS)-ratios than the Malmeoideae, which could indicate less stringent purifying selection, a pattern that has been found in groups with small population sizes. This study generates new insights into the processes causing lineage-specific rate heterogeneity, which could lead to improved phylogenetic methods.
Collapse
|
46
|
Genetic Stability and Evolution of the sigB Allele, Used for Listeria Sensu Stricto Subtyping and Phylogenetic Inference. Appl Environ Microbiol 2017; 83:AEM.00306-17. [PMID: 28389543 DOI: 10.1128/aem.00306-17] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2017] [Accepted: 04/03/2017] [Indexed: 11/20/2022] Open
Abstract
Sequencing of single genes remains an important tool that allows the rapid classification of bacteria. Sequencing of a portion of sigB, which encodes a stress-responsive alternative sigma factor, has emerged as a commonly used molecular tool for the initial characterization of diverse Listeria isolates. In this study, evolutionary approaches were used to assess the validity of sigB allelic typing for Listeria For a data set of 4,280 isolates, sigB allelic typing showed a Simpson's index of diversity of 0.96. Analyses of 164 sigB allelic types (ATs) found among the 6 Listeriasensu stricto species, representing these 4,280 isolates, indicate that neither frequent homologous recombination nor positive selection significantly contributed to the evolution of sigB, confirming its genetic stability. The molecular clock test provided evidence for unequal evolution rates across clades; Listeria welshimeri displayed the lowest sigB diversity and was the only species in which sigB evolved in a clocklike manner, implying a unique natural history. Among the four L. monocytogenes lineages, sigB evolution followed a molecular clock only in lineage IV. Moreover, sigB displayed a significant negative Tajima D value in lineage II, suggesting a recent population bottleneck followed by lineage expansion. The absence of positive selection along with the violation of the molecular clock suggested a nearly neutral mechanism of Listeriasensu strictosigB evolution. While comparison with a whole-genome sequence-based phylogeny revealed that the sigB phylogeny did not correctly reflect the ancestry of L. monocytogenes lineage IV, the availability of a large sigB AT database allowed accurate species classification.IMPORTANCEsigB allelic typing has been widely used for species delineation and subtyping of Listeria However, an informative evaluation of this method from an evolutionary perspective was missing. Our data indicate that the genetic stability of sigB is affected by neither frequent homologous recombination nor positive selection, which supports that sigB allelic typing provides reliable subtyping and classification of Listeria sensu stricto strains. However, multigene data are required for accurate phylogeny reconstruction of Listeria This study thus contributes to a better understanding of the evolution of sigB and confirms the robustness of the sigB subtyping system for Listeria.
Collapse
|
47
|
The impacts of drift and selection on genomic evolution in insects. PeerJ 2017; 5:e3241. [PMID: 28462044 PMCID: PMC5410144 DOI: 10.7717/peerj.3241] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2016] [Accepted: 03/28/2017] [Indexed: 11/20/2022] Open
Abstract
Genomes evolve through a combination of mutation, drift, and selection, all of which act heterogeneously across genes and lineages. This leads to differences in branch-length patterns among gene trees. Genes that yield trees with the same branch-length patterns can be grouped together into clusters. Here, we propose a novel phylogenetic approach to explain the factors that influence the number and distribution of these gene-tree clusters. We apply our method to a genomic dataset from insects, an ancient and diverse group of organisms. We find some evidence that when drift is the dominant evolutionary process, each cluster tends to contain a large number of fast-evolving genes. In contrast, strong negative selection leads to many distinct clusters, each of which contains only a few slow-evolving genes. Our work, although preliminary in nature, illustrates the use of phylogenetic methods to shed light on the factors driving rate variation in genomic evolution.
Collapse
|
48
|
A new sequence data set of SSU rRNA gene for Scleractinia and its phylogenetic and ecological applications. Mol Ecol Resour 2017; 17:1054-1071. [DOI: 10.1111/1755-0998.12640] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2016] [Revised: 11/01/2016] [Accepted: 11/15/2016] [Indexed: 11/29/2022]
|
49
|
Nonreceding hare lines: genetic continuity since the Late Pleistocene in European mountain hares (Lepus timidus). Biol J Linn Soc Lond 2017. [DOI: 10.1093/biolinnean/blw009] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
50
|
A Possible Role for Stochastic Astrophysical Ionizing Radiation Events in the Systematic Disparity between Molecular and Fossil Dates. ASTROBIOLOGY 2017; 17:87-90. [PMID: 28026990 DOI: 10.1089/ast.2016.1527] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Major discrepancies have been noted for some time between fossil ages and molecular divergence dates for a variety of taxa. Recently, systematic trends within avian clades have been uncovered. The trends show that the disparity is much larger for mitochondrial DNA than for nuclear DNA, also that it is larger for crown fossil dates than stem fossil dates. It has been argued that this pattern is largely inconsistent with incompleteness of the fossil record as the principal driver of the disparity. A case is presented that, given the expected mutations from a fluctuating background of astrophysical radiation from such sources as supernovae, the rate of molecular clocks is variable and should increase back to a few million years, before returning to the long-term average rate. This is a possible explanation for the disparity. One test of this hypothesis is to look for an acceleration of molecular clocks at 2 to 2.5 Ma due to one or more moderately nearby supernovae known to have happened at that time. Another is to look for reduced disparity in benthic organisms of the deep ocean. In addition, due to the importance of highly penetrating muon irradiation, the disparity should be magnified for megafauna. Key Words: Extreme events in Earth history-Molecular clock-Radiation physics-Evolution. Astrobiology 17, 87-90.
Collapse
|