Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Total Articles

44
(from Reference Citation Analysis)

Article PDFs (7)

Cited by > 0 (38)

Searched Name

James H Degnan

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Statistics

Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Category

Show more Refine

Number	Citation Analysis
1	Hypernatremia in Hyperglycemia: Clinical Features and Relationship to Fractional Changes in Body Water and Monovalent Cations during Its Development. J Clin Med 2024;13:1957. [PMID: 38610721 PMCID: PMC11012913 DOI: 10.3390/jcm13071957] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2024] [Revised: 03/13/2024] [Accepted: 03/22/2024] [Indexed: 04/14/2024] Open Abstract In hyperglycemia, the serum sodium concentration ([Na]S) receives influences from (a) the fluid exit from the intracellular compartment and thirst, which cause [Na]S decreases; (b) osmotic diuresis with sums of the urinary sodium plus potassium concentration lower than the baseline euglycemic [Na]S, which results in a [Na]S increase; and (c), in some cases, gains or losses of fluid, sodium, and potassium through the gastrointestinal tract, the respiratory tract, and the skin. Hyperglycemic patients with hypernatremia have large deficits of body water and usually hypovolemia and develop severe clinical manifestations and significant mortality. To assist with the correction of both the severe dehydration and the hypovolemia, we developed formulas computing the fractional losses of the body water and monovalent cations in hyperglycemia. The formulas estimate varying losses between patients with the same serum glucose concentration ([Glu]S) and [Na]S but with different sums of monovalent cation concentrations in the lost fluids. Among subjects with the same [Glu]S and [Na]S, those with higher monovalent cation concentrations in the fluids lost have higher fractional losses of body water. The sum of the monovalent cation concentrations in the lost fluids should be considered when computing the volume and composition of the fluid replacement for hyperglycemic syndromes. Collapse Key Words hyperglycemia hypernatremia osmotic diuresis potassium in fluids lost sodium in fluids lost Collapse MESH Headings Collapse Grants Collapse
2	The onset of rare earth metallosis begins with renal gadolinium-rich nanoparticles from magnetic resonance imaging contrast agent exposure. Sci Rep 2023;13:2025. [PMID: 36739294 PMCID: PMC9899216 DOI: 10.1038/s41598-023-28666-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Accepted: 01/23/2023] [Indexed: 02/06/2023] Open Abstract The leitmotifs of magnetic resonance imaging (MRI) contrast agent-induced complications range from acute kidney injury, symptoms associated with gadolinium exposure (SAGE)/gadolinium deposition disease, potentially fatal gadolinium encephalopathy, and irreversible systemic fibrosis. Gadolinium is the active ingredient of these contrast agents, a non-physiologic lanthanide metal. The mechanisms of MRI contrast agent-induced diseases are unknown. Mice were treated with a MRI contrast agent. Human kidney tissues from contrast-naïve and MRI contrast agent-treated patients were obtained and analyzed. Kidneys (human and mouse) were assessed with transmission electron microscopy and scanning transmission electron microscopy with X-ray energy-dispersive spectroscopy. MRI contrast agent treatment resulted in unilamellar vesicles and mitochondriopathy in renal epithelium. Electron-dense intracellular precipitates and the outer rim of lipid droplets were rich in gadolinium and phosphorus. We conclude that MRI contrast agents are not physiologically inert. The long-term safety of these synthetic metal-ligand complexes, especially with repeated use, should be studied further. Collapse Key Words acute kidney injury experimental models of disease metals Collapse MESH Headings Collapse Grants Collapse
3	Trying Out a Million Genes to Find the Perfect Pair with RTIST. Bioinformatics 2022;38:3565-3573. [PMID: 35641003 DOI: 10.1093/bioinformatics/btac349] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 05/07/2022] [Accepted: 05/17/2022] [Indexed: 11/13/2022] Open Abstract MOTIVATION Consensus methods can be used for reconstructing a species tree from several gene trees which exhibit incompatible topologies due to incomplete lineage sorting. Motivated by the fact that there are no anomalous rooted gene trees with three taxa and no anomalous unrooted gene trees with four taxa in the multispecies coalescent model, several contemporary methods form the gene tree consensus by finding the median tree with respect to the triplet or quartet distance-i.e., estimate the species tree as the tree which minimizes the sum of triplet or quartet distances to the input gene trees. These methods reformulate the solution to the consensus problem as the solution to a recursively-solved dynamic programming problem. We present an iterative, easily-parallelizable approach to finding the exact median triplet tree, and implement it as an open source software package which can also find suboptimal consensus trees within a specified triplet distance to the gene trees. The most time-consuming step for methods of this type is the creation of a weights array for all possible subtree bipartitions. By grouping the relevant calculations and array update operations of different bipartitions of the same subtree together, this implementation finds the exact median tree of many gene trees faster than comparable methods, has better scaling properties with respect to the number of gene trees, and has a smaller memory footprint. RESULTS RTIST (Rooted Triple Inference of Species Trees) finds the exact median triplet tree of a set of gene trees. Its runtime and memory footprints scale better than existing algorithms. RTIST can resolve all the non-unique median trees, as well as sub-optimal consensus trees within a user-specified triplet distance to the median. Although it is limited in the number of taxa (≤ 20), its runtime changes little when the number of gene trees is changed by several orders of magnitude. AVAILABILITY RTIST is written in C and Python. It is freely available at https://github.com/glebzhelezov/rtist. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
4	Statistical inconsistency of the unrooted minimize deep coalescence criterion. PLoS One 2021;16:e0251107. [PMID: 33970931 PMCID: PMC8109837 DOI: 10.1371/journal.pone.0251107] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2020] [Accepted: 04/20/2021] [Indexed: 11/24/2022] Open Abstract Species trees, which describe the evolutionary relationships between species, are often inferred from gene trees, which describe the ancestral relationships between sequences sampled at different loci from the species of interest. A common approach to inferring species trees from gene trees is motivated by supposing that gene tree variation is due to incomplete lineage sorting, also known as deep coalescence. One of the earliest methods motivated by deep coalescence is to find the species tree that minimizes the number of deep coalescent events needed to explain discrepancies between the species tree and input gene trees. This minimize deep coalescence (MDC) criterion can be applied in both rooted and unrooted settings. where either rooted or unrooted gene trees can be used to infer a rooted species tree. Previous work has shown that MDC is statistically inconsistent in the rooted setting, meaning that under a probabilistic model for deep coalescence, the multispecies coalescent, for some species trees, increasing the number of input gene trees does not make the method more likely to return a correct species tree. Here, we obtain analogous results in the unrooted setting, showing conditions leading to inconsistency of the MDC criterion using the multispecies coalescent model with unrooted gene trees for four taxa and five taxa. Collapse Key Words Collapse MESH Headings Algorithms Animals Biological Evolution Computer Simulation Genetic Speciation Humans Models, Genetic Models, Statistical Phylogeny Software Statistics, Nonparametric Collapse Grants R01 GM117590 NIGMS NIH HHS Foundation for the National Institutes of Health Collapse
5	Heuristics for unrooted, unranked, and ranked anomaly zones under birth-death models. Mol Phylogenet Evol 2021;161:107162. [PMID: 33831548 DOI: 10.1016/j.ympev.2021.107162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2019] [Revised: 10/21/2020] [Accepted: 03/23/2021] [Indexed: 10/21/2022] Abstract Species trees that can generate a nonmatching gene tree topology that is more probable than the topology matching the species tree are said to be in an anomaly zone. We introduce some heuristic approaches to infer whether species trees are in anomaly zones when it is difficult or impossible to compute the entire distribution of gene tree topologies. Here, probabilities of unrooted, unranked, and ranked gene tree topologies under the multispecies coalescent are used. A ranked tree can be viewed as an unranked tree with a temporal ordering of its internal nodes. Overall, considering probabilities of unrooted or unranked gene tree topologies within one nearest neighbor interchange from the species tree topology is a reasonable heuristic to infer the existence of anomalous unrooted or unranked gene trees, respectively. We investigated a test proposed by Linkem et al. (2016) which classifies a species tree as being in an unranked anomaly zone if there is a subset of four taxa in an unranked anomaly zone. We find this test to have high true positive rates, but it can also have high false positive rates. For ranked trees, because at least one of the most probable ranked gene tree topologies must have the same unranked topology as the species tree, we propose to use only those ranked gene trees that have topologies that match the unranked species tree topology. We find that the probability that the species tree is in unrooted and unranked anomaly zones tends to increase with the speciation rate, and the probability of all three types of anomaly zones increases rapidly with the number of taxa. We find that probabilities that species trees are in an anomaly zone can be quite high for moderately high speciation rates. Collapse Key Words Anomaly zones Coalescent Gene trees Heuristic methods Phylogeny Species trees Collapse MESH Headings Collapse Grants Collapse
6	Probabilities of Unranked and Ranked Anomaly Zones under Birth-Death Models. Mol Biol Evol 2021;37:1480-1494. [PMID: 31860090 DOI: 10.1093/molbev/msz305] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open Abstract A labeled gene tree topology that is more probable than the labeled gene tree topology matching a species tree is called "anomalous." Species trees that can generate such anomalous gene trees are said to be in the "anomaly zone." Here, probabilities of "unranked" and "ranked" gene tree topologies under the multispecies coalescent are considered. A ranked tree depicts not only the topological relationship among gene lineages, as an unranked tree does, but also the sequence in which the lineages coalesce. In this article, we study how the parameters of a species tree simulated under a constant-rate birth-death process can affect the probability that the species tree lies in the anomaly zone. We find that with more than five taxa, it is possible for species trees to have both anomalous unranked and ranked gene trees. The probability of being in either type of anomaly zone increases with more taxa. The probability of anomalous gene trees also increases with higher speciation rates. We observe that the probabilities of unranked anomaly zones are higher and grow much faster than those of ranked anomaly zones as the speciation rate increases. Our simulation shows that the most probable ranked gene tree is likely to have the same unranked topology as the species tree. We design the software PRANC, which computes probabilities of ranked gene tree topologies given a species tree under the coalescent model. Collapse Key Words anomalous gene tree coalescent gene tree phylogeny species tree Collapse MESH Headings Collapse Grants Collapse
7	PRANC: ML species tree estimation from the ranked gene trees under coalescence. Bioinformatics 2021;36:4819-4821. [PMID: 32609371 DOI: 10.1093/bioinformatics/btaa605] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2020] [Revised: 06/16/2020] [Accepted: 06/23/2020] [Indexed: 11/12/2022] Open Abstract SUMMARY PRANC computes the Probabilities of RANked gene tree topologies under the multispecies coalescent. A ranked gene tree is a gene tree accounting for the temporal ordering of internal nodes. PRANC can also estimate the maximum likelihood (ML) species tree from a sample of ranked or unranked gene tree topologies. It estimates the ML tree with estimated branch lengths in coalescent units. AVAILABILITY AND IMPLEMENTATION PRANC is written in C++ and freely available at github.com/anastasiiakim/PRANC. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
8	Meng and Kubatko (2009): Modeling hybridization with coalescence. Theor Popul Biol 2020;133:36-37. [DOI: 10.1016/j.tpb.2019.07.008] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2019] [Revised: 07/06/2019] [Accepted: 07/08/2019] [Indexed: 11/16/2022] Abstract Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
9	An initiative to implement immediate postpartum long-acting reversible contraception in rural New Mexico. Am J Obstet Gynecol 2020;222:S911.e1-S911.e7. [PMID: 31978431 DOI: 10.1016/j.ajog.2020.01.027] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Revised: 01/10/2020] [Accepted: 01/13/2020] [Indexed: 12/14/2022] Abstract BACKGROUND Over the past decade, many states have developed approaches to reimburse for immediate postpartum long-acting reversible contraception. Despite expanded coverage, few hospitals offer immediate postpartum long-acting reversible contraception. OBJECTIVES Immediate postpartum long-acting reversible contraception implementation is complex and requires a committed multidisciplinary team. After New Mexico Medicaid approved reimbursement for this service, the New Mexico Perinatal Collaborative developed and initiated an evidence-based implementation program containing several components. We sought to evaluate timing of the implementation process and facilitators and barriers to immediate postpartum long-acting reversible contraception in several New Mexico rural hospitals. The primary study outcome was time from New Mexico Perinatal Collaborative program component introduction in each hospital to the hospital's completion of the corresponding implementation step. Secondary outcomes included barriers and facilitators to immediate postpartum contraception implementation. STUDY DESIGN In this mixed-methods study, conducted from April 2017 to May 2018, we completed semistructured questionnaires and interviews with 20 key personnel from 7 New Mexico hospitals that planned to implement immediate postpartum long-acting reversible contraception. The New Mexico Perinatal Collaborative introduced program components to hospitals in a stepped-wedge design. Participants contributed baseline and follow-up data at 4 time periods detailing the steps taken towards program implementation and the timing of step completion at their hospital. Qualitative data were analyzed using directed qualitative content analysis principles based on the Consolidated Framework for Implementation Research. RESULTS Investigators conducted 43 interviews during the 14-month study period. Median time to complete steps toward implementation-patient education, clinician training, nursing education, charge capture, available supplies, and protocols or guidelines-ranged from 7 days for clinician training to 357 days to develop patient education materials. Facilitators of immediate postpartum contraception readiness were local hospital clinical champions and institutional administrative and financial stability. Of the 7 hospitals, 4 completed all Perinatal Collaborative implementation program components and 3 of those piloted immediate postpartum long-acting reversible contraception services. Two publicly funded hospitals currently offer immediate postpartum long-acting reversible contraception without verification of payment for the device or insertion. The third hospital piloted the program with 8 contraceptive devices, did not receive reimbursement due to identified flaws in Medicaid billing guidance and does not currently offer the service. The remaining 3 of the 7 hospitals declined to complete the NMPC program; the hospital that completed the program but did not pilot immediate postpartum long-acting reversible contraception did so because Medicaid billing mechanisms were incompatible with their automated billing systems. Participants consistently reported that lack of reimbursement was the major barrier to immediate postpartum long-acting reversible contraception implementation. CONCLUSION Despite the New Mexico Perinatal Collaborative's robust implementation process and hospital engagement, most hospitals did not offer immediate postpartum long-acting reversible contraception over the study period. Reimbursement obstacles prevented full service implementation. Interventions to improve immediate postpartum long-acting reversible contraception access must begin with implementation of seamless billing and reimbursement mechanisms to ensure adequate hospital payments. Collapse Key Words contraceptive implant immediate postpartum LARC implementation science intrauterine device long-acting reversible contraception payment policy postpartum contraception rural hospitals Collapse MESH Headings Collapse Grants Collapse
10	Modeling Hybridization Under the Network Multispecies Coalescent. Syst Biol 2018;67:786-799. [PMID: 29846734 PMCID: PMC6101600 DOI: 10.1093/sysbio/syy040] [Citation(s) in RCA: 53] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2017] [Revised: 05/13/2018] [Accepted: 05/16/2018] [Indexed: 11/13/2022] Open Abstract Simultaneously modeling hybridization and the multispecies coalescent is becoming increasingly common, and inference of species networks in this context is now implemented in several software packages. This article addresses some of the conceptual issues and decisions to be made in this modeling, including whether or not to use branch lengths and issues with model identifiability. This article is based on a talk given at a Spotlight Session at Evolution 2017 meeting in Portland, Oregon. This session included several talks about modeling hybridization and gene flow in the presence of incomplete lineage sorting. Other talks given at this meeting are also included in this special issue of Systematic Biology. Collapse Key Words displayed trees gene flow likelihood phylogenetic networks trait evolution Collapse MESH Headings Gene Flow Hybridization, Genetic Models, Genetic Phylogeny Collapse Grants R01 GM117590 NIGMS NIH HHS Collapse
11	Displayed Trees Do Not Determine Distinguishability Under the Network Multispecies Coalescent. Syst Biol 2018;66:283-298. [PMID: 27780899 DOI: 10.1093/sysbio/syw097] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2015] [Accepted: 03/08/2016] [Indexed: 11/13/2022] Open Abstract Recent work in estimating species relationships from gene trees has included inferring networks assuming that past hybridization has occurred between species. Probabilistic models using the multispecies coalescent can be used in this framework for likelihood-based inference of both network topologies and parameters, including branch lengths and hybridization parameters. A difficulty for such methods is that it is not always clear whether, or to what extent, networks are identifiable-that is whether there could be two distinct networks that lead to the same distribution of gene trees. For cases in which incomplete lineage sorting occurs in addition to hybridization, we demonstrate a new representation of the species network likelihood that expresses the probability distribution of the gene tree topologies as a linear combination of gene tree distributions given a set of species trees. This representation makes it clear that in some cases in which two distinct networks give the same distribution of gene trees when sampling one allele per species, the two networks can be distinguished theoretically when multiple individuals are sampled per species. This result means that network identifiability is not only a function of the trees displayed by the networks but also depends on allele sampling within species. We additionally give an example in which two networks that display exactly the same trees can be distinguished from their gene trees even when there is only one lineage sampled per species. [gene tree, hybridization, identifiability, maximum likelihood, species tree, phylogeny.]. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
12	Species Tree Inference from Gene Splits by Unrooted STAR Methods. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018;15:337-342. [PMID: 28113601 PMCID: PMC5388605 DOI: 10.1109/tcbb.2016.2604812] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023] Abstract The method was proposed by Liu and Yu to infer a species tree topology from unrooted topological gene trees. While its statistical consistency under the multispecies coalescent model was established only for a four-taxon tree, simulations demonstrated its good performance on gene trees inferred from sequences for many taxa. Here, we prove the statistical consistency of the method for an arbitrarily large species tree. Our approach connects to a generalization of the STAR method of Liu, Pearl, and Edwards, and a previous theoretical analysis of it. We further show utilizes only the distribution of splits in the gene trees, and not their individual topologies. Finally, we discuss how multiple samples per taxon per gene should be handled for statistical consistency. Collapse Key Words coalescent model star algorithm nj_st species tree Collapse MESH Headings Algorithms Computational Biology/methods Computer Simulation Databases, Genetic Genetics, Population Phylogeny Collapse Grants R01 GM117590 NIGMS NIH HHS Collapse
13	Split Probabilities and Species Tree Inference Under the Multispecies Coalescent Model. Bull Math Biol 2017;80:64-103. [PMID: 29127546 DOI: 10.1007/s11538-017-0363-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2017] [Accepted: 10/26/2017] [Indexed: 01/01/2023] Abstract Using topological summaries of gene trees as a basis for species tree inference is a promising approach to obtain acceptable speed on genomic-scale datasets, and to avoid some undesirable modeling assumptions. Here we study the probabilities of splits on gene trees under the multispecies coalescent model, and how their features might inform species tree inference. After investigating the behavior of split consensus methods, we investigate split invariants-that is, polynomial relationships between split probabilities. These invariants are then used to show that, even though a split is an unrooted notion, split probabilities retain enough information to identify the rooted species tree topology for trees of 5 or more taxa, with one possible 6-taxon exception. Collapse Key Words Multispecies coalescent model Species tree identifiability Split probability Collapse MESH Headings Evolution, Molecular Genetic Speciation Linear Models Mathematical Concepts Models, Genetic Phylogeny Probability Collapse Grants R01 GM117590 NIGMS NIH HHS R01 GM117590 NIH HHS Collapse
14	Magnetic Field Measurements inside a Converging Flux Conserver for Magnetized Target Fusion Applications. FUSION SCIENCE AND TECHNOLOGY 2017. [DOI: 10.13182/fst02-a196] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Abstract Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
15	Does Gene Tree Discordance Explain the Mismatch between Macroevolutionary Models and Empirical Patterns of Tree Shape and Branching Times? Syst Biol 2016;65:628-39. [PMID: 26968785 PMCID: PMC4911941 DOI: 10.1093/sysbio/syw019] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2015] [Accepted: 03/01/2016] [Indexed: 11/13/2022] Open Abstract Classic null models for speciation and extinction give rise to phylogenies that differ in distribution from empirical phylogenies. In particular, empirical phylogenies are less balanced and have branching times closer to the root compared to phylogenies predicted by common null models. This difference might be due to null models of the speciation and extinction process being too simplistic, or due to the empirical datasets not being representative of random phylogenies. A third possibility arises because phylogenetic reconstruction methods often infer gene trees rather than species trees, producing an incongruity between models that predict species tree patterns and empirical analyses that consider gene trees. We investigate the extent to which the difference between gene trees and species trees under a combined birth-death and multispecies coalescent model can explain the difference in empirical trees and birth-death species trees. We simulate gene trees embedded in simulated species trees and investigate their difference with respect to tree balance and branching times. We observe that the gene trees are less balanced and typically have branching times closer to the root than the species trees. Empirical trees from TreeBase are also less balanced than our simulated species trees, and model gene trees can explain an imbalance increase of up to 8% compared to species trees. However, we see a much larger imbalance increase in empirical trees, about 100%, meaning that additional features must also be causing imbalance in empirical trees. This simulation study highlights the necessity of revisiting the assumptions made in phylogenetic analyses, as these assumptions, such as equating the gene tree with the species tree, might lead to a biased conclusion. Collapse Key Words Birth–death process genealogy multispecies coalescent phylogeny Collapse MESH Headings Collapse Grants Collapse
16	There are no caterpillars in a wicked forest. Theor Popul Biol 2015;105:17-23. [DOI: 10.1016/j.tpb.2015.08.007] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2015] [Revised: 08/26/2015] [Accepted: 08/28/2015] [Indexed: 10/23/2022] Abstract Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
17	Hybrid-Lambda: simulation of multiple merger and Kingman gene genealogies in species networks and species trees. BMC Bioinformatics 2015;16:292. [PMID: 26373308 PMCID: PMC4571064 DOI: 10.1186/s12859-015-0721-y] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2015] [Accepted: 08/25/2015] [Indexed: 11/10/2022] Open Abstract BACKGROUND There has been increasing interest in coalescent models which admit multiple mergers of ancestral lineages; and to model hybridization and coalescence simultaneously. RESULTS Hybrid-Lambda is a software package that simulates gene genealogies under multiple merger and Kingman's coalescent processes within species networks or species trees. Hybrid-Lambda allows different coalescent processes to be specified for different populations, and allows for time to be converted between generations and coalescent units, by specifying a population size for each population. In addition, Hybrid-Lambda can generate simulated datasets, assuming the infinitely many sites mutation model, and compute the F ST statistic. As an illustration, we apply Hybrid-Lambda to infer the time of subdivision of certain marine invertebrates under different coalescent processes. CONCLUSIONS Hybrid-Lambda makes it possible to investigate biogeographic concordance among high fecundity species exhibiting skewed offspring distribution. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
18	Robustness to divergence time underestimation when inferring species trees from estimated gene trees. Syst Biol 2013;63:66-82. [PMID: 23988674 DOI: 10.1093/sysbio/syt059] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open Abstract To infer species trees from gene trees estimated from phylogenomic data sets, tractable methods are needed that can handle dozens to hundreds of loci. We examine several computationally efficient approaches-MP-EST, STAR, STEAC, STELLS, and STEM-for inferring species trees from gene trees estimated using maximum likelihood (ML) and Bayesian approaches. Among the methods examined, we found that topology-based methods often performed better using ML gene trees and methods employing coalescent times typically performed better using Bayesian gene trees, with MP-EST, STAR, STEAC, and STELLS outperforming STEM under most conditions. We examine why the STEM tree (also called GLASS or Maximum Tree) is less accurate on estimated gene trees by comparing estimated and true coalescence times, performing species tree inference using simulations, and analyzing a great ape data set keeping track of false positive and false negative rates for inferred clades. We find that although true coalescence times are more ancient than speciation times under the multispecies coalescent model, estimated coalescence times are often more recent than speciation times. This underestimation can lead to increased bias and lack of resolution with increased sampling (either alleles or loci) when gene trees are estimated with ML. The problem appears to be less severe using Bayesian gene-tree estimates. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
19	Species tree inference by the STAR method and its generalizations. J Comput Biol 2013;20:50-61. [PMID: 23294273 DOI: 10.1089/cmb.2012.0101] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022] Open Abstract The multispecies coalescent model describes the generation of gene trees from a rooted metric species tree and thus provides a framework for the inference of species trees from sampled gene trees. We prove that the STAR method of Liu et al. (2009) and generalizations of it, are statistically consistent methods of topological species tree inference under this model. We discuss the impact of gene tree sampling schemes for species tree inference using generalized STAR methods and reinterpret the original STAR as a consensus method based on clades. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
20	Anomalous Unrooted Gene Trees. Syst Biol 2013;62:574-90. [DOI: 10.1093/sysbio/syt023] [Citation(s) in RCA: 57] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open Abstract Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
21	Applied magnetic field design for the field reversed configuration compression heating experiment. THE REVIEW OF SCIENTIFIC INSTRUMENTS 2013;84:043507. [PMID: 23635196 DOI: 10.1063/1.4801952] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023] Abstract Detailed calculations of the formation, guide, and mirror applied magnetic fields in the FRC compression-heating experiment (FRCHX) were conducted using a commercially available generalized finite element solver, COMSOL Multiphysics(®). In FRCHX, an applied magnetic field forms, translates, and finally captures the FRC in the liner region sufficiently long to enable compression. Large single turn coils generate the fast magnetic fields necessary for FRC formation. Solenoidal coils produce the magnetic field for translation and capture of the FRC prior to liner implosion. Due to the limited FRC lifetime, liner implosion is initiated before the FRC is injected, and the magnetic flux that diffuses into the liner is compressed. Two-dimensional axisymmetric magnetohydrodynamic simulations using MACH2 were used to specify optimal magnetic field characteristics, and this paper describes the simulations conducted to design magnetic field coils and compression hardware for FRCHX. This paper presents the vacuum solution for the magnetic field. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
22	Evaluating variations on the STAR algorithm for relative efficiency and sample sizes needed to reconstruct species trees. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2013:262-272. [PMID: 23424131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 06/01/2023] Abstract Many methods for inferring species trees from gene trees have been developed when incongruence among gene trees is due to incomplete lineage sorting. A method called STAR (Liu et al, 2009), assigns values to nodes in gene trees based only on topological information and uses the average value of the most recent common ancestor node for each pair of taxa to construct a distance matrix which is then used for clustering taxa into a tree. This method is very efficient computationally, scaling linearly in the number of loci and quadratically in the number of taxa, and in simulations has shown to be highly accurate for moderate to large numbers of loci as well as robust to molecular clock violations and misestimation of gene trees from sequence data. The method is based on a particular choice of numbering nodes in the gene trees; however, other choices for numbering nodes in gene trees can also lead to consistent inference of the species tree. Here, expected values and variances for average pairwise distances and differences between average pairwise distances in the distance matrix constructed by the STAR algorithm are used to analytically evaluate efficiency of different numbering schemes that are variations on the original STAR numbering for small trees. Collapse Key Words Collapse MESH Headings Algorithms Computational Biology Computer Simulation Models, Genetic Phylogeny Collapse Grants Collapse
23	A characterization of the set of species trees that produce anomalous ranked gene trees. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2012;9:1558-1568. [PMID: 22868677 DOI: 10.1109/tcbb.2012.110] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023] Abstract Ranked gene trees, which consider both the gene tree topology and the sequence in which gene lineages separate, can potentially provide a new source of information for use in modeling genealogies and performing inference of species trees. Recently,we have calculated the probability distribution of ranked gene trees under the standard multispecies coalescent model for the evolution of gene lineages along the branches of a fixed species tree, demonstrating the existence of anomalous ranked gene trees (ARGTs), in which a ranked gene tree that does not match the ranked species tree can have greater probability under the model than the matching ranked gene tree. Here, we fully characterize the set of unranked species tree topologies that give rise to ARGTs, showing that this set contains all species tree topologies with five or more taxa, with the exceptions of caterpillars and pseudocaterpillars. The results have implications for the use of ranked gene trees in phylogenetic inference. Collapse Key Words Collapse MESH Headings Computational Biology/methods Genealogy and Heraldry Genes Models, Genetic Phylogeny Collapse Grants Collapse
24	Multiple merger gene genealogies in two species: Monophyly, paraphyly, and polyphyly for two examples of Lambda coalescents. Theor Popul Biol 2012;82:117-30. [PMID: 22613063 DOI: 10.1016/j.tpb.2012.05.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2012] [Revised: 05/07/2012] [Accepted: 05/09/2012] [Indexed: 10/28/2022] Abstract Probabilities of monophyly, paraphyly, and polyphyly of two-species gene genealogies are computed for modest sample sizes and compared for two different Λ coalescent processes. Coalescent processes belonging to the Λ coalescent family admit asynchronous multiple mergers of active ancestral lineages. Assigning a timescale to the time of divergence becomes a central issue when different populations have different coalescent processes running on different timescales. Clade probabilities in single populations are also computed, which can be useful for testing for taxonomic distinctiveness of an observed set of monophyletic lineages. The coalescence rates of multiple merger coalescent processes are functions of coalescent parameters. The effect of coalescent parameters on the probabilities studied depends on the coalescent process, and if the population is ancestral or derived. The probability of reciprocal monophyly tends to be somewhat lower, when associated with a Λ coalescent, under the null hypothesis that two groups come from the same population. However, even for fairly recent divergence times, the probability of monophyly tends to be higher as a function of the number of generations for coalescent processes that admit multiple mergers, and is sensitive to the parameter of one of the example processes. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
25	A polynomial time algorithm for calculating the probability of a ranked gene tree given a species tree. Algorithms Mol Biol 2012;7:7. [PMID: 22546066 PMCID: PMC3637458 DOI: 10.1186/1748-7188-7-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2011] [Accepted: 04/02/2012] [Indexed: 11/17/2022] Open Abstract Background The ancestries of genes form gene trees which do not necessarily have the same topology as the species tree due to incomplete lineage sorting. Available algorithms determining the probability of a gene tree given a species tree require exponential computational runtime. Results In this paper, we provide a polynomial time algorithm to calculate the probability of a ranked gene tree topology for a given species tree, where a ranked tree topology is a tree topology with the internal vertices being ordered. The probability of a gene tree topology can thus be calculated in polynomial time if the number of orderings of the internal vertices is a polynomial number. However, the complexity of calculating the probability of a gene tree topology with an exponential number of rankings for a given species tree remains unknown. Conclusions Polynomial algorithms for calculating ranked gene tree probabilities may become useful in developing methodology to infer species trees based on a collection of gene trees, leading to a more accurate reconstruction of ancestral species relationships. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
26	The probability of a gene tree topology within a phylogenetic network with applications to hybridization detection. PLoS Genet 2012;8:e1002660. [PMID: 22536161 PMCID: PMC3330115 DOI: 10.1371/journal.pgen.1002660] [Citation(s) in RCA: 130] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2011] [Accepted: 03/05/2012] [Indexed: 11/29/2022] Open Abstract Gene tree topologies have proven a powerful data source for various tasks, including species tree inference and species delimitation. Consequently, methods for computing probabilities of gene trees within species trees have been developed and widely used in probabilistic inference frameworks. All these methods assume an underlying multispecies coalescent model. However, when reticulate evolutionary events such as hybridization occur, these methods are inadequate, as they do not account for such events. Methods that account for both hybridization and deep coalescence in computing the probability of a gene tree topology currently exist for very limited cases. However, no such methods exist for general cases, owing primarily to the fact that it is currently unknown how to compute the probability of a gene tree topology within the branches of a phylogenetic network. Here we present a novel method for computing the probability of gene tree topologies on phylogenetic networks and demonstrate its application to the inference of hybridization in the presence of incomplete lineage sorting. We reanalyze a Saccharomyces species data set for which multiple analyses had converged on a species tree candidate. Using our method, though, we show that an evolutionary hypothesis involving hybridization in this group has better support than one of strict divergence. A similar reanalysis on a group of three Drosophila species shows that the data is consistent with hybridization. Further, using extensive simulation studies, we demonstrate the power of gene tree topologies at obtaining accurate estimates of branch lengths and hybridization probabilities of a given phylogenetic network. Finally, we discuss identifiability issues with detecting hybridization, particularly in cases that involve extinction or incomplete sampling of taxa. Species trees depict how species split and diverge. Within the branches of a species tree, gene trees, which depict the evolutionary histories of different genomic regions in the species, grow. Evolutionary analyses of the genomes of closely related organisms have highlighted the phenomenon that gene trees may disagree with each other as well as with the species tree that contains them due to deep coalescence. Furthermore, for several groups of organisms, hybridization plays an important role in their evolution and diversification. This evolutionary event also results in gene tree incongruence and gives rise to a species phylogeny that is a network. Thus, inferring the evolutionary histories of groups of organisms where hybridization is known, or suspected, to play an evolutionary role requires dealing simultaneously with hybridization and other sources of gene tree incongruence. Currently, no methods exist for doing this with general scenarios of hybridization. In this paper, we propose the first method for this task and demonstrate its performance. We revisit the analysis of a set of yeast species and another of Drosophila species, and show that evolutionary histories involving hybridization have higher support than the strictly diverging evolutionary histories estimated when not incorporating hybridization in the analysis. Collapse Key Words Collapse MESH Headings Algorithms Animals Biological Evolution Drosophila/classification Drosophila/genetics Hybridization, Genetic Models, Theoretical Phylogeny Probability Saccharomyces/classification Saccharomyces/genetics Collapse Grants R01 LM009494 NLM NIH HHS EF-0832858 PHS HHS R01LM009494 NLM NIH HHS Collapse
27	Determining species tree topologies from clade probabilities under the coalescent. J Theor Biol 2011;289:96-106. [DOI: 10.1016/j.jtbi.2011.08.006] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2011] [Revised: 06/08/2011] [Accepted: 08/02/2011] [Indexed: 11/28/2022] Abstract Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
28	Clades, clans, and reciprocal monophyly under neutral evolutionary models. Theor Popul Biol 2011;79:220-7. [PMID: 21420994 DOI: 10.1016/j.tpb.2011.03.002] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2010] [Revised: 03/08/2011] [Accepted: 03/09/2011] [Indexed: 10/18/2022] Abstract The Yule model and the coalescent model are two neutral stochastic models for generating trees in phylogenetics and population genetics, respectively. Although these models are quite different, they lead to identical distributions concerning the probability that pre-specified groups of taxa form monophyletic groups (clades) in the tree. We extend earlier work to derive exact formulae for the probability of finding one or more groups of taxa as clades in a rooted tree, or as 'clans' in an unrooted tree. Our findings are relevant for calculating the statistical significance of observed monophyly and reciprocal monophyly in phylogenetics. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
29	Coalescent histories on phylogenetic networks and detection of hybridization despite incomplete lineage sorting. Syst Biol 2011;60:138-49. [PMID: 21248369 DOI: 10.1093/sysbio/syq084] [Citation(s) in RCA: 144] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open Abstract Analyses of the increasingly available genomic data continue to reveal the extent of hybridization and its role in the evolutionary diversification of various groups of species. We show, through extensive coalescent-based simulations of multilocus data sets on phylogenetic networks, how divergence times before and after hybridization events can result in incomplete lineage sorting with gene tree incongruence signatures identical to those exhibited by hybridization. Evolutionary analysis of such data under the assumption of a species tree model can miss all hybridization events, whereas analysis under the assumption of a species network model would grossly overestimate hybridization events. These issues necessitate a paradigm shift in evolutionary analysis under these scenarios, from a model that assumes a priori a single source of gene tree incongruence to one that integrates multiple sources in a unifying framework. We propose a framework of coalescence within the branches of a phylogenetic network and show how this framework can be used to detect hybridization despite incomplete lineage sorting. We apply the model to simulated data and show that the signature of hybridization can be revealed as long as the interval between the divergence times of the species involved in hybridization is not too small. We reanalyze a data set of 106 loci from 7 in-group Saccharomyces species for which a species tree with no hybridization has been reported in the literature. Our analysis supports the hypothesis that hybridization occurred during the evolution of this group, explaining a large amount of the incongruence in the data. Our findings show that an integrative approach to gene tree incongruence and its reconciliation is needed. Our framework will help in systematically analyzing genomic data for the occurrence of hybridization and elucidating its evolutionary role. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
30	Identifying the rooted species tree from the distribution of unrooted gene trees under the coalescent. J Math Biol 2010;62:833-62. [PMID: 20652704 DOI: 10.1007/s00285-010-0355-7] [Citation(s) in RCA: 103] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2009] [Revised: 06/25/2010] [Indexed: 11/29/2022] Abstract Gene trees are evolutionary trees representing the ancestry of genes sampled from multiple populations. Species trees represent populations of individuals-each with many genes-splitting into new populations or species. The coalescent process, which models ancestry of gene copies within populations, is often used to model the probability distribution of gene trees given a fixed species tree. This multispecies coalescent model provides a framework for phylogeneticists to infer species trees from gene trees using maximum likelihood or Bayesian approaches. Because the coalescent models a branching process over time, all trees are typically assumed to be rooted in this setting. Often, however, gene trees inferred by traditional phylogenetic methods are unrooted. We investigate probabilities of unrooted gene trees under the multispecies coalescent model. We show that when there are four species with one gene sampled per species, the distribution of unrooted gene tree topologies identifies the unrooted species tree topology and some, but not all, information in the species tree edges (branch lengths). The location of the root on the species tree is not identifiable in this situation. However, for 5 or more species with one gene sampled per species, we show that the distribution of unrooted gene tree topologies identifies the rooted species tree topology and all its internal branch lengths. The length of any pendant branch leading to a leaf of the species tree is also identifiable for any species from which more than one gene is sampled. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
31	Fast and consistent estimation of species trees using supermatrix rooted triples. Mol Biol Evol 2009;27:552-69. [PMID: 19833741 DOI: 10.1093/molbev/msp250] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open Abstract Concatenated sequence alignments are often used to infer species-level relationships. Previous studies have shown that analysis of concatenated data using maximum likelihood (ML) can produce misleading results when loci have differing gene tree topologies due to incomplete lineage sorting. Here, we develop a polynomial time method that utilizes the modified mincut supertree algorithm to construct an estimated species tree from inferred rooted triples of concatenated alignments. We term this method SuperMatrix Rooted Triple (SMRT) and use the notation SMRT-ML when rooted triples are inferred by ML. We use simulations to investigate the performance of SMRT-ML under Jukes-Cantor and general time-reversible substitution models for four- and five-taxon species trees and also apply the method to an empirical data set of yeast genes. We find that SMRT-ML converges to the correct species tree in many cases in which ML on the full concatenated data set fails to do so. SMRT-ML can be conservative in that its output tree is often partially unresolved for problematic clades. We show analytically that when the species tree is clocklike and mutations occur under the Cavender-Farris-Neyman substitution model, as the number of genes increases, SMRT-ML is increasingly likely to infer the correct species tree even when the most likely gene tree does not match the species tree. SMRT-ML is therefore a computationally efficient and statistically consistent estimator of the species tree when gene trees are distributed according to the multispecies coalescent model. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
32	Gene tree discordance, phylogenetic inference and the multispecies coalescent. Trends Ecol Evol 2009;24:332-40. [PMID: 19307040 DOI: 10.1016/j.tree.2009.01.009] [Citation(s) in RCA: 1072] [Impact Index Per Article: 71.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2008] [Revised: 12/17/2008] [Accepted: 01/05/2009] [Indexed: 01/29/2023] Abstract Collapse Key Words Collapse MESH Headings Genetic Speciation Genetic Variation Models, Biological Phylogeny Collapse Grants Collapse
33	Properties of consensus methods for inferring species trees from gene trees. Syst Biol 2009;58:35-54. [PMID: 20525567 PMCID: PMC2909780 DOI: 10.1093/sysbio/syp008] [Citation(s) in RCA: 122] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2008] [Revised: 07/07/2008] [Accepted: 10/22/2008] [Indexed: 11/14/2022] Open Abstract Consensus methods provide a useful strategy for summarizing information from a collection of gene trees. An important application of consensus methods is to combine gene trees to estimate a species tree. To investigate the theoretical properties of consensus trees that would be obtained from large numbers of loci evolving according to a basic evolutionary model, we construct consensus trees from rooted gene trees that occur in proportion to gene-tree probabilities derived from coalescent theory. We consider majority-rule, rooted triple (R()), and greedy consensus trees obtained from known, rooted gene trees, both in the asymptotic case as numbers of gene trees approach infinity and for finite numbers of genes. Our results show that for some combinations of species-tree branch lengths, increasing the number of independent loci can make the rooted majority-rule consensus tree more likely to be at least partially unresolved. However, the probability that the R() consensus tree has the species-tree topology approaches 1 as the number of gene trees approaches infinity. Although the greedy consensus algorithm can be the quickest to converge on the correct species-tree topology when increasing the number of gene trees, it can also be positively misleading. The majority-rule consensus tree is not a misleading estimator of the species-tree topology, and the R() consensus tree is a statistically consistent estimator of the species-tree topology. Our results therefore suggest a method for using multiple loci to infer the species-tree topology, even when it is discordant with the most likely gene tree. Collapse Key Words* anomalous gene tree coalescence discordance lineage sorting phylogenetics statistical consistency Collapse MESH Headings Algorithms Biological Evolution Computational Biology/methods Genetic Speciation Models, Genetic Phylogeny Collapse Grants T32 HG000040 NHGRI NIH HHS T32 GM070449 NIGMS NIH HHS Collapse
34	Genomics and genome-wide association studies: an integrative approach to expression QTL mapping. Genomics 2008;92:129-33. [PMID: 18586451 DOI: 10.1016/j.ygeno.2008.05.012] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2008] [Revised: 05/13/2008] [Accepted: 05/28/2008] [Indexed: 01/29/2023] Abstract Expression QTL mapping by integrating genome-wide gene expression and genotype data is a promising approach to identifying functional genetic variation, but is hampered by the large number of multiple comparisons inherent in such studies. A novel approach to addressing multiple testing problems in genome-wide family-based association studies is screening candidate markers using heritability or conditional power. We apply these methods in settings in which microarray gene expression data are used as phenotypes, screening for SNPs near the expressed genes. We perform association analyses for phenotypes using a univariate approach. We also perform simulations on trios with large numbers of causal SNPs to determine the optimal number of markers to use in a screen. We demonstrate that our family-based screening approach performs well in the analysis of integrative genomic datasets and that screening using either heritability or conditional power produces similar, though not identical, results. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
35	Genotype, haplotype and copy-number variation in worldwide human populations. Nature 2008;451:998-1003. [PMID: 18288195 DOI: 10.1038/nature06742] [Citation(s) in RCA: 611] [Impact Index Per Article: 38.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2007] [Accepted: 01/29/2008] [Indexed: 11/09/2022] Abstract Genome-wide patterns of variation across individuals provide a powerful source of data for uncovering the history of migration, range expansion, and adaptation of the human species. However, high-resolution surveys of variation in genotype, haplotype and copy number have generally focused on a small number of population groups. Here we report the analysis of high-quality genotypes at 525,910 single-nucleotide polymorphisms (SNPs) and 396 copy-number-variable loci in a worldwide sample of 29 populations. Analysis of SNP genotypes yields strongly supported fine-scale inferences about population structure. Increasing linkage disequilibrium is observed with increasing geographic distance from Africa, as expected under a serial founder effect for the out-of-Africa spread of human populations. New approaches for haplotype analysis produce inferences about population structure that complement results based on unphased SNPs. Despite a difference from SNPs in the frequency spectrum of the copy-number variants (CNVs) detected--including a comparatively large number of CNVs in previously unexamined populations from Oceania and the Americas--the global distribution of CNVs largely accords with population structure analyses for SNP data sets of similar size. Our results produce new inferences about inter-population variation, support the utility of CNVs in human population-genetic research, and serve as a genomic resource for human-genetic studies in diverse worldwide populations. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
36	EFBAT: exact family-based association tests. BMC Genet 2007;8:86. [PMID: 18096066 PMCID: PMC2234422 DOI: 10.1186/1471-2156-8-86] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2007] [Accepted: 12/20/2007] [Indexed: 11/18/2022] Open Abstract Background Family-based association tests are important tools for investigating genetic risk factors of complex diseases. These tests are especially valuable for being robust to population structure. We introduce a tool, EFBAT, which performs exact family-based tests of association for X-chromosome and autosomal biallelic markers. Results The program EFBAT extends a network algorithm previously applied to autosomal markers to include the X-chromosome and to perform tests of association under the null hypotheses "no association, no linkage" and "no association in the presence of linkage" under additive, dominant and recessive genetic models. These tests are valid regardless of patterns of missing familial data. Conclusion The general framework for performing exact family-based association tests has been usefully extended to the X-chromosome, particularly for the hypothesis of "no association in the presence of linkage" and for different genetic models. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
37	Inconsistency of phylogenetic estimates from concatenated data under coalescence. Syst Biol 2007;56:17-24. [PMID: 17366134 DOI: 10.1080/10635150601146041] [Citation(s) in RCA: 666] [Impact Index Per Article: 39.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open Abstract Although multiple gene sequences are becoming increasingly available for molecular phylogenetic inference, the analysis of such data has largely relied on inference methods designed for single genes. One of the common approaches to analyzing data from multiple genes is concatenation of the individual gene data to form a single supergene to which traditional phylogenetic inference procedures - e.g., maximum parsimony (MP) or maximum likelihood (ML) - are applied. Recent empirical studies have demonstrated that concatenation of sequences from multiple genes prior to phylogenetic analysis often results in inference of a single, well-supported phylogeny. Theoretical work, however, has shown that the coalescent can produce substantial variation in single-gene histories. Using simulation, we combine these ideas to examine the performance of the concatenation approach under conditions in which the coalescent produces a high level of discord among individual gene trees and show that it leads to statistically inconsistent estimation in this setting. Furthermore, use of the bootstrap to measure support for the inferred phylogeny can result in moderate to strong support for an incorrect tree under these conditions. These results highlight the importance of incorporating variation in gene histories into multilocus phylogenetics. Collapse Key Words Collapse MESH Headings Classification/methods Computer Simulation Genes/genetics Likelihood Functions Models, Genetic Phylogeny Research Design Collapse Grants MH59532 NIMH NIH HHS Collapse
38	Experimental Design of a Magnetic Flux Compression Experiment. JOURNAL OF FUSION ENERGY 2006. [DOI: 10.1007/s10894-006-9038-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Abstract Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
39	Discordance of species trees with their most likely gene trees. PLoS Genet 2006;2:e68. [PMID: 16733550 PMCID: PMC1464820 DOI: 10.1371/journal.pgen.0020068] [Citation(s) in RCA: 565] [Impact Index Per Article: 31.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2006] [Accepted: 03/23/2006] [Indexed: 12/01/2022] Open Abstract Because of the stochastic way in which lineages sort during speciation, gene trees may differ in topology from each other and from species trees. Surprisingly, assuming that genetic lineages follow a coalescent model of within-species evolution, we find that for any species tree topology with five or more species, there exist branch lengths for which gene tree discordance is so common that the most likely gene tree topology to evolve along the branches of a species tree differs from the species phylogeny. This counterintuitive result implies that in combining data on multiple loci, the straightforward procedure of using the most frequently observed gene tree topology as an estimate of the species tree topology can be asymptotically guaranteed to produce an incorrect estimate. We conclude with suggestions that can aid in overcoming this new obstacle to accurate genomic inference of species phylogenies. Different genomic regions evolving along the branches of a tree of species relationships can have different evolutionary histories. Consequently, estimates of species trees from genetic data may be influenced by the particular choice of genomic regions used in an analysis. Recent work has focused on circumventing this problem by combining information from multiple regions to attempt to produce accurate species tree estimates. The authors show that the use of multiple genomic regions for species tree inference is subject to a surprising new difficulty, the problem of “anomalous gene trees.” Not only can individual genes or genomic regions have genealogical histories that differ in shape, or topology, from a species tree, the gene tree topology most likely to evolve can differ from the species tree topology. As a result, the “democratic vote” procedure of using the most frequently observed gene tree topology as an estimate of the species tree topology can converge on the wrong species tree as more genes are added. As it becomes more feasible to simultaneously investigate many regions of a genome, species tree inference algorithms will need to begin taking the problem of anomalous gene trees into consideration. Collapse Key Words Collapse MESH Headings Algorithms Animals Computational Biology/methods Evolution, Molecular Genetic Speciation Genetics Genetics, Population Genomics/methods Humans Models, Genetic Models, Statistical Phylogeny Species Specificity Stochastic Processes Collapse Grants R01 MH059532 NIMH NIH HHS MH59532 NIMH NIH HHS Collapse
40	Gene tree distributions under the coalescent process. Evolution 2005;59:24-37. [PMID: 15792224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2023] Abstract Under the coalescent model for population divergence, lineage sorting can cause considerable variability in gene trees generated from any given species tree. In this paper, we derive a method for computing the distribution of gene tree topologies given a bifurcating species tree for trees with an arbitrary number of taxa in the case that there is one gene sampled per species. Applications for gene tree distributions include determining exact probabilities of topological equivalence between gene trees and species trees and inferring species trees from multiple datasets. In addition, we examine the shapes of gene tree distributions and their sensitivity to changes in branch lengths, species tree shape, and tree size. The method for computing gene tree distributions is implemented in the computer program COAL. Collapse Key Words Collapse MESH Headings Genetic Speciation Genetics, Population Models, Genetic Phylogeny Probability Software Collapse Grants Collapse
41	GENE TREE DISTRIBUTIONS UNDER THE COALESCENT PROCESS. Evolution 2005. [DOI: 10.1554/04-385] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Abstract Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
42	GENE TREE DISTRIBUTIONS UNDER THE COALESCENT PROCESS. Evolution 2005. [DOI: 10.1111/j.0014-3820.2005.tb00891.x] [Citation(s) in RCA: 259] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Abstract Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
43	Electromagnetic implosion of spherical liner. PHYSICAL REVIEW LETTERS 1995;74:98-101. [PMID: 10057708 DOI: 10.1103/physrevlett.74.98] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023] Abstract Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
44	Fast, large-signal, free-standing foil bolometer for measuring ultrasoft x-ray burst fluence. THE REVIEW OF SCIENTIFIC INSTRUMENTS 1979;50:1223. [PMID: 18699364 DOI: 10.1063/1.1135690] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023] Abstract A fast ( approximately 300 ns), large-signal ( greater, similar1 V), free-standing foil bolometer was developed for measuring ultrasoft x-ray burst fluences. The results of bolometer measurements of the radiation output of an imploding foil liner plasma indicate yields of several tens of kJ, assuming isotropic emission. This is in substantial agreement with filtered metal photocathode (x-ray diode) measurements. The bolometer design, response function, and comparison with x-ray photodiode data are discussed. This type of bolometer is particularly applicable to radiation measurements of high-energy, destructive pulsed plasmas such as high-energy imploding liner plasmas. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse