1
|
Adams R, Lozano JR, Duncan M, Green J, Assis R, DeGiorgio M. A Tale of Too Many Trees: A Conundrum for Phylogenetic Regression. Mol Biol Evol 2025; 42:msaf032. [PMID: 39930867 PMCID: PMC11884811 DOI: 10.1093/molbev/msaf032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2024] [Revised: 12/20/2024] [Accepted: 01/21/2025] [Indexed: 03/08/2025] Open
Abstract
Just exactly which tree(s) should we assume when testing evolutionary hypotheses? This question has plagued comparative biologists for decades. Though all phylogenetic comparative methods require input trees, we seldom know with certainty whether even a perfectly estimated tree (if this is possible in practice) is appropriate for our studied traits. Yet, we also know that phylogenetic conflict is ubiquitous in modern comparative biology, and we are still learning about its dangers when testing evolutionary hypotheses. Here, we investigate the consequences of tree-trait mismatch for phylogenetic regression in the presence of gene tree-species tree conflict. Our simulation experiments reveal excessively high false positive rates for mismatched models with both small and large trees, simple and complex traits, and known and estimated phylogenies. In some cases, we find evidence of a directionality of error: assuming a species tree for traits that evolved according to a gene tree sometimes fares worse than the opposite. We also explored the impacts of tree choice using an expansive, cross-species gene expression dataset as an arguably "best-case" scenario in which one may have a better chance of matching tree with trait. Offering a potential path forward, we found promise in the application of a robust estimator as a potential, albeit imperfect, solution to some issues raised by tree mismatch. Collectively, our results emphasize the importance of careful study design for comparative methods, highlighting the need to fully appreciate the role of accurate and thoughtful phylogenetic modeling.
Collapse
Affiliation(s)
- Richard Adams
- Department of Entomology and Plant Pathology, University of Arkansas, Fayetteville, AR, USA
- Center for Agricultural Data Analytics, University of Arkansas, Fayetteville, AR, USA
| | - Jenniffer Roa Lozano
- Department of Entomology and Plant Pathology, University of Arkansas, Fayetteville, AR, USA
- Center for Agricultural Data Analytics, University of Arkansas, Fayetteville, AR, USA
| | - Mataya Duncan
- Department of Entomology and Plant Pathology, University of Arkansas, Fayetteville, AR, USA
- Center for Agricultural Data Analytics, University of Arkansas, Fayetteville, AR, USA
| | - Jack Green
- Department of Entomology and Plant Pathology, University of Arkansas, Fayetteville, AR, USA
- Center for Agricultural Data Analytics, University of Arkansas, Fayetteville, AR, USA
| | - Raquel Assis
- Department of Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL, USA
- Institute for Human Health and Disease Intervention, Florida Atlantic University, Boca Raton, FL, USA
| | - Michael DeGiorgio
- Department of Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL, USA
| |
Collapse
|
2
|
Herrig DK, Ridenbaugh RD, Vertacnik KL, Everson KM, Sim SB, Geib SM, Weisrock DW, Linnen CR. Whole Genomes Reveal Evolutionary Relationships and Mechanisms Underlying Gene-Tree Discordance in Neodiprion Sawflies. Syst Biol 2024; 73:839-860. [PMID: 38970484 DOI: 10.1093/sysbio/syae036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Revised: 07/04/2024] [Accepted: 07/05/2024] [Indexed: 07/08/2024] Open
Abstract
Rapidly evolving taxa are excellent models for understanding the mechanisms that give rise to biodiversity. However, developing an accurate historical framework for comparative analysis of such lineages remains a challenge due to ubiquitous incomplete lineage sorting (ILS) and introgression. Here, we use a whole-genome alignment, multiple locus-sampling strategies, and summary-tree and single nucleotide polymorphism-based species-tree methods to infer a species tree for eastern North American Neodiprion species, a clade of pine-feeding sawflies (Order: Hymenopteran; Family: Diprionidae). We recovered a well-supported species tree that-except for three uncertain relationships-was robust to different strategies for analyzing whole-genome data. Nevertheless, underlying gene-tree discordance was high. To understand this genealogical variation, we used multiple linear regression to model site concordance factors estimated in 50-kb windows as a function of several genomic predictor variables. We found that site concordance factors tended to be higher in regions of the genome with more parsimony-informative sites, fewer singletons, less missing data, lower GC content, more genes, lower recombination rates, and lower D-statistics (less introgression). Together, these results suggest that ILS, introgression, and genotyping error all shape the genomic landscape of gene-tree discordance in Neodiprion. More generally, our findings demonstrate how combining phylogenomic analysis with knowledge of local genomic features can reveal mechanisms that produce topological heterogeneity across genomes.
Collapse
Affiliation(s)
- Danielle K Herrig
- Department of Biology, University of Kentucky, 195 Huguelet Dr., Lexington, KY 40508, USA
| | - Ryan D Ridenbaugh
- Department of Biology, University of Kentucky, 195 Huguelet Dr., Lexington, KY 40508, USA
| | - Kim L Vertacnik
- Department of Biology, University of Kentucky, 195 Huguelet Dr., Lexington, KY 40508, USA
| | - Kathryn M Everson
- Department of Natural Resources and Environmental Science, University of Nevada, 1664 N. Virginia St., Reno, NV 89557, USA
- Department of Integrative Biology, Oregon State University, 4575 SW Research Way, Corvallis, OR 97333, USA
| | - Sheina B Sim
- USDA-ARS Daniel K. Inouye US Pacific Basin Agricultural Research Center, Tropical Pest Genetics and Molecular Biology Research Unit, 64 Nowelo St., Hilo, HI 96720, USA
| | - Scott M Geib
- USDA-ARS Daniel K. Inouye US Pacific Basin Agricultural Research Center, Tropical Pest Genetics and Molecular Biology Research Unit, 64 Nowelo St., Hilo, HI 96720, USA
| | - David W Weisrock
- Department of Biology, University of Kentucky, 195 Huguelet Dr., Lexington, KY 40508, USA
| | - Catherine R Linnen
- Department of Biology, University of Kentucky, 195 Huguelet Dr., Lexington, KY 40508, USA
| |
Collapse
|
3
|
Braichenko S, Borges R, Kosiol C. Polymorphism-Aware Models in RevBayes: Species Trees, Disentangling Balancing Selection, and GC-Biased Gene Conversion. Mol Biol Evol 2024; 41:msae138. [PMID: 38980178 PMCID: PMC11272101 DOI: 10.1093/molbev/msae138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Revised: 04/19/2024] [Accepted: 07/06/2024] [Indexed: 07/10/2024] Open
Abstract
The role of balancing selection is a long-standing evolutionary puzzle. Balancing selection is a crucial evolutionary process that maintains genetic variation (polymorphism) over extended periods of time; however, detecting it poses a significant challenge. Building upon the Polymorphism-aware phylogenetic Models (PoMos) framework rooted in the Moran model, we introduce a PoMoBalance model. This novel approach is designed to disentangle the interplay of mutation, genetic drift, and directional selection (GC-biased gene conversion), along with the previously unexplored balancing selection pressures on ultra-long timescales comparable with species divergence times by analyzing multi-individual genomic and phylogenetic divergence data. Implemented in the open-source RevBayes Bayesian framework, PoMoBalance offers a versatile tool for inferring phylogenetic trees as well as quantifying various selective pressures. The novel aspect of our approach in studying balancing selection lies in polymorphism-aware phylogenetic models' ability to account for ancestral polymorphisms and incorporate parameters that measure frequency-dependent selection, allowing us to determine the strength of the effect and exact frequencies under selection. We implemented validation tests and assessed the model on the data simulated with SLiM and a custom Moran model simulator. Real sequence analysis of Drosophila populations reveals insights into the evolutionary dynamics of regions subject to frequency-dependent balancing selection, particularly in the context of sex-limited color dimorphism in Drosophila erecta.
Collapse
Affiliation(s)
- Svitlana Braichenko
- Centre for Biological Diversity, School of Biology, University of St Andrews, Fife KY16 9TH, UK
- Institute of Genetics and Cancer, University of Edinburgh, Edinburgh EH4 2XU, UK
| | - Rui Borges
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien 1210, Austria
| | - Carolin Kosiol
- Centre for Biological Diversity, School of Biology, University of St Andrews, Fife KY16 9TH, UK
| |
Collapse
|
4
|
Catalan A, Höhna S, Lower SE, Duchen P. Inferring the demographic history of the North American firefly Photinus pyralis. J Evol Biol 2022; 35:1488-1499. [PMID: 36168726 DOI: 10.1111/jeb.14094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Revised: 06/13/2022] [Accepted: 07/11/2022] [Indexed: 11/28/2022]
Abstract
The firefly Photinus pyralis inhabits a wide range of latitudinal and ecological niches, with populations living from temperate to tropical habitats. Despite its broad distribution, its demographic history is unknown. In this study, we modelled and inferred different demographic scenarios for North American populations of P. pyralis, which were collected from Texas to New Jersey. We used a combination of ABC techniques (for multi-population/colonization analyses) and likelihood inference (dadi, StairwayPlot2, PoMo) for single-population demographic inference, which proved useful with our RAD data. We uncovered that the most ancestral North American population lays in Texas, which further colonized the Central region of the US and more recently the North Eastern coast. Our study confidently rejects a demographic scenario where the North Eastern populations colonized more southern populations until reaching Texas. To estimate the age of divergence between of P. pyralis, which provides deeper insights into the history of the entire species, we assembled a multi-locus phylogenetic data covering the genus Photinus. We uncovered that the phylogenetic node leading to P. pyralis lies at the end of the Miocene. Importantly, modelling the demographic history of North American P. pyralis serves as a null model of nucleotide diversity patterns in a widespread native insect species, which will serve in future studies for the detection of adaptation events in this firefly species, as well as a comparison for future studies of other North American insect taxa.
Collapse
Affiliation(s)
- Ana Catalan
- Division of Evolutionary Biology, Ludwig-Maximilians-Universität München, Planegg-Martinsried, Germany
| | - Sebastian Höhna
- GeoBio-Center, Ludwig-Maximilians-Universität München, Munich, Germany.,Department of Earth and Environmental Sciences, Paleontology & Geobiology, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Sarah E Lower
- Department of Biology, Bucknell University, Lewisburg, PA, USA
| | - Pablo Duchen
- Institute for Organismal and Molecular Evolutionary Biology, Johannes Gutenberg University of Mainz, Mainz, Germany
| |
Collapse
|
5
|
Borges R, Boussau B, Höhna S, Pereira RJ, Kosiol C. Polymorphism‐aware estimation of species trees and evolutionary forces from genomic sequences with
RevBayes. Methods Ecol Evol 2022. [DOI: 10.1111/2041-210x.13980] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Affiliation(s)
- Rui Borges
- Institut für Populationsgenetik, Vetmeduni Vienna Wien Austria
| | - Bastien Boussau
- Université de Lyon, Université Claude Bernard Lyon 1 Villeurbanne France
| | - Sebastian Höhna
- GeoBio‐Center, Ludwig‐Maximilians‐Universität München Munich Germany
- Department of Earth and Environmental Sciences, Paleontology & Geobiology Ludwig‐Maximilians‐Universität München Munich Germany
| | - Ricardo J. Pereira
- Division of Evolutionary Biology, Department of Biology II Ludwig‐Maximilians‐Universität München Martinsried Germany
| | - Carolin Kosiol
- Institut für Populationsgenetik, Vetmeduni Vienna Wien Austria
- Centre for Biological Diversity University of St Andrews St Andrews UK
| |
Collapse
|
6
|
Hao CL, Arken K, Kadir M, Zhang WR, Rong MJ, Wei NW, Liu YJ, Yue C. The complete mitochondrial genomes of Paradiplozoon yarkandense and Paradiplozoon homoion confirm that Diplozoidae evolve at an elevated rate. Parasit Vectors 2022; 15:149. [PMID: 35477556 PMCID: PMC9044634 DOI: 10.1186/s13071-022-05275-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 04/04/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Diplozoidae are monogenean (Monogenea: Polyopisthocotylea) fish parasites characterised by a unique life history: two larvae permanently fuse into an X-shaped "Siamese" organism. Taxonomy and phylogeny of Diplozoidae and Polyopisthocotylea remain unresolved due to the unavailability of molecular markers with sufficiently high resolution. Mitogenomes may be a suitable candidate, but there are currently only 12 available for the Polyopisthocotylea (three for Diplozoidae). The only available study of diplozoid mitogenomes found unique base composition patterns and elevated evolution rates in comparison with other Monogenean mitogenomes. METHODS To further explore their evolution and generate molecular data for evolutionary studies, we sequenced the complete mitogenomes of two Diplozoidae species, Paradiplozoon homoion and Paradiplozoon yarkandense, and conducted a number of comparative mitogenomic analyses with other polyopisthocotyleans. RESULTS We found further evidence that mitogenomes of Diplozoidae evolve at a unique, elevated rate, which was reflected in their exceptionally long branches, large sizes, unique base composition, skews, and very low gene sequence similarity levels between the two newly sequenced species. They also exhibited remarkably large overlaps between some genes. Phylogenetic analysis of Polyopisthocotylea resolved all major taxa as monophyletic, and Mazocraeidea was split into two major clades: (Diplozoidae) + (all four remaining families: Diclidophoridae, Chauhaneidae, Mazocraeidae and Microcotylidae). It also provided further confirmation that the genus Paradiplozoon is paraphyletic and requires a taxonomic revision, so the two species may have to be renamed Indodiplozoon homoion and Diplozoon yarkandense comb. nov. CONCLUSIONS Although our findings indicate that mitogenomes may be a promising tool for resolving the phylogeny of Polyopisthocotylea, elevated evolutionary rates of Diplozoidae may cause phylogenetic artefacts, so future studies should pay caution to this problem. Furthermore, as the reason for their elevated evolution remains unknown, Diplozoidae are a remarkably interesting lineage for other types of evolutionary mitogenomic studies.
Collapse
Affiliation(s)
- Cui-Lan Hao
- College of Veterinary Medicine, Xinjiang Agricultural University, Urumqi, 830052, Xinjiang, China
| | - Kadirden Arken
- College of Veterinary Medicine, Xinjiang Agricultural University, Urumqi, 830052, Xinjiang, China
| | - Munira Kadir
- College of Veterinary Medicine, Xinjiang Agricultural University, Urumqi, 830052, Xinjiang, China
| | - Wen-Run Zhang
- College of Veterinary Medicine, Xinjiang Agricultural University, Urumqi, 830052, Xinjiang, China
| | - Meng-Jie Rong
- College of Veterinary Medicine, Xinjiang Agricultural University, Urumqi, 830052, Xinjiang, China
| | - Nian-Wen Wei
- College of Veterinary Medicine, Xinjiang Agricultural University, Urumqi, 830052, Xinjiang, China
| | - Yan-Jun Liu
- College of Veterinary Medicine, Xinjiang Agricultural University, Urumqi, 830052, Xinjiang, China
| | - Cheng Yue
- College of Veterinary Medicine, Xinjiang Agricultural University, Urumqi, 830052, Xinjiang, China.
| |
Collapse
|