1
|
Rincón-Barrado M, Villaverde T, Perez MF, Sanmartín I, Riina R. The sweet tabaiba or there and back again: phylogeographical history of the Macaronesian Euphorbia balsamifera. ANNALS OF BOTANY 2024; 133:883-904. [PMID: 38197716 PMCID: PMC11082519 DOI: 10.1093/aob/mcae001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 03/01/2024] [Indexed: 01/11/2024]
Abstract
BACKGROUND AND AIMS Biogeographical relationships between the Canary Islands and north-west Africa are often explained by oceanic dispersal and geographical proximity. Sister-group relationships between Canarian and eastern African/Arabian taxa, the 'Rand Flora' pattern, are rare among plants and have been attributed to the extinction of north-western African populations. Euphorbia balsamifera is the only representative species of this pattern that is distributed in the Canary Islands and north-west Africa; it is also one of few species present in all seven islands. Previous studies placed African populations of E. balsamifera as sister to the Canarian populations, but this relationship was based on herbarium samples with highly degraded DNA. Here, we test the extinction hypothesis by sampling new continental populations; we also expand the Canarian sampling to examine the dynamics of island colonization and diversification. METHODS Using target enrichment with genome skimming, we reconstructed phylogenetic relationships within E. balsamifera and between this species and its disjunct relatives. A single nucleotide polymorphism dataset obtained from the target sequences was used to infer population genetic diversity patterns. We used convolutional neural networks to discriminate among alternative Canary Islands colonization scenarios. KEY RESULTS The results confirmed the Rand Flora sister-group relationship between western E. balsamifera and Euphorbia adenensis in the Eritreo-Arabian region and recovered an eastern-western geographical structure among E. balsamifera Canarian populations. Convolutional neural networks supported a scenario of east-to-west island colonization, followed by population extinctions in Lanzarote and Fuerteventura and recolonization from Tenerife and Gran Canaria; a signal of admixture between the eastern island and north-west African populations was recovered. CONCLUSIONS Our findings support the Surfing Syngameon Hypothesis for the colonization of the Canary Islands by E. balsamifera, but also a recent back-colonization to the continent. Populations of E. balsamifera from northwest Africa are not the remnants of an ancestral continental stock, but originated from migration events from Lanzarote and Fuerteventura. This is further evidence that oceanic archipelagos are not a sink for biodiversity, but may be a source of new genetic variability.
Collapse
Affiliation(s)
- Mario Rincón-Barrado
- Real Jardín Botánico (RJB), CSIC, Madrid, 28014, Spain
- Centro Nacional de Biotecnología (CNB), CSIC, Madrid, 28049, Spain
| | - Tamara Villaverde
- Universidad Rey Juan Carlos (URJC), Área de Biodiversidad y Conservación, Móstoles, 28933, Spain
| | - Manolo F Perez
- Institut de Systématique, Evolution, Biodiversité (ISYEB – URM 7205 CNRS), Muséum National d’Histoire Naturelle, SU, EPHE & UA, Paris, France
| | | | - Ricarda Riina
- Real Jardín Botánico (RJB), CSIC, Madrid, 28014, Spain
| |
Collapse
|
2
|
Silva FLD, de Medeiros BAS, Farrell BD. Once upon a fly: The biogeographical odyssey of Labrundinia (Chironomidae, Tanypodinae), an aquatic non-biting midge towards diversification. Mol Phylogenet Evol 2024; 194:108025. [PMID: 38342160 DOI: 10.1016/j.ympev.2024.108025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2023] [Revised: 01/17/2024] [Accepted: 02/04/2024] [Indexed: 02/13/2024]
Abstract
Labrundinia is a highly recognizable lineage in the Pentaneurini tribe (Diptera, Chironomidae). The distinct predatory free-swimming larvae of this genus are typically present in unpolluted aquatic environments, such as small streams, ponds, lakes, and bays. They can be found on the bottom mud, clinging to rocks and wood, and dwelling among aquatic vegetation. Labrundinia has been extensively studied in ecological research and comprises 39 species, all but one of which has been described from regions outside the Palearctic. Earlier phylogenetic studies have suggested that the initial diversification of the genus likely occurred in the Neotropical Region, with its current presence in the Nearctic Region and southern South America being the result of subsequent dispersal events. Through the integration of molecular and morphological data in a calibrated phylogeny, we reveal a complex and nuanced evolutionary history for Labrundinia, providing insights into its biogeographical and diversification patterns. In this comprehensive study, we analyze a dataset containing 46 Labrundinia species, totaling 10,662 characters, consisting of 10,616 nucleotide sites and 46 morphological characters. The molecular data was generated mainly by anchored enrichment hybrid methods. Using this comprehensive dataset, we inferred the phylogeny of the group based on a total evidence matrix. Subsequently, we employed the generated tree for time calibration and further analysis of biogeography and diversification patterns. Our findings reveal multiple dispersal events out of the Neotropics, where the group originated in the late Cretaceous approximately 72 million years ago (69-78 Ma). We further reveal that the genus experienced an early burst of diversification rates during the Paleocene, which gradually decelerated towards the present-day. We also find that the Neotropics have played a pivotal role in the evolution of Labrundinia by serving as both a cradle and a museum. By "cradle," we mean that the region has been a hotspot for the origin and diversification of new Labrundinia lineages, while "museum" refers to the region's ability to preserve ancestral lineages over extended periods. In summary, our findings indicate that the Neotropics have been a key source of genetic diversity for Labrundinia, resulting in the development of distinctive adaptations and characteristics within the genus. This evidence highlights the crucial role that these regions have played in shaping the evolutionary trajectory of Labrundinia.
Collapse
Affiliation(s)
- Fabio Laurindo da Silva
- Laboratory of Aquatic Insect Biodiversity and Ecology, Department of Zoology, Institute of Biosciences, University of São Paulo, São Paulo, Brazil; Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, USA.
| | - Bruno A S de Medeiros
- Field Museum of Natural History, Negaunee Integrative Research Center, Chicago, USA; Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, USA
| | - Brian D Farrell
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, USA
| |
Collapse
|
3
|
Raza M, Ortiz EM, Schwung L, Shigita G, Schaefer H. Resolving the phylogeny of Thladiantha (Cucurbitaceae) with three different target capture pipelines. BMC Ecol Evol 2023; 23:75. [PMID: 38087247 PMCID: PMC10714463 DOI: 10.1186/s12862-023-02185-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Accepted: 12/05/2023] [Indexed: 12/18/2023] Open
Abstract
BACKGROUND Despite recent advances, reliable tools to simultaneously handle different types of sequencing data (e.g., target capture, genome skimming) for phylogenomics are still scarce. Here, we evaluate the performance of the recently developed pipeline Captus in comparison with the well-known target capture pipelines HybPiper and SECAPR. As test data, we analyzed newly generated sequences for the genus Thladiantha (Cucurbitaceae) for which no well-resolved phylogeny estimate has been available so far, as well as simulated reads derived from the genome of Arabidopsis thaliana. RESULTS Our pipeline comparisons are based on (1) the time needed for data assembly and locus extraction, (2) locus recovery per sample, (3) the number of informative sites in nucleotide alignments, and (4) the topology of the nuclear and plastid phylogenies. Additionally, the simulated reads derived from the genome of Arabidopsis thaliana were used to evaluate the accuracy and completeness of the recovered loci. In terms of computation time, locus recovery per sample, and informative sites, Captus outperforms HybPiper and SECAPR. The resulting topologies of Captus and SECAPR are identical for coalescent trees but differ when trees are inferred from concatenated alignments. The HybPiper phylogeny is similar to Captus in both methods. The nuclear genes recover a deep split of Thladiantha in two clades, but this is not supported by the plastid data. CONCLUSIONS Captus is the best choice among the three pipelines in terms of computation time and locus recovery. Even though there is no significant topological difference between the Thladiantha species trees produced by the three pipelines, Captus yields a higher number of gene trees in agreement with the topology of the species tree (i.e., fewer genes in conflict with the species tree topology).
Collapse
Affiliation(s)
- Mustafa Raza
- Plant Biodiversity Research, Dept. Life Science Systems, Technical University of Munich (TUM), Emil-Ramann-Str. 2, D-85354, Freising, Germany
| | - Edgardo M Ortiz
- Plant Biodiversity Research, Dept. Life Science Systems, Technical University of Munich (TUM), Emil-Ramann-Str. 2, D-85354, Freising, Germany
| | - Lea Schwung
- Plant Biodiversity Research, Dept. Life Science Systems, Technical University of Munich (TUM), Emil-Ramann-Str. 2, D-85354, Freising, Germany
| | - Gentaro Shigita
- Plant Biodiversity Research, Dept. Life Science Systems, Technical University of Munich (TUM), Emil-Ramann-Str. 2, D-85354, Freising, Germany
| | - Hanno Schaefer
- Plant Biodiversity Research, Dept. Life Science Systems, Technical University of Munich (TUM), Emil-Ramann-Str. 2, D-85354, Freising, Germany.
| |
Collapse
|
4
|
Pezzini FF, Ferrari G, Forrest LL, Hart ML, Nishii K, Kidner CA. Target capture and genome skimming for plant diversity studies. APPLICATIONS IN PLANT SCIENCES 2023; 11:e11537. [PMID: 37601316 PMCID: PMC10439825 DOI: 10.1002/aps3.11537] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 06/16/2023] [Accepted: 07/10/2023] [Indexed: 08/22/2023]
Abstract
Recent technological advances in long-read high-throughput sequencing and assembly methods have facilitated the generation of annotated chromosome-scale whole-genome sequence data for evolutionary studies; however, generating such data can still be difficult for many plant species. For example, obtaining high-molecular-weight DNA is typically impossible for samples in historical herbarium collections, which often have degraded DNA. The need to fast-freeze newly collected living samples to conserve high-quality DNA can be complicated when plants are only found in remote areas. Therefore, short-read reduced-genome representations, such as target capture and genome skimming, remain important for evolutionary studies. Here, we review the pros and cons of each technique for non-model plant taxa. We provide guidance related to logistics, budget, the genomic resources previously available for the target clade, and the nature of the study. Furthermore, we assess the available bioinformatic analyses, detailing best practices and pitfalls, and suggest pathways to combine newly generated data with legacy data. Finally, we explore the possible downstream analyses allowed by the type of data generated using each technique. We provide a practical guide to help researchers make the best-informed choice regarding reduced genome representation for evolutionary studies of non-model plants in cases where whole-genome sequencing remains impractical.
Collapse
Affiliation(s)
| | - Giada Ferrari
- Royal Botanic Garden Edinburgh Edinburgh United Kingdom
| | | | | | - Kanae Nishii
- Royal Botanic Garden Edinburgh Edinburgh United Kingdom
| | - Catherine A Kidner
- Royal Botanic Garden Edinburgh Edinburgh United Kingdom
- School of Biological Sciences University of Edinburgh Edinburgh United Kingdom
| |
Collapse
|
5
|
Zhou W, Furey NM, Soisook P, Thong VD, Lim BK, Rossiter SJ, Mao X. Diversification and introgression in four chromosomal taxa of the Pearson's horseshoe bat (Rhinolophus pearsoni) group. Mol Phylogenet Evol 2023; 183:107784. [PMID: 37040825 DOI: 10.1016/j.ympev.2023.107784] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 03/11/2023] [Accepted: 04/07/2023] [Indexed: 04/13/2023]
Abstract
Chromosomal variation among closely related taxa is common in both plants and animals, and can reduce rates of introgression as well as promote reproductive isolation and speciation. In mammals, studies relating introgression to chromosomal variation have tended to focus on a few model systems and typically characterized levels of introgression using small numbers of loci. Here we took a genome-wide approach to examine how introgression rates vary among four closely related horseshoe bats (Rhinolophus pearsoni group) that possess different diploid chromosome numbers (2n = 42, 44, 46, and 60) resulting from Robertsonian (Rb) changes (fissions/fusions). Using a sequence capture we obtained orthologous loci for thousands of nuclear loci, as well as mitogenomes, and performed phylogenetic and population genetic analyses. We found that the taxon with 2n = 60 was the first to diverge in this group, and that the relationships among the three other taxa (2n = 42, 44 and 46) showed discordance across our different analyses. Our results revealed signatures of multiple ancient introgression events between the four taxa, with evidence of mitonuclar discordance in phylogenetic trees and reticulation events in their evolutionary history. Despite this, we found no evidence of recent and/or ongoing introgression between taxa. Overall, our results indicate that the effects of Rb changes on the reduction of introgression are complicated and that these may contribute to reproductive isolation and speciation in concert with other factors (e.g. phenotypic and genic divergence).
Collapse
Affiliation(s)
- Weiwei Zhou
- School of Ecological and Environmental Sciences, East China Normal University, Shanghai 200062, China
| | - Neil M Furey
- Fauna & Flora International (Cambodia), PO Box 1380, No. 19, Street 360, Boeng Keng Kong 1, Phnom Penh 12000, Cambodia
| | - Pipat Soisook
- Princess Maha Chakri Sirindhorn Natural History Museum, Prince of Songkla University, Hat Yai, Songkhla 90110, Thailand
| | - Vu D Thong
- Institute of Ecology and Biological Resources, Vietnam Academy of Science and Technology (VAST), 18 Hoang Quoc Viet Road, Cau Giay District, Hanoi, Viet Nam; Graduate University of Science and Technology, VAST, Viet Nam
| | - Burton K Lim
- Department of Natural History, Royal Ontario Museum, Toronto, Ontario M5S 2C6, Canada
| | - Stephen J Rossiter
- School of Biological and Behavioural Sciences, Queen Mary University of London, London E1 4NS, UK.
| | - Xiuguang Mao
- School of Ecological and Environmental Sciences, East China Normal University, Shanghai 200062, China.
| |
Collapse
|
6
|
Zhang Z, Xie P, Guo Y, Zhou W, Liu E, Yu Y. Easy353: A Tool to Get Angiosperms353 Genes for Phylogenomic Research. Mol Biol Evol 2022; 39:6862883. [PMID: 36458838 PMCID: PMC9757696 DOI: 10.1093/molbev/msac261] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 10/28/2022] [Accepted: 11/29/2022] [Indexed: 12/04/2022] Open
Abstract
The Angiosperms353 gene set (AGS) consists of a set of 353 universal low-copy nuclear genes that were selected by examining more than 600 angiosperm species. These genes can be used for phylogenetic studies and population genetics at multiple taxonomic scales. However, current pipelines are not able to recover Angiosperms353 genes efficiently and accurately from high-throughput sequences. Here, we developed Easy353, a reference-guided assembly tool to recover the AGS from high-throughput sequencing (HTS) data (including genome skimming, RNA-seq, and target enrichment). Easy353 is an open-source user-friendly assembler for diverse types of high-throughput data. It has a graphical user interface and a command-line interface that is compatible with all widely-used computer systems. Evaluations, based on both simulated and empirical data, suggest that Easy353 yields low rates of assembly errors.
Collapse
Affiliation(s)
- Zhen Zhang
- Key Laboratory of Bio-Resources and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, 610065, P. R. China
| | - Pulin Xie
- Key Laboratory of Bio-Resources and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, 610065, P. R. China
| | - Yongling Guo
- Key Laboratory of Bio-Resources and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, 610065, P. R. China
| | - Wenbin Zhou
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Enyan Liu
- Key Laboratory of Bio-Resources and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, 610065, P. R. China
| | - Yan Yu
- Corresponding author: E-mail:
| |
Collapse
|
7
|
Nunes R, Storer C, Doleck T, Kawahara AY, Pierce NE, Lohman DJ. Predictors of sequence capture in a large-scale anchored phylogenomics project. Front Ecol Evol 2022. [DOI: 10.3389/fevo.2022.943361] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/19/2023] Open
Abstract
Next-generation sequencing (NGS) technologies have revolutionized phylogenomics by decreasing the cost and time required to generate sequence data from multiple markers or whole genomes. Further, the fragmented DNA of biological specimens collected decades ago can be sequenced with NGS, reducing the need for collecting fresh specimens. Sequence capture, also known as anchored hybrid enrichment, is a method to produce reduced representation libraries for NGS sequencing. The technique uses single-stranded oligonucleotide probes that hybridize with pre-selected regions of the genome that are sequenced via NGS, culminating in a dataset of numerous orthologous loci from multiple taxa. Phylogenetic analyses using these sequences have the potential to resolve deep and shallow phylogenetic relationships. Identifying the factors that affect sequence capture success could save time, money, and valuable specimens that might be destructively sampled despite low likelihood of sequencing success. We investigated the impacts of specimen age, preservation method, and DNA concentration on sequence capture (number of captured sequences and sequence quality) while accounting for taxonomy and extracted tissue type in a large-scale butterfly phylogenomics project. This project used two probe sets to extract 391 loci or a subset of 13 loci from over 6,000 butterfly specimens. We found that sequence capture is a resilient method capable of amplifying loci in samples of varying age (0–111 years), preservation method (alcohol, papered, pinned), and DNA concentration (0.020 ng/μl - 316 ng/ul). Regression analyses demonstrate that sequence capture is positively correlated with DNA concentration. However, sequence capture and DNA concentration are negatively correlated with sample age and preservation method. Our findings suggest that sequence capture projects should prioritize the use of alcohol-preserved samples younger than 20 years old when available. In the absence of such specimens, dried samples of any age can yield sequence data, albeit with returns that diminish with increasing age.
Collapse
|
8
|
Acha S, Majure LC. A New Approach Using Targeted Sequence Capture for Phylogenomic Studies across Cactaceae. Genes (Basel) 2022; 13:genes13020350. [PMID: 35205394 PMCID: PMC8871817 DOI: 10.3390/genes13020350] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Revised: 02/04/2022] [Accepted: 02/10/2022] [Indexed: 02/04/2023] Open
Abstract
Relationships within the major clades of Cactaceae are relatively well known based on DNA sequence data mostly from the chloroplast genome. Nevertheless, some nodes along the backbone of the phylogeny, and especially generic and species-level relationships, remain poorly resolved and are in need of more informative genetic markers. In this study, we propose a new approach to solve the relationships within Cactaceae, applying a targeted sequence capture pipeline. We designed a custom probe set for Cactaceae using MarkerMiner and complemented it with the Angiosperms353 probe set. We then tested both probe sets against 36 different transcriptomes using Hybpiper preferentially retaining phylogenetically informative loci and reconstructed the relationships using RAxML-NG and Astral. Finally, we tested each probe set through sequencing 96 accessions, representing 88 species across Cactaceae. Our preliminary analyses recovered a well-supported phylogeny across Cactaceae with a near identical topology among major clade relationships as that recovered with plastome data. As expected, however, we found incongruences in relationships when comparing our nuclear probe set results to plastome datasets, especially at the generic level. Our results reveal great potential for the combination of Cactaceae-specific and Angiosperm353 probe set application to improve phylogenetic resolution for Cactaceae and for other studies.
Collapse
|
9
|
de Lima Ferreira P, Batista R, Andermann T, Groppo M, Bacon CD, Antonelli A. Target sequence capture of Barnadesioideae (Compositae) demonstrates the utility of low coverage loci in phylogenomic analyses. Mol Phylogenet Evol 2022; 169:107432. [DOI: 10.1016/j.ympev.2022.107432] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Revised: 12/21/2021] [Accepted: 01/14/2022] [Indexed: 11/26/2022]
|
10
|
Coyotee Howard C, Crowl AA, Harvey TS, Cellinese N. Peeling Back the Layers: First Phylogenomic Insights into the Ledebouriinae (Scilloideae, Asparagaceae). Mol Phylogenet Evol 2022; 169:107430. [DOI: 10.1016/j.ympev.2022.107430] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Revised: 01/11/2022] [Accepted: 01/25/2022] [Indexed: 11/26/2022]
|
11
|
Morales-Briones DF, Gehrke B, Huang CH, Liston A, Ma H, Marx HE, Tank DC, Yang Y. Analysis of Paralogs in Target Enrichment Data Pinpoints Multiple Ancient Polyploidy Events in Alchemilla s.l. (Rosaceae). Syst Biol 2021; 71:190-207. [PMID: 33978764 PMCID: PMC8677558 DOI: 10.1093/sysbio/syab032] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 04/28/2021] [Accepted: 05/03/2021] [Indexed: 12/16/2022] Open
Abstract
Target enrichment is becoming increasingly popular for phylogenomic studies. Although baits for enrichment are typically designed to target single-copy genes, paralogs are often recovered with increased sequencing depth, sometimes from a significant proportion of loci, especially in groups experiencing whole-genome duplication (WGD) events. Common approaches for processing paralogs in target enrichment data sets include random selection, manual pruning, and mainly, the removal of entire genes that show any evidence of paralogy. These approaches are prone to errors in orthology inference or removing large numbers of genes. By removing entire genes, valuable information that could be used to detect and place WGD events is discarded. Here, we used an automated approach for orthology inference in a target enrichment data set of 68 species of Alchemilla s.l. (Rosaceae), a widely distributed clade of plants primarily from temperate climate regions. Previous molecular phylogenetic studies and chromosome numbers both suggested ancient WGDs in the group. However, both the phylogenetic location and putative parental lineages of these WGD events remain unknown. By taking paralogs into consideration and inferring orthologs from target enrichment data, we identified four nodes in the backbone of Alchemilla s.l. with an elevated proportion of gene duplication. Furthermore, using a gene-tree reconciliation approach, we established the autopolyploid origin of the entire Alchemilla s.l. and the nested allopolyploid origin of four major clades within the group. Here, we showed the utility of automated tree-based orthology inference methods, previously designed for genomic or transcriptomic data sets, to study complex scenarios of polyploidy and reticulate evolution from target enrichment data sets.[Alchemilla; allopolyploidy; autopolyploidy; gene tree discordance; orthology inference; paralogs; Rosaceae; target enrichment; whole genome duplication.].
Collapse
Affiliation(s)
- Diego F Morales-Briones
- Department of Plant and Microbial Biology, University of Minnesota-Twin Cities, 1445 Gortner Avenue, St. Paul, MN 55108, USA
- Department of Biological Sciences and Institute for Bioinformatics and Evolutionary Studies, University of Idaho, 875 Perimeter Drive MS 3051, Moscow, ID 83844, USA
| | - Berit Gehrke
- University Gardens, University Museum, University of Bergen, Mildeveien 240, 5259 Hjellestad, Norway
| | - Chien-Hsun Huang
- State Key Laboratory of Genetic Engineering and Collaborative Innovation Center of Genetics and Development, Ministry of Education Key Laboratory of Biodiversity and Ecological Engineering, Institute of Plant Biology, Center of Evolutionary Biology, School of Life Sciences, Fudan University, Shanghai 200433, China
| | - Aaron Liston
- Department of Botany and Plant Pathology, Oregon State University, 2082 Cordley Hall, Corvallis, OR 97331, USA
| | - Hong Ma
- Department of Biology, the Huck Institute of the Life Sciences, the Pennsylvania State University, 510D Mueller Laboratory, University Park, PA 16802 USA
| | - Hannah E Marx
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109-1048, USA
- Museum of Southwestern Biology and Department of Biology, University of New Mexico, Albuquerque, NM 87131, USA
| | - David C Tank
- Department of Biological Sciences and Institute for Bioinformatics and Evolutionary Studies, University of Idaho, 875 Perimeter Drive MS 3051, Moscow, ID 83844, USA
| | - Ya Yang
- Department of Plant and Microbial Biology, University of Minnesota-Twin Cities, 1445 Gortner Avenue, St. Paul, MN 55108, USA
| |
Collapse
|
12
|
Bangs MR, Steppan SJ. A rodent anchored hybrid enrichment probe set for a range of phylogenetic utility: From order to species. Mol Ecol Resour 2021; 22:1521-1528. [PMID: 34800355 DOI: 10.1111/1755-0998.13555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 09/22/2021] [Accepted: 11/10/2021] [Indexed: 11/29/2022]
Abstract
Rodents are the largest order of mammals and contain several model organisms important to scientific research in a variety of fields, yet no large set of genomic markers have been designed for this group to date, hindering evolutionary studies into relationships of the group as a whole. Here we present a genomic probe set designed and optimized for rodents with a protocol that is easy to replicate with little laboratory investment. This design utilizes an anchored hybrid enrichment approach specifically targeting rodents to generate longer loci with a higher substitution rate than existing vertebrate probes to provide utility at various taxonomic levels. Using a test set of rodents from all five suborders, we successfully obtained alignments for 416 of the 418 target loci with an average of 1379 bp per locus and a total alignment of more than half a million base pairs. This genomic data set performed well in all phylogenetic analyses, especially in recent phylogenetic splits, with ample parsimony-informative sites within genera and even within species, showing more than four times as many single nucleotide polymorphisms per locus than a recent vertebrate ultraconserved elements study. Additional support is provided in resolving deeper clades in Rodentia. By providing this probe design, we hope that more laboratories can easily generate data for answering questions in rodents from species delimitation to understanding relationships among families in rapid radiations.
Collapse
Affiliation(s)
- Max R Bangs
- Department of Biological Sciences, Florida State University, Tallahassee, Florida, USA
| | - Scott J Steppan
- Department of Biological Sciences, Florida State University, Tallahassee, Florida, USA
| |
Collapse
|
13
|
Mulder KP, Alarcón-Ríos L, Nicieza AG, Fleischer RC, Bell RC, Velo-Antón G. Independent evolutionary transitions to pueriparity across multiple timescales in the viviparous genus Salamandra. Mol Phylogenet Evol 2021; 167:107347. [PMID: 34763070 DOI: 10.1016/j.ympev.2021.107347] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Revised: 10/21/2021] [Accepted: 11/03/2021] [Indexed: 02/07/2023]
Abstract
The ability to bear live offspring, viviparity, has evolved multiple times across the tree of life and is a remarkable adaptation with profound life-history and ecological implications. Within amphibians the ancestral reproductive mode is oviparity followed by a larval life stage, but viviparity has evolved independently in all three amphibian orders. Two types of viviparous reproduction can be distinguished in amphibians; larviparity and pueriparity. Larviparous amphibians deliver larvae into nearby ponds and streams, while pueriparous amphibians deliver fully developed juveniles and thus do not require waterbodies for reproduction. Among amphibians, the salamander genus Salamandra is remarkable as it exhibits both inter- and intraspecific variation in the occurrence of larviparity and pueriparity. While the evolutionary relationships among Salamandra lineages have been the focus of several recent studies, our understanding of how often and when transitions between modes occurred is still incomplete. Furthermore, in species with intraspecific variation, the reproductive mode of a given population can only be confirmed by direct observation of births and thus the prevalence of pueriparous populations is also incompletely documented. We used sequence capture to obtain 1,326 loci from 94 individuals from across the geographic range of the genus, focusing on potential reproductive mode transition zones. We also report additional direct observations of pueriparous births for 20 new locations and multiple lineages. We identify at least five independent transitions from the ancestral mode of larviparity to pueriparity among and within species, occurring at different evolutionary timescales ranging from the Pliocene to the Holocene. Four of these transitions occurred within species. Based on a distinct set of markers and analyses, we also confirm previous findings of introgression between species and the need for taxonomic revisions in the genus. We discuss the implications of our findings with respect to the evolution of this complex trait, and the potential of using five independent convergent transitions for further studies on the ecological context in which pueriparity evolves and the genetic architecture of this specialized reproductive mode.
Collapse
Affiliation(s)
- Kevin P Mulder
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Campus de Vairão, Universidade do Porto, 4485-661 Vairão, Portugal; BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairão, 4485-661 Vairão, Portugal; Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, 1000 Constitution Avenue NW, Washington, DC 20560, USA; Center for Conservation Genomics, Smithsonian Conservation Biology Institute, National Zoological Park, 3001 Connecticut Avenue NW, Washington, DC 20008, USA.
| | - Lucía Alarcón-Ríos
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Campus de Vairão, Universidade do Porto, 4485-661 Vairão, Portugal; BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairão, 4485-661 Vairão, Portugal; Departamento de Biología de Organismos y Sistemas, Universidad de Oviedo UO, Oviedo, Spain
| | - Alfredo G Nicieza
- Departamento de Biología de Organismos y Sistemas, Universidad de Oviedo UO, Oviedo, Spain; Biodiversity Research Institute (IMIB), University of Oviedo-Principality of Asturias-CSIC, 33600 Mieres, Spain
| | - Robert C Fleischer
- Center for Conservation Genomics, Smithsonian Conservation Biology Institute, National Zoological Park, 3001 Connecticut Avenue NW, Washington, DC 20008, USA
| | - Rayna C Bell
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, 1000 Constitution Avenue NW, Washington, DC 20560, USA; Herpetology Department, California Academy of Sciences, 55 Music Concourse Drive, San Francisco, CA 94118, USA
| | - Guillermo Velo-Antón
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Campus de Vairão, Universidade do Porto, 4485-661 Vairão, Portugal; BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairão, 4485-661 Vairão, Portugal; Universidade de Vigo, Grupo GEA, Departamento de Ecoloxía e Bioloxía Animal, Vigo, Spain.
| |
Collapse
|
14
|
G Ribeiro P, Torres Jiménez MF, Andermann T, Antonelli A, Bacon CD, Matos-Maraví P. A bioinformatic platform to integrate target capture and whole genome sequences of various read depths for phylogenomics. Mol Ecol 2021; 30:6021-6035. [PMID: 34674330 PMCID: PMC9298010 DOI: 10.1111/mec.16240] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 09/24/2021] [Accepted: 10/16/2021] [Indexed: 11/28/2022]
Abstract
The increasing availability of short‐read whole genome sequencing (WGS) provides unprecedented opportunities to study ecological and evolutionary processes. Although loci of interest can be extracted from WGS data and combined with target sequence data, this requires suitable bioinformatic workflows. Here, we test different assembly and locus extraction strategies and implement them into secapr, a pipeline that processes short‐read data into multilocus alignments for phylogenetics and molecular ecology analyses. We integrate the processing of data from low‐coverage WGS (<30×) and target sequence capture into a flexible framework, while optimizing de novo contig assembly and loci extraction. Specifically, we test different assembly strategies by contrasting their ability to recover loci from targeted butterfly protein‐coding genes, using four data sets: a WGS data set across different average coverages (10×, 5× and 2×) and a data set for which these loci were enriched prior to sequencing via target sequence capture. Using the resulting de novo contigs, we account for potential errors within contigs and infer phylogenetic trees to evaluate the ability of each assembly strategy to recover species relationships. We demonstrate that choosing multiple sizes of kmer simultaneously for assembly results in the highest yield of extracted loci from de novo assembled contigs, while data sets derived from sequencing read depths as low as 5× recovers the expected species relationships in phylogenetic trees. By making the tested assembly approaches available in the secapr pipeline, we hope to inspire future studies to incorporate complementary data and make an informed choice on the optimal assembly strategy.
Collapse
Affiliation(s)
- Pedro G Ribeiro
- Biology Centre of the Czech Academy of Sciences, Institute of Entomology, České Budějovice, Czech Republic.,Faculty of Science, University of South Bohemia, České Budějovice, Czech Republic
| | - María Fernanda Torres Jiménez
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden.,Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
| | - Tobias Andermann
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden.,Gothenburg Global Biodiversity Centre, Gothenburg, Sweden.,Department of Biology, University of Fribourg, Fribourg, Switzerland.,Swiss Institute of Bioinformatics, Fribourg, Switzerland
| | - Alexandre Antonelli
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden.,Gothenburg Global Biodiversity Centre, Gothenburg, Sweden.,Royal Botanical Gardens Kew, Richmond, UK.,Department of Plant Sciences, University of Oxford, Oxford, UK
| | - Christine D Bacon
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden.,Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
| | - Pável Matos-Maraví
- Biology Centre of the Czech Academy of Sciences, Institute of Entomology, České Budějovice, Czech Republic.,Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
| |
Collapse
|
15
|
O'Connell KA, Mulder KP, Wynn A, de Queiroz K, Bell RC. Genomic library preparation and hybridization capture of formalin-fixed tissues and allozyme supernatant for population genomics and considerations for combining capture- and RADseq-based single nucleotide polymorphism data sets. Mol Ecol Resour 2021; 22:487-502. [PMID: 34329532 DOI: 10.1111/1755-0998.13481] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Revised: 06/10/2021] [Accepted: 07/14/2021] [Indexed: 12/17/2022]
Abstract
Until recently many historical museum specimens were largely inaccessible to genomic inquiry, but high-throughput sequencing (HTS) approaches have allowed researchers to successfully sequence genomic DNA from dried and fluid-preserved museum specimens. In addition to preserved specimens, many museums contain large series of allozyme supernatant samples, but the amenability of these samples to HTS has not yet been assessed. Here, we compared the performance of a target-capture approach using alternative sources of genomic DNA from 10 specimens of spring salamanders (Plethodontidae: Gyrinophilus porphyriticus) collected between 1985 and 1990: allozyme supernatants, allozyme homogenate pellets and formalin-fixed tissues. We designed capture probes based on double-digest restriction-site associated sequencing (RADseq) derived loci from frozen blood samples available for seven of the specimens and assessed the success and consistency of capture and RADseq approaches. This study design enabled direct comparisons of data quality and potential biases among the different data sets for phylogenomic and population genomic analyses. We found that in phylogenetic analyses, all enrichment types for a given specimen clustered together. In principal component space all capture-based samples clustered together, but RADseq samples did not cluster with corresponding capture-based samples. Single nucleotide polymorphism calls were on average 18.3% different between enrichment types for a given individual, but these discrepancies were primarily due to differences in heterozygous/homozygous single nucleotide polymorphism calls. We demonstrate that both allozyme supernatant and formalin-fixed samples can be successfully used for population genomic analyses and we discuss ways to identify and reduce biases associated with combining capture and RADseq data.
Collapse
Affiliation(s)
- Kyle A O'Connell
- Global Genome Initiative, National Museum of Natural History, Smithsonian Institution, Washington, District of Columbia, USA.,Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, District of Columbia, USA.,Department of Biological Sciences, The George Washington University, Washington, District of Columbia, USA.,Biomedical Data Science Lab, Deloitte Consulting LLP, Arlington, Virginia, USA
| | - Kevin P Mulder
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, District of Columbia, USA.,CIBIO/InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, Vairão, Portugal.,Center for Conservation Genomics, Smithsonian Conservation Biology Institute, National Zoological Park, Washington, District of Columbia, USA
| | - Addison Wynn
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, District of Columbia, USA
| | - Kevin de Queiroz
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, District of Columbia, USA
| | - Rayna C Bell
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, District of Columbia, USA.,Department of Herpetology, California Academy of Sciences, San Francisco, California, USA
| |
Collapse
|
16
|
McLay TGB, Birch JL, Gunn BF, Ning W, Tate JA, Nauheimer L, Joyce EM, Simpson L, Schmidt‐Lebuhn AN, Baker WJ, Forest F, Jackson CJ. New targets acquired: Improving locus recovery from the Angiosperms353 probe set. APPLICATIONS IN PLANT SCIENCES 2021; 9:APS311420. [PMID: 34336399 PMCID: PMC8312740 DOI: 10.1002/aps3.11420] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/06/2020] [Accepted: 03/15/2021] [Indexed: 05/10/2023]
Abstract
PREMISE Universal target enrichment kits maximize utility across wide evolutionary breadth while minimizing the number of baits required to create a cost-efficient kit. The Angiosperms353 kit has been successfully used to capture loci throughout the angiosperms, but the default target reference file includes sequence information from only 6-18 taxa per locus. Consequently, reads sequenced from on-target DNA molecules may fail to map to references, resulting in fewer on-target reads for assembly, and reducing locus recovery. METHODS We expanded the Angiosperms353 target file, incorporating sequences from 566 transcriptomes to produce a 'mega353' target file, with each locus represented by 17-373 taxa. This mega353 file is a drop-in replacement for the original Angiosperms353 file in HybPiper analyses. We provide tools to subsample the file based on user-selected taxon groups, and to incorporate other transcriptome or protein-coding gene data sets. RESULTS Compared to the default Angiosperms353 file, the mega353 file increased the percentage of on-target reads by an average of 32%, increased locus recovery at 75% length by 49%, and increased the total length of the concatenated loci by 29%. DISCUSSION Increasing the phylogenetic density of the target reference file results in improved recovery of target capture loci. The mega353 file and associated scripts are available at: https://github.com/chrisjackson-pellicle/NewTargets.
Collapse
Affiliation(s)
- Todd G. B. McLay
- National Herbarium of VictoriaRoyal Botanic Gardens VictoriaMelbourneAustralia
- School of BiosciencesUniversity of MelbourneMelbourneAustralia
- Centre for Australian National Biodiversity ResearchCSIROCanberraAustralia
| | - Joanne L. Birch
- School of BiosciencesUniversity of MelbourneMelbourneAustralia
| | - Bee F. Gunn
- National Herbarium of VictoriaRoyal Botanic Gardens VictoriaMelbourneAustralia
- School of BiosciencesUniversity of MelbourneMelbourneAustralia
| | - Weixuan Ning
- School of Fundamental SciencesMassey UniversityPalmerston NorthNew Zealand
| | - Jennifer A. Tate
- School of Fundamental SciencesMassey UniversityPalmerston NorthNew Zealand
| | - Lars Nauheimer
- James Cook UniversityCairnsAustralia
- Australian Tropical HerbariumJames Cook UniversityCairnsAustralia
| | - Elizabeth M. Joyce
- James Cook UniversityCairnsAustralia
- Australian Tropical HerbariumJames Cook UniversityCairnsAustralia
| | - Lalita Simpson
- James Cook UniversityCairnsAustralia
- Australian Tropical HerbariumJames Cook UniversityCairnsAustralia
| | | | | | - Félix Forest
- Royal Botanic Gardens, KewRichmondSurreyTW9 3AEUnited Kingdom
| | - Chris J. Jackson
- National Herbarium of VictoriaRoyal Botanic Gardens VictoriaMelbourneAustralia
| |
Collapse
|
17
|
Nauheimer L, Weigner N, Joyce E, Crayn D, Clarke C, Nargar K. HybPhaser: A workflow for the detection and phasing of hybrids in target capture data sets. APPLICATIONS IN PLANT SCIENCES 2021; 9:APS311441. [PMID: 34336402 PMCID: PMC8312746 DOI: 10.1002/aps3.11441] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Accepted: 04/28/2021] [Indexed: 05/24/2023]
Abstract
PREMISE Hybrids contain divergent alleles that can confound phylogenetic analyses but can provide insights into reticulated evolution when identified and phased. We developed a workflow to detect hybrids in target capture data sets and phase reads into parental lineages using a similarity and phylogenetic framework. METHODS We used Angiosperms353 target capture data for Nepenthes, including known hybrids to test the novel workflow. Reference mapping was used to assess heterozygous sites across the data set and to detect hybrid accessions and paralogous genes. Hybrid samples were phased by mapping reads to multiple references and sorting reads according to similarity. Phased accessions were included in the phylogenetic framework. RESULTS All known Nepenthes hybrids and nine additional samples had high levels of heterozygous sites, had reads associated with multiple divergent clades, and were phased into accessions resembling divergent haplotypes. Phylogenetic analysis including phased accessions increased clade support and confirmed parental lineages of hybrids. DISCUSSION HybPhaser provides a novel approach to detect and phase hybrids in target capture data sets, which can provide insights into reticulations by revealing origins of hybrids and reduce conflicting signal, leading to more robust phylogenetic analyses.
Collapse
Affiliation(s)
- Lars Nauheimer
- Australian Tropical HerbariumJames Cook UniversityMcGregor RoadSmithfieldQueensland4878Australia
- Centre for Tropical Bioinformatics and Molecular BiologyJames Cook UniversityMcGregor RoadSmithfieldQueensland4878Australia
- Centre for Tropical Environmental Sustainability ScienceJames Cook UniversityMcGregor RoadSmithfieldQueensland4878Australia
| | - Nicholas Weigner
- Australian Tropical HerbariumJames Cook UniversityMcGregor RoadSmithfieldQueensland4878Australia
| | - Elizabeth Joyce
- Australian Tropical HerbariumJames Cook UniversityMcGregor RoadSmithfieldQueensland4878Australia
- Centre for Tropical Environmental Sustainability ScienceJames Cook UniversityMcGregor RoadSmithfieldQueensland4878Australia
| | - Darren Crayn
- Australian Tropical HerbariumJames Cook UniversityMcGregor RoadSmithfieldQueensland4878Australia
- Centre for Tropical Bioinformatics and Molecular BiologyJames Cook UniversityMcGregor RoadSmithfieldQueensland4878Australia
- Centre for Tropical Environmental Sustainability ScienceJames Cook UniversityMcGregor RoadSmithfieldQueensland4878Australia
| | - Charles Clarke
- Australian Tropical HerbariumJames Cook UniversityMcGregor RoadSmithfieldQueensland4878Australia
- Cairns Botanic GardensCollins AvenueEdge HillQueensland4870Australia
| | - Katharina Nargar
- Australian Tropical HerbariumJames Cook UniversityMcGregor RoadSmithfieldQueensland4878Australia
- National Research Collections AustraliaCommonwealth Industrial and Scientific Research Organisation (CSIRO)GPO Box 1700CanberraAustralian Capital Territory2601Australia
| |
Collapse
|
18
|
Zhou W, Soghigian J, Xiang QYJ. A New Pipeline for Removing Paralogs in Target Enrichment Data. Syst Biol 2021; 71:410-425. [PMID: 34146111 PMCID: PMC8974407 DOI: 10.1093/sysbio/syab044] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2021] [Revised: 06/04/2021] [Accepted: 06/12/2021] [Indexed: 12/30/2022] Open
Abstract
Target enrichment (such as Hyb-Seq) is a well-established high throughput sequencing
method that has been increasingly used for phylogenomic studies. Unfortunately, current
widely used pipelines for analysis of target enrichment data do not have a vigorous
procedure to remove paralogs in target enrichment data. In this study, we develop a
pipeline we call Putative Paralogs Detection (PPD) to better address putative paralogs
from enrichment data. The new pipeline is an add-on to the existing HybPiper pipeline, and
the entire pipeline applies criteria in both sequence similarity and heterozygous sites at
each locus in the identification of paralogs. Users may adjust the thresholds of sequence
identity and heterozygous sites to identify and remove paralogs according to the level of
phylogenetic divergence of their group of interest. The new pipeline also removes highly
polymorphic sites attributed to errors in sequence assembly and gappy regions in the
alignment. We demonstrated the value of the new pipeline using empirical data generated
from Hyb-Seq and the Angiosperms353 kit for two woody genera Castanea
(Fagaceae, Fagales) and Hamamelis (Hamamelidaceae, Saxifragales).
Comparisons of data sets showed that the PPD identified many more putative paralogs than
the popular method HybPiper. Comparisons of tree topologies and divergence times showed
evident differences between data from HybPiper and data from our new PPD pipeline. We
further evaluated the accuracy and error rates of PPD by BLAST mapping of putative
paralogous and orthologous sequences to a reference genome sequence of Castanea
mollissima. Compared to HybPiper alone, PPD identified substantially more
paralogous gene sequences that mapped to multiple regions of the reference genome (31
genes for PPD compared with 4 genes for HybPiper alone). In conjunction with HybPiper,
paralogous genes identified by both pipelines can be removed resulting in the construction
of more robust orthologous gene data sets for phylogenomic and divergence time analyses.
Our study demonstrates the value of Hyb-Seq with data derived from the Angiosperms353
probe set for elucidating species relationships within a genus, and argues for the
importance of additional steps to filter paralogous genes and poorly aligned regions
(e.g., as occur through assembly errors), such as our new PPD pipeline described in this
study. [Angiosperms353; Castanea; divergence time;
Hamamelis; Hyb-Seq, paralogs, phylogenomics.]
Collapse
Affiliation(s)
- Wenbin Zhou
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC 27965, USA
| | - John Soghigian
- Department of Entomology and Plant Pathology, North Carolina State University, Raleigh, NC 27965, USA
| | - Qiu-Yun Jenny Xiang
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC 27965, USA
| |
Collapse
|
19
|
Helmstetter AJ, Amoussou BEN, Bethune K, Kamdem NG, Glèlè Kakaï R, Sonké B, Couvreur TLP. Phylogenomic approaches reveal how climate shapes patterns of genetic diversity in an African rain forest tree species. Mol Ecol 2020; 29:3560-3573. [PMID: 32743910 DOI: 10.1111/mec.15572] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2020] [Revised: 07/08/2020] [Accepted: 07/13/2020] [Indexed: 12/23/2022]
Abstract
The world's second largest expanse of tropical rainforest is in Central Africa, and it harbours enormous species diversity. Population genetic studies have consistently revealed significant structure across Central African rainforest plants. In particular, previous studies have repeatedly demonstrated a north-south genetic discontinuity around the equatorial line, in a continuous expanse of rainforest where a climatic inversion is documented. Here, we took a phylogeographic approach by sequencing 351 nuclear markers in 112 individuals across the distribution of the African rainforest tree species Annickia affinis (Annonaceae). We showed for the first time that the north-south divide is the result of a single, major colonization event across the climatic inversion from an ancestral population located in Gabon. We suggested that differences in ecological niche of populations located on either side of this inversion may have contributed to this phylogenetic discontinuity. We found evidence for inland dispersal, predominantly in northern areas, and variable demographic histories among genetic clusters, indicating that populations responded differently to past climate change. We show how newly developed genomic tools can provide invaluable insights into our understanding of tropical rainforest evolutionary dynamics.
Collapse
Affiliation(s)
| | - Biowa E N Amoussou
- IRD, UMR DIADE, Université de Montpellier, Montpellier, France.,Faculté des Sciences Agronomiques, Université d'Abomey-Calavi, Cotonou, Benin
| | - Kevin Bethune
- IRD, UMR DIADE, Université de Montpellier, Montpellier, France
| | - Narcisse G Kamdem
- Laboratoire de Botanique Systématique et d'Ecologie, Département des Sciences Biologiques, Ecole Normale Supérieure, Université de Yaoundé I, Yaoundé, Cameroon
| | - Romain Glèlè Kakaï
- Laboratoire de Biomathématiques et d'Estimations Forestières, Université d'Abomey-Calavi, Cotonou, Bénin
| | - Bonaventure Sonké
- Laboratoire de Botanique Systématique et d'Ecologie, Département des Sciences Biologiques, Ecole Normale Supérieure, Université de Yaoundé I, Yaoundé, Cameroon
| | - Thomas L P Couvreur
- IRD, UMR DIADE, Université de Montpellier, Montpellier, France.,Laboratoire de Botanique Systématique et d'Ecologie, Département des Sciences Biologiques, Ecole Normale Supérieure, Université de Yaoundé I, Yaoundé, Cameroon
| |
Collapse
|
20
|
Portik DM, Wiens JJ. Do Alignment and Trimming Methods Matter for Phylogenomic (UCE) Analyses? Syst Biol 2020; 70:440-462. [PMID: 32797207 DOI: 10.1093/sysbio/syaa064] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2019] [Revised: 08/02/2020] [Accepted: 08/03/2020] [Indexed: 11/14/2022] Open
Abstract
Alignment is a crucial issue in molecular phylogenetics because different alignment methods can potentially yield very different topologies for individual genes. But it is unclear if the choice of alignment methods remains important in phylogenomic analyses, which incorporate data from hundreds or thousands of genes. For example, problematic biases in alignment might be multiplied across many loci, whereas alignment errors in individual genes might become irrelevant. The issue of alignment trimming (i.e., removing poorly aligned regions or missing data from individual genes) is also poorly explored. Here, we test the impact of 12 different combinations of alignment and trimming methods on phylogenomic analyses. We compare these methods using published phylogenomic data from ultraconserved elements (UCEs) from squamate reptiles (lizards and snakes), birds, and tetrapods. We compare the properties of alignments generated by different alignment and trimming methods (e.g., length, informative sites, missing data). We also test whether these data sets can recover well-established clades when analyzed with concatenated (RAxML) and species-tree methods (ASTRAL-III), using the full data ($\sim $5000 loci) and subsampled data sets (10% and 1% of loci). We show that different alignment and trimming methods can significantly impact various aspects of phylogenomic data sets (e.g., length, informative sites). However, these different methods generally had little impact on the recovery and support values for well-established clades, even across very different numbers of loci. Nevertheless, our results suggest several "best practices" for alignment and trimming. Intriguingly, the choice of phylogenetic methods impacted the phylogenetic results most strongly, with concatenated analyses recovering significantly more well-established clades (with stronger support) than the species-tree analyses. [Alignment; concatenated analysis; phylogenomics; sequence length heterogeneity; species-tree analysis; trimming].
Collapse
Affiliation(s)
- Daniel M Portik
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA.,California Academy of Sciences, San Francisco, CA 94118, USA
| | - John J Wiens
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA
| |
Collapse
|
21
|
O'Connell KA, Oaks JR, Hamidy A, Shaney KJ, Kurniawan N, Smith EN, Fujita MK. Impacts of the Toba eruption and montane forest expansion on diversification in Sumatran parachuting frogs (Rhacophorus). Mol Ecol 2020; 29:2994-3009. [PMID: 32633832 DOI: 10.1111/mec.15541] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2019] [Revised: 06/11/2020] [Accepted: 06/12/2020] [Indexed: 01/09/2023]
Abstract
Catastrophic events, such as volcanic eruptions, can have profound impacts on the demographic histories of resident taxa. Due to its presumed effect on biodiversity, the Pleistocene eruption of super-volcano Toba has received abundant attention. We test the effects of the Toba eruption on the diversification, genetic diversity, and demography of three co-distributed species of parachuting frogs (Genus Rhacophorus) on Sumatra. We generate target-capture data (~950 loci and ~440,000 bp) for three species of parachuting frogs and use these data paired with previously generated double digest restriction-site associated DNA (ddRADseq) data to estimate population structure and genetic diversity, to test for population size changes using demographic modelling, and to estimate the temporal clustering of size change events using a full-likelihood Bayesian method. We find that populations around Toba exhibit reduced genetic diversity compared with southern populations, and that northern populations exhibit a shift in effective population size around the time of the eruption (~80 kya). However, we infer a stronger signal of expansion in southern populations around ~400 kya, and at least two of the northern populations may have also expanded at this time. Taken together, these findings suggest that the Toba eruption precipitated population declines in northern populations, but that the demographic history of these three species was also strongly impacted by mid-Pleistocene forest expansion during glacial periods. We propose local rather than regional effects of the Toba eruption, and emphasize the dynamic nature of diversification on the Sunda Shelf.
Collapse
Affiliation(s)
- Kyle A O'Connell
- Global Genome Initiative, National Museum of Natural History, Smithsonian Institute, Washington, DC, USA.,Division of Amphibians and Reptiles, Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institute, Washington, DC, USA.,Department of Biology and Amphibian and Reptile Diversity Research Center, The University of Texas at Arlington, Arlington, TX, USA.,Department of Biological Sciences, The George Washington University, Washington, DC, USA
| | - Jamie R Oaks
- Department of Biological Sciences and Museum of Natural History, Auburn University, Auburn, Alabama, USA
| | - Amir Hamidy
- Zoology Division, Museum Zoologicum Bogoriense, Research Center for Biology, Indonesian Institute of Sciences. Gd, Bogor, West Java, Indonesia
| | - Kyle J Shaney
- Institute of Ecology, National Autonomous University of Mexico, Mexico City, Mexico
| | - Nia Kurniawan
- Department of Biology, Universitas Brawijaya, Malang, East Java, Indonesia
| | - Eric N Smith
- Department of Biology and Amphibian and Reptile Diversity Research Center, The University of Texas at Arlington, Arlington, TX, USA
| | - Matthew K Fujita
- Department of Biology and Amphibian and Reptile Diversity Research Center, The University of Texas at Arlington, Arlington, TX, USA
| |
Collapse
|
22
|
Jantzen JR, Amarasinghe P, Folk RA, Reginato M, Michelangeli FA, Soltis DE, Cellinese N, Soltis PS. A two-tier bioinformatic pipeline to develop probes for target capture of nuclear loci with applications in Melastomataceae. APPLICATIONS IN PLANT SCIENCES 2020; 8:e11345. [PMID: 32477841 PMCID: PMC7249273 DOI: 10.1002/aps3.11345] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Accepted: 12/20/2019] [Indexed: 05/21/2023]
Abstract
PREMISE Putatively single-copy nuclear (SCN) loci, which are identified using genomic resources of closely related species, are ideal for phylogenomic inference. However, suitable genomic resources are not available for many clades, including Melastomataceae. We introduce a versatile approach to identify SCN loci for clades with few genomic resources and use it to develop probes for target enrichment in the distantly related Memecylon and Tibouchina (Melastomataceae). METHODS We present a two-tiered pipeline. First, we identified putatively SCN loci using MarkerMiner and transcriptomes from distantly related species in Melastomataceae. Published loci and genes of functional significance were then added (384 total loci). Second, using HybPiper, we retrieved 689 homologous template sequences for these loci using genome-skimming data from within the focal clades. RESULTS We sequenced 193 loci common to Memecylon and Tibouchina. Probes designed from 56 template sequences successfully targeted sequences in both clades. Probes designed from genome-skimming data within a focal clade were more successful than probes designed from other sources. DISCUSSION Our pipeline successfully identified and targeted SCN loci in Memecylon and Tibouchina, enabling phylogenomic studies in both clades and potentially across Melastomataceae. This pipeline could be easily applied to other clades with few genomic resources.
Collapse
Affiliation(s)
- Johanna R. Jantzen
- Department of BiologyUniversity of FloridaGainesvilleFlorida32611USA
- Florida Museum of Natural HistoryUniversity of FloridaGainesvilleFlorida32611USA
| | - Prabha Amarasinghe
- Department of BiologyUniversity of FloridaGainesvilleFlorida32611USA
- Florida Museum of Natural HistoryUniversity of FloridaGainesvilleFlorida32611USA
| | - Ryan A. Folk
- Department of Biological SciencesMississippi State UniversityStarkvilleMississippi39762USA
| | - Marcelo Reginato
- Institute of Systematic BotanyThe New York Botanical GardenBronxNew York10458USA
- Universidade Federal do Rio Grande do SulPorto AlegreRio Grande do Sul90040‐060Brazil
| | | | - Douglas E. Soltis
- Department of BiologyUniversity of FloridaGainesvilleFlorida32611USA
- Florida Museum of Natural HistoryUniversity of FloridaGainesvilleFlorida32611USA
| | - Nico Cellinese
- Florida Museum of Natural HistoryUniversity of FloridaGainesvilleFlorida32611USA
| | - Pamela S. Soltis
- Florida Museum of Natural HistoryUniversity of FloridaGainesvilleFlorida32611USA
| |
Collapse
|
23
|
Helmstetter AJ, Kamga SM, Bethune K, Lautenschläger T, Zizka A, Bacon CD, Wieringa JJ, Stauffer F, Antonelli A, Sonké B, Couvreur TLP. Unraveling the Phylogenomic Relationships of the Most Diverse African Palm Genus Raphia (Calamoideae, Arecaceae). PLANTS (BASEL, SWITZERLAND) 2020; 9:E549. [PMID: 32340211 PMCID: PMC7238857 DOI: 10.3390/plants9040549] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/15/2020] [Revised: 03/27/2020] [Accepted: 04/08/2020] [Indexed: 11/16/2022]
Abstract
Palms are conspicuous floristic elements across the tropics. In continental Africa, even though there are less than 70 documented species, they are omnipresent across the tropical landscape. The genus Raphia has 20 accepted species in Africa and one species endemic to the Neotropics. It is the most economically important genus of African palms with most of its species producing food and construction material. Raphia is divided into five sections based on inflorescence morphology. Nevertheless, the taxonomy of Raphia is problematic with no intra-generic phylogenetic study available. We present a phylogenetic study of the genus using a targeted exon capture approach sequencing of 56 individuals representing 18 out of the 21 species. Our results recovered five well supported clades within the genus. Three sections correspond to those based on inflorescence morphology. R. regalis is strongly supported as sister to all other Raphia species and is placed into a newly described section: Erectae. Overall, morphological based identifications agreed well with our phylogenetic analyses, with 12 species recovered as monophyletic based on our sampling. Species delimitation analyses recovered 17 or 23 species depending on the confidence level used. Species delimitation is especially problematic in the Raphiate and Temulentae sections. In addition, our clustering analysis using SNP data suggested that individual clusters matched geographic distribution. The Neotropical species R. taedigera is supported as a distinct species, rejecting the hypothesis of a recent introduction into South America. Our analyses support the hypothesis that the Raphia individuals from Madagascar are potentially a distinct species different from the widely distributed R. farinifera. In conclusion, our results support the infra generic classification of Raphia based on inflorescence morphology, which is shown to be phylogenetically useful. Classification and species delimitation within sections remains problematic even with our phylogenomic approach. Certain widely distributed species could potentially contain cryptic species. More in-depth studies should be undertaken using morphometrics, increased sampling, and more variable markers. Our study provides a robust phylogenomic framework that enables further investigation on the biogeographic history, morphological evolution, and other eco-evolutionary aspects of this charismatic, socially, and economically important palm genus.
Collapse
Affiliation(s)
| | - Suzanne Mogue Kamga
- Laboratoire de Botanique systématique et d’Ecologie, Department of Biological Sciences, University of Yaoundé I, Higher Teacher Training College, Yaoundé B.P. 047, Cameroon; (S.M.K.); (B.S.)
| | - Kevin Bethune
- IRD, DIADE, University Montpellier, 34394 Montpellier, France; (A.J.H.); (K.B.)
| | - Thea Lautenschläger
- Institute of Botany, Department of Biology, Faculty of Science, Technische Universität Dresden, 01062 Dresden, Germany;
| | - Alexander Zizka
- German Center for Integrative Biodiversity Research (iDiv) Halle-Leipzig-Jena, 04103 Leipzig, Germany;
| | - Christine D. Bacon
- Department of Biological and Environmental Sciences and Gothenburg Global Biodiversity Centre, University of Gothenburg, 405 30 Gothenburg, Sweden; (C.D.B.); (A.A.)
- Gothenburg Global Biodiversity Centre, Box 461, SE 40530 Goteborg, Sweden
| | - Jan J. Wieringa
- Naturalis Biodiversity Center, Darwinweg 2, 2333 CR Leiden, The Netherlands;
| | - Fred Stauffer
- Department of Botany and Plant Biology, Conservatory and Botanical Garden of the City of Geneva, University of Geneva, 1205 Geneva, Switzerland;
| | - Alexandre Antonelli
- Department of Biological and Environmental Sciences and Gothenburg Global Biodiversity Centre, University of Gothenburg, 405 30 Gothenburg, Sweden; (C.D.B.); (A.A.)
- Gothenburg Global Biodiversity Centre, Box 461, SE 40530 Goteborg, Sweden
- Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Bonaventure Sonké
- Laboratoire de Botanique systématique et d’Ecologie, Department of Biological Sciences, University of Yaoundé I, Higher Teacher Training College, Yaoundé B.P. 047, Cameroon; (S.M.K.); (B.S.)
| | | |
Collapse
|
24
|
Andermann T, Torres Jiménez MF, Matos-Maraví P, Batista R, Blanco-Pastor JL, Gustafsson ALS, Kistler L, Liberal IM, Oxelman B, Bacon CD, Antonelli A. A Guide to Carrying Out a Phylogenomic Target Sequence Capture Project. Front Genet 2020; 10:1407. [PMID: 32153629 PMCID: PMC7047930 DOI: 10.3389/fgene.2019.01407] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2019] [Accepted: 12/24/2019] [Indexed: 12/17/2022] Open
Abstract
High-throughput DNA sequencing techniques enable time- and cost-effective sequencing of large portions of the genome. Instead of sequencing and annotating whole genomes, many phylogenetic studies focus sequencing effort on large sets of pre-selected loci, which further reduces costs and bioinformatic challenges while increasing coverage. One common approach that enriches loci before sequencing is often referred to as target sequence capture. This technique has been shown to be applicable to phylogenetic studies of greatly varying evolutionary depth. Moreover, it has proven to produce powerful, large multi-locus DNA sequence datasets suitable for phylogenetic analyses. However, target capture requires careful considerations, which may greatly affect the success of experiments. Here we provide a simple flowchart for designing phylogenomic target capture experiments. We discuss necessary decisions from the identification of target loci to the final bioinformatic processing of sequence data. We outline challenges and solutions related to the taxonomic scope, sample quality, and available genomic resources of target capture projects. We hope this review will serve as a useful roadmap for designing and carrying out successful phylogenetic target capture studies.
Collapse
Affiliation(s)
- Tobias Andermann
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
| | - Maria Fernanda Torres Jiménez
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
| | - Pável Matos-Maraví
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
- Institute of Entomology, Biology Centre of the Czech Academy of Sciences, České Budějovice, Czechia
| | - Romina Batista
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
- Programa de Pós-Graduação em Genética, Conservação e Biologia Evolutiva, PPG GCBEv–Instituto Nacional de Pesquisas da Amazônia—INPA Campus II, Manaus, Brazil
- Coordenação de Zoologia, Museu Paraense Emílio Goeldi, Belém, Brazil
| | - José L. Blanco-Pastor
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- INRAE, Centre Nouvelle-Aquitaine-Poitiers, Lusignan, France
| | | | - Logan Kistler
- Department of Anthropology, National Museum of Natural History, Smithsonian Institution, Washington, DC, United States
| | - Isabel M. Liberal
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
| | - Bengt Oxelman
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
| | - Christine D. Bacon
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
| | - Alexandre Antonelli
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
- Royal Botanic Gardens, Kew, Richmond-Surrey, United Kingdom
| |
Collapse
|
25
|
Matos-Maraví P, Duarte Ritter C, Barnes CJ, Nielsen M, Olsson U, Wahlberg N, Marquina D, Sääksjärvi I, Antonelli A. Biodiversity seen through the perspective of insects: 10 simple rules on methodological choices and experimental design for genomic studies. PeerJ 2019; 7:e6727. [PMID: 31106048 PMCID: PMC6499058 DOI: 10.7717/peerj.6727] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2018] [Accepted: 03/06/2019] [Indexed: 12/18/2022] Open
Abstract
Massively parallel DNA sequencing opens up opportunities for bridging multiple temporal and spatial dimensions in biodiversity research, thanks to its efficiency to recover millions of nucleotide polymorphisms. Here, we identify the current status, discuss the main challenges, and look into future perspectives on biodiversity genomics focusing on insects, which arguably constitute the most diverse and ecologically important group among all animals. We suggest 10 simple rules that provide a succinct step-by-step guide and best-practices to anyone interested in biodiversity research through the study of insect genomics. To this end, we review relevant literature on biodiversity and evolutionary research in the field of entomology. Our compilation is targeted at researchers and students who may not yet be specialists in entomology or molecular biology. We foresee that the genomic revolution and its application to the study of non-model insect lineages will represent a major leap to our understanding of insect diversity.
Collapse
Affiliation(s)
- Pável Matos-Maraví
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
- Institute of Entomology, Biology Centre CAS, České Budějovice, Czech Republic
| | - Camila Duarte Ritter
- Department of Eukaryotic Microbiology, University of Duisburg-Essen, Essen, Germany
| | | | - Martin Nielsen
- Natural History Museum of Denmark, University of Copenhagen, Copenhagen, Denmark
- Section for Evolutionary Genomics, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Urban Olsson
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
| | | | - Daniel Marquina
- Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden
- Department of Zoology, Stockholm University, Stockholm, Sweden
| | | | - Alexandre Antonelli
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
- Royal Botanical Garden, Kew, Richmond, Surrey, UK
| |
Collapse
|
26
|
Böhne A, Weber AAT, Rajkov J, Rechsteiner M, Riss A, Egger B, Salzburger W. Repeated Evolution Versus Common Ancestry: Sex Chromosome Evolution in the Haplochromine Cichlid Pseudocrenilabrus philander. Genome Biol Evol 2019; 11:439-458. [PMID: 30649313 PMCID: PMC6375353 DOI: 10.1093/gbe/evz003] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/07/2019] [Indexed: 12/15/2022] Open
Abstract
Why sex chromosomes turn over and remain undifferentiated in some taxa, whereas they degenerate in others, is still an area of ongoing research. The recurrent occurrence of homologous and homomorphic sex chromosomes in distantly related taxa suggests their independent evolution or continued recombination since their first emergence. Fishes display a great diversity of sex-determining systems. Here, we focus on sex chromosome evolution in haplochromines, the most species-rich lineage of cichlid fishes. We investigate sex-specific signatures in the Pseudocrenilabrus philander species complex, which belongs to a haplochromine genus found in many river systems and ichthyogeographic regions in northern, eastern, central, and southern Africa. Using whole-genome sequencing and population genetic, phylogenetic, and read-coverage analyses, we show that one population of P. philander has an XX-XY sex-determining system on LG7 with a large region of suppressed recombination. However, in a second bottlenecked population, we did not find any sign of a sex chromosome. Interestingly, LG7 also carries an XX-XY system in the phylogenetically more derived Lake Malawi haplochromine cichlids. Although the genomic regions determining sex are the same in Lake Malawi cichlids and P. philander, we did not find evidence for shared ancestry, suggesting that LG7 evolved as sex chromosome at least twice in haplochromine cichlids. Hence, our work provides further evidence for the labile nature of sex determination in fishes and supports the hypothesis that the same genomic regions can repeatedly and rapidly be recruited as sex chromosomes in more distantly related lineages.
Collapse
Affiliation(s)
- Astrid Böhne
- Department of Environmental Sciences, Zoological Institute, University of Basel, Switzerland
| | - Alexandra Anh-Thu Weber
- Department of Environmental Sciences, Zoological Institute, University of Basel, Switzerland
- Museums Victoria, Melbourne, Victoria, Australia
| | - Jelena Rajkov
- Department of Environmental Sciences, Zoological Institute, University of Basel, Switzerland
| | - Michael Rechsteiner
- Department of Environmental Sciences, Zoological Institute, University of Basel, Switzerland
| | - Andrin Riss
- Department of Environmental Sciences, Zoological Institute, University of Basel, Switzerland
| | - Bernd Egger
- Department of Environmental Sciences, Zoological Institute, University of Basel, Switzerland
- Program Man Society Environment, University of Basel, Switzerland
| | - Walter Salzburger
- Department of Environmental Sciences, Zoological Institute, University of Basel, Switzerland
| |
Collapse
|