1
|
McKibben MTW, Finch G, Barker MS. Species-tree topology impacts the inference of ancient whole-genome duplications across the angiosperm phylogeny. AMERICAN JOURNAL OF BOTANY 2024; 111:e16378. [PMID: 39039654 DOI: 10.1002/ajb2.16378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Revised: 06/11/2024] [Accepted: 06/12/2024] [Indexed: 07/24/2024]
Abstract
PREMISE The history of angiosperms is marked by repeated rounds of ancient whole-genome duplications (WGDs). Here we used state-of-the-art methods to provide an up-to-date view of the distribution of WGDs in the history of angiosperms that considers both uncertainty introduced by different WGD inference methods and different underlying species-tree hypotheses. METHODS We used the distribution synonymous divergences (Ks) of paralogs and orthologs from transcriptomic and genomic data to infer and place WGDs across two hypothesized angiosperm phylogenies. We further tested these WGD hypotheses with syntenic inferences and Bayesian models of duplicate gene gain and loss. RESULTS The predicted number of WGDs in the history of angiosperms (~170) based on the current taxon sampling is largely similar across different inference methods, but varies in the precise placement of WGDs on the phylogeny. Ks-based methods often yield alternative hypothesized WGD placements due to variation in substitution rates among lineages. Phylogenetic models of duplicate gene gain and loss are more robust to topological variation. However, errors in species-tree inference can still produce spurious WGD hypotheses, regardless of method used. CONCLUSIONS Here we showed that different WGD inference methods largely agree on an average of 3.5 WGD in the history of individual angiosperm species. However, the precise placement of WGDs on the phylogeny is subject to the WGD inference method and tree topology. As researchers continue to test hypotheses regarding the impacts ancient WGDs have on angiosperm evolution, it is important to consider the uncertainty of the phylogeny as well as WGD inference methods.
Collapse
Affiliation(s)
- Michael T W McKibben
- Department of Ecology & Evolutionary Biology, University of Arizona, Tucson, AZ, USA
| | - Geoffrey Finch
- Department of Ecology & Evolutionary Biology, University of Arizona, Tucson, AZ, USA
| | - Michael S Barker
- Department of Ecology & Evolutionary Biology, University of Arizona, Tucson, AZ, USA
| |
Collapse
|
2
|
Halabi K, Shafir A, Mayrose I. PloiDB: the plant ploidy database. THE NEW PHYTOLOGIST 2023; 240:918-927. [PMID: 37337836 DOI: 10.1111/nph.19057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Accepted: 05/16/2023] [Indexed: 06/21/2023]
Abstract
See also the Commentary on this article by Spoelhof et al., 240: 909–911.
Collapse
Affiliation(s)
- Keren Halabi
- School of Plant Sciences and Food Security, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv-Yafo, 69978, Israel
| | - Anat Shafir
- School of Plant Sciences and Food Security, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv-Yafo, 69978, Israel
| | - Itay Mayrose
- School of Plant Sciences and Food Security, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv-Yafo, 69978, Israel
| |
Collapse
|
3
|
Marlétaz F, Timoshevskaya N, Timoshevskiy V, Simakov O, Parey E, Gavriouchkina D, Suzuki M, Kubokawa K, Brenner S, Smith J, Rokhsar DS. The hagfish genome and the evolution of vertebrates. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.17.537254. [PMID: 37131617 PMCID: PMC10153176 DOI: 10.1101/2023.04.17.537254] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
As the only surviving lineages of jawless fishes, hagfishes and lampreys provide a critical window into early vertebrate evolution. Here, we investigate the complex history, timing, and functional role of genome-wide duplications in vertebrates in the light of a chromosome-scale genome of the brown hagfish Eptatretus atami. Using robust chromosome-scale (paralogon-based) phylogenetic methods, we confirm the monophyly of cyclostomes, document an auto-tetraploidization (1RV) that predated the origin of crown group vertebrates ~517 Mya, and establish the timing of subsequent independent duplications in the gnathostome and cyclostome lineages. Some 1RV gene duplications can be linked to key vertebrate innovations, suggesting that this early genomewide event contributed to the emergence of pan-vertebrate features such as neural crest. The hagfish karyotype is derived by numerous fusions relative to the ancestral cyclostome arrangement preserved by lampreys. These genomic changes were accompanied by the loss of genes essential for organ systems (eyes, osteoclast) that are absent in hagfish, accounting in part for the simplification of the hagfish body plan; other gene family expansions account for hagfishes' capacity to produce slime. Finally, we characterise programmed DNA elimination in somatic cells of hagfish, identifying protein-coding and repetitive elements that are deleted during development. As in lampreys, the elimination of these genes provides a mechanism for resolving genetic conflict between soma and germline by repressing germline/pluripotency functions. Reconstruction of the early genomic history of vertebrates provides a framework for further exploration of vertebrate novelties.
Collapse
Affiliation(s)
- Ferdinand Marlétaz
- Centre for Life's Origins and Evolution, Department of Genetics, Evolution and Environment, University College London, London, UK
- Molecular Genetics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
| | | | | | - Oleg Simakov
- Molecular Genetics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
- Department of Molecular Evolution and Development, University of Vienna, Vienna, Austria
| | - Elise Parey
- Centre for Life's Origins and Evolution, Department of Genetics, Evolution and Environment, University College London, London, UK
| | - Daria Gavriouchkina
- Molecular Genetics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
- Present address: UK Dementia Research Institute, University College London, London, UK
| | - Masakazu Suzuki
- Department of Science, Graduate School of Integrated Science and Technology, Shizuoka University, Shizuoka, Japan
| | - Kaoru Kubokawa
- Ocean Research Institute, The University of Tokyo, Tokyo, Japan
| | - Sydney Brenner
- Comparative and Medical Genomics Laboratory, Institute of Molecular and Cell Biology, A*STAR, Biopolis, Singapore 138673, Singapore
- Deceased
| | - Jeramiah Smith
- Department of Biology, University of Kentucky, Lexington, KY, USA
| | - Daniel S Rokhsar
- Molecular Genetics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
- Chan Zuckerberg Biohub, San Francisco, CA, USA
| |
Collapse
|
4
|
Chen H, Zwaenepoel A. Inference of Ancient Polyploidy from Genomic Data. Methods Mol Biol 2023; 2545:3-18. [PMID: 36720805 DOI: 10.1007/978-1-0716-2561-3_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
Whole-genome sequence data have revealed that numerous eukaryotic organisms derive from distant polyploid ancestors, even when these same organisms are genetically and karyotypically diploid. Such ancient whole-genome duplications (WGDs) have been important for long-term genome evolution and are often speculatively associated with important evolutionary events such as key innovations, adaptive radiations, or survival after mass extinctions. Clearly, reliable methods for unveiling ancient WGDs are key toward furthering understanding of the long-term evolutionary significance of polyploidy. In this chapter, we describe a set of basic established comparative genomics approaches for the inference of ancient WGDs from genomic data based on empirical age distributions and collinearity analyses, explain the principles on which they are based, and illustrate a basic workflow using the software "wgd," geared toward a typical exploratory analysis of a newly obtained genome sequence.
Collapse
Affiliation(s)
- Hengchi Chen
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, VIB, Ghent, Belgium
| | - Arthur Zwaenepoel
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium.
| |
Collapse
|
5
|
Qiao X, Zhang S, Paterson AH. Pervasive genome duplications across the plant tree of life and their links to major evolutionary innovations and transitions. Comput Struct Biotechnol J 2022; 20:3248-3256. [PMID: 35782740 PMCID: PMC9237934 DOI: 10.1016/j.csbj.2022.06.026] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Revised: 06/12/2022] [Accepted: 06/12/2022] [Indexed: 01/09/2023] Open
Abstract
Whole-genome duplication (WGD) has occurred repeatedly during plant evolution and diversification, providing genetic layers for evolving new functions and phenotypes. Advances in long-read sequencing technologies have enabled sequencing and assembly of over 1000 plant genomes spanning nearly 800 species, in which a large set of ancient WGDs has been uncovered. Here, we review the recently reported WGDs that occurred in major plant lineages and key evolutionary positions, and highlight their contributions to morphological innovation and adaptive evolution. Current gaps and challenges in integrating enormous volumes of sequenced plant genomes, accurately inferring WGDs, and developing web-based analysis tools are emphasized. Looking to the future, ambitious genome sequencing projects and global efforts may substantially recapitulate the plant tree of life based on broader sampling of phylogenetic diversity, reveal much of the timetable of ancient WGDs, and address the biological significance of WGDs in plant adaptation and radiation.
Collapse
Affiliation(s)
- Xin Qiao
- Centre of Pear Engineering Technology Research, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing 210095, China
| | - Shaoling Zhang
- Centre of Pear Engineering Technology Research, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing 210095, China
| | - Andrew H. Paterson
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA 30605, USA
| |
Collapse
|
6
|
Fukunaga T, Iwasaki W. Mirage: estimation of ancestral gene-copy numbers by considering different evolutionary patterns among gene families. BIOINFORMATICS ADVANCES 2021; 1:vbab014. [PMID: 36700099 PMCID: PMC9710636 DOI: 10.1093/bioadv/vbab014] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Revised: 07/22/2021] [Accepted: 07/28/2021] [Indexed: 01/28/2023]
Abstract
Motivation Reconstruction of gene copy number evolution is an essential approach for understanding how complex biological systems have been organized. Although various models have been proposed for gene copy number evolution, existing evolutionary models have not appropriately addressed the fact that different gene families can have very different gene gain/loss rates. Results In this study, we developed Mirage (MIxtuRe model for Ancestral Genome Estimation), which allows different gene families to have flexible gene gain/loss rates. Mirage can use three models for formulating heterogeneous evolution among gene families: the discretized Γ model, probability distribution-free model and pattern mixture (PM) model. Simulation analysis showed that Mirage can accurately estimate heterogeneous gene gain/loss rates and reconstruct gene-content evolutionary history. Application to empirical datasets demonstrated that the PM model fits genome data from various taxonomic groups better than the other heterogeneous models. Using Mirage, we revealed that metabolic function-related gene families displayed frequent gene gains and losses in all taxa investigated. Availability and implementation The source code of Mirage is freely available at https://github.com/fukunagatsu/Mirage. Supplementary information Supplementary data are available at Bioinformatics Advances online.
Collapse
Affiliation(s)
- Tsukasa Fukunaga
- Waseda Institute for Advanced Study, Waseda University, Tokyo 1690051, Japan,Department of Computer Science, Graduate School of Information Science and Technology, The University of Tokyo, Tokyo 1130032, Japan,To whom correspondence should be addressed. or
| | - Wataru Iwasaki
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba 2770882, Japan,Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo 1130032, Japan,Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba 2770882, Japan,Atmosphere and Ocean Research Institute, The University of Tokyo, Chiba 2770882, Japan,Institute for Quantitative Biosciences, The University of Tokyo, Tokyo 1130032, Japan,Collaborative Research Institute for Innovative Microbiology, The University of Tokyo, Tokyo 1130032, Japan,To whom correspondence should be addressed. or
| |
Collapse
|
7
|
Zhao T, Zwaenepoel A, Xue JY, Kao SM, Li Z, Schranz ME, Van de Peer Y. Whole-genome microsynteny-based phylogeny of angiosperms. Nat Commun 2021; 12:3498. [PMID: 34108452 PMCID: PMC8190143 DOI: 10.1038/s41467-021-23665-0] [Citation(s) in RCA: 51] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Accepted: 05/10/2021] [Indexed: 02/05/2023] Open
Abstract
Plant genomes vary greatly in size, organization, and architecture. Such structural differences may be highly relevant for inference of genome evolution dynamics and phylogeny. Indeed, microsynteny-the conservation of local gene content and order-is recognized as a valuable source of phylogenetic information, but its use for the inference of large phylogenies has been limited. Here, by combining synteny network analysis, matrix representation, and maximum likelihood phylogenetic inference, we provide a way to reconstruct phylogenies based on microsynteny information. Both simulations and use of empirical data sets show our method to be accurate, consistent, and widely applicable. As an example, we focus on the analysis of a large-scale whole-genome data set for angiosperms, including more than 120 available high-quality genomes, representing more than 50 different plant families and 30 orders. Our 'microsynteny-based' tree is largely congruent with phylogenies proposed based on more traditional sequence alignment-based methods and current phylogenetic classifications but differs for some long-contested and controversial relationships. For instance, our synteny-based tree finds Vitales as early diverging eudicots, Saxifragales within superasterids, and magnoliids as sister to monocots. We discuss how synteny-based phylogenetic inference can complement traditional methods and could provide additional insights into some long-standing controversial phylogenetic relationships.
Collapse
Affiliation(s)
- Tao Zhao
- State Key Laboratory of Crop Stress Biology for Arid Areas/Shaanxi Key Laboratory of Apple, College of Horticulture, Northwest A&F University, Yangling, China.
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium.
- Center for Plant Systems Biology, VIB, Ghent, Belgium.
| | - Arthur Zwaenepoel
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- Center for Plant Systems Biology, VIB, Ghent, Belgium
| | - Jia-Yu Xue
- College of Horticulture, Academy for Advanced Interdisciplinary Studies, Nanjing Agricultural University, Nanjing, China
- Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing, China
| | - Shu-Min Kao
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- Center for Plant Systems Biology, VIB, Ghent, Belgium
| | - Zhen Li
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- Center for Plant Systems Biology, VIB, Ghent, Belgium
| | - M Eric Schranz
- Biosystematics Group, Wageningen University and Research, Wageningen, The Netherlands
| | - Yves Van de Peer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium.
- Center for Plant Systems Biology, VIB, Ghent, Belgium.
- College of Horticulture, Academy for Advanced Interdisciplinary Studies, Nanjing Agricultural University, Nanjing, China.
- Center for Microbial Ecology and Genomics, Department of Biochemistry, Genetics and Microbiology, University of Pretoria, Pretoria, South Africa.
| |
Collapse
|