1
|
Hu L, Li X, Li C, Wang L, Han L, Ni W, Zhou P, Hu S. Characterization of a novel multifunctional glycoside hydrolase family in the metagenome-assembled genomes of horse gut. Gene 2024; 927:148758. [PMID: 38977109 DOI: 10.1016/j.gene.2024.148758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Revised: 05/29/2024] [Accepted: 07/05/2024] [Indexed: 07/10/2024]
Abstract
The gut microbiota is a treasure trove of carbohydrate-active enzymes (CAZymes). To explore novel and efficient CAZymes, we analyzed the 4,142 metagenome-assembled genomes (MAGs) of the horse gut microbiota and found the MAG117.bin13 genome (Bacteroides fragilis) contains the highest number of polysaccharide utilisation loci sites (PULs), indicating its high capability for carbohydrate degradation. Bioinformatics analysis indicate that the PULs region of the MAG117.bin13 genome encodes many hypothetical proteins, which are important sources for exploring novel CAZymes. Interestingly, we discovered a hypothetical protein (595 amino acids). This protein exhibits potential CAZymes activity and has a lower similarity to CAZymes, we named it BfLac2275. We purified the protein using prokaryotic expression technology and studied its enzymatic function. The hydrolysis experiment of the polysaccharide substrate showed that the BfLac2275 protein has the ability to degrade α-lactose (156.94 U/mg), maltose (92.59 U/mg), raffinose (86.81 U/mg), and hyaluronic acid (5.71 U/mg). The enzyme activity is optimal at pH 5.0 and 30 ℃, indicating that the hypothetical protein BfLac2275 is a novel and multifunctional CAZymes in the glycoside hydrolases (GHs). These properties indicate that BfLac2275 has broad application prospects in many fields such as plant polysaccharide decomposition, food industry, animal feed additives and enzyme preparations. This study not only serves as a reference for exploring novel CAZymes encoded by gut microbiota but also provides an example for further studying the functional annotation of hypothetical genes in metagenomic assembly genomes.
Collapse
Affiliation(s)
- Lingling Hu
- College of Life Sciences, Shihezi University, Shihezi, Xinjiang 832003, China
| | - Xiaoyue Li
- College of Life Sciences, Shihezi University, Shihezi, Xinjiang 832003, China
| | - Cunyuan Li
- College of Life Sciences, Shihezi University, Shihezi, Xinjiang 832003, China
| | - Limin Wang
- State Key Laboratory of Sheep Genetic Improvement and Healthy Production, Xinjiang Academy of Agricultural and Reclamation Science, Xinjiang 832003, China
| | - Lin Han
- College of Life Sciences, Shihezi University, Shihezi, Xinjiang 832003, China
| | - Wei Ni
- College of Life Sciences, Shihezi University, Shihezi, Xinjiang 832003, China.
| | - Ping Zhou
- State Key Laboratory of Sheep Genetic Improvement and Healthy Production, Xinjiang Academy of Agricultural and Reclamation Science, Xinjiang 832003, China.
| | - Shengwei Hu
- College of Life Sciences, Shihezi University, Shihezi, Xinjiang 832003, China.
| |
Collapse
|
2
|
Han Y, Xie Y, Hao Z, Mao J, Wang X, Chang Y, Tian Y. The Mitochondrial Genome of Ylistrum japonicum (Bivalvia, Pectinidae) and Its Phylogenetic Analysis. Int J Mol Sci 2024; 25:8755. [PMID: 39201441 PMCID: PMC11354973 DOI: 10.3390/ijms25168755] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2024] [Revised: 08/06/2024] [Accepted: 08/09/2024] [Indexed: 09/02/2024] Open
Abstract
The Ylistrum japonicum is a commercially valuable scallop known for its long-distance swimming abilities. Despite its economic importance, genetic and genomic research on this species is limited. This study presents the first complete mitochondrial genome of Y. japonicum. The mitochondrial genome is 19,475 bp long and encompasses 13 protein-coding genes, three ribosomal RNA genes, and 23 transfer RNA genes. Two distinct phylogenetic analyses were used to explore the phylogenetic position of the Y. japonicum within the family Pectinidae. Based on one mitochondrial phylogenetic analysis by selecting 15 Pectinidae species and additional outgroup taxa and one single gene phylogenetic analysis by 16S rRNA, two phylogenetic trees were constructed to provide clearer insights into the evolutionary placement of Y. japonicum within the family Pectinidae. Our analysis reveals that Ylistrum is a basal lineage to the Pectininae clade, distinct from its previously assigned tribe, Amusiini. This study offers critical insights into the genetic makeup and evolutionary history of Y. japonicum, enhancing our knowledge of this economically vital species.
Collapse
Affiliation(s)
| | | | | | | | | | - Yaqing Chang
- Key Laboratory of Mariculture & Stock Enhancement in North China Sea, Ministry of Agriculture and Rural Affairs, Dalian Ocean University, Dalian 116023, China; (Y.H.); (Y.X.); (Z.H.); (J.M.); (X.W.)
| | - Ying Tian
- Key Laboratory of Mariculture & Stock Enhancement in North China Sea, Ministry of Agriculture and Rural Affairs, Dalian Ocean University, Dalian 116023, China; (Y.H.); (Y.X.); (Z.H.); (J.M.); (X.W.)
| |
Collapse
|
3
|
Li T, Zhong S, Wen B, Jiang L, Zhang M. Comparative Analysis of Complete Mitochondrial Genomes of Five Chromodorididae Species (Nudibranchia:Doridina). Biochem Genet 2024:10.1007/s10528-024-10878-3. [PMID: 38954214 DOI: 10.1007/s10528-024-10878-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Accepted: 06/24/2024] [Indexed: 07/04/2024]
Abstract
Mitochondrial genome is an important molecular marker for exploring the phylogenetic relationships of species and revealing molecular evolution. In the present study, 5 mitogenomes of Chromodorididae (Chromodoris lochi, Chromodoris colemani, Chromodoris elisabethina, Chromodoris annae and Hypselodoris whitei) were systemically investigated. The lengths of the mitogenomes sequences were 14248 bp, 14257 bp, 14252 bp, 14254 bp and 14856 bp, respectively. Most protein-coding genes (PCGs) were initiated with the common ATG codon and terminated with the TAA and TAG. We calculated Ka/Ks values for all 13 PCGs of Chromodorididae species, all ratios were less than 1, indicating selection by purification. Phylogenetic relationships were constructed by Bayesian inference (BI) and maximum likelihood (ML) methods based on all complete genomes of 50 species, primarily from the family Chromodorididae (Doridina) and 2 outgroups. This phylogenetic tree provided further additional references for the classification of the suborder Doridina. Gene rearrangement suggested a more conserved pattern of gene sequences in the superfamily Chromodoridoidea. These results and newly sequenced will contribute to a better understanding of Chromodorididae and provide reference for further phylogenetic studies.
Collapse
Affiliation(s)
- Tianyu Li
- National Engineering Research Center of Marine Facilities Aquaculture, College of Fisheries, Zhejiang Ocean University, Zhoushan, 316022, China
| | - Shengping Zhong
- Guangxi Key Laboratory of Marine Drugs, Institute of Marine Drugs, Guangxi University of Chinese Medicine, Nanning, 530200, China
| | - Beihua Wen
- School of Marine Sciences, School of Resources, Environment and Materials, Guangxi University, Nanning, 530004, China
| | - Lihua Jiang
- National Engineering Research Center of Marine Facilities Aquaculture, College of Fisheries, Zhejiang Ocean University, Zhoushan, 316022, China.
| | - Man Zhang
- School of Marine Sciences, School of Resources, Environment and Materials, Guangxi University, Nanning, 530004, China.
| |
Collapse
|
4
|
Yan Y, da Fonseca RR, Rahbek C, Borregaard MK, Davis CC. A new nuclear phylogeny of the tea family (Theaceae) unravels rapid radiations in genus Camellia. Mol Phylogenet Evol 2024; 196:108089. [PMID: 38679302 DOI: 10.1016/j.ympev.2024.108089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2023] [Revised: 03/08/2024] [Accepted: 04/25/2024] [Indexed: 05/01/2024]
Abstract
Molecular analyses of rapidly radiating groups often reveal incongruence between gene trees. This mainly results from incomplete lineage sorting, introgression, and gene tree estimation error, which complicate the estimation of phylogenetic relationships. In this study, we reconstruct the phylogeny of Theaceae using 348 nuclear loci from 68 individuals and two outgroup taxa. Sequence data were obtained by target enrichment using the recently released Angiosperm 353 universal probe set applied to herbarium specimens. The robustness of the topologies to variation in data quality was established under a range of different filtering schemes, using both coalescent and concatenation approaches. Our results confirmed most of the previously hypothesized relationships among tribes and genera, while clarifying additional interspecific relationships within the rapidly radiating genus Camellia. We recovered a remarkably high degree of gene tree heterogeneity indicative of rapid radiation in the group and observed cytonuclear conflicts, especially within Camellia. This was especially pronounced around short branches, which we primarily associate with gene tree estimation error. Our analysis also indicates that incomplete lineage sorting (ILS) contributed to gene-tree conflicts and accounted for approximately 14 % of the explained variation, whereas inferred introgression levels were low. Our study advances the understanding of the evolution of this important plant family and provides guidance on the application of target capture methods and the evaluation of key processes that influence phylogenetic discordances.
Collapse
Affiliation(s)
- Yujing Yan
- Center for Macroecology, Evolution and Climate, Globe Institute, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark; Department of Organismic and Evolutionary Biology, Harvard University Herbaria, 22 Divinity Ave, Cambridge, MA 02138, USA.
| | - Rute R da Fonseca
- Center for Macroecology, Evolution and Climate, Globe Institute, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark
| | - Carsten Rahbek
- Center for Macroecology, Evolution and Climate, Globe Institute, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark; Center for Global Mountain Biodiversity, Globe Institute, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark; Department of Life Sciences, Imperial College London, Silkwood Park campus, Ascot SL5 7PY, UK; Danish Institute for Advanced Study, University of Southern Denmark, 5230 Odense M, Denmark
| | - Michael K Borregaard
- Center for Macroecology, Evolution and Climate, Globe Institute, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark
| | - Charles C Davis
- Department of Organismic and Evolutionary Biology, Harvard University Herbaria, 22 Divinity Ave, Cambridge, MA 02138, USA
| |
Collapse
|
5
|
Gupta A, Mirarab S, Turakhia Y. Accurate, scalable, and fully automated inference of species trees from raw genome assemblies using ROADIES. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.27.596098. [PMID: 38854139 PMCID: PMC11160643 DOI: 10.1101/2024.05.27.596098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2024]
Abstract
Inference of species trees plays a crucial role in advancing our understanding of evolutionary relationships and has immense significance for diverse biological and medical applications. Extensive genome sequencing efforts are currently in progress across a broad spectrum of life forms, holding the potential to unravel the intricate branching patterns within the tree of life. However, estimating species trees starting from raw genome sequences is quite challenging, and the current cutting-edge methodologies require a series of error-prone steps that are neither entirely automated nor standardized. In this paper, we present ROADIES, a novel pipeline for species tree inference from raw genome assemblies that is fully automated, easy to use, scalable, free from reference bias, and provides flexibility to adjust the tradeoff between accuracy and runtime. The ROADIES pipeline eliminates the need to align whole genomes, choose a single reference species, or pre-select loci such as functional genes found using cumbersome annotation steps. Moreover, it leverages recent advances in phylogenetic inference to allow multi-copy genes, eliminating the need to detect orthology. Using the genomic datasets released from large-scale sequencing consortia across three diverse life forms (placental mammals, pomace flies, and birds), we show that ROADIES infers species trees that are comparable in quality with the state-of-the-art approaches but in a fraction of the time. By incorporating optimal approaches and automating all steps from assembled genomes to species and gene trees, ROADIES is poised to improve the accuracy, scalability, and reproducibility of phylogenomic analyses.
Collapse
Affiliation(s)
- Anshu Gupta
- Department of Computer Science and Engineering, University of California, San Diego; San Diego, CA 92093, USA
| | - Siavash Mirarab
- Department of Electrical and Computer Engineering, University of California, San Diego; San Diego, CA 92093, USA
| | - Yatish Turakhia
- Department of Electrical and Computer Engineering, University of California, San Diego; San Diego, CA 92093, USA
| |
Collapse
|
6
|
Zou Y, Zhang Z, Zeng Y, Hu H, Hao Y, Huang S, Li B. Common Methods for Phylogenetic Tree Construction and Their Implementation in R. Bioengineering (Basel) 2024; 11:480. [PMID: 38790347 PMCID: PMC11117635 DOI: 10.3390/bioengineering11050480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Revised: 05/04/2024] [Accepted: 05/07/2024] [Indexed: 05/26/2024] Open
Abstract
A phylogenetic tree can reflect the evolutionary relationships between species or gene families, and they play a critical role in modern biological research. In this review, we summarize common methods for constructing phylogenetic trees, including distance methods, maximum parsimony, maximum likelihood, Bayesian inference, and tree-integration methods (supermatrix and supertree). Here we discuss the advantages, shortcomings, and applications of each method and offer relevant codes to construct phylogenetic trees from molecular data using packages and algorithms in R. This review aims to provide comprehensive guidance and reference for researchers seeking to construct phylogenetic trees while also promoting further development and innovation in this field. By offering a clear and concise overview of the different methods available, we hope to enable researchers to select the most appropriate approach for their specific research questions and datasets.
Collapse
Affiliation(s)
- Yue Zou
- College of Life Sciences, Chongqing Normal University, Chongqing 401331, China; (Y.Z.); (Z.Z.); (Y.Z.); (H.H.); (Y.H.)
| | - Zixuan Zhang
- College of Life Sciences, Chongqing Normal University, Chongqing 401331, China; (Y.Z.); (Z.Z.); (Y.Z.); (H.H.); (Y.H.)
| | - Yujie Zeng
- College of Life Sciences, Chongqing Normal University, Chongqing 401331, China; (Y.Z.); (Z.Z.); (Y.Z.); (H.H.); (Y.H.)
| | - Hanyue Hu
- College of Life Sciences, Chongqing Normal University, Chongqing 401331, China; (Y.Z.); (Z.Z.); (Y.Z.); (H.H.); (Y.H.)
| | - Youjin Hao
- College of Life Sciences, Chongqing Normal University, Chongqing 401331, China; (Y.Z.); (Z.Z.); (Y.Z.); (H.H.); (Y.H.)
| | - Sheng Huang
- Animal Nutrition Institute, Chongqing Academy of Animal Science, Chongqing 402460, China
| | - Bo Li
- College of Life Sciences, Chongqing Normal University, Chongqing 401331, China; (Y.Z.); (Z.Z.); (Y.Z.); (H.H.); (Y.H.)
| |
Collapse
|
7
|
Wang WQ, Liu XF, Zhu YJ, Zhu JZ, Liu C, Wang ZY, Shen XX, Allan AC, Yin XR. Identification of miRNA858 long-loop precursors in seed plants. THE PLANT CELL 2024; 36:1637-1654. [PMID: 38114096 PMCID: PMC11062470 DOI: 10.1093/plcell/koad315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Revised: 11/20/2023] [Accepted: 12/11/2023] [Indexed: 12/21/2023]
Abstract
MicroRNAs (miRNAs) are a class of nonprotein-coding short transcripts that provide a layer of post-transcriptional regulation essential to many plant biological processes. MiR858, which targets the transcripts of MYB transcription factors, can affect a range of secondary metabolic processes. Although miR858 and its 187-nt precursor have been well studied in Arabidopsis (Arabidopsis thaliana), a systematic investigation of miR858 precursors and their functions across plant species is lacking due to a problem in identifying the transcripts that generate this subclass. By re-evaluating the transcript of miR858 and relaxing the length cut-off for identifying hairpins, we found in kiwifruit (Actinidia chinensis) that miR858 has long-loop hairpins (1,100 to 2,100 nt), whose intervening sequences between miRNA generating complementary sites were longer than all previously reported miRNA hairpins. Importantly, these precursors of miR858 containing long-loop hairpins (termed MIR858L) are widespread in seed plants including Arabidopsis, varying between 350 and 5,500 nt. Moreover, we showed that MIR858L has a greater impact on proanthocyanidin and flavonol levels in both Arabidopsis and kiwifruit. We suggest that an active MIR858L-MYB regulatory module appeared in the transition of early land plants to large upright flowering plants, making a key contribution to plant secondary metabolism.
Collapse
Affiliation(s)
- Wen-qiu Wang
- College of Agriculture and Biotechnology, Zhejiang University, Zijingang Campus, Hangzhou 310058, China
- School of Horticulture, Anhui Agricultural University, Hefei 230036, China
| | - Xiao-fen Liu
- College of Agriculture and Biotechnology, Zhejiang University, Zijingang Campus, Hangzhou 310058, China
| | - Yong-jing Zhu
- College of Agriculture and Biotechnology, Zhejiang University, Zijingang Campus, Hangzhou 310058, China
| | - Jia-zhen Zhu
- College of Agriculture and Biotechnology, Zhejiang University, Zijingang Campus, Hangzhou 310058, China
- The New Zealand Institute for Plant & Food Research Limited (Plant & Food Research), Mt Albert, Private Bag 92169, Auckland Mail Centre, Auckland 1142, New Zealand
| | - Chao Liu
- College of Agriculture and Biotechnology, Zhejiang University, Zijingang Campus, Hangzhou 310058, China
| | - Zhi-ye Wang
- State Key Laboratory of Plant Physiology and Biochemistry, College of Life Science, Zhejiang University, Zijingang Campus, Hangzhou 310058, China
| | - Xing-Xing Shen
- College of Agriculture and Biotechnology, Zhejiang University, Zijingang Campus, Hangzhou 310058, China
| | - Andrew C Allan
- The New Zealand Institute for Plant & Food Research Limited (Plant & Food Research), Mt Albert, Private Bag 92169, Auckland Mail Centre, Auckland 1142, New Zealand
- School of Biological Sciences, University of Auckland, Private Bag 92019, Auckland 1010, New Zealand
| | - Xue-ren Yin
- College of Agriculture and Biotechnology, Zhejiang University, Zijingang Campus, Hangzhou 310058, China
| |
Collapse
|
8
|
Yap-Chiongco MK, Bergmeier FS, Roberts NG, Jörger KM, Kocot KM. Phylogenomic reconstruction of Solenogastres (Mollusca, Aplacophora) informs hypotheses on body size evolution. Mol Phylogenet Evol 2024; 194:108029. [PMID: 38341006 DOI: 10.1016/j.ympev.2024.108029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 01/24/2024] [Accepted: 02/05/2024] [Indexed: 02/12/2024]
Abstract
Body size is a fundamental characteristic of animals that impacts every aspect of their biology from anatomical complexity to ecology. In Mollusca, Solenogastres has been considered important to understanding the group's early evolution as most morphology-based phylogenetic reconstructions placed it as an early branching molluscan lineage. Under this scenario, molluscs were thought to have evolved from a small, turbellarian-like ancestor and small (i.e., macrofaunal) body size was inferred to be plesiomorphic for Solenogastres. More recently, phylogenomic studies have shown that aplacophorans (Solenogastres + Caudofoveata) form a clade with chitons (Polyplacophora), which is sister to all other molluscs, suggesting a relatively large-bodied (i.e., megafaunal) ancestor for Mollusca. Meanwhile, recent investigations into aplacophoran phylogeny have called the assumption that the last common ancestor of Solenogastres was small-bodied into question, but sampling of meiofaunal species was limited, biasing these studies towards large-bodied taxa and leaving fundamental questions about solenogaster body size evolution unanswered. Here, we supplemented available data with transcriptomes from eight diverse meiofaunal species of Solenogastres and conducted phylogenomic analyses on datasets of up to 949 genes. Maximum likelihood analyses support the meiofaunal family Meiomeniidae as the sister group to all other solenogasters, congruent with earlier ideas of a small-bodied ancestor of Solenogastres. In contrast, Bayesian Inference analyses support the large-bodied family Amphimeniidae as the sister group to all other solenogasters. Investigation of phylogenetic signal by comparing site-wise likelihood scores for the two competing hypotheses support the Meiomeniidae-first topology. In light of these results, we performed ancestral character state reconstruction to explore the implications of both hypotheses on understanding of Solenogaster evolution and review previous hypotheses about body size evolution and its potential consequences for solenogaster biology. Both hypotheses imply that body size evolution has been highly dynamic over the course of solenogaster evolution and that their relatively static body plan has successfully allowed for evolutionary transitions between meio-, macro- and megafaunal size ranges.
Collapse
Affiliation(s)
| | | | - Nickellaus G Roberts
- Department of Biological Sciences, The University of Alabama, Tuscaloosa, AL, USA
| | - Katharina M Jörger
- SNSB-Bavarian State Collection for Zoology, Section Mollusca, Munich, Germany
| | - Kevin M Kocot
- Department of Biological Sciences, The University of Alabama, Tuscaloosa, AL, USA; Alabama Museum of Natural History, The University of Alabama, Tuscaloosa, AL, USA.
| |
Collapse
|
9
|
Cyriac VP, Mohan AV, Dinesh KP, Torsekar V, Jayarajan A, Swamy P, Vijayakumar SP, Shanker K. Diversifying in the mountains: spatiotemporal diversification of frogs in the Western Ghats biodiversity hotspot. Evolution 2024; 78:701-715. [PMID: 38252792 DOI: 10.1093/evolut/qpae006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2023] [Revised: 01/02/2024] [Accepted: 01/18/2024] [Indexed: 01/24/2024]
Abstract
Mountain ranges are hotspots of biodiversity. However, the mechanisms that generate biodiversity patterns in different mountainous regions and taxa are not apparent. The Western Ghats (WG) escarpment in India is a globally recognized biodiversity hotspot with high species richness and endemism. Most studies have either invoked paleoclimatic conditions or climatic stability in the southern WG refugium to explain this high diversity and endemism. However, the factors driving macroevolutionary change remain unexplored for most taxa. Here, we generated the most comprehensive dated phylogeny to date for ranoid frogs in the WG and tested the role of paleoclimatic events or climatic stability in influencing frog diversification. We found that the diversity of different ranoid frog clades in the WG either accumulated at a constant rate through time or underwent a decrease in speciation rates around 3-2.5 Ma during the Pleistocene glaciation cycles. We also find no significant difference in diversification rate estimates across elevational gradients and the three broad biogeographic zones in the WG (northern, central, and southern WG). However, time-for-speciation explained regional species richness within clades, wherein older lineages have more extant species diversity. Overall, we find that global paleoclimatic events have had little impact on WG frog diversification throughout most of its early history until the Quaternary and that the WG may have been climatically stable allowing lineages to accumulate and persist over evolutionary time.
Collapse
Affiliation(s)
- Vivek Philip Cyriac
- Centre for Ecological Sciences, Indian Institute of Science, Bengaluru, Karnataka, India
| | | | - K P Dinesh
- Zoological Survey of India, Western Regional Centre, Pune, Maharashtra, India
| | - Varun Torsekar
- Centre for Ecological Sciences, Indian Institute of Science, Bengaluru, Karnataka, India
| | - Aditi Jayarajan
- Centre for Ecological Sciences, Indian Institute of Science, Bengaluru, Karnataka, India
- Department of Biology, Florida Museum of Natural History, University of Florida, Gainesville, FL, United States
| | - Priyanka Swamy
- Centre for Ecological Sciences, Indian Institute of Science, Bengaluru, Karnataka, India
- Department of Studies in Zoology, University of Mysore, Manasagangothri, Mysuru, Karnataka, India
| | - S P Vijayakumar
- Centre for Ecological Sciences, Indian Institute of Science, Bengaluru, Karnataka, India
| | - Kartik Shanker
- Centre for Ecological Sciences, Indian Institute of Science, Bengaluru, Karnataka, India
| |
Collapse
|
10
|
Steinberg PL, Liu LY, Neiman-Golden A, Patel Y, Boutros PC. Quantifying the seed sensitivity of cancer subclonal reconstruction algorithms. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.05.579021. [PMID: 38370678 PMCID: PMC10871259 DOI: 10.1101/2024.02.05.579021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
Background Intra-tumoural heterogeneity complicates cancer prognosis and impairs treatment success. One of the ways subclonal reconstruction (SRC) quantifies intra-tumoural heterogeneity is by estimating the number of subclones present in bulk DNA sequencing data. SRC algorithms are probabilistic and need to be initialized by a random seed. However, the seeds used in bioinformatics algorithms are rarely reported in the literature. Thus, the impact of the initializing seed on SRC solutions has not been studied. To address this gap, we generated a set of ten random seeds to systematically benchmark the seed sensitivity of three probabilistic SRC algorithms: PyClone-VI, DPClust, and PhyloWGS. Results We characterized the seed sensitivity of three algorithms across fourteen whole-genome sequences of head and neck squamous cell carcinoma and nine SRC pipelines, each composed of a single nucleotide variant caller, a copy number aberration caller and an SRC algorithm. This led to a total of 1470 subclonal reconstructions, including 1260 single-region and 210 multi-region reconstructions. The number of subclones estimated per patient vary across SRC pipelines, but all three SRC algorithms show substantial seed sensitivity: subclone estimates vary across different seeds for the same set of input using the same SRC algorithm. No seed consistently estimated the mode number of subclones across all patients for any SRC algorithm. Conclusions These findings highlight the variability in quantifying intra-tumoural heterogeneity introduced by the seed sensitivity of probabilistic SRC algorithms. We recommend that authors, reviewers and editors adopt guidelines to both report and randomize seed choices. It may also be valuable to consider seed-sensitivity in the benchmarking of newly developed SRC algorithms. These findings may be of interest in other areas of bioinformatics where seeded probabilistic algorithms are used and suggest consideration of formal seed reporting standards to enhance reproducibility.
Collapse
Affiliation(s)
- Philippa L. Steinberg
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, 90095, USA
- Jonsson Comprehensive Cancer Centre, University of California, Los Angeles, Los Angeles, CA, 90024, USA
- Institute for Precision Health, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Lydia Y. Liu
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, 90095, USA
- Jonsson Comprehensive Cancer Centre, University of California, Los Angeles, Los Angeles, CA, 90024, USA
- Institute for Precision Health, University of California, Los Angeles, Los Angeles, CA, 90095, USA
- Department of Medical Biophysics, University of Toronto, Toronto, ON, M5G 1L7, Canada
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, M5G 2C1, Canada
| | - Anna Neiman-Golden
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, 90095, USA
- Jonsson Comprehensive Cancer Centre, University of California, Los Angeles, Los Angeles, CA, 90024, USA
- Institute for Precision Health, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Yash Patel
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, 90095, USA
- Jonsson Comprehensive Cancer Centre, University of California, Los Angeles, Los Angeles, CA, 90024, USA
- Institute for Precision Health, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Paul C. Boutros
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, 90095, USA
- Jonsson Comprehensive Cancer Centre, University of California, Los Angeles, Los Angeles, CA, 90024, USA
- Institute for Precision Health, University of California, Los Angeles, Los Angeles, CA, 90095, USA
- Department of Urology, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| |
Collapse
|
11
|
Henríquez-Piskulich P, Hugall AF, Stuart-Fox D. A supermatrix phylogeny of the world's bees (Hymenoptera: Anthophila). Mol Phylogenet Evol 2024; 190:107963. [PMID: 37967640 DOI: 10.1016/j.ympev.2023.107963] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 10/28/2023] [Accepted: 11/04/2023] [Indexed: 11/17/2023]
Abstract
The increasing availability of large molecular phylogenies has provided new opportunities to study the evolution of species traits, their origins and diversification, and biogeography; yet there are limited attempts to synthesise existing phylogenetic information for major insect groups. Bees (Hymenoptera: Anthophila) are a large group of insect pollinators that have a worldwide distribution, and a wide variation in ecology, morphology, and life-history traits, including sociality. For these reasons, as well as their major economic importance as pollinators, numerous molecular phylogenetic studies of family and genus-level relationships have been published, providing an opportunity to assemble a bee 'tree-of-life'. We used publicly available genetic sequence data, including phylogenomic data, reconciled to a taxonomic database, to produce a concatenated supermatrix phylogeny for the Anthophila comprising 4,586 bee species, representing 23% of species and 82% of genera. At family, subfamily, and tribe levels, support for expected relationships was robust, but between and within some genera relationships remain uncertain. Within families, sampling of genera ranged from 67 to 100% but species coverage was lower (17-41%). Our phylogeny mostly reproduces the relationships found in recent phylogenomic studies with a few exceptions. We provide a summary of these differences and the current state of molecular data available and its gaps. We discuss the advantages and limitations of this bee supermatrix phylogeny (available online at beetreeoflife.org), which may enable new insights into long standing questions about evolutionary drivers in bees, and potentially insects more generally.
Collapse
Affiliation(s)
| | - Andrew F Hugall
- School of BioSciences, The University of Melbourne, Parkville, Victoria, Australia; Department of Sciences, Museums Victoria, Melbourne, Victoria, Australia.
| | - Devi Stuart-Fox
- School of BioSciences, The University of Melbourne, Parkville, Victoria, Australia
| |
Collapse
|
12
|
Steenwyk JL, Li Y, Zhou X, Shen XX, Rokas A. Incongruence in the phylogenomics era. Nat Rev Genet 2023; 24:834-850. [PMID: 37369847 DOI: 10.1038/s41576-023-00620-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/19/2023] [Indexed: 06/29/2023]
Abstract
Genome-scale data and the development of novel statistical phylogenetic approaches have greatly aided the reconstruction of a broad sketch of the tree of life and resolved many of its branches. However, incongruence - the inference of conflicting evolutionary histories - remains pervasive in phylogenomic data, hampering our ability to reconstruct and interpret the tree of life. Biological factors, such as incomplete lineage sorting, horizontal gene transfer, hybridization, introgression, recombination and convergent molecular evolution, can lead to gene phylogenies that differ from the species tree. In addition, analytical factors, including stochastic, systematic and treatment errors, can drive incongruence. Here, we review these factors, discuss methodological advances to identify and handle incongruence, and highlight avenues for future research.
Collapse
Affiliation(s)
- Jacob L Steenwyk
- Howards Hughes Medical Institute and the Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA
- Vanderbilt Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN, USA
| | - Yuanning Li
- Institute of Marine Science and Technology, Shandong University, Qingdao, China
| | - Xiaofan Zhou
- Guangdong Laboratory for Lingnan Modern Agriculture, Guangdong Province Key Laboratory of Microbial Signals and Disease Control, Integrative Microbiology Research Centre, South China Agricultural University, Guangzhou, China
| | - Xing-Xing Shen
- Key Laboratory of Biology of Crop Pathogens and Insects of Zhejiang Province, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA.
- Vanderbilt Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN, USA.
- Heidelberg Institute for Theoretical Studies, Heidelberg, Germany.
| |
Collapse
|
13
|
Diao R, Zhao M, Liu Y, Zhang Z, Zhong B. The advantages of crosstalk during the evolution of the BZR1-ARF6-PIF4 (BAP) module. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2023; 65:2631-2644. [PMID: 37552560 DOI: 10.1111/jipb.13554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Accepted: 08/07/2023] [Indexed: 08/10/2023]
Abstract
The BAP module, comprising BRASSINAZOLE RESISTANT 1 (BZR1), AUXIN RESPONSE FACTOR 6 (ARF6), and PHYTOCHROME-INTERACTING FACTOR 4 (PIF4), functions as a molecular hub to orchestrate plant growth and development. In Arabidopsis thaliana, components of the BAP module physically interact to form a complex system that integrates light, brassinosteroid (BR), and auxin signals. Little is known about the origin and evolution of the BAP module. Here, we conducted comparative genomic and transcriptomic analyses to investigate the evolution and functional diversification of the BAP module. Our results suggest that the BAP module originated in land plants and that the ζ, ε, and γ whole-genome duplication/triplication events contributed to the expansion of BAP module components in seed plants. Comparative transcriptomic analysis suggested that the prototype BAP module arose in Marchantia polymorpha, experienced stepwise evolution, and became established as a mature regulatory system in seed plants. We developed a formula to calculate the signal transduction productivity of the BAP module and demonstrate that more crosstalk among components enables higher signal transduction efficiency. Our results reveal the evolutionary history of the BAP module and provide insights into the evolution of plant signaling networks and the strategies employed by plants to integrate environmental and endogenous signals.
Collapse
Affiliation(s)
- Runjie Diao
- College of Life Sciences, Nanjing Normal University, Nanjing, 210023, China
| | - Mengru Zhao
- College of Life Sciences, Nanjing Normal University, Nanjing, 210023, China
| | - Yannan Liu
- College of Life Sciences, Nanjing Normal University, Nanjing, 210023, China
| | - Zhenhua Zhang
- College of Life Sciences, Nanjing Normal University, Nanjing, 210023, China
| | - Bojian Zhong
- College of Life Sciences, Nanjing Normal University, Nanjing, 210023, China
| |
Collapse
|
14
|
Wang Z, Sun J, Gao Y, Xue Y, Zhang Y, Li K, Zhang W, Zhang C, Zu J, Zhang L. Fusang: a framework for phylogenetic tree inference via deep learning. Nucleic Acids Res 2023; 51:10909-10923. [PMID: 37819036 PMCID: PMC10639059 DOI: 10.1093/nar/gkad805] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Revised: 08/17/2023] [Accepted: 09/20/2023] [Indexed: 10/13/2023] Open
Abstract
Phylogenetic tree inference is a classic fundamental task in evolutionary biology that entails inferring the evolutionary relationship of targets based on multiple sequence alignment (MSA). Maximum likelihood (ML) and Bayesian inference (BI) methods have dominated phylogenetic tree inference for many years, but BI is too slow to handle a large number of sequences. Recently, deep learning (DL) has been successfully applied to quartet phylogenetic tree inference and tentatively extended into more sequences with the quartet puzzling algorithm. However, no DL-based tools are immediately available for practical real-world applications. In this paper, we propose Fusang (http://fusang.cibr.ac.cn), a DL-based framework that achieves comparable performance to that of ML-based tools with both simulated and real datasets. More importantly, with continuous optimization, e.g. through the use of customized training datasets for real-world scenarios, Fusang has great potential to outperform ML-based tools.
Collapse
Affiliation(s)
- Zhicheng Wang
- Chinese Institute for Brain Research, Beijing 102206, China
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Jinnan Sun
- School of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an 710049, China
| | - Yuan Gao
- Chinese Institute for Brain Research, Beijing 102206, China
| | - Yongwei Xue
- Chinese Institute for Brain Research, Beijing 102206, China
| | - Yubo Zhang
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Kuan Li
- Chinese Institute for Brain Research, Beijing 102206, China
| | - Wei Zhang
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
- State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Peking University, Beijing 100871, China
| | - Chi Zhang
- Key Laboratory of Vertebrate Evolution and Human Origins, Institute of Vertebrate Paleontology and Paleoanthropology, Center for Excellence in Life and Paleoenvironment, Chinese Academy of Sciences, Beijing 100044, China
| | - Jian Zu
- School of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an 710049, China
| | - Li Zhang
- Chinese Institute for Brain Research, Beijing 102206, China
| |
Collapse
|
15
|
Kumar S, Tao Q, Lamarca AP, Tamura K. Computational Reproducibility of Molecular Phylogenies. Mol Biol Evol 2023; 40:msad165. [PMID: 37467477 PMCID: PMC10370456 DOI: 10.1093/molbev/msad165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 07/11/2023] [Accepted: 07/12/2023] [Indexed: 07/21/2023] Open
Abstract
Repeated runs of the same program can generate different molecular phylogenies from identical data sets under the same analytical conditions. This lack of reproducibility of inferred phylogenies casts a long shadow on downstream research employing these phylogenies in areas such as comparative genomics, systematics, and functional biology. We have assessed the relative accuracies and log-likelihoods of alternative phylogenies generated for computer-simulated and empirical data sets. Our findings indicate that these alternative phylogenies reconstruct evolutionary relationships with comparable accuracy. They also have similar log-likelihoods that are not inferior to the log-likelihoods of the true tree. We determined that the direct relationship between irreproducibility and inaccuracy is due to their common dependence on the amount of phylogenetic information in the data. While computational reproducibility can be enhanced through more extensive heuristic searches for the maximum likelihood tree, this does not lead to higher accuracy. We conclude that computational irreproducibility plays a minor role in molecular phylogenetics.
Collapse
Affiliation(s)
- Sudhir Kumar
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA, USA
- Department of Biology, Temple University, Philadelphia, PA, USA
| | - Qiqing Tao
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA, USA
- Department of Biology, Temple University, Philadelphia, PA, USA
| | - Alessandra P Lamarca
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA, USA
- Department of Biology, Temple University, Philadelphia, PA, USA
- Department of Genetics, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Koichiro Tamura
- Research Center for Genomics and Bioinformatics, Tokyo Metropolitan University, Hachioji, Tokyo, Japan
- Department of Biological Sciences, Tokyo Metropolitan University, Hachioji, Tokyo, Japan
| |
Collapse
|
16
|
Li X, Ma S, Meng Y, Wei W, Peng C, Ling C, Fan S, Liu Z. Characterization of Antagonistic Bacteria Paenibacillus polymyxa ZYPP18 and the Effects on Plant Growth. PLANTS (BASEL, SWITZERLAND) 2023; 12:2504. [PMID: 37447065 DOI: 10.3390/plants12132504] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 06/24/2023] [Accepted: 06/28/2023] [Indexed: 07/15/2023]
Abstract
Paenibacillus polymyxa is a plant growth-promoting rhizobacteria (PGPR) that has significant biocontrol properties. Wheat sheath blight caused by Rhizoctonia cerealis is a significant soil-borne disease of wheat that causes significant losses in wheat production, and the biological control against the disease has received extensive attention. P. polymyxa ZYPP18 was identified using morphological and molecular characterization. An antagonistic activity experiment verified that ZYPP18 inhibits the growth of R. cerealis on artificial growth media. A detached leaf assay verified that ZYPP18 inhibits the expansion of wheat sheath blight on the detached leaf. ZYPP18 has been found to possess plant growth-promoting properties, as well as the ability to solubilize phosphate and generate indole-3-acetic acid. Results from hydroponic experiments showed that wheat seedlings treated with ZYPP18 grew faster. Additionally, pot experiments and field experiments demonstrated that ZYPP18 effectively controls the occurrence of wheat sheath blight. ZYPP18 reduced the incidence of wheat sheath blight in wheat seedlings by 37.37% and 37.90%, respectively. The control effect of ZYPP18 on wheat sheath blight was 56.30% and 65.57%, respectively. These findings provide evidence that P. polymyxa ZYPP18 is an effective biological factor that can control disease and promote plant growth.
Collapse
Affiliation(s)
- Xiangying Li
- College of Plant Protection, Shandong Agricultural University, Taian 271018, China
| | - Sujing Ma
- College of Plant Protection, Shandong Agricultural University, Taian 271018, China
| | - Yuan Meng
- College of Plant Protection, Shandong Agricultural University, Taian 271018, China
| | - Wei Wei
- College of Plant Protection, Shandong Agricultural University, Taian 271018, China
| | - Chen Peng
- College of Plant Protection, Shandong Agricultural University, Taian 271018, China
| | - Chunli Ling
- Ecology Institute, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250014, China
| | - Susu Fan
- Ecology Institute, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250014, China
| | - Zhenyu Liu
- College of Plant Protection, Shandong Agricultural University, Taian 271018, China
| |
Collapse
|
17
|
Foley NM, Mason VC, Harris AJ, Bredemeyer KR, Damas J, Lewin HA, Eizirik E, Gatesy J, Karlsson EK, Lindblad-Toh K, Springer MS, Murphy WJ, Andrews G, Armstrong JC, Bianchi M, Birren BW, Bredemeyer KR, Breit AM, Christmas MJ, Clawson H, Damas J, Di Palma F, Diekhans M, Dong MX, Eizirik E, Fan K, Fanter C, Foley NM, Forsberg-Nilsson K, Garcia CJ, Gatesy J, Gazal S, Genereux DP, Goodman L, Grimshaw J, Halsey MK, Harris AJ, Hickey G, Hiller M, Hindle AG, Hubley RM, Hughes GM, Johnson J, Juan D, Kaplow IM, Karlsson EK, Keough KC, Kirilenko B, Koepfli KP, Korstian JM, Kowalczyk A, Kozyrev SV, Lawler AJ, Lawless C, Lehmann T, Levesque DL, Lewin HA, Li X, Lind A, Lindblad-Toh K, Mackay-Smith A, Marinescu VD, Marques-Bonet T, Mason VC, Meadows JRS, Meyer WK, Moore JE, Moreira LR, Moreno-Santillan DD, Morrill KM, Muntané G, Murphy WJ, Navarro A, Nweeia M, Ortmann S, Osmanski A, Paten B, Paulat NS, Pfenning AR, Phan BN, Pollard KS, Pratt HE, Ray DA, Reilly SK, Rosen JR, Ruf I, Ryan L, Ryder OA, Sabeti PC, Schäffer DE, Serres A, Shapiro B, Smit AFA, Springer M, Srinivasan C, Steiner C, Storer JM, Sullivan KAM, Sullivan PF, Sundström E, Supple MA, Swofford R, Talbot JE, Teeling E, Turner-Maier J, Valenzuela A, Wagner F, Wallerman O, Wang C, Wang J, Weng Z, Wilder AP, Wirthlin ME, Xue JR, Zhang X. A genomic timescale for placental mammal evolution. Science 2023; 380:eabl8189. [PMID: 37104581 DOI: 10.1126/science.abl8189] [Citation(s) in RCA: 35] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/29/2023]
Abstract
The precise pattern and timing of speciation events that gave rise to all living placental mammals remain controversial. We provide a comprehensive phylogenetic analysis of genetic variation across an alignment of 241 placental mammal genome assemblies, addressing prior concerns regarding limited genomic sampling across species. We compared neutral genome-wide phylogenomic signals using concatenation and coalescent-based approaches, interrogated phylogenetic variation across chromosomes, and analyzed extensive catalogs of structural variants. Interordinal relationships exhibit relatively low rates of phylogenomic conflict across diverse datasets and analytical methods. Conversely, X-chromosome versus autosome conflicts characterize multiple independent clades that radiated during the Cenozoic. Genomic time trees reveal an accumulation of cladogenic events before and immediately after the Cretaceous-Paleogene (K-Pg) boundary, implying important roles for Cretaceous continental vicariance and the K-Pg extinction in the placental radiation.
Collapse
Affiliation(s)
- Nicole M Foley
- Veterinary Integrative Biosciences, Texas A&M University, College Station, TX, USA
| | - Victor C Mason
- Institute of Cell Biology, University of Bern, Bern, Switzerland
| | - Andrew J Harris
- Veterinary Integrative Biosciences, Texas A&M University, College Station, TX, USA
- Interdisciplinary Program in Genetics and Genomics, Texas A&M University, College Station, TX, USA
| | - Kevin R Bredemeyer
- Veterinary Integrative Biosciences, Texas A&M University, College Station, TX, USA
- Interdisciplinary Program in Genetics and Genomics, Texas A&M University, College Station, TX, USA
| | - Joana Damas
- The Genome Center, University of California, Davis, CA, USA
| | - Harris A Lewin
- The Genome Center, University of California, Davis, CA, USA
- Department of Evolution and Ecology, University of California, Davis, CA, USA
| | - Eduardo Eizirik
- School of Health and Life Sciences, Pontifical Catholic University of Rio Grande do Sul, Porto Alegre, Brazil
| | - John Gatesy
- Division of Vertebrate Zoology, American Museum of Natural History, New York, NY, USA
| | - Elinor K Karlsson
- Program in Bioinformatics and Integrative Biology, UMass Chan Medical School, Worcester, MA 01605, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA
- Program in Molecular Medicine, University of Massachussetts Chan Medical School, Worcester, MA 01605, USA
| | - Kerstin Lindblad-Toh
- Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 751 32 Uppsala, Sweden
| | - Mark S Springer
- Department of Evolution, Ecology, and Organismal Biology, University of California, Riverside, CA, USA
| | - William J Murphy
- Veterinary Integrative Biosciences, Texas A&M University, College Station, TX, USA
- Interdisciplinary Program in Genetics and Genomics, Texas A&M University, College Station, TX, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
In silico environmental sampling of emerging fungal pathogens via big data analysis. FUNGAL ECOL 2023. [DOI: 10.1016/j.funeco.2022.101212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
19
|
Postnikova OA, Irish BM, Eisenback J, Nemchinov LG. Snake River alfalfa virus, a persistent virus infecting alfalfa (Medicago sativa L.) in Washington State, USA. Virol J 2023; 20:32. [PMID: 36803436 PMCID: PMC9938972 DOI: 10.1186/s12985-023-01991-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 02/14/2023] [Indexed: 02/21/2023] Open
Abstract
Here we report an occurrence of Snake River alfalfa virus (SRAV) in Washington state, USA. SRAV was recently identified in alfalfa (Medicago sativa L.) plants and western flower thrips in south-central Idaho and proposed to be a first flavi-like virus identified in a plant host. We argue that the SRAV, based on its prevalence in alfalfa plants, readily detectable dsRNA, genome structure, presence in alfalfa seeds, and seed-mediated transmission is a persistent new virus distantly resembling members of the family Endornaviridae.
Collapse
Affiliation(s)
- Olga A Postnikova
- Molecular Plant Pathology Laboratory, USDA/ARS, Beltsville Agricultural Research Center, Beltsville, MD, USA.,School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, VA, USA
| | - Brian M Irish
- USDA/ARS Plant Germplasm Introduction Testing and Research Unit, Prosser, WA, USA
| | - Jonathan Eisenback
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, VA, USA
| | - Lev G Nemchinov
- Molecular Plant Pathology Laboratory, USDA/ARS, Beltsville Agricultural Research Center, Beltsville, MD, USA.
| |
Collapse
|
20
|
Finding a home for the ram’s horn squid: phylogenomic analyses support Spirula spirula (Cephalopoda: Decapodiformes) as a close relative of Oegopsida. ORG DIVERS EVOL 2022. [DOI: 10.1007/s13127-022-00583-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]
|
21
|
Pelosi JA, Kim EH, Barbazuk WB, Sessa EB. Phylotranscriptomics Illuminates the Placement of Whole Genome Duplications and Gene Retention in Ferns. FRONTIERS IN PLANT SCIENCE 2022; 13:882441. [PMID: 35909764 PMCID: PMC9330400 DOI: 10.3389/fpls.2022.882441] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Accepted: 06/16/2022] [Indexed: 05/31/2023]
Abstract
Ferns are the second largest clade of vascular plants with over 10,000 species, yet the generation of genomic resources for the group has lagged behind other major clades of plants. Transcriptomic data have proven to be a powerful tool to assess phylogenetic relationships, using thousands of markers that are largely conserved across the genome, and without the need to sequence entire genomes. We assembled the largest nuclear phylogenetic dataset for ferns to date, including 2884 single-copy nuclear loci from 247 transcriptomes (242 ferns, five outgroups), and investigated phylogenetic relationships across the fern tree, the placement of whole genome duplications (WGDs), and gene retention patterns following WGDs. We generated a well-supported phylogeny of ferns and identified several regions of the fern phylogeny that demonstrate high levels of gene tree-species tree conflict, which largely correspond to areas of the phylogeny that have been difficult to resolve. Using a combination of approaches, we identified 27 WGDs across the phylogeny, including 18 large-scale events (involving more than one sampled taxon) and nine small-scale events (involving only one sampled taxon). Most inferred WGDs occur within single lineages (e.g., orders, families) rather than on the backbone of the phylogeny, although two inferred events are shared by leptosporangiate ferns (excluding Osmundales) and Polypodiales (excluding Lindsaeineae and Saccolomatineae), clades which correspond to the majority of fern diversity. We further examined how retained duplicates following WGDs compared across independent events and found that functions of retained genes were largely convergent, with processes involved in binding, responses to stimuli, and certain organelles over-represented in paralogs while processes involved in transport, organelles derived from endosymbiotic events, and signaling were under-represented. To date, our study is the most comprehensive investigation of the nuclear fern phylogeny, though several avenues for future research remain unexplored.
Collapse
Affiliation(s)
- Jessie A. Pelosi
- Department of Biology, University of Florida, Gainesville, FL, United States
| | - Emily H. Kim
- Department of Microbiology and Cell Science, University of Florida, Gainesville, FL, United States
| | - W. Brad Barbazuk
- Department of Biology, University of Florida, Gainesville, FL, United States
- Genetics Institute, University of Florida, Gainesville, FL, United States
| | - Emily B. Sessa
- Department of Biology, University of Florida, Gainesville, FL, United States
| |
Collapse
|
22
|
Uribe JE, González VL, Irisarri I, Kano Y, Herbert DG, Strong EE, Harasewych MG. A phylogenomic backbone for gastropod molluscs. Syst Biol 2022; 71:1271-1280. [PMID: 35766870 DOI: 10.1093/sysbio/syac045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 06/16/2022] [Accepted: 06/24/2022] [Indexed: 11/13/2022] Open
Abstract
Gastropods have survived several mass extinctions during their evolutionary history resulting in extraordinary diversity in morphology, ecology, and developmental modes, which complicate the reconstruction of a robust phylogeny. Currently, gastropods are divided into six subclasses: Caenogastropoda, Heterobranchia, Neomphaliones, Neritimorpha, Patellogastropoda, and Vetigastropoda. Phylogenetic relationships among these taxa historically lack consensus, despite numerous efforts using morphological and molecular information. We generated sequence data for transcriptomes derived from twelve taxa belonging to clades with little or no prior representation in previous studies in order to infer the deeper cladogenetic events within Gastropoda and, for the first time, infer the position of the deep-sea Neomphaliones using a phylogenomic approach. We explored the impact of missing data, homoplasy, and compositional heterogeneity on the inferred phylogenetic hypotheses. We recovered a highly supported backbone for gastropod relationships that is congruent with morphological and mitogenomic evidence, in which Patellogastropoda, true limpets, are the sister lineage to all other gastropods (Orthogastropoda) which are divided into two main clades (i) Vetigastropoda s.l. (including Pleurotomariida + Neomphaliones) and (ii) Neritimorpha + (Caenogastropoda + Heterobranchia). As such, our results support the recognition of five subclasses (or infraclasses) in Gastropoda: Patellogastropoda, Vetigastropoda, Neritimorpha, Caenogastropoda and Heterobranchia.
Collapse
Affiliation(s)
- Juan E Uribe
- Department of Invertebrate Zoology, MRC 163, National Museum of Natural History, Smithsonian Institution, P O Box 37012 Washington, DC 20013-7012, USA
| | - Vanessa L González
- Global Genome Initiative, National Museum of Natural History, Smithsonian Institution, Washington, DC 20013, USA
| | - Iker Irisarri
- Department of Applied Bioinformatics, Institute for Microbiology and Genetics, University of Göttingen, and Campus Institute Data Science (CIDAS), Göttingen, Germany.,Leibniz Institute for the Analysis of Biodiversity Change (LIB), Zoological Museum Hamburg, Martin-Luther-King-Platz 3, 20146 Hamburg, Germany
| | - Yasunori Kano
- Department of Marine Ecosystems Dynamics, Atmosphere and Ocean Research Institute, The University of Tokyo, Kashiwa, Chiba, Japan
| | - David G Herbert
- Department of Natural Sciences, National Museum Wales, Cathays Park, Cardiff, CF10 3NP, UK
| | - Ellen E Strong
- Department of Invertebrate Zoology, MRC 163, National Museum of Natural History, Smithsonian Institution, P O Box 37012 Washington, DC 20013-7012, USA
| | - M G Harasewych
- Department of Invertebrate Zoology, MRC 163, National Museum of Natural History, Smithsonian Institution, P O Box 37012 Washington, DC 20013-7012, USA
| |
Collapse
|
23
|
Li Y, Chen S, Rapakoulia T, Kuwahara H, Yip KY, Gao X. Deep learning identifies and quantifies recombination hotspot determinants. Bioinformatics 2022; 38:2683-2691. [PMID: 35561158 PMCID: PMC9113300 DOI: 10.1093/bioinformatics/btac234] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Revised: 03/08/2022] [Accepted: 04/08/2022] [Indexed: 11/30/2022] Open
Abstract
MOTIVATION Recombination is one of the essential genetic processes for sexually reproducing organisms, which can happen more frequently in some regions, called recombination hotspots. Although several factors, such as PRDM9 binding motifs, are known to be related to the hotspots, their contributions to the recombination hotspots have not been quantified, and other determinants are yet to be elucidated. Here, we propose a computational method, RHSNet, based on deep learning and signal processing, to identify and quantify the hotspot determinants in a purely data-driven manner, utilizing datasets from various studies, populations, sexes and species. RESULTS RHSNet can significantly outperform other sequence-based methods on multiple datasets across different species, sexes and studies. In addition to being able to identify hotspot regions and the well-known determinants accurately, more importantly, RHSNet can quantify the determinants that contribute significantly to the recombination hotspot formation in the relation between PRDM9 binding motif, histone modification and GC content. Further cross-sex, cross-population and cross-species studies suggest that the proposed method has the generalization power and potential to identify and quantify the evolutionary determinant motifs. AVAILABILITY AND IMPLEMENTATION https://github.com/frankchen121212/RHSNet. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yu Li
- To whom correspondence should be addressed. or
| | | | | | - Hiroyuki Kuwahara
- Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia
- KAUST Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology, Thuwal 23955-6900, Kingdom of Saudi Arabia
| | - Kevin Y Yip
- Department of Computer Science and Engineering (CSE), The Chinese University of Hong Kong (CUHK), 999077, Hong Kong SAR, China
| | - Xin Gao
- To whom correspondence should be addressed. or
| |
Collapse
|
24
|
Sandhu M, Spence MA, Jackson CJ. Evo-velocity: Protein language modeling accelerates the study of evolution. Cell Syst 2022; 13:271-273. [PMID: 35447077 DOI: 10.1016/j.cels.2022.03.004] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Understanding how protein sequences have evolved is one of the defining challenges in modern biology. In this issue of Cell Systems, Hie et al. describe a novel phylogenetic approach, dubbed "evo-velocity," that exploits protein language modeling to overcome many limitations of traditional phylogenetic analysis.
Collapse
Affiliation(s)
- Mahakaran Sandhu
- Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia
| | - Matthew A Spence
- Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia
| | - Colin J Jackson
- Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia; ARC Centre of Excellence for Innovations in Peptide & Protein Science, Australian National University, Canberra, ACT 2601, Australia; ARC Centre of Excellence in Synthetic Biology, Australian National University, Canberra, ACT 2601, Australia.
| |
Collapse
|
25
|
Hübner L, Kozlov AM, Hespe D, Sanders P, Stamatakis A. Exploring parallel MPI fault tolerance mechanisms for phylogenetic inference with RAxML-NG. Bioinformatics 2021; 37:4056-4063. [PMID: 34037680 PMCID: PMC9502163 DOI: 10.1093/bioinformatics/btab399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Revised: 05/10/2021] [Accepted: 05/25/2021] [Indexed: 11/18/2022] Open
Abstract
MOTIVATION Phylogenetic trees are now routinely inferred on large scale high performance computing systems with thousands of cores as the parallel scalability of phylogenetic inference tools has improved over the past years to cope with the molecular data avalanche. Thus, the parallel fault tolerance of phylogenetic inference tools has become a relevant challenge. To this end, we explore parallel fault tolerance mechanisms and algorithms, the software modifications required and the performance penalties induced via enabling parallel fault tolerance by example of RAxML-NG, the successor of the widely used RAxML tool for maximum likelihood-based phylogenetic tree inference. RESULTS We find that the slowdown induced by the necessary additional recovery mechanisms in RAxML-NG is on average 1.00 ± 0.04. The overall slowdown by using these recovery mechanisms in conjunction with a fault-tolerant Message Passing Interface implementation amounts to on average 1.7 ± 0.6 for large empirical datasets. Via failure simulations, we show that RAxML-NG can successfully recover from multiple simultaneous failures, subsequent failures, failures during recovery and failures during checkpointing. Recoveries are automatic and transparent to the user. AVAILABILITY AND IMPLEMENTATION The modified fault-tolerant RAxML-NG code is available under GNU GPL at https://github.com/lukashuebner/ft-raxml-ng. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Lukas Hübner
- Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Baden, Karlsruhe, Württemberg, Germany
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Baden, Heidelberg, Württemberg, Germany
| | - Alexey M Kozlov
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Baden, Heidelberg, Württemberg, Germany
| | - Demian Hespe
- Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Baden, Karlsruhe, Württemberg, Germany
| | - Peter Sanders
- Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Baden, Karlsruhe, Württemberg, Germany
| | - Alexandros Stamatakis
- Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Baden, Karlsruhe, Württemberg, Germany
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Baden, Heidelberg, Württemberg, Germany
| |
Collapse
|
26
|
Abstract
Identifying our most distant animal relatives has emerged as one of the most challenging problems in phylogenetics. This debate has major implications for our understanding of the origin of multicellular animals and of the earliest events in animal evolution, including the origin of the nervous system. Some analyses identify sponges as our most distant animal relatives (Porifera-sister hypothesis), and others identify comb jellies (Ctenophora-sister hypothesis). These analyses vary in many respects, making it difficult to interpret previous tests of these hypotheses. To gain insight into why different studies yield different results, an important next step in the ongoing debate, we systematically test these hypotheses by synthesizing 15 previous phylogenomic studies and performing new standardized analyses under consistent conditions with additional models. We find that Ctenophora-sister is recovered across the full range of examined conditions, and Porifera-sister is recovered in some analyses under narrow conditions when most outgroups are excluded and site-heterogeneous CAT models are used. We additionally find that the number of categories in site-heterogeneous models is sufficient to explain the Porifera-sister results. Furthermore, our cross-validation analyses show CAT models that recover Porifera-sister have hundreds of additional categories and fail to fit significantly better than site-heterogenuous models with far fewer categories. Systematic and standardized testing of diverse phylogenetic models suggests that we should be skeptical of Porifera-sister results both because they are recovered under such narrow conditions and because the models in these conditions fit the data no better than other models that recover Ctenophora-sister.
Collapse
Affiliation(s)
- Yuanning Li
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, USA
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA
| | - Xing-Xing Shen
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA
- State Key Laboratory of Rice Biology and Ministry of Agriculture Key Lab of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Benjamin Evans
- Yale Center for Research Computing, Yale University, New Haven, CT, USA
| | - Casey W Dunn
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, USA
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA
| |
Collapse
|
27
|
Li Y, Steenwyk JL, Chang Y, Wang Y, James TY, Stajich JE, Spatafora JW, Groenewald M, Dunn CW, Hittinger CT, Shen XX, Rokas A. A genome-scale phylogeny of the kingdom Fungi. Curr Biol 2021; 31:1653-1665.e5. [PMID: 33607033 PMCID: PMC8347878 DOI: 10.1016/j.cub.2021.01.074] [Citation(s) in RCA: 114] [Impact Index Per Article: 38.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2020] [Revised: 12/10/2020] [Accepted: 01/21/2021] [Indexed: 12/22/2022]
Abstract
Phylogenomic studies using genome-scale amounts of data have greatly improved understanding of the tree of life. Despite the diversity, ecological significance, and biomedical and industrial importance of fungi, evolutionary relationships among several major lineages remain poorly resolved, especially those near the base of the fungal phylogeny. To examine poorly resolved relationships and assess progress toward a genome-scale phylogeny of the fungal kingdom, we compiled a phylogenomic data matrix of 290 genes from the genomes of 1,644 species that includes representatives from most major fungal lineages. We also compiled 11 data matrices by subsampling genes or taxa from the full data matrix based on filtering criteria previously shown to improve phylogenomic inference. Analyses of these 12 data matrices using concatenation- and coalescent-based approaches yielded a robust phylogeny of the fungal kingdom, in which ∼85% of internal branches were congruent across data matrices and approaches used. We found support for several historically poorly resolved relationships as well as evidence for polytomies likely stemming from episodes of ancient diversification. By examining the relative evolutionary divergence of taxonomic groups of equivalent rank, we found that fungal taxonomy is broadly aligned with both genome sequence divergence and divergence time but also identified lineages where current taxonomic circumscription does not reflect their levels of evolutionary divergence. Our results provide a robust phylogenomic framework to explore the tempo and mode of fungal evolution and offer directions for future fungal phylogenetic and taxonomic studies.
Collapse
Affiliation(s)
- Yuanning Li
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37235, USA
| | - Jacob L Steenwyk
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37235, USA
| | - Ying Chang
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, USA
| | - Yan Wang
- Department of Microbiology and Plant Pathology, Institute for Integrative Genome Biology, University of California, Riverside, CA 92521, USA; Department of Biological Sciences, University of Toronto Scarborough and Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, Canada
| | - Timothy Y James
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Jason E Stajich
- Department of Microbiology and Plant Pathology, Institute for Integrative Genome Biology, University of California, Riverside, CA 92521, USA
| | - Joseph W Spatafora
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, USA
| | - Marizeth Groenewald
- Westerdijk Fungal Biodiversity Institute, 3584 CT, Utrecht 85167, the Netherlands
| | - Casey W Dunn
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06520, USA
| | - Chris Todd Hittinger
- Laboratory of Genetics, Center for Genomic Science Innovation, J.F. Crow Institute for the Study of Evolution, DOE Great Lakes Bioenergy Research Center, Wisconsin Energy Institute, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Xing-Xing Shen
- State Key Laboratory of Rice Biology and Ministry of Agriculture Key Lab of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou 310058, China.
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37235, USA.
| |
Collapse
|
28
|
Salomaki ED, Terpis KX, Rueckert S, Kotyk M, Varadínová ZK, Čepička I, Lane CE, Kolisko M. Gregarine single-cell transcriptomics reveals differential mitochondrial remodeling and adaptation in apicomplexans. BMC Biol 2021; 19:77. [PMID: 33863338 PMCID: PMC8051059 DOI: 10.1186/s12915-021-01007-2] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2020] [Accepted: 03/19/2021] [Indexed: 01/05/2023] Open
Abstract
BACKGROUND Apicomplexa is a diverse phylum comprising unicellular endobiotic animal parasites and contains some of the most well-studied microbial eukaryotes including the devastating human pathogens Plasmodium falciparum and Cryptosporidium hominis. In contrast, data on the invertebrate-infecting gregarines remains sparse and their evolutionary relationship to other apicomplexans remains obscure. Most apicomplexans retain a highly modified plastid, while their mitochondria remain metabolically conserved. Cryptosporidium spp. inhabit an anaerobic host-gut environment and represent the known exception, having completely lost their plastid while retaining an extremely reduced mitochondrion that has lost its genome. Recent advances in single-cell sequencing have enabled the first broad genome-scale explorations of gregarines, providing evidence of differential plastid retention throughout the group. However, little is known about the retention and metabolic capacity of gregarine mitochondria. RESULTS Here, we sequenced transcriptomes from five species of gregarines isolated from cockroaches. We combined these data with those from other apicomplexans, performed detailed phylogenomic analyses, and characterized their mitochondrial metabolism. Our results support the placement of Cryptosporidium as the earliest diverging lineage of apicomplexans, which impacts our interpretation of evolutionary events within the phylum. By mapping in silico predictions of core mitochondrial pathways onto our phylogeny, we identified convergently reduced mitochondria. These data show that the electron transport chain has been independently lost three times across the phylum, twice within gregarines. CONCLUSIONS Apicomplexan lineages show variable functional restructuring of mitochondrial metabolism that appears to have been driven by adaptations to parasitism and anaerobiosis. Our findings indicate that apicomplexans are rife with convergent adaptations, with shared features including morphology, energy metabolism, and intracellularity.
Collapse
Affiliation(s)
- Eric D Salomaki
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic
| | - Kristina X Terpis
- Department of Biological Sciences, University of Rhode Island, Kingston, RI, USA
| | - Sonja Rueckert
- School of Applied Sciences, Edinburgh Napier University, Edinburgh, Scotland, UK
| | - Michael Kotyk
- Department of Zoology, Faculty of Science, Charles University in Prague, Prague, Czech Republic
| | | | - Ivan Čepička
- Department of Zoology, Faculty of Science, Charles University in Prague, Prague, Czech Republic
| | - Christopher E Lane
- Department of Biological Sciences, University of Rhode Island, Kingston, RI, USA.
| | - Martin Kolisko
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic.
- Department of Molecular Biology and Genetics, University of South Bohemia, České Budějovice, Czech Republic.
| |
Collapse
|
29
|
Deng Q, Li M, He C, Lu Q, Gao Y, Li Q, Shi M, Wang P, Wei P. Genetic diversity of avian leukosis virus subgroup J (ALV-J): toward a unified phylogenetic classification and nomenclature system. Virus Evol 2021; 7:veab037. [PMID: 34026272 PMCID: PMC8129623 DOI: 10.1093/ve/veab037] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Avian leukosis virus subgroup J (ALV-J) has infected a variety of birds, causing major economic losses in China. Understanding the comprehensive criteria of classification and nomenclature of ALV-J would be useful for the investigation of the viral evolution and also for the prevention and control of this infection. An in-depth analysis of the genetic diversity of ALV-J was performed in the present study. Four hundred and seventy-five sequences of the gp85 gene, including thirteen of avian endogenous retrovirus designated ev/J and 462 of ALV-J, were used in the phylogenetic and the evolutionary distance analysis for this classification. The study identified that the current ALV-J strains were divided into two first-order clades (Clades 1 and 2) and three second-order clades (Clades 1.1, 1.2 and 1.3). The current Chinese ALV-J strains are predominantly in Clade 1.3, and the Chinese and Egyptian chicken flocks have been facing the emerging Clade 2 viruses. This system pioneers the classification efforts for ALV-J, which uses Pilot tree for rapid classification of the new isolates and also the addition of possible new clades. The proposed unified classification system will facilitate future studies of ALV-J epidemiology and genetic evolution and of the comparison of sequences obtained across the world.
Collapse
Affiliation(s)
- Qiaomu Deng
- Institute for Poultry Science and Health, Guangxi University, Nanning, Guangxi 530004, China
| | - Min Li
- Institute for Poultry Science and Health, Guangxi University, Nanning, Guangxi 530004, China
| | - Chengwei He
- Pingxiang Customs P. R. China, 341 Nanda Road, Pingxiang, Guangxi 532600, China
| | - Qiaoe Lu
- Institute for Poultry Science and Health, Guangxi University, Nanning, Guangxi 530004, China
| | - Yanli Gao
- Institute for Poultry Science and Health, Guangxi University, Nanning, Guangxi 530004, China
| | - Qiuhong Li
- Institute for Poultry Science and Health, Guangxi University, Nanning, Guangxi 530004, China
| | - Mengya Shi
- Institute for Poultry Science and Health, Guangxi University, Nanning, Guangxi 530004, China
| | - Peikun Wang
- Institute of Microbe and Host Health, Linyi University, Linyi, Shandong 276005, China
| | - Ping Wei
- Institute for Poultry Science and Health, Guangxi University, Nanning, Guangxi 530004, China
| |
Collapse
|
30
|
Shen XX, Steenwyk JL, Rokas A. Dissecting incongruence between concatenation- and quartet-based approaches in phylogenomic data. Syst Biol 2021; 70:997-1014. [PMID: 33616672 DOI: 10.1093/sysbio/syab011] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Revised: 02/10/2021] [Accepted: 02/17/2021] [Indexed: 12/12/2022] Open
Abstract
Topological conflict or incongruence is widespread in phylogenomic data. Concatenation- and coalescent-based approaches often result in incongruent topologies, but the causes of this conflict can be difficult to characterize. We examined incongruence stemming from conflict between likelihood-based signal (quantified by the difference in gene-wise log likelihood score or ΔGLS) and quartet-based topological signal (quantified by the difference in gene-wise quartet score or ΔGQS) for every gene in three phylogenomic studies in animals, fungi, and plants, which were chosen because their concatenation-based IQ-TREE (T1) and quartet-based ASTRAL (T2) phylogenies are known to produce eight conflicting internal branches (bipartitions). By comparing the types of phylogenetic signal for all genes in these three data matrices, we found that 30% - 36% of genes in each data matrix are inconsistent, that is, each of these genes has higher log likelihood score for T1 versus T2 (i.e., ΔGLS >0) whereas its T1 topology has lower quartet score than its T2 topology (i.e., ΔGQS <0) or vice versa. Comparison of inconsistent and consistent genes using a variety of metrics (e.g., evolutionary rate, gene tree topology, distribution of branch lengths, hidden paralogy, and gene tree discordance) showed that inconsistent genes are more likely to recover neither T1 nor T2 and have higher levels of gene tree discordance than consistent genes. Simulation analyses demonstrate that removal of inconsistent genes from datasets with low levels of incomplete lineage sorting (ILS) and low and medium levels of gene tree estimation error (GTEE) reduced incongruence and increased accuracy. In contrast, removal of inconsistent genes from datasets with medium and high ILS levels and high GTEE levels eliminated or extensively reduced incongruence, but the resulting congruent species phylogenies were not always topologically identical to the true species trees.
Collapse
Affiliation(s)
- Xing-Xing Shen
- State Key Laboratory of Rice Biology and Ministry of Agriculture Key Lab of Molecular Biology of Crop Pathogens and Insects, Zhejiang University, Hangzhou, China.,Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Jacob L Steenwyk
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA
| |
Collapse
|
31
|
Kusy D, Motyka M, Bocak L. Click Beetle Mitogenomics with the Definition of a New Subfamily Hapatesinae from Australasia (Coleoptera: Elateridae). INSECTS 2020; 12:17. [PMID: 33383651 PMCID: PMC7859858 DOI: 10.3390/insects12010017] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/08/2020] [Revised: 12/22/2020] [Accepted: 12/23/2020] [Indexed: 01/30/2023]
Abstract
Elateridae is a taxon with very unstable classification and a number of conflicting phylogenetic hypotheses have been based on morphology and molecular data. We assembled eight complete mitogenomes for seven elaterid subfamilies and merged these taxa with an additional 22 elaterids and an outgroup. The structure of the newly produced mitogenomes showed a very similar arrangement with regard to all earlier published mitogenomes for the Elateridae. The maximum likelihood and Bayesian analyses indicated that Hapatesus Candèze, 1863, is a sister of Parablacinae and Pityobiinae. Therefore, Hapatesinae, a new subfamily, is proposed for the Australian genera Hapatesus (21 spp.) and Toorongus Neboiss, 1957 (4 spp.). Parablacinae, Pityobiinae, and Hapatesinae have a putative Gondwanan origin as the constituent genera are known from the Australian region (9 genera) and Neotropical region (Tibionema Solier, 1851), and only Pityobius LeConte, 1853, occurs in the Nearctic region. Another putative Gondwanan lineage, the Afrotropical Morostomatinae, forms either a serial paraphylum with the clade of Parablacinae, Pityobiinae, and Hapatesinae or is rooted in a more terminal position, but always as an independent lineage. An Eudicronychinae lineage was either recovered as a sister to Melanotini or as a deep split inside Elaterinae and we herein transfer the group to Elaterinae as Eudicronychini, a new status. The mitochondrial genomes provide a sufficient signal for the placement of most lineages, but the deep bipartitions need to be compared with phylogenomic analyses.
Collapse
Affiliation(s)
| | | | - Ladislav Bocak
- Laboratory of Biodiversity and Molecular Evolution, CATRIN-CRH, Palacky University, 17. listopadu 50, 771 46 Olomouc, Czech Republic; (D.K.); (M.M.)
| |
Collapse
|
32
|
Wang C, Li J, Ma M, Lin Z, Hu W, Lin W, Zhang P. Structural and Biochemical Insights Into Two BAHD Acyltransferases ( AtSHT and AtSDT) Involved in Phenolamide Biosynthesis. FRONTIERS IN PLANT SCIENCE 2020; 11:610118. [PMID: 33519864 PMCID: PMC7838080 DOI: 10.3389/fpls.2020.610118] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Accepted: 12/17/2020] [Indexed: 05/18/2023]
Abstract
Phenolamides represent one of the largest classes of plant-specialized secondary metabolites and function in diverse physiological processes, including defense responses and development. The biosynthesis of phenolamides requires the BAHD-family acyltransferases, which transfer acyl-groups from different acyl-donors specifically to amines, the acyl-group acceptors. However, the mechanisms of substrate specificity and multisite-acylation of the BAHD-family acyltransferases remain poorly understood. In this study, we provide a structural and biochemical analysis of AtSHT and AtSDT, two representative BAHD-family members that catalyze the multisite acylation of spermidine but show different product profiles. By determining the structures of AtSHT and AtSDT and using structure-based mutagenesis, we identified the residues important for substrate recognition in AtSHT and AtSDT and hypothesized that the acyl acceptor spermidine might adopt a free-rotating conformation in AtSHT, which can undergo mono-, di-, or tri-acylation; while the spermidine molecule in AtSDT might adopt a linear conformation, which only allows mono- or di-acylation to take place. In addition, through sequence similarity network (SSN) and structural modeling analysis, we successfully predicted and verified the functions of two uncharacterized Arabidopsis BAHD acyltransferases, OAO95042.1 and NP_190301.2, which use putrescine as the main acyl-acceptor. Our work provides not only an excellent starting point for understanding multisite acylation in BAHD-family enzymes, but also a feasible methodology for predicting possible acyl acceptor specificity of uncharacterized BAHD-family acyltransferases.
Collapse
Affiliation(s)
- Chengyuan Wang
- National Key Laboratory of Plant Molecular Genetics, Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, China
| | - Jianxu Li
- National Key Laboratory of Plant Molecular Genetics, Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, China
| | - Miaolian Ma
- National Key Laboratory of Plant Molecular Genetics, Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, China
| | - Zhaozhu Lin
- Department of Microbiology and Immunology, School of Medicine & Holistic Integrative Medicine, Nanjing University of Chinese Medicine, Nanjing, China
| | - Wenli Hu
- National Key Laboratory of Plant Molecular Genetics, Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, China
| | - Wei Lin
- Department of Microbiology and Immunology, School of Medicine & Holistic Integrative Medicine, Nanjing University of Chinese Medicine, Nanjing, China
- Wei Lin,
| | - Peng Zhang
- National Key Laboratory of Plant Molecular Genetics, Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, China
- *Correspondence: Peng Zhang,
| |
Collapse
|