Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Kapli P, Kotari I, Telford MJ, Goldman N, Yang Z. DNA Sequences Are as Useful as Protein Sequences for Inferring Deep Phylogenies. Syst Biol 2023;72:1119-1135. [PMID: 37366056 PMCID: PMC10627555 DOI: 10.1093/sysbio/syad036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Indexed: 06/28/2023] Open

For:	Kapli P, Kotari I, Telford MJ, Goldman N, Yang Z. DNA Sequences Are as Useful as Protein Sequences for Inferring Deep Phylogenies. Syst Biol 2023;72:1119-1135. [PMID: 37366056 PMCID: PMC10627555 DOI: 10.1093/sysbio/syad036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Indexed: 06/28/2023] Open

Number

Cited by Other Article(s)

Manuel C, Sakalli E, Schmidt HA, Viñas C, von Haeseler A, Elgert C. When the Past Fades: Detecting Phylogenetic Signal with SatuTe. Mol Biol Evol 2025;42:msaf090. [PMID: 40423578 PMCID: PMC12108095 DOI: 10.1093/molbev/msaf090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2024] [Revised: 03/10/2025] [Accepted: 03/26/2025] [Indexed: 05/28/2025] Open

Lavin AA, Rivas-Santisteban J. Limitations of sequence dissimilarity as a predictor of prokaryotic lineage. Open Biol 2025;15:240302. [PMID: 40101780 PMCID: PMC11919493 DOI: 10.1098/rsob.240302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2024] [Revised: 01/15/2025] [Accepted: 02/09/2025] [Indexed: 03/20/2025] Open

Ren H, Wong TKF, Minh BQ, Lanfear R. MixtureFinder: Estimating DNA Mixture Models for Phylogenetic Analyses. Mol Biol Evol 2025;42:msae264. [PMID: 39715360 PMCID: PMC11704958 DOI: 10.1093/molbev/msae264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Revised: 11/26/2024] [Accepted: 12/19/2024] [Indexed: 12/25/2024] Open

Höhna S, Hsiang AY. Sequential Bayesian Phylogenetic Inference. Syst Biol 2024;73:704-721. [PMID: 38771253 DOI: 10.1093/sysbio/syae020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 04/15/2024] [Accepted: 05/04/2024] [Indexed: 05/22/2024] Open

Abstract

The ideal approach to Bayesian phylogenetic inference is to estimate all parameters of interest jointly in a single hierarchical model. However, this is often not feasible in practice due to the high computational cost. Instead, phylogenetic pipelines generally consist of sequential analyses, whereby a single point estimate from a given analysis is used as input for the next analysis (e.g., a single multiple sequence alignment is used to estimate a gene tree). In this framework, uncertainty is not propagated from step to step, which can lead to inaccurate or spuriously confident results. Here, we formally develop and test a sequential inference approach for Bayesian phylogenetic inference, which uses importance sampling to generate observations for the next step of an analysis pipeline from the posterior distribution produced in the previous step. Our sequential inference approach presented here not only accounts for uncertainty between analysis steps but also allows for greater flexibility in software choice (and hence model availability) and can be computationally more efficient than the traditional joint inference approach when multiple models are being tested. We show that our sequential inference approach is identical in practice to the joint inference approach only if sufficient information in the data is present (a narrow posterior distribution) and/or sufficiently many important samples are used. Conversely, we show that the common practice of using a single point estimate can be biased, for example, a single phylogeny estimate can transform an unrooted phylogeny into a time-calibrated phylogeny. We demonstrate the theory of sequential Bayesian inference using both a toy example and an empirical case study of divergence-time estimation in insects using a relaxed clock model from transcriptome data. In the empirical example, we estimate 3 posterior distributions of branch lengths from the same data (DNA character matrix with a GTR+Γ+I substitution model, an amino acid data matrix with empirical substitution models, and an amino acid data matrix with the PhyloBayes CAT-GTR model). Finally, we apply 3 different node-calibration strategies and show that divergence time estimates are affected by both the data source and underlying substitution process to estimate branch lengths as well as the node-calibration strategies. Thus, our new sequential Bayesian phylogenetic inference provides the opportunity to efficiently test different approaches for divergence time estimation, including branch-length estimation from other software.

Collapse

Son A, Park J, Kim W, Yoon Y, Lee S, Park Y, Kim H. Revolutionizing Molecular Design for Innovative Therapeutic Applications through Artificial Intelligence. Molecules 2024;29:4626. [PMID: 39407556 PMCID: PMC11477718 DOI: 10.3390/molecules29194626] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2024] [Revised: 09/19/2024] [Accepted: 09/27/2024] [Indexed: 10/20/2024] Open

Middlebrook EA, Katani R, Fair JM. OrthoPhyl-streamlining large-scale, orthology-based phylogenomic studies of bacteria at broad evolutionary scales. G3 (BETHESDA, MD.) 2024;14:jkae119. [PMID: 38839049 PMCID: PMC11304591 DOI: 10.1093/g3journal/jkae119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Revised: 05/15/2024] [Accepted: 05/29/2024] [Indexed: 06/07/2024]

Tsuda K, Maeno A, Otake A, Kato K, Tanaka W, Hibara KI, Nonomura KI. YABBY and diverged KNOX1 genes shape nodes and internodes in the stem. Science 2024;384:1241-1247. [PMID: 38870308 DOI: 10.1126/science.adn6748] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Accepted: 05/03/2024] [Indexed: 06/15/2024]

Wang W, Dong Z, Du Z, Wu P. Genome-scale approach to reconstructing the phylogenetic tree of psyllids (superfamily Psylloidea) with account of systematic bias. Mol Phylogenet Evol 2023;189:107924. [PMID: 37699449 DOI: 10.1016/j.ympev.2023.107924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 09/05/2023] [Accepted: 09/09/2023] [Indexed: 09/14/2023]

Abstract

Psyllids (class Insecta: order Hemiptera: superfamily Psylloidea) are a taxonomically and phylogenetically challenging clade. Recent studies have largely advanced the phylogeny of this group, yet the family-level relationships among Aphalaridae, Carsidaridae, and others remain unresolved. Genome-scale phylogenetic analysis is known to provide a finer resolution for problems like that. However, such phylogenomics also introduces new problems: incorrect trees with high confidence yielded due to systematic error (bias). Here we addressed these issues using hundreds of single-copy orthologous (SCO) genes in psyllid transcriptomes and genomes. Our analyses revealed conflicts between the nucleotide-based and amino-acid-based phylogenetic trees. While the nucleotide-based phylogeny strongly supported the (Aphalaridae + Carsidaridae) + Others relationship, the amino-acid-based one recovered Aphalaridae + (Carsidaridae + Others) with 100% support. Further inspection revealed significant compositional heterogeneity in nucleotide sequences for 67% of SCO genes, but not in the corresponding translated amino acid sequences. We then used different strategies to combat this compositional bias, and found that using the RY-coding strategy (coding the standard nucleotides as purines and pyrimidines) the nucleotide-based phylogeny became consistent with the amino-acid-based one. We further applied RY-coding to a published concatenated nucleotide dataset and recovered the Aphalaridae monophyly (which is refuted by the original literature on non-recoded sequences) at the base of psyllid tree. Moreover, it was found that variations in evolutionary rate could lead to errors in nucleotide-based phylogeny. The fast-evolving Heteropsylla cubana (Psyllidae: Ciriacreminae) was incorrectly placed within the subfamily Psyllinae. This bias can be avoided by using data removal or RY-coding strategies. Together, our results strongly support the family relationship of Aphalaridae + (Carsidaridae + Others), and show that the amino-acid-based concatenation analysis is more robust than nucleotide-based one. Future phylogenomic analysis of psyllid nucleotide sequences should take into account methods such as the RY-coding scheme to address potential systematic biases arising from composition and rate heterogeneities.

Collapse