1
|
Huyghe CET, Fages A, Ronco F, Indermaur A, Schedel FDB, Kimirei I, Makasa L, Tschopp P, Salzburger W. Metagenomic Insights Into the Dietary Diversity of the Adaptive Radiation of Cichlid Fishes in Lake Tanganyika. Mol Ecol 2025; 34:e17743. [PMID: 40156205 DOI: 10.1111/mec.17743] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2024] [Revised: 03/03/2025] [Accepted: 03/14/2025] [Indexed: 04/01/2025]
Abstract
Diet specialisation is a main driver of diversification in many adaptive radiations. Therefore, identifying diet items is essential to characterise trophic specialisations and to understand the dynamics of dietary adaptations. In this study, we explored the trophic niches of 56 species from the adaptive radiation of cichlid fishes in Lake Tanganyika, encompassing all major phylogenetic lineages and feeding specialisations. We employed a metagenomic sequencing approach to identify the food sources of the investigated species at high taxonomic resolution, sequencing over 400 digestive content samples from wild-caught individuals at around 50 million paired-end read depth per sample. Our analyses revealed Arthropoda, Chordata (fishes), Bacillariophyta and Streptophyta as the primary diet phyla of the Tanganyikan cichlids. Moreover, we confirmed the presence of other food sources and identified taxa not previously documented to be part of the cichlids' diet. Based on their dietary compositions, the Tanganyikan cichlids can be grouped into herbivores, invertivores, piscivores and mixed feeders. Further, we showed that trophic disparity in the radiation is shaped by rapid divergence and documented cases of dietary niche convergence. Diet composition correlated with carbon and nitrogen stable isotope values, gut length, and body morphology. Differences in diet-such as the consumption of diatoms, streptophytes and chlorophytes versus fish and arthropods-were associated with changes in body, upper oral jaw and lower pharyngeal jaw shape. Collectively, this study presents a comprehensive and detailed diet classification of the Tanganyikan cichlids, highlighting the power of metagenomic approaches in delineating dietary adaptations.
Collapse
Affiliation(s)
| | - Antoine Fages
- Zoological Institute, University of Basel, Basel, Switzerland
| | - Fabrizia Ronco
- Zoological Institute, University of Basel, Basel, Switzerland
| | | | - Frederic D B Schedel
- Zoological Institute, University of Basel, Basel, Switzerland
- Faculty of Biology, LMU Munich, Munich, Germany
| | - Ismael Kimirei
- Tanzania Fisheries Research Institute, Ministry of Livestock and Fisheries, Kigoma, Tanzania
| | - Lawrence Makasa
- Lake Tanganyika Research Unit, Department of Fisheries, Ministry of Fisheries and Livestock, Mpulungu, Zambia
| | - Patrick Tschopp
- Zoological Institute, University of Basel, Basel, Switzerland
| | | |
Collapse
|
2
|
Jensen EL, Marchisio C, Ochoa A, Gray R, Parra V, Miller JM, Çilingir FG, Caccone A. Synteny Enabled Upgrade of the Galapagos Giant Tortoise Genome Improves Inferences of Runs of Homozygosity. Ecol Evol 2025; 15:e71358. [PMID: 40290375 PMCID: PMC12032190 DOI: 10.1002/ece3.71358] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2024] [Revised: 03/26/2025] [Accepted: 04/15/2025] [Indexed: 04/30/2025] Open
Abstract
The utility and importance of whole-genome sequences are recognized across various fields, including evolution and conservation. However, for some taxa, like extinct species, using methods to generate contiguous genomes that rely on high-quality DNA is impossible. In such cases, an alternative may be to employ synteny-based methods using a genome from a closely related taxon to generate more complete genomes. Here we update the reference genome for the Pinta Island Galapagos giant tortoise (Chelonoidis abingdonii) without conducting additional sequencing through rescaffolding against the most closely related chromosome-level genome assembly, the Aldabra giant tortoise (Aldabrachelys gigantea). This effort resulted in a much more contiguous genome, CheloAbing_2.0, with an N50 that is two orders of magnitude longer and large reductions in L50 and the number of gaps. We then examined the impact of the CheloAbing_2.0 genome on estimates of runs of homozygosity (ROH) using genome resequencing data from 37 individual Galapagos giant tortoises from the 13 extant lineages to test the mechanisms by which a fragmented assembly may over- or underestimate the number and extent of ROH. The use of CheloAbing_2.0 resulted in individual estimates of inbreeding, including ROH proportion (FROH), number (NROH), and cumulative length (SROH), that were statistically different from those derived from the earlier genome assembly. This improved genome will serve as a resource for future efforts focusing on the ecology, evolution, and conservation of this species group. More broadly, our results highlight that synteny-based scaffolding is promising for generating contiguous genomes without needing additional data types.
Collapse
Affiliation(s)
- Evelyn L. Jensen
- School of Natural and Environmental Sciences, Newcastle UniversityNewcastleUpon TyneUK
| | - Chiara Marchisio
- School of Natural and Environmental Sciences, Newcastle UniversityNewcastleUpon TyneUK
- Faculty of Health and Life SciencesUniversitat Pompeu FabraBarcelonaSpain
| | - Alexander Ochoa
- Department of Ecology and Evolutionary BiologyYale UniversityNew HavenConnecticutUSA
| | - Rachel Gray
- School of Natural and Environmental Sciences, Newcastle UniversityNewcastleUpon TyneUK
| | - Vanessa Parra
- Biology DepartmentUniversity of KentuckyLexingtonKentuckyUSA
| | - Joshua M. Miller
- Department of Biological SciencesMacEwan UniversityEdmontonCanada
| | - F. Gözde Çilingir
- Department of Evolutionary Biology and Environmental StudiesUniversity of ZurichZurichSwitzerland
- Swiss Federal Institute for Research WSLBirmensdorfSwitzerland
| | - Adalgisa Caccone
- Department of Ecology and Evolutionary BiologyYale UniversityNew HavenConnecticutUSA
| |
Collapse
|
3
|
Sommer M, Zimin A, Salzberg S. PSAURON: a tool for assessing protein annotation across a broad range of species. NAR Genom Bioinform 2025; 7:lqae189. [PMID: 39781514 PMCID: PMC11704789 DOI: 10.1093/nargab/lqae189] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2024] [Revised: 12/10/2024] [Accepted: 12/23/2024] [Indexed: 01/12/2025] Open
Abstract
Evaluating the accuracy of protein-coding sequences in genome annotations is a challenging problem for which there is no broadly applicable solution. In this manuscript, we introduce PSAURON (Protein Sequence Assessment Using a Reference ORF Network), a novel software tool developed to help assess the quality of protein-coding gene annotations. Utilizing a machine learning model trained on a diverse dataset from over 1000 plant and animal genomes, PSAURON assigns a score to coding DNA or protein sequence that reflects the likelihood that the sequence is a genuine protein-coding region. PSAURON scores can be used for genome-wide protein annotation assessment as well as the rapid identification of potentially spurious annotated proteins. Validation against established benchmarks demonstrates PSAURON's effectiveness and correlation with recognized measures of protein quality, highlighting its potential use as a widely applicable method to evaluate precision in gene annotation. PSAURON is open source and freely available at https://github.com/salzberg-lab/PSAURON.
Collapse
Affiliation(s)
- Markus J Sommer
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Aleksey V Zimin
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Steven L Salzberg
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21218, USA
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, USA
- Department of Biostatistics, Johns Hopkins University, Baltimore, MD 21218, USA
| |
Collapse
|
4
|
Little J, Meyer GH, Grover A, Francette AM, Partha R, Arndt KM, Smith M, Clark N, Chikina M. ERC 2.0 - evolutionary rate covariation update improves inference of functional interactions across large phylogenies. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.02.24.639970. [PMID: 40060623 PMCID: PMC11888306 DOI: 10.1101/2025.02.24.639970] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 03/21/2025]
Abstract
Evolutionary Rate Covariation (ERC) is an established comparative genomics method that identifies sets of genes sharing patterns of sequence evolution, which suggests shared function. Whereas many functional predictions of ERC have been empirically validated, its predictive power has hitherto been limited by its inability to tackle the large numbers of species in contemporary comparative genomics datasets. This study introduces ERC2.0, an enhanced methodology for studying ERC across phylogenies with hundreds of species and tens of thousands of genes. ERC2.0 improves upon previous iterations of ERC in algorithm speed, normalizing for heteroskedasticity, and normalizing correlations via Fisher transformations. These improvements have resulted in greater statistical power to predict biological function. In exemplar yeast and mammalian datasets, we demonstrate that the predictive power of ERC2.0 is improved relative to the previous method, ERC1.0, and that further improvements are obtained by using larger yeast and mammalian phylogenies. We attribute the improvements to both the larger datasets and improved rate normalization. We demonstrate that ERC2.0 has high predictive accuracy for known annotations and can predict the functions of genes in non-model systems. Our findings underscore the potential for ERC2.0 to be used as a single-pass computational tool in candidate gene screening and functional predictions.
Collapse
Affiliation(s)
| | | | - Aakash Grover
- Department of Biological Sciences, University of Pittsburgh
| | - Alex Michael Francette
- Department of Cell Biology and Physiology, Washington University School of Medicine, St. Louis, MO
| | | | - Karen M Arndt
- Department of Biological Sciences, University of Pittsburgh
| | - Martin Smith
- Department of Earth Sciences, University of Durham
| | - Nathan Clark
- Department of Biological Sciences, University of Pittsburgh
| | - Maria Chikina
- Department of Computational and Systems Biology, University of Pittsburgh
| |
Collapse
|
5
|
Fischer D, Tapio M, Bitz O, Iso-Touru T, Kause A, Tapio I. Fine-tuning GBS data with comparison of reference and mock genome approaches for advancing genomic selection in less studied farmed species. BMC Genomics 2025; 26:111. [PMID: 39910437 PMCID: PMC11796084 DOI: 10.1186/s12864-025-11296-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Accepted: 01/27/2025] [Indexed: 02/07/2025] Open
Abstract
BACKGROUND Diversifying animal cultivation demands efficient genotyping for enabling genomic selection, but non-model species lack efficient genotyping solutions. The aim of this study was to optimize a genotyping-by-sequencing (GBS) double-digest RAD-sequencing (ddRAD) pipeline. Bovine data was used to automate the bioinformatic analysis. The application of the optimization was demonstrated on non-model European whitefish data. RESULTS DdRAD data generation was designed for a reliable estimation of relatedness and is scalable to up to 384 samples. The GBS sequencing yielded approximately one million reads for each of the around 100 assessed samples. Optimizing various strategies to create a de-novo reference genome for variant calling (mock reference) showed that using three samples outperformed other building strategies with single or very large number of samples. Adjustments to most pipeline tuning parameters had limited impact on high-quality data, except for the identity criterion for merging mock reference genome clusters. For each species, over 15k GBS variants based on the mock reference were obtained and showed comparable results with the ones called using an existing reference genome. Repeatability analysis showed high concordance over replicates, particularly in bovine while in European whitefish data repeatability did not exceed earlier observations. CONCLUSIONS The proposed cost-effective ddRAD strategy, coupled with an efficient bioinformatics workflow, enables broad adoption of ddRAD GBS across diverse farmed species. While beneficial, a reference genome is not obligatory. The integration of Snakemake streamlines the pipeline usage on computer clusters and supports customization. This user-friendly solution facilitates genotyping for both model and non-model species.
Collapse
Affiliation(s)
- Daniel Fischer
- Applied Statistical Methods, Natural Resources, Natural Resources Institute Finland (Luke), Jokioinen, 31600, Finland.
| | - Miika Tapio
- Genomics and Breeding, Production Systems, Natural Resources Institute Finland (Luke), Jokioinen, 31600, Finland
| | - Oliver Bitz
- Genomics and Breeding, Production Systems, Natural Resources Institute Finland (Luke), Jokioinen, 31600, Finland
| | - Terhi Iso-Touru
- Genomics and Breeding, Production Systems, Natural Resources Institute Finland (Luke), Jokioinen, 31600, Finland
| | - Antti Kause
- Genomics and Breeding, Production Systems, Natural Resources Institute Finland (Luke), Jokioinen, 31600, Finland
| | - Ilma Tapio
- Genomics and Breeding, Production Systems, Natural Resources Institute Finland (Luke), Jokioinen, 31600, Finland
| |
Collapse
|
6
|
Hadjadji C, Devalloir Q, Gaillard C, van den Brink NW, Scheifler R. Evidence linking cadmium and/or lead exposure to immunomodulatory effects in mammals based upon an adverse outcome pathways approach, and research perspectives. CHEMOSPHERE 2025; 371:144056. [PMID: 39746483 DOI: 10.1016/j.chemosphere.2024.144056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/23/2024] [Revised: 12/20/2024] [Accepted: 12/30/2024] [Indexed: 01/04/2025]
Abstract
For decades, studies have shown how exposure to non-essential trace metals such as lead (Pb) and cadmium (Cd) largely impact global wildlife. Ecoimmunotoxicology has emerged in the past two decades and focuses on the effects of pollutants on the immune system of free-ranging organisms. Adverse outcome pathways (AOPs) represent a conceptual approach to explore the mechanistic linkage between a molecular initiating event and adverse outcomes, potentially at all biological levels of organisation. The present paper proposes putative AOPs related to the effects of Cd, Pb, and the mixture Cd-Pb, on the immune system of mammals to address future questions in ecoimmunotoxicology. Molecular Initiating Events for both metals relate to entrance in cells through Ca2+ channels or bond to cell surfaces. Exposure to Cd, Pb and Cd-Pb share several similar Key Events (KEs), primarily an increase of oxidative stress (OS) in immune cells through production of reactive oxygen species. For both metals and the mixture, OS affects mitochondrial membranes, and induces apoptosis, ultimately decreasing immune cell number. Both metals affect innate immune system through nuclear factor kappa B (NF-κB) and mitogen-activated protein kinase (MAPK) inflammatory signalling pathways, leading to an upregulation of inflammatory markers and mediators. Adaptive immune system is also affected by the exposure to both metals though a decrease of CD4+/CD8+ ratio, a decrease of MHCII, an inactivation of TH1 and TH2 response, and an inhibition of the humoral response mediated by various Ig. Mixture effects of Cd-Pb are less documented resulting in a more speculative AOP, but potential synergic and antagonistic effects were identified. According to our AOPs, further research in ecoimmunotoxicology of metals in free-ranging mammals should focus on KEs related to NF-κB/MAPK inflammatory signalling pathways, changes in CD4+/CD8+ ratio and MHCII complexes, and on AOs related to auto-immune disorders and on the effective increase of infection rate, particularly in case of exposure to metal mixtures.
Collapse
Affiliation(s)
- Cloe Hadjadji
- Swiss Ornithological Institute, Seerose 1, CH-6204, Sempach, Switzerland; Laboratoire Chrono-Environnement, UMR 6249, CNRS/Université de Franche-Comté, 16 route de Gray, 25000, Besançon, France.
| | - Quentin Devalloir
- Laboratoire Chrono-Environnement, UMR 6249, CNRS/Université de Franche-Comté, 16 route de Gray, 25000, Besançon, France
| | - Colette Gaillard
- Laboratoire Chrono-Environnement, UMR 6249, CNRS/Université de Franche-Comté, 16 route de Gray, 25000, Besançon, France
| | - Nico W van den Brink
- Division of Toxicology, Wageningen University, Box 8000, 6700 EA, Wageningen, the Netherlands
| | - Renaud Scheifler
- Laboratoire Chrono-Environnement, UMR 6249, CNRS/Université de Franche-Comté, 16 route de Gray, 25000, Besançon, France
| |
Collapse
|
7
|
Dewar AE, Belcher LJ, West SA. A phylogenetic approach to comparative genomics. Nat Rev Genet 2025:10.1038/s41576-024-00803-0. [PMID: 39779997 PMCID: PMC7617348 DOI: 10.1038/s41576-024-00803-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/05/2024] [Indexed: 01/11/2025]
Abstract
Comparative genomics, whereby the genomes of different species are compared, has the potential to address broad and fundamental questions at the intersection of genetics and evolution. However, species, genomes and genes cannot be considered as independent data points within statistical tests. Closely related species tend to be similar because they share genes by common descent, which must be accounted for in analyses. This problem of non-independence may be exacerbated when examining genomes or genes but can be addressed by applying phylogeny-based methods to comparative genomic analyses. Here, we review how controlling for phylogeny can change the conclusions of comparative genomics studies. We address common questions on how to apply these methods and illustrate how they can be used to test causal hypotheses. The combination of rapidly expanding genomic datasets and phylogenetic comparative methods is set to revolutionize the biological insights possible from comparative genomic studies.
Collapse
Affiliation(s)
- Anna E Dewar
- Department of Biology, University of Oxford, Oxford, UK.
- St John's College, Oxford, UK.
| | | | - Stuart A West
- Department of Biology, University of Oxford, Oxford, UK
| |
Collapse
|
8
|
Kosch TA, Crawford AJ, Lockridge Mueller R, Wollenberg Valero KC, Power ML, Rodríguez A, O'Connell LA, Young ND, Skerratt LF. Comparative analysis of amphibian genomes: An emerging resource for basic and applied research. Mol Ecol Resour 2025; 25:e14025. [PMID: 39364691 DOI: 10.1111/1755-0998.14025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 08/26/2024] [Accepted: 09/12/2024] [Indexed: 10/05/2024]
Abstract
Amphibians are the most threatened group of vertebrates and are in dire need of conservation intervention to ensure their continued survival. They exhibit unique features including a high diversity of reproductive strategies, permeable and specialized skin capable of producing toxins and antimicrobial compounds, multiple genetic mechanisms of sex determination and in some lineages, the ability to regenerate limbs and organs. Although genomic approaches would shed light on these unique traits and aid conservation, sequencing and assembly of amphibian genomes has lagged behind other taxa due to their comparatively large genome sizes. Fortunately, the development of long-read sequencing technologies and initiatives has led to a recent burst of new amphibian genome assemblies. Although growing, the field of amphibian genomics suffers from the lack of annotation resources, tools for working with challenging genomes and lack of high-quality assemblies in multiple clades of amphibians. Here, we analyse 51 publicly available amphibian genomes to evaluate their usefulness for functional genomics research. We report considerable variation in genome assembly quality and completeness and report some of the highest transposable element and repeat contents of any vertebrate. Additionally, we detected an association between transposable element content and climatic variables. Our analysis provides evidence of conserved genome synteny despite the long divergence times of this group, but we also highlight inconsistencies in chromosome naming and orientation across genome assemblies. We discuss sequencing gaps in the phylogeny and suggest key targets for future sequencing endeavours. Finally, we propose increased investment in amphibian genomics research to promote their conservation.
Collapse
Affiliation(s)
- Tiffany A Kosch
- Faculty of Science, University of Melbourne, Melbourne, Victoria, Australia
| | - Andrew J Crawford
- Departamento de Ciencias Biológicas, Universidad de los Andes, Bogotá, Colombia
| | | | | | - Megan L Power
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
| | - Ariel Rodríguez
- Institute of Zoology, University of Veterinary Medicine of Hannover, Hannover, Germany
| | | | - Neil D Young
- Faculty of Science, University of Melbourne, Melbourne, Victoria, Australia
| | - Lee F Skerratt
- Faculty of Science, University of Melbourne, Melbourne, Victoria, Australia
| |
Collapse
|
9
|
Li F, Wang X, Zhou X. The Genomics Revolution Drives a New Era in Entomology. ANNUAL REVIEW OF ENTOMOLOGY 2025; 70:379-400. [PMID: 39874145 DOI: 10.1146/annurev-ento-013024-013420] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/30/2025]
Abstract
Thanks to the fast development of sequencing techniques and bioinformatics tools, sequencing the genome of an insect species for specific research purposes has become an increasingly popular practice. Insect genomes not only provide sets of gene sequences but also represent a change in focus from reductionism to systemic biology in the field of entomology. Using insect genomes, researchers are able to identify and study the functions of all members of a gene family, pathway, or gene network associated with a trait of interest. Comparative genomics studies provide new insights into insect evolution, addressing long-lasting controversies in taxonomy. It is also now feasible to uncover the genetic basis of important traits by identifying variants using genome resequencing data of individual insects, followed by genome-wide association analysis. Here, we review the current progress in insect genome sequencing projects and the application of insect genomes in uncovering the phylogenetic relationships between insects and unraveling the mechanisms of important life-history traits. We also summarize the challenges in genome data sharing and possible solutions. Finally, we provide guidance for fully and deeply mining insect genome data.
Collapse
Affiliation(s)
- Fei Li
- Zhejiang Key Laboratory of Biology and Ecological Regulation of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China;
| | - Xianhui Wang
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China;
| | - Xin Zhou
- Department of Entomology, China Agricultural University, Beijing, China;
| |
Collapse
|
10
|
Liu T, Conesa A. Profiling the epigenome using long-read sequencing. Nat Genet 2025; 57:27-41. [PMID: 39779955 DOI: 10.1038/s41588-024-02038-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2024] [Accepted: 11/19/2024] [Indexed: 01/11/2025]
Abstract
The advent of single-molecule, long-read sequencing (LRS) technologies by Oxford Nanopore Technologies and Pacific Biosciences has revolutionized genomics, transcriptomics and, more recently, epigenomics research. These technologies offer distinct advantages, including the direct detection of methylated DNA and simultaneous assessment of DNA sequences spanning multiple kilobases along with their modifications at the single-molecule level. This has enabled the development of new assays for analyzing chromatin states and made it possible to integrate data for DNA methylation, chromatin accessibility, transcription factor binding and histone modifications, thereby facilitating comprehensive epigenomic profiling. Owing to recent advancements, alternative, nascent and translating transcripts can be detected using LRS approaches. This Review discusses LRS-based experimental and computational strategies for characterizing chromatin states and highlights their advantages over short-read sequencing methods. Furthermore, we demonstrate how various long-read methods can be integrated to design multi-omics studies to investigate the relationship between chromatin states and transcriptional dynamics.
Collapse
Affiliation(s)
- Tianyuan Liu
- Institute for Integrative Systems Biology, Spanish National Research Council, Paterna, Spain
| | - Ana Conesa
- Institute for Integrative Systems Biology, Spanish National Research Council, Paterna, Spain.
| |
Collapse
|
11
|
Glick L, Castiglione S, Loewenthal G, Raia P, Pupko T, Mayrose I. Phylogenetic Analysis of 590 Species Reveals Distinct Evolutionary Patterns of Intron-Exon Gene Structures Across Eukaryotic Lineages. Mol Biol Evol 2024; 41:msae248. [PMID: 39657604 DOI: 10.1093/molbev/msae248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2024] [Revised: 11/16/2024] [Accepted: 11/18/2024] [Indexed: 12/12/2024] Open
Abstract
Introns are highly prevalent in most eukaryotic genomes. Despite the accumulating evidence for benefits conferred by the possession of introns, their specific roles and functions, as well as the processes shaping their evolution, are still only partially understood. Here, we explore the evolution of the eukaryotic intron-exon gene structure by focusing on several key features such as the intron length, the number of introns, and the intron-to-exon length ratio in protein-coding genes. We utilize whole-genome data from 590 species covering the main eukaryotic taxonomic groups and analyze them within a statistical phylogenetic framework. We found that the basic gene structure differs markedly among the main eukaryotic groups, with animals, and particularly chordates, displaying intron-rich genes, compared with plants and fungi. Reconstruction of gene structure evolution suggests that these differences evolved prior to the divergence of the main phyla and have remained mostly conserved within groups. We revisit the previously reported association between the genome size and the mean intron length and report that this association differs considerably among phyla. Analyzing a large and diverse dataset of species with whole-genome information while applying advanced modeling techniques allowed us to obtain a global evolutionary perspective. Our findings may indicate that introns play different molecular and evolutionary roles in different organisms.
Collapse
Affiliation(s)
- Lior Glick
- School of Plant Sciences and Food Security, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Silvia Castiglione
- Department of Earth Sciences, Environment and Resources, University of Naples Federico II, Naples, Italy
| | - Gil Loewenthal
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Pasquale Raia
- Department of Earth Sciences, Environment and Resources, University of Naples Federico II, Naples, Italy
| | - Tal Pupko
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Itay Mayrose
- School of Plant Sciences and Food Security, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
12
|
Lancaster MA. Unraveling mechanisms of human brain evolution. Cell 2024; 187:5838-5857. [PMID: 39423803 PMCID: PMC7617105 DOI: 10.1016/j.cell.2024.08.052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Revised: 06/19/2024] [Accepted: 08/28/2024] [Indexed: 10/21/2024]
Abstract
Evolutionary changes in human brain structure and function have enabled our specialized cognitive abilities. How these changes have come about genetically and functionally has remained an open question. However, new methods are providing a wealth of information about the genetic, epigenetic, and transcriptomic differences that set the human brain apart. Combined with in vitro models that allow access to developing brain tissue and the cells of our closest living relatives, the puzzle pieces are now coming together to yield a much more complete picture of what is actually unique about the human brain. The challenge now will be linking these observations and making the jump from correlation to causation. However, elegant genetic manipulations are now possible and, when combined with model systems such as organoids, will uncover a mechanistic understanding of how evolutionary changes at the genetic level have led to key differences in development and function that enable human cognition.
Collapse
Affiliation(s)
- Madeline A Lancaster
- MRC Laboratory of Molecular Biology, Cambridge, UK; Cambridge Stem Cell Institute, University of Cambridge, Cambridge, UK.
| |
Collapse
|
13
|
Cooper KL. The case against simplistic genetic explanations of evolution. Development 2024; 151:dev203077. [PMID: 39369308 PMCID: PMC11463953 DOI: 10.1242/dev.203077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/07/2024]
Abstract
Humans are curious to understand the causes of traits that distinguish us from other animals and that distinguish vastly different species from one another. We also have a proclivity for simple stories and sometimes tend toward seeking and accepting simple genetic explanations for large evolutionary shifts, even to a single gene. Here, I reveal how a biased expectation of mechanistic simplicity threads through the long history of evolutionary and developmental genetics. I argue, however, that expecting a simple mechanism threatens a deeper understanding of evolution, and I define the limitations for interpreting experimental evidence in evolutionary developmental genetics.
Collapse
Affiliation(s)
- Kimberly L. Cooper
- Department of Cell and Developmental Biology, University of California San Diego, La Jolla, CA 92093, USA
| |
Collapse
|
14
|
Linck EB, Cadena CD. A Latitudinal Gradient of Reference Genomes. Mol Ecol 2024:e17551. [PMID: 39400919 DOI: 10.1111/mec.17551] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2024] [Revised: 09/17/2024] [Accepted: 09/26/2024] [Indexed: 10/15/2024]
Abstract
Global inequality rooted in legacies of colonialism and uneven development can lead to systematic biases in scientific knowledge. In ecology and evolutionary biology, findings, funding and research effort are disproportionately concentrated at high latitudes, while biological diversity is concentrated at low latitudes. This discrepancy may have a particular influence in fields like phylogeography, molecular ecology and conservation genetics, where the rise of genomics has increased the cost and technical expertise required to apply state-of-the-art methods. Here, we ask whether a fundamental biogeographic pattern-the latitudinal gradient of species richness in tetrapods-is reflected in the available reference genomes, an important data resource for various applications of molecular tools for biodiversity research and conservation. We also ask whether sequencing approaches differ between the Global South and Global North, reviewing the last 5 years of conservation genetics research in four leading journals. We find that extant reference genomes are scarce relative to species richness at low latitudes and that reduced representation and whole-genome sequencing are disproportionately applied to taxa in the Global North. We conclude with recommendations to close this gap and improve international collaborations in biodiversity genomics.
Collapse
Affiliation(s)
- Ethan B Linck
- Department of Ecology, Montana State University, Bozeman, Montana, USA
| | | |
Collapse
|
15
|
Gulhan AB, Burhans R, Harris R, Kandemir M, Haeussler M, Nekrutenko A. KegAlign: Optimizing pairwise alignments with diagonal partitioning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.02.610839. [PMID: 39282333 PMCID: PMC11398343 DOI: 10.1101/2024.09.02.610839] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 09/22/2024]
Abstract
Our ability to generate sequencing data and assemble it into high quality complete genomes has rapidly advanced in recent years. These data promise to advance our understanding of organismal biology and answer longstanding evolutionary questions. Multiple genome alignment is a key tool in this quest. It is also the area which is lagging: today we can generate genomes faster than we can construct and update multiple alignments containing them. The bottleneck is in considerable computational time required to generate accurate pairwise alignments between divergent genomes, an unavoidable precursor to multiple alignments. This step is typically performed with lastZ, a very sensitive and yet equally slow tool. Here we describe an optimized GPU-enabled pairwise aligner KegAlign. It incorporates a new parallelization strategy, diagonal partitioning, with the latest features of modern GPUs. With KegAlign a typical human/mouse alignment can be computed in under 6 hours on a machine containing a single NVidia A100 GPU and 80 CPU cores without the need for any pre-partitioning of input sequences: a ~150× improvement over lastZ. While other pairwise aligners can complete this task in a fraction of that time, none achieves the sensitivity of KegAlign's main alignment engine, lastZ, and thus may not be suitable for comparing divergent genomes. In addition to providing the source code and a Conda package for KegAlign we also provide a Galaxy workflow that can be readily used by anyone.
Collapse
Affiliation(s)
- A Burak Gulhan
- Department of Computer Science and Engineering, Penn State University
| | - Richard Burhans
- Department of Biochemistry and Molecular Biology, Penn State University
- Center for Computational Biology and Bioinformatics, Penn State University
- The Galaxy Project, Error! Hyperlink reference not valid
| | - Robert Harris
- Center for Computational Biology and Bioinformatics, Penn State University
| | - Mahmut Kandemir
- Department of Computer Science and Engineering, Penn State University
| | | | - Anton Nekrutenko
- Department of Biochemistry and Molecular Biology, Penn State University
- Center for Computational Biology and Bioinformatics, Penn State University
- The Galaxy Project, Error! Hyperlink reference not valid
| |
Collapse
|
16
|
Tang X, Ortner NJ, Nikonishyna YV, Fernández-Quintero ML, Kokot J, Striessnig J, Liedl KR. Pathogenicity of de novo CACNA1D Ca 2+ channel variants predicted from sequence co-variation. Eur J Hum Genet 2024; 32:1065-1073. [PMID: 38553610 PMCID: PMC11369236 DOI: 10.1038/s41431-024-01594-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 03/02/2024] [Accepted: 03/12/2024] [Indexed: 09/04/2024] Open
Abstract
Voltage-gated L-type Cav1.3 Ca2+ channels support numerous physiological functions including neuronal excitability, sinoatrial node pacemaking, hearing, and hormone secretion. De novo missense mutations in the gene of their pore-forming α1-subunit (CACNA1D) induce severe gating defects which lead to autism spectrum disorder and a more severe neurological disorder with and without endocrine symptoms. The number of CACNA1D variants reported is constantly rising, but their pathogenic potential often remains unclear, which complicates clinical decision-making. Since functional tests are time-consuming and not always available, bioinformatic tools further improving pathogenicity potential prediction of novel variants are needed. Here we employed evolutionary analysis considering sequences of the Cav1.3 α1-subunit throughout the animal kingdom to predict the pathogenicity of human disease-associated CACNA1D missense variants. Co-variation analyses of evolutionary information revealed residue-residue couplings and allowed to generate a score, which correctly predicted previously identified pathogenic variants, supported pathogenicity in variants previously classified as likely pathogenic and even led to the re-classification or re-examination of 18 out of 80 variants previously assessed with clinical and electrophysiological data. Based on the prediction score, we electrophysiologically tested one variant (V584I) and found significant gating changes associated with pathogenic risks. Thus, our co-variation model represents a valuable addition to complement the assessment of the pathogenicity of CACNA1D variants completely independent of clinical diagnoses, electrophysiology, structural or biophysical considerations, and solely based on evolutionary analyses.
Collapse
Affiliation(s)
- Xuechen Tang
- Department of General, Inorganic and Theoretical Chemistry, Center for Molecular Biosciences Innsbruck, University of Innsbruck, A-6020, Innsbruck, Austria
| | - Nadine J Ortner
- Department of Pharmacology and Toxicology, Center for Molecular Biosciences Innsbruck, University of Innsbruck, A-6020, Innsbruck, Austria
| | - Yuliia V Nikonishyna
- Department of Pharmacology and Toxicology, Center for Molecular Biosciences Innsbruck, University of Innsbruck, A-6020, Innsbruck, Austria
| | - Monica L Fernández-Quintero
- Department of General, Inorganic and Theoretical Chemistry, Center for Molecular Biosciences Innsbruck, University of Innsbruck, A-6020, Innsbruck, Austria
| | - Janik Kokot
- Department of General, Inorganic and Theoretical Chemistry, Center for Molecular Biosciences Innsbruck, University of Innsbruck, A-6020, Innsbruck, Austria
| | - Jörg Striessnig
- Department of Pharmacology and Toxicology, Center for Molecular Biosciences Innsbruck, University of Innsbruck, A-6020, Innsbruck, Austria.
| | - Klaus R Liedl
- Department of General, Inorganic and Theoretical Chemistry, Center for Molecular Biosciences Innsbruck, University of Innsbruck, A-6020, Innsbruck, Austria.
| |
Collapse
|
17
|
Fleming LE, Landrigan PJ, Ashford OS, Whitman EM, Swift A, Gerwick WH, Heymans JJ, Hicks CC, Morrissey K, White MP, Alcantara-Creencia L, Alexander KA, Astell-Burt T, Berlinck RGS, Cohen PJ, Hixson R, Islam MM, Iwasaki A, Praptiwi RA, Raps H, Remy JY, Sowman G, Ternon E, Thiele T, Thilsted SH, Uku J, Ockenden S, Kumar P. Enhancing Human Health and Wellbeing through Sustainably and Equitably Unlocking a Healthy Ocean's Potential. Ann Glob Health 2024; 90:41. [PMID: 39005643 PMCID: PMC11243763 DOI: 10.5334/aogh.4471] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2024] [Accepted: 05/13/2024] [Indexed: 07/16/2024] Open
Abstract
A healthy ocean is essential for human health, and yet the links between the ocean and human health are often overlooked. By providing new medicines, technologies, energy, foods, recreation, and inspiration, the ocean has the potential to enhance human health and wellbeing. However, climate change, pollution, biodiversity loss, and inequity threaten both ocean and human health. Sustainable realisation of the ocean's health benefits will require overcoming these challenges through equitable partnerships, enforcement of laws and treaties, robust monitoring, and use of metrics that assess both the ocean's natural capital and human wellbeing. Achieving this will require an explicit focus on human rights, equity, sustainability, and social justice. In addition to highlighting the potential unique role of the healthcare sector, we offer science-based recommendations to protect both ocean health and human health, and we highlight the unique potential of the healthcare sector tolead this effort.
Collapse
Affiliation(s)
- Lora E Fleming
- European Centre for Environment and Human Health of the University of Exeter Medical School, Cornwall, UK
| | - Philip J Landrigan
- Instituto de Quimica de Sao Carlos, Universidade de Sao Paulo, Sao Carlos, SP, Brazil
| | | | - Ella M Whitman
- Program for Global Public Health and the Common Good, Boston College, Boston, Massachusetts, USA and the Centre Scientifique de Monaco, Monaco
| | - Amy Swift
- Ocean Program, at World Resources Institute, London, UK
| | - William H Gerwick
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego, US
| | | | | | - Karyn Morrissey
- Department of Technology, Management and Economics, Technical University of Denmark, Denmark
| | | | - Lota Alcantara-Creencia
- College of Fisheries and Aquatic Sciences, Western Philippines University, Palawan, Philippines
| | - Karen A Alexander
- Marine Governance and Blue Economy at Heriot-Watt University, Orkney, UK
| | - Thomas Astell-Burt
- School of Architecture, Design and Planning, University of Sydney, Sydney, Australia
| | - Roberto G S Berlinck
- Instituto de Quimica de Sao Carlos, Universidade de Sao Paulo, Sao Carlos, SP, Brazil
| | - Philippa J Cohen
- Small-Scale Fisheries Research Program World Fish, Penang, Malaysia
| | - Richard Hixson
- Critical Care, County Durham and Darlington NHS Foundation Trust, Darlington, UK
| | - Mohammad Mahmudul Islam
- Department of Coastal and Marine Fisheries, Sylhet Agricultural University, Sylhet, Bangladesh
| | - Arihiro Iwasaki
- Department of Applied Chemistry, Faculty of Science and Engineering, Chuo University, Tokyo, Japan
| | - Radisti A Praptiwi
- Research Center for Ecology and Ethnobiology, National Research and Innovation Agency (BRIN), Jakarta, Indonesia
| | | | - Jan Yves Remy
- Shridath Ramphal Centre, the University of the West Indies, St. Lucia/Barbados
| | - Georgina Sowman
- Advanced Wellbeing Research Centre, Sheffield Hallam University, Sheffield, UK
| | - Eva Ternon
- Laboratoire d'Océanographie de Villefranche at Sorbonne Université, Paris, France
| | - Torsten Thiele
- Research Institute for Sustainability - Helmholtz Centre Potsdam (RIFS), Potsdam, Germany
| | - Shakuntala H Thilsted
- Nutrition, Health and Food Security Impact Area Platform Worldfish CGIAR, Penang, Malaysia
| | - Jacqueline Uku
- Kenya Marine and Fisheries Research Institute, Mombasa, Kenya
| | | | | |
Collapse
|
18
|
Eaton KM, Krabbenhoft TJ, Backenstose NJC, Bernal MA. The chromosome-scale reference genome for the pinfish (Lagodon rhomboides) provides insights into their evolutionary and demographic history. G3 (BETHESDA, MD.) 2024; 14:jkae096. [PMID: 38739549 PMCID: PMC11228864 DOI: 10.1093/g3journal/jkae096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Revised: 03/18/2024] [Accepted: 05/06/2024] [Indexed: 05/16/2024]
Abstract
The pinfish (Lagodon rhomboides) is an ecologically, economically, and culturally relevant member of the family Sparidae, playing crucial roles in the marine food webs of the western Atlantic Ocean and Gulf of Mexico. Despite their high abundance and ecological importance, there is a scarcity of genomic resources for this species. We assembled and annotated a chromosome-scale genome for the pinfish, resulting in a highly contiguous 785 Mb assembly of 24 scaffolded chromosomes. The high-quality assembly contains 98.9% complete BUSCOs and shows strong synteny to other chromosome-scale genomes of fish in the family Sparidae, with a limited number of large-scale genomic rearrangements. Leveraging this new genomic resource, we found evidence of significant expansions of dietary gene families over the evolutionary history of the pinfish, which may be associated with an ontogenetic shift from carnivory to herbivory seen in this species. Estimates of historical patterns of population demography using this new reference genome identified several periods of population growth and contraction which were associated with ancient climatic shifts and sea level changes. This genome serves as a valuable reference for future studies of population genomics and differentiation and provides a much-needed genomic resource for this western Atlantic sparid.
Collapse
Affiliation(s)
- Katherine M Eaton
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| | - Trevor J Krabbenhoft
- Department of Biological Sciences, University at Buffalo, Buffalo, NY 14260, USA
- Research and Education in Energy, Environment, and Water (RENEW) Institute, University at Buffalo, Buffalo, NY 14260, USA
| | | | - Moisés A Bernal
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
- Smithsonian Tropical Research Institute (STRI), Panama City, 0843-03092, Panama
| |
Collapse
|
19
|
Jia H, Tan S, Cai Y, Guo Y, Shen J, Zhang Y, Ma H, Zhang Q, Chen J, Qiao G, Ruan J, Zhang YE. Low-input PacBio sequencing generates high-quality individual fly genomes and characterizes mutational processes. Nat Commun 2024; 15:5644. [PMID: 38969648 PMCID: PMC11226609 DOI: 10.1038/s41467-024-49992-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 06/20/2024] [Indexed: 07/07/2024] Open
Abstract
Long-read sequencing, exemplified by PacBio, revolutionizes genomics, overcoming challenges like repetitive sequences. However, the high DNA requirement ( > 1 µg) is prohibitive for small organisms. We develop a low-input (100 ng), low-cost, and amplification-free library-generation method for PacBio sequencing (LILAP) using Tn5-based tagmentation and DNA circularization within one tube. We test LILAP with two Drosophila melanogaster individuals, and generate near-complete genomes, surpassing preexisting single-fly genomes. By analyzing variations in these two genomes, we characterize mutational processes: complex transpositions (transposon insertions together with extra duplications and/or deletions) prefer regions characterized by non-B DNA structures, and gene conversion of transposons occurs on both DNA and RNA levels. Concurrently, we generate two complete assemblies for the endosymbiotic bacterium Wolbachia in these flies and similarly detect transposon conversion. Thus, LILAP promises a broad PacBio sequencing adoption for not only mutational studies of flies and their symbionts but also explorations of other small organisms or precious samples.
Collapse
Affiliation(s)
- Hangxing Jia
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
| | - Shengjun Tan
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
| | - Yingao Cai
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yanyan Guo
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jieyu Shen
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yaqiong Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Huijing Ma
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Qingzhu Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jinfeng Chen
- University of Chinese Academy of Sciences, Beijing, China
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Gexia Qiao
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jue Ruan
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China.
| | - Yong E Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
- University of Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
20
|
Crowley LM, Hotaling S. The genome sequence of the oak bush-cricket, Meconema thalassinum (De Geer, 1773). Wellcome Open Res 2024; 9:346. [PMID: 39398939 PMCID: PMC11467643 DOI: 10.12688/wellcomeopenres.21500.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/17/2024] [Indexed: 10/15/2024] Open
Abstract
We present a genome assembly from one male Meconema thalassinum (the oak bush-cricket; Arthropoda; Insecta; Orthoptera; Tettigoniidae). The genome sequence is 9,039.1 megabases in span. Most of the assembly is scaffolded into 15 chromosomal pseudomolecules, including the X sex chromosome. The mitochondrial genome has also been assembled and is 15.63 kilobases in length.
Collapse
Affiliation(s)
- Liam M. Crowley
- Department of Biology, University of Oxford, Oxford, England, UK
| | - Scott Hotaling
- School of Biological Sciences, Washington State University, Pullman, Washington, USA
| | | | | | | | | | | | | | | |
Collapse
|
21
|
Grant AR, Johnson KP, Stanley EL, Baldwin-Brown J, Kolenčík S, Allen JM. Rapid Targeted Assembly of the Proteome Reveals Evolutionary Variation of GC Content in Avian Lice. Bioinform Biol Insights 2024; 18:11779322241257991. [PMID: 38860163 PMCID: PMC11163934 DOI: 10.1177/11779322241257991] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Accepted: 05/02/2024] [Indexed: 06/12/2024] Open
Abstract
Nucleotide base composition plays an influential role in the molecular mechanisms involved in gene function, phenotype, and amino acid composition. GC content (proportion of guanine and cytosine in DNA sequences) shows a high level of variation within and among species. Many studies measure GC content in a small number of genes, which may not be representative of genome-wide GC variation. One challenge when assembling extensive genomic data sets for these studies is the significant amount of resources (monetary and computational) associated with data processing, and many bioinformatic tools have not been optimized for resource efficiency. Using a high-performance computing (HPC) cluster, we manipulated resources provided to the targeted gene assembly program, automated target restricted assembly method (aTRAM), to determine an optimum way to run the program to maximize resource use. Using our optimum assembly approach, we assembled and measured GC content of all of the protein-coding genes of a diverse group of parasitic feather lice. Of the 499 426 genes assembled across 57 species, feather lice were GC-poor (mean GC = 42.96%) with a significant amount of variation within and between species (GC range = 19.57%-73.33%). We found a significant correlation between GC content and standard deviation per taxon for overall GC and GC3, which could indicate selection for G and C nucleotides in some species. Phylogenetic signal of GC content was detected in both GC and GC3. This research provides a large-scale investigation of GC content in parasitic lice laying the foundation for understanding the basis of variation in base composition across species.
Collapse
Affiliation(s)
- Avery R Grant
- Department of Biology, University of Nevada, Reno, Reno, NV, USA
| | - Kevin P Johnson
- Illinois Natural History Survey, Prairie Research Institute, University of Illinois at Urbana-Champaign, Champaign, IL, USA
| | - Edward L Stanley
- Department of Natural History, Florida Museum of Natural History, University of Florida, Gainesville, FL, USA
| | | | - Stanislav Kolenčík
- Faculty of Mathematics, Natural Sciences, and Information Technologies, University of Primorska, Koper, Slovenia
| | - Julie M Allen
- Department of Biological Sciences, Virginia Tech, Blacksburg, VA, USA
| |
Collapse
|
22
|
Dittrich C, Hoelzl F, Smith S, Fouilloux CA, Parker DJ, O’Connell LA, Knowles LS, Hughes M, Fewings A, Morgan R, Rojas B, Comeault AA. Genome Assembly of the Dyeing Poison Frog Provides Insights into the Dynamics of Transposable Element and Genome-Size Evolution. Genome Biol Evol 2024; 16:evae109. [PMID: 38753031 PMCID: PMC11152451 DOI: 10.1093/gbe/evae109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/12/2024] [Indexed: 06/07/2024] Open
Abstract
Genome size varies greatly across the tree of life and transposable elements are an important contributor to this variation. Among vertebrates, amphibians display the greatest variation in genome size, making them ideal models to explore the causes and consequences of genome size variation. However, high-quality genome assemblies for amphibians have, until recently, been rare. Here, we generate a high-quality genome assembly for the dyeing poison frog, Dendrobates tinctorius. We compare this assembly to publicly available frog genomes and find evidence for both large-scale conserved synteny and widespread rearrangements between frog lineages. Comparing conserved orthologs annotated in these genomes revealed a strong correlation between genome size and gene size. To explore the cause of gene-size variation, we quantified the location of transposable elements relative to gene features and find that the accumulation of transposable elements in introns has played an important role in the evolution of gene size in D. tinctorius, while estimates of insertion times suggest that many insertion events are recent and species-specific. Finally, we carry out population-scale mobile-element sequencing and show that the diversity and abundance of transposable elements in poison frog genomes can complicate genotyping from repetitive element sequence anchors. Our results show that transposable elements have clearly played an important role in the evolution of large genome size in D. tinctorius. Future studies are needed to fully understand the dynamics of transposable element evolution and to optimize primer or bait design for cost-effective population-level genotyping in species with large, repetitive genomes.
Collapse
Affiliation(s)
- Carolin Dittrich
- Department of Biology and Environmental Sciences, University of Jyväskylä, Jyväskylä, Finland
- Department of Interdisciplinary Life Sciences, Konrad Lorenz Institute of Ethology, University of Veterinary Medicine, Vienna, Austria
| | - Franz Hoelzl
- Department of Interdisciplinary Life Sciences, Konrad Lorenz Institute of Ethology, University of Veterinary Medicine, Vienna, Austria
| | - Steve Smith
- Department of Interdisciplinary Life Sciences, Konrad Lorenz Institute of Ethology, University of Veterinary Medicine, Vienna, Austria
| | - Chloe A Fouilloux
- Department of Biology and Environmental Sciences, University of Jyväskylä, Jyväskylä, Finland
| | - Darren J Parker
- School of Environmental and Natural Sciences, Molecular Ecology & Evolution Group, Bangor University, Bangor, UK
| | | | - Lucy S Knowles
- NERC Environmental Omics Facility, University of Sheffield, Sheffield, UK
| | - Margaret Hughes
- Centre for Genomic Research, University of Liverpool, Liverpool, UK
| | - Ade Fewings
- Supercomputing Wales, Digital Services, Bangor University, Bangor, UK
| | - Rhys Morgan
- School of Environmental and Natural Sciences, Molecular Ecology & Evolution Group, Bangor University, Bangor, UK
| | - Bibiana Rojas
- Department of Biology and Environmental Sciences, University of Jyväskylä, Jyväskylä, Finland
- Department of Interdisciplinary Life Sciences, Konrad Lorenz Institute of Ethology, University of Veterinary Medicine, Vienna, Austria
| | - Aaron A Comeault
- School of Environmental and Natural Sciences, Molecular Ecology & Evolution Group, Bangor University, Bangor, UK
| |
Collapse
|
23
|
Rick JA, Brock CD, Lewanski AL, Golcher-Benavides J, Wagner CE. Reference Genome Choice and Filtering Thresholds Jointly Influence Phylogenomic Analyses. Syst Biol 2024; 73:76-101. [PMID: 37881861 DOI: 10.1093/sysbio/syad065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Revised: 09/20/2023] [Accepted: 10/20/2023] [Indexed: 10/27/2023] Open
Abstract
Molecular phylogenies are a cornerstone of modern comparative biology and are commonly employed to investigate a range of biological phenomena, such as diversification rates, patterns in trait evolution, biogeography, and community assembly. Recent work has demonstrated that significant biases may be introduced into downstream phylogenetic analyses from processing genomic data; however, it remains unclear whether there are interactions among bioinformatic parameters or biases introduced through the choice of reference genome for sequence alignment and variant calling. We address these knowledge gaps by employing a combination of simulated and empirical data sets to investigate the extent to which the choice of reference genome in upstream bioinformatic processing of genomic data influences phylogenetic inference, as well as the way that reference genome choice interacts with bioinformatic filtering choices and phylogenetic inference method. We demonstrate that more stringent minor allele filters bias inferred trees away from the true species tree topology, and that these biased trees tend to be more imbalanced and have a higher center of gravity than the true trees. We find the greatest topological accuracy when filtering sites for minor allele count (MAC) >3-4 in our 51-taxa data sets, while tree center of gravity was closest to the true value when filtering for sites with MAC >1-2. In contrast, filtering for missing data increased accuracy in the inferred topologies; however, this effect was small in comparison to the effect of minor allele filters and may be undesirable due to a subsequent mutation spectrum distortion. The bias introduced by these filters differs based on the reference genome used in short read alignment, providing further support that choosing a reference genome for alignment is an important bioinformatic decision with implications for downstream analyses. These results demonstrate that attributes of the study system and dataset (and their interaction) add important nuance for how best to assemble and filter short-read genomic data for phylogenetic inference.
Collapse
Affiliation(s)
- Jessica A Rick
- School of Natural Resources & the Environment, University of Arizona, Tucson, AZ 85719, USA
| | - Chad D Brock
- Department of Biological Sciences, Tarleton State University, Stephenville, TX 76401, USA
| | - Alexander L Lewanski
- Department of Integrative Biology and W.K. Kellogg Biological Station, Michigan State University, East Lansing, MI 48824, USA
| | - Jimena Golcher-Benavides
- Department of Natural Resource Ecology and Management, Iowa State University, Ames, IA 50011, USA
| | - Catherine E Wagner
- Program in Ecology and Evolution, University of Wyoming, Laramie, WY 82071, USA
- Department of Botany, University of Wyoming, Laramie, WY 82071, USA
| |
Collapse
|
24
|
Sommer MJ, Zimin AV, Salzberg SL. PSAURON: a tool for assessing protein annotation across a broad range of species. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.15.594385. [PMID: 38798674 PMCID: PMC11118318 DOI: 10.1101/2024.05.15.594385] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Evaluating the accuracy of protein-coding sequences in genome annotations is a challenging problem for which there is no broadly applicable solution. In this manuscript we introduce PSAURON (Protein Sequence Assessment Using a Reference ORF Network), a novel software tool developed to assess the quality of protein-coding gene annotations. Utilizing a machine learning model trained on a diverse dataset from over 1000 plant and animal genomes, PSAURON assigns a score to coding DNA or protein sequence that reflects the likelihood that the sequence is a genuine protein coding region. PSAURON scores can be used for genome-wide protein annotation assessment as well as the rapid identification of potentially spurious annotated proteins. Validation against established benchmarks demonstrates PSAURON's effectiveness and correlation with recognized measures of protein quality, highlighting its potential use as a general-purpose method to evaluate gene annotation. PSAURON is open source and freely available at https://github.com/salzberg-lab/PSAURON . One-Sentence Summary PSAURON is a machine learning-based tool for rapid assessment of protein coding gene annotation.
Collapse
|
25
|
Galià-Camps C, Pegueroles C, Turon X, Carreras C, Pascual M. Genome composition and GC content influence loci distribution in reduced representation genomic studies. BMC Genomics 2024; 25:410. [PMID: 38664648 PMCID: PMC11046876 DOI: 10.1186/s12864-024-10312-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 04/15/2024] [Indexed: 04/28/2024] Open
Abstract
BACKGROUND Genomic architecture is a key evolutionary trait for living organisms. Due to multiple complex adaptive and neutral forces which impose evolutionary pressures on genomes, there is a huge variability of genomic features. However, their variability and the extent to which genomic content determines the distribution of recovered loci in reduced representation sequencing studies is largely unexplored. RESULTS Here, by using 80 genome assemblies, we observed that whereas plants primarily increase their genome size by expanding their intergenic regions, animals expand both intergenic and intronic regions, although the expansion patterns differ between deuterostomes and protostomes. Loci mapping in introns, exons, and intergenic categories obtained by in silico digestion using 2b-enzymes are positively correlated with the percentage of these regions in the corresponding genomes, suggesting that loci distribution mostly mirrors genomic architecture of the selected taxon. However, exonic regions showed a significant enrichment of loci in all groups regardless of the used enzyme. Moreover, when using selective adaptors to obtain a secondarily reduced loci dataset, the percentage and distribution of retained loci also varied. Adaptors with G/C terminals recovered a lower percentage of selected loci, with a further enrichment of exonic regions, while adaptors with A/T terminals retained a higher percentage of loci and slightly selected more intronic regions than expected. CONCLUSIONS Our results highlight how genome composition, genome GC content, RAD enzyme choice and use of base-selective adaptors influence reduced genome representation techniques. This is important to acknowledge in population and conservation genomic studies, as it determines the abundance and distribution of loci.
Collapse
Affiliation(s)
- Carles Galià-Camps
- Departament de Genètica, Microbiologia i Estadística, Universitat de Barcelona, Avinguda Diagonal 643, Barcelona, 08028, Spain.
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona (UB), Barcelona, Spain.
- Department of Marine Ecology, Centre d'Estudis Avançats de Blanes (CEAB-CSIC), Accés Cala Sant Francesc 14, Blanes, 17300, Spain.
| | - Cinta Pegueroles
- Departament de Genètica, Microbiologia i Estadística, Universitat de Barcelona, Avinguda Diagonal 643, Barcelona, 08028, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona (UB), Barcelona, Spain
| | - Xavier Turon
- Department of Marine Ecology, Centre d'Estudis Avançats de Blanes (CEAB-CSIC), Accés Cala Sant Francesc 14, Blanes, 17300, Spain
| | - Carlos Carreras
- Departament de Genètica, Microbiologia i Estadística, Universitat de Barcelona, Avinguda Diagonal 643, Barcelona, 08028, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona (UB), Barcelona, Spain
| | - Marta Pascual
- Departament de Genètica, Microbiologia i Estadística, Universitat de Barcelona, Avinguda Diagonal 643, Barcelona, 08028, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona (UB), Barcelona, Spain
| |
Collapse
|
26
|
Lawson DJ, Howard-McCombe J, Beaumont M, Senn H. How admixed captive breeding populations could be rescued using local ancestry information. Mol Ecol 2024:e17349. [PMID: 38634332 DOI: 10.1111/mec.17349] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 12/21/2023] [Accepted: 02/26/2024] [Indexed: 04/19/2024]
Abstract
This paper asks the question: can genomic information be used to recover a species that is already on the pathway to extinction due to genetic swamping from a related and more numerous population? We show that a breeding strategy in a captive breeding program can use whole genome sequencing to identify and remove segments of DNA introgressed through hybridisation. The proposed policy uses a generalized measure of kinship or heterozygosity accounting for local ancestry, that is, whether a specific genetic location was inherited from the target of conservation. We then show that optimizing these measures would minimize undesired ancestry while also controlling kinship and/or heterozygosity, in a simulated breeding population. The process is applied to real data representing the hybridized Scottish wildcat breeding population, with the result that it should be possible to breed out domestic cat ancestry. The ability to reverse introgression is a powerful tool brought about through the combination of sequencing with computational advances in ancestry estimation. Since it works best when applied early in the process, important decisions need to be made about which genetically distinct populations should benefit from it and which should be left to reform into a single population.
Collapse
Affiliation(s)
- Daniel J Lawson
- Institute of Statistical Sciences, School of Mathematics, University of Bristol, Bristol, UK
| | - Jo Howard-McCombe
- RZSS WildGenes Laboratory, Conservation Department, Royal Zoological Society of Scotland, Edinburgh, UK
| | - Mark Beaumont
- School of Biological Sciences, University of Bristol, Bristol, UK
| | - Helen Senn
- RZSS WildGenes Laboratory, Conservation Department, Royal Zoological Society of Scotland, Edinburgh, UK
| |
Collapse
|
27
|
Gomes-Dos-Santos A, Fonseca E, Riccardi N, Hinzmann M, Lopes-Lima M, Froufe E. The transcriptome assembly of the European freshwater mussel Unio elongatulus C. Pfeiffer, 1825. Sci Data 2024; 11:377. [PMID: 38609426 PMCID: PMC11014934 DOI: 10.1038/s41597-024-03226-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Accepted: 04/04/2024] [Indexed: 04/14/2024] Open
Abstract
Freshwater mussels of the order Unionida are a global conservation concern. Species of this group are strictly freshwater, sessile, slow-growing animals and, extremely sensitive to environmental changes. Human-mediated changes in freshwater habitats are imposing enormous pressure on the survival of freshwater mussels. Although a few flagship species are protected in Europe, other highly imperilled species receive much less attention. Moreover, knowledge about biology, ecology, and evolution and proper conservation assessments of many European species are still sparse. This knowledge gap is further aggravated by the lack of genomic resources available, which are key tools for conservation. Here we present the transcriptome assembly of Unio elongatulus C. Pfeiffer, 1825, one of the least studied European freshwater mussels. Using the individual sequencing outputs from eight physiologically representative mussel tissues, we provide an annotated panel of tissue-specific Relative Gene Expression profiles. These resources are pivotal to studying the species' biological and ecological features, as well as helping to understand its vulnerability to current and future threats.
Collapse
Affiliation(s)
- André Gomes-Dos-Santos
- CIIMAR/CIMAR - Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Avenida General Norton de Matos, S/N, P 4450-208, Matosinhos, Portugal.
| | - Elza Fonseca
- CIIMAR/CIMAR - Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Avenida General Norton de Matos, S/N, P 4450-208, Matosinhos, Portugal
| | | | - Mariana Hinzmann
- CIIMAR/CIMAR - Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Avenida General Norton de Matos, S/N, P 4450-208, Matosinhos, Portugal
| | - Manuel Lopes-Lima
- BIOPOLIS Program in Genomics, Biodiversity and Ecosystems, CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Campus de Vairão, Universidade do Porto, Vairão, Portugal
- IUCN SSC Mollusc Specialist Group, c/o IUCN, Cambridge, UK
| | - Elsa Froufe
- CIIMAR/CIMAR - Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Avenida General Norton de Matos, S/N, P 4450-208, Matosinhos, Portugal.
| |
Collapse
|
28
|
Halstead-Nussloch G, Signorini SG, Giulio M, Crocetta F, Munari M, Della Torre C, Weber AAT. The genome of the rayed Mediterranean limpet Patella caerulea (Linnaeus, 1758). Genome Biol Evol 2024; 16:evae070. [PMID: 38546725 PMCID: PMC11003540 DOI: 10.1093/gbe/evae070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/23/2024] [Indexed: 04/11/2024] Open
Abstract
Patella caerulea (Linnaeus, 1758) is a mollusc limpet species of the class Gastropoda. Endemic to the Mediterranean Sea, it is considered a keystone species due to its primary role in structuring and regulating the ecological balance of tidal and subtidal habitats. It is currently being used as a bioindicator to assess the environmental quality of coastal marine waters and as a model species to understand adaptation to ocean acidification. Here, we provide a high-quality reference genome assembly and annotation for P. caerulea. We generated ∼30 Gb of Pacific Biosciences high-fidelity data from a single individual and provide a final 749.8 Mb assembly containing 62 contigs, including the mitochondrial genome (14,938 bp). With an N50 of 48.8 Mb and 98% of the assembly contained in the 18 largest contigs, this assembly is near chromosome-scale. Benchmarking Universal Single-Copy Orthologs scores were high (Mollusca, 87.8% complete; Metazoa, 97.2% complete) and similar to metrics observed for other chromosome-level Patella genomes, highlighting a possible bias in the Mollusca database for Patellids. We generated transcriptomic Illumina data from a second individual collected at the same locality and used it together with protein evidence to annotate the genome. A total of 23,938 protein-coding gene models were found. By comparing this annotation with other published Patella annotations, we found that the distribution and median values of exon and gene lengths was comparable with other Patella species despite different annotation approaches. The present high-quality P. caerulea reference genome, available on GenBank (BioProject: PRJNA1045377; assembly: GCA_036850965.1), is an important resource for future ecological and evolutionary studies.
Collapse
Affiliation(s)
| | - Silvia Giorgia Signorini
- Department of Aquatic Ecology, Swiss Federal Institute of Aquatic Science and Technology (Eawag), Dübendorf, Switzerland
- Department of Biosciences, University of Milan, Milan, Italy
- Department of Integrative Marine Ecology, Stazione Zoologica Anton Dohrn, Naples, Italy
| | - Marco Giulio
- Department of Aquatic Ecology, Swiss Federal Institute of Aquatic Science and Technology (Eawag), Dübendorf, Switzerland
| | - Fabio Crocetta
- Department of Integrative Marine Ecology, Stazione Zoologica Anton Dohrn, Naples, Italy
- National Biodiversity Future Center (NBFC), Palermo, Italy
| | - Marco Munari
- Department of Integrative Marine Ecology, Stazione Zoologica Anton Dohrn, Naples, Italy
- Department of Biology, Stazione Idrobiologica ‘Umberto d’Ancona’, University of Padova, Chioggia, Italy
| | - Camilla Della Torre
- Department of Biosciences, University of Milan, Milan, Italy
- Department of Integrative Marine Ecology, Stazione Zoologica Anton Dohrn, Naples, Italy
| | - Alexandra Anh-Thu Weber
- Department of Aquatic Ecology, Swiss Federal Institute of Aquatic Science and Technology (Eawag), Dübendorf, Switzerland
| |
Collapse
|
29
|
Linn C, O’Malley A, Khatri K, Wright EM, Sebagh D, Grbić M, Kowal K, Chruszcz M. Microscopic Menaces: The Impact of Mites on Human Health. Int J Mol Sci 2024; 25:3675. [PMID: 38612486 PMCID: PMC11011512 DOI: 10.3390/ijms25073675] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 03/20/2024] [Accepted: 03/21/2024] [Indexed: 04/14/2024] Open
Abstract
Mites are highly prevalent arthropods that infest diverse ecological niches globally. Approximately 55,000 species of mites have been identified but many more are yet to be discovered. Of the ones we do know about, most go unnoticed by humans and animals. However, there are several species from the Acariformes superorder that exert a significant impact on global human health. House dust mites are a major source of inhaled allergens, affecting 10-20% of the world's population; storage mites also cause a significant allergy in susceptible individuals; chiggers are the sole vectors for the bacterium that causes scrub typhus; Demodex mites are part of the normal microfauna of humans and their pets, but under certain conditions populations grow out of control and affect the integrity of the integumentary system; and scabies mites cause one of the most common dermatological diseases worldwide. On the other hand, recent genome sequences of mites provide novel tools for mite control and the development of new biomaterial with applications in biomedicine. Despite the palpable disease burden, mites remain understudied in parasitological research. By better understanding mite biology and disease processes, researchers can identify new ways to diagnose, manage, and prevent common mite-induced afflictions. This knowledge can lead to improved clinical outcomes and reduced disease burden from these remarkably widespread yet understudied creatures.
Collapse
Affiliation(s)
- Christina Linn
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA; (C.L.); (A.O.); (K.K.); (E.M.W.); (D.S.)
| | - Andrea O’Malley
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA; (C.L.); (A.O.); (K.K.); (E.M.W.); (D.S.)
| | - Kriti Khatri
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA; (C.L.); (A.O.); (K.K.); (E.M.W.); (D.S.)
| | - Elaine M. Wright
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA; (C.L.); (A.O.); (K.K.); (E.M.W.); (D.S.)
| | - Dylan Sebagh
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA; (C.L.); (A.O.); (K.K.); (E.M.W.); (D.S.)
| | - Miodrag Grbić
- Department of Biology, University of Western Ontario, London, ON N6A 5B7, Canada;
| | - Krzysztof Kowal
- Department of Allergology and Internal Medicine, Medical University of Bialystok, 15-276 Bialystok, Poland;
- Department of Experimental Allergology and Immunology, Medical University of Bialystok, 15-276 Bialystok, Poland
| | - Maksymilian Chruszcz
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA; (C.L.); (A.O.); (K.K.); (E.M.W.); (D.S.)
| |
Collapse
|
30
|
Awkerman JA, Glinski DA, Henderson WM, Van Meter R, Purucker ST. Framework for multi-stressor physiological response evaluation in amphibian risk assessment and conservation. Front Ecol Evol 2024; 12:1-16. [PMID: 39679000 PMCID: PMC11636185 DOI: 10.3389/fevo.2024.1336747] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2024] Open
Abstract
Controlled laboratory experiments are often performed on amphibians to establish causality between stressor presence and an adverse outcome. However, in the field, identification of lab-generated biomarkers from single stressors and the interactions of multiple impacts are difficult to discern in an ecological context. The ubiquity of some pesticides and anthropogenic contaminants results in potentially cryptic sublethal effects or synergistic effects among multiple stressors. Although biochemical pathways regulating physiological responses to toxic stressors are often well-conserved among vertebrates, different exposure regimes and life stage vulnerabilities can yield variable ecological risk among species. Here we examine stress-related biomarkers, highlight endpoints commonly linked to apical effects, and discuss differences in ontogeny and ecology that could limit interpretation of biomarkers across species. Further we identify promising field-based physiological measures indicative of potential impacts to health and development of amphibians that could be useful to anuran conservation. We outline the physiological responses to common stressors in the context of altered functional pathways, presenting useful stage-specific endpoints for anuran species, and discussing multi-stressor vulnerability in the larger framework of amphibian life history and ecology. This overview identifies points of physiological, ecological, and demographic vulnerability to provide context in evaluating the multiple stressors impacting amphibian populations worldwide for strategic conservation planning.
Collapse
Affiliation(s)
- Jill A. Awkerman
- Center for Ecosystem Measurement and Modeling, Office of Research and Development, US Environmental Protection Agency, Gulf Breeze, FL, United States
| | - Donna A. Glinski
- Center for Ecosystem Measurement and Modeling, Office of Research and Development, US Environmental Protection Agency, Athens, GA, United States
| | - W. Matthew Henderson
- Center for Ecosystem Measurement and Modeling, Office of Research and Development, US Environmental Protection Agency, Athens, GA, United States
| | - Robin Van Meter
- Environmental Science and Studies, Washington College, Chestertown, MD, United States
| | - S. Thomas Purucker
- Center for Computational Toxicology and Exposure, Office of Research and Development, US Environmental Protection Agency, Durham, NC, United States
| |
Collapse
|
31
|
Margiotta-Casaluci L, Owen SF, Winter MJ. Cross-Species Extrapolation of Biological Data to Guide the Environmental Safety Assessment of Pharmaceuticals-The State of the Art and Future Priorities. ENVIRONMENTAL TOXICOLOGY AND CHEMISTRY 2024; 43:513-525. [PMID: 37067359 DOI: 10.1002/etc.5634] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 03/23/2023] [Accepted: 04/13/2023] [Indexed: 05/27/2023]
Abstract
The extrapolation of biological data across species is a key aspect of biomedical research and drug development. In this context, comparative biology considerations are applied with the goal of understanding human disease and guiding the development of effective and safe medicines. However, the widespread occurrence of pharmaceuticals in the environment and the need to assess the risk posed to wildlife have prompted a renewed interest in the extrapolation of pharmacological and toxicological data across the entire tree of life. To address this challenge, a biological "read-across" approach, based on the use of mammalian data to inform toxicity predictions in wildlife species, has been proposed as an effective way to streamline the environmental safety assessment of pharmaceuticals. Yet, how effective has this approach been, and are we any closer to being able to accurately predict environmental risk based on known human risk? We discuss the main theoretical and experimental advancements achieved in the last 10 years of research in this field. We propose that a better understanding of the functional conservation of drug targets across species and of the quantitative relationship between target modulation and adverse effects should be considered as future research priorities. This pharmacodynamic focus should be complemented with the application of higher-throughput experimental and computational approaches to accelerate the prediction of internal exposure dynamics. The translation of comparative (eco)toxicology research into real-world applications, however, relies on the (limited) availability of experts with the skill set needed to navigate the complexity of the problem; hence, we also call for synergistic multistakeholder efforts to support and strengthen comparative toxicology research and education at a global level. Environ Toxicol Chem 2024;43:513-525. © 2023 The Authors. Environmental Toxicology and Chemistry published by Wiley Periodicals LLC on behalf of SETAC.
Collapse
Affiliation(s)
- Luigi Margiotta-Casaluci
- Institute of Pharmaceutical Science, Faculty of Life Sciences & Medicine, King's College London, London, United Kingdom
| | - Stewart F Owen
- Global Sustainability, AstraZeneca, Macclesfield, Cheshire, United Kingdom
| | - Matthew J Winter
- Biosciences, Faculty of Health and Life Sciences, University of Exeter, Exeter, Devon, United Kingdom
| |
Collapse
|
32
|
Larivière D, Abueg L, Brajuka N, Gallardo-Alba C, Grüning B, Ko BJ, Ostrovsky A, Palmada-Flores M, Pickett BD, Rabbani K, Antunes A, Balacco JR, Chaisson MJP, Cheng H, Collins J, Couture M, Denisova A, Fedrigo O, Gallo GR, Giani AM, Gooder GM, Horan K, Jain N, Johnson C, Kim H, Lee C, Marques-Bonet T, O'Toole B, Rhie A, Secomandi S, Sozzoni M, Tilley T, Uliano-Silva M, van den Beek M, Williams RW, Waterhouse RM, Phillippy AM, Jarvis ED, Schatz MC, Nekrutenko A, Formenti G. Scalable, accessible and reproducible reference genome assembly and evaluation in Galaxy. Nat Biotechnol 2024; 42:367-370. [PMID: 38278971 PMCID: PMC11462542 DOI: 10.1038/s41587-023-02100-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2024]
Affiliation(s)
- Delphine Larivière
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA, USA
| | - Linelle Abueg
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Nadolina Brajuka
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Cristóbal Gallardo-Alba
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs University Freiburg, Freiburg, Germany
| | - Bjorn Grüning
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs University Freiburg, Freiburg, Germany
| | - Byung June Ko
- Department of Agricultural Biotechnology and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, Republic of Korea
| | - Alex Ostrovsky
- Departments of Biology and Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Marc Palmada-Flores
- Department of Medicine and Life Sciences (MELIS), Institut de Biologia Evolutiva, Universitat Pompeu Fabra-CSIC, Barcelona, Spain
| | - Brandon D Pickett
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Keon Rabbani
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Agostinho Antunes
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Porto, Portugal
- Department of Biology, Faculty of Sciences, University of Porto, Porto, Portugal
| | - Jennifer R Balacco
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Mark J P Chaisson
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Haoyu Cheng
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | | | - Melanie Couture
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Alexandra Denisova
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia
| | - Olivier Fedrigo
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | | | | | | | - Kathleen Horan
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Nivesh Jain
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Cassidy Johnson
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Heebal Kim
- Department of Agricultural Biotechnology and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, Republic of Korea
- eGnome, Inc., Seoul, Republic of Korea
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
| | - Chul Lee
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
| | - Tomas Marques-Bonet
- Department of Medicine and Life Sciences (MELIS), Institut de Biologia Evolutiva, Universitat Pompeu Fabra-CSIC, Barcelona, Spain
- Catalan Institution of Research and Advanced Studies (ICREA), Barcelona, Spain
- CNAG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Brian O'Toole
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Simona Secomandi
- Department of Biological Sciences, University of Cyprus, Nicosia, Cyprus
| | | | - Tatiana Tilley
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | | | - Marius van den Beek
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA, USA
| | - Robert W Williams
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Robert M Waterhouse
- Department of Ecology & Evolution and Swiss Institute of Bioinformatics, University of Lausanne, Lausanne, Switzerland
| | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Erich D Jarvis
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA.
| | - Michael C Schatz
- Departments of Biology and Computer Science, Johns Hopkins University, Baltimore, MD, USA.
| | - Anton Nekrutenko
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA, USA.
| | - Giulio Formenti
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA.
| |
Collapse
|
33
|
Tang S, Peel E, Belov K, Hogg CJ, Farquharson KA. Multi-omics resources for the Australian southern stuttering frog (Mixophyes australis) reveal assorted antimicrobial peptides. Sci Rep 2024; 14:3991. [PMID: 38368484 PMCID: PMC10874372 DOI: 10.1038/s41598-024-54522-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Accepted: 02/13/2024] [Indexed: 02/19/2024] Open
Abstract
The number of genome-level resources for non-model species continues to rapidly expand. However, frog species remain underrepresented, with up to 90% of frog genera having no genomic or transcriptomic data. Here, we assemble the first genomic and transcriptomic resources for the recently described southern stuttering frog (Mixophyes australis). The southern stuttering frog is ground-dwelling, inhabiting naturally vegetated riverbanks in south-eastern Australia. Using PacBio HiFi long-read sequencing and Hi-C scaffolding, we generated a high-quality genome assembly, with a scaffold N50 of 369.3 Mb and 95.1% of the genome contained in twelve scaffolds. Using this assembly, we identified the mitochondrial genome, and assembled six tissue-specific transcriptomes. We also bioinformatically characterised novel sequences of two families of antimicrobial peptides (AMPs) in the southern stuttering frog, the cathelicidins and β-defensins. While traditional peptidomic approaches to peptide discovery have typically identified one or two AMPs in a frog species from skin secretions, our bioinformatic approach discovered 12 cathelicidins and two β-defensins that were expressed in a range of tissues. We investigated the novelty of the peptides and found diverse predicted activities. Our bioinformatic approach highlights the benefits of multi-omics resources in peptide discovery and contributes valuable genomic resources in an under-represented taxon.
Collapse
Affiliation(s)
- Simon Tang
- School of Life and Environmental Sciences, The University of Sydney, Sydney, NSW, 2006, Australia
- Australian Research Council Centre of Excellence for Innovations in Peptide and Protein Science, The University of Sydney, Sydney, NSW, 2006, Australia
| | - Emma Peel
- School of Life and Environmental Sciences, The University of Sydney, Sydney, NSW, 2006, Australia
- Australian Research Council Centre of Excellence for Innovations in Peptide and Protein Science, The University of Sydney, Sydney, NSW, 2006, Australia
| | - Katherine Belov
- School of Life and Environmental Sciences, The University of Sydney, Sydney, NSW, 2006, Australia
- Australian Research Council Centre of Excellence for Innovations in Peptide and Protein Science, The University of Sydney, Sydney, NSW, 2006, Australia
| | - Carolyn J Hogg
- School of Life and Environmental Sciences, The University of Sydney, Sydney, NSW, 2006, Australia.
- Australian Research Council Centre of Excellence for Innovations in Peptide and Protein Science, The University of Sydney, Sydney, NSW, 2006, Australia.
| | - Katherine A Farquharson
- School of Life and Environmental Sciences, The University of Sydney, Sydney, NSW, 2006, Australia
- Australian Research Council Centre of Excellence for Innovations in Peptide and Protein Science, The University of Sydney, Sydney, NSW, 2006, Australia
| |
Collapse
|
34
|
Badger JH, Giordano R, Zimin A, Wappel R, Eskipehlivan SM, Muller S, Donthu R, Soto-Adames F, Vieira P, Zasada I, Goodwin S. Direct sequencing of insect symbionts via nanopore adaptive sampling. CURRENT OPINION IN INSECT SCIENCE 2024; 61:101135. [PMID: 37926187 PMCID: PMC11793409 DOI: 10.1016/j.cois.2023.101135] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/13/2023] [Revised: 10/16/2023] [Accepted: 10/17/2023] [Indexed: 11/07/2023]
Abstract
Insect symbionts can alter their host phenotype and their effects can range from beneficial to pathogenic. Moreover, many insects exhibit co-infections, making their study more challenging. Less than 1% of insect species have high-quality referenced genomes available and fewer still also have their symbionts sequenced. Two methods are commonly used to sequence symbionts: whole-genome sequencing to concomitantly capture the host and bacterial genomes, or isolation of the symbiont's genome before sequencing. These methods are limited when dealing with rare or poorly characterized symbionts. Long-read technology is an important tool to generate high-quality genomes as they can overcome high levels of heterozygosity, repeat content, and transposable elements that confound short-read methods. Oxford Nanopore (ONT) adaptive sampling allows a sequencing instrument to select or reject sequences in real time. We describe a method based on ONT adaptive sampling (subtractive) approach that readily permitted the sequencing of the complete genomes of mitochondria, Buchnera and its plasmids (pLeu, pTrp), and Wolbachia genomes in two aphid species, Aphis glycines and Pentalonia nigronervosa. Adaptive sampling is able to retrieve organelles such as mitochondria and symbionts that have high representation in their hosts such as Buchnera and Wolbachia, but is less successful at retrieving symbionts in low concentrations.
Collapse
Affiliation(s)
- Jonathan H Badger
- Genetics and Microbiome Core, Laboratory of Integrative Cancer Immunology, Center for Cancer Research, National Cancer Institute, Bethesda, MD, USA
| | - Rosanna Giordano
- Institute of Environment, Florida International University, Miami, FL, USA
| | - Aleksey Zimin
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Robert Wappel
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | | | | | - Ravikiran Donthu
- Centre for Life Sciences, Mahindra University, Bahadurpally, Hyderabad 500043, India
| | - Felipe Soto-Adames
- Florida Department of Agriculture and Consumer Services, Department of Plant Industry, Gainesville, FL 32614, USA
| | - Paulo Vieira
- USDA-ARS Agricultural Research Center, Mycology & Nematology Genetic Diversity & Biology Laboratory, Beltsville, MD, USA
| | - Inga Zasada
- USDA-ARS Horticultural Crops Research Laboratory, Corvallis, OR, USA
| | - Sara Goodwin
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
| |
Collapse
|
35
|
Schindel DE, Page RMP. Creating Virtuous Cycles for DNA Barcoding: A Case Study in Science Innovation, Entrepreneurship, and Diplomacy. Methods Mol Biol 2024; 2744:7-32. [PMID: 38683309 DOI: 10.1007/978-1-0716-3581-0_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/01/2024]
Abstract
This chapter on the history of the DNA barcoding enterprise attempts to set the stage for the more scholarly contributions in this volume by addressing the following questions. How did the DNA barcoding enterprise begin? What were its goals, how did it develop, and to what degree are its goals being realized? We have taken a keen interest in the barcoding movement and its relationship to taxonomy, collections, and biodiversity informatics more broadly considered. This chapter integrates our two different perspectives on barcoding. DES was the Executive Secretary of the Consortium for the Barcode of Life from 2004 to 2017, with the mission to support the success of DNA barcoding without being directly involved in generating barcode data. RDMP viewed barcoding as an important entry into the landscape of biodiversity data, with many potential linkages to other components of that landscape. We also saw it as a critical step toward the era of international genomic research that was sure to follow. Like the Mercury Program that paved the way for lunar landings by the Apollo Program, we saw DNA barcoding as the proving grounds for the interdisciplinary and international cooperation that would be needed for success of whole-genome research.
Collapse
Affiliation(s)
| | - Roderic M P Page
- School of Biodiversity, One Health, and Veterinary Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, UK
| |
Collapse
|
36
|
Feldmeyer B, Bornberg-Bauer E, Dohmen E, Fouks B, Heckenhauer J, Huylmans AK, Jones ARC, Stolle E, Harrison MC. Comparative Evolutionary Genomics in Insects. Methods Mol Biol 2024; 2802:473-514. [PMID: 38819569 DOI: 10.1007/978-1-0716-3838-5_16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2024]
Abstract
Genome sequencing quality, in terms of both read length and accuracy, is constantly improving. By combining long-read sequencing technologies with various scaffolding techniques, chromosome-level genome assemblies are now achievable at an affordable price for non-model organisms. Insects represent an exciting taxon for studying the genomic underpinnings of evolutionary innovations, due to ancient origins, immense species-richness, and broad phenotypic diversity. Here we summarize some of the most important methods for carrying out a comparative genomics study on insects. We describe available tools and offer concrete tips on all stages of such an endeavor from DNA extraction through genome sequencing, annotation, and several evolutionary analyses. Along the way we describe important insect-specific aspects, such as DNA extraction difficulties or gene families that are particularly difficult to annotate, and offer solutions. We describe results from several examples of comparative genomics analyses on insects to illustrate the fascinating questions that can now be addressed in this new age of genomics research.
Collapse
Affiliation(s)
- Barbara Feldmeyer
- Senckenberg Biodiversity and Climate Research Centre (SBiK-F), Molecular Ecology, Frankfurt, Germany
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Elias Dohmen
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Bertrand Fouks
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Jacqueline Heckenhauer
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Frankfurt, Germany
- Department of Terrestrial Zoology, Senckenberg Research Institute and Natural History Museum Frankfurt, Frankfurt, Germany
| | - Ann Kathrin Huylmans
- Institute of Organismic and Molecular Evolution, Johannes Gutenberg University, Mainz, Germany
| | - Alun R C Jones
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Eckart Stolle
- Museum Koenig, Leibniz Institute for the Analysis of Biodiversity Change (LIB), Bonn, Germany
| | - Mark C Harrison
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany.
| |
Collapse
|
37
|
Chen X, Wang Z, Zhang C, Hu J, Lu Y, Zhou H, Mei Y, Cong Y, Guo F, Wang Y, He K, Liu Y, Li F. Unraveling the complex evolutionary history of lepidopteran chromosomes through ancestral chromosome reconstruction and novel chromosome nomenclature. BMC Biol 2023; 21:265. [PMID: 37981687 PMCID: PMC10658929 DOI: 10.1186/s12915-023-01762-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Accepted: 11/06/2023] [Indexed: 11/21/2023] Open
Abstract
BACKGROUND Lepidoptera is one of the most species-rich animal groups, with substantial karyotype variations among species due to chromosomal rearrangements. Knowledge of the evolutionary patterns of lepidopteran chromosomes still needs to be improved. RESULTS Here, we used chromosome-level genome assemblies of 185 lepidopteran insects to reconstruct an ancestral reference genome and proposed a new chromosome nomenclature. Thus, we renamed over 5000 extant chromosomes with this system, revealing the historical events of chromosomal rearrangements and their features. Additionally, our findings indicate that, compared with autosomes, the Z chromosome in Lepidoptera underwent a fast loss of conserved genes, rapid acquisition of lineage-specific genes, and a low rate of gene duplication. Moreover, we presented evidence that all available 67 W chromosomes originated from a common ancestor chromosome, with four neo-W chromosomes identified, including one generated by fusion with an autosome and three derived through horizontal gene transfer. We also detected nearly 4000 inter-chromosomal gene movement events. Notably, Geminin is transferred from the autosome to the Z chromosome. When located on the autosome, Geminin shows female-biased expression, but on the Z chromosome, it exhibits male-biased expression. This contributes to the sexual dimorphism of body size in silkworms. CONCLUSIONS Our study sheds light on the complex evolutionary history of lepidopteran chromosomes based on ancestral chromosome reconstruction and novel chromosome nomenclature.
Collapse
Affiliation(s)
- Xi Chen
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Zuoqi Wang
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Chaowei Zhang
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Jingheng Hu
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Yueqi Lu
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Hang Zhou
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Yang Mei
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Yuyang Cong
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Fangyuan Guo
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Yaqin Wang
- State Key Laboratory of Rice Biology, Institute of Biotechnology, Zhejiang University, Hangzhou, China
| | - Kang He
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Ying Liu
- Key Laboratory of Green Prevention and Control of Agricultural Transboundary Pests of Yunnan Province and Agricultural Environment/ Agriculture Environment and Resources Institute, Yunnan Academy of Agricultural Sciences, Kunming, China
| | - Fei Li
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China.
| |
Collapse
|
38
|
Liao IJY, Lu TM, Chen ME, Luo YJ. Spiralian genomics and the evolution of animal genome architecture. Brief Funct Genomics 2023; 22:498-508. [PMID: 37507111 DOI: 10.1093/bfgp/elad029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Revised: 06/27/2023] [Accepted: 07/07/2023] [Indexed: 07/30/2023] Open
Abstract
Recent developments in sequencing technologies have greatly improved our knowledge of phylogenetic relationships and genomic architectures throughout the tree of life. Spiralia, a diverse clade within Protostomia, is essential for understanding the evolutionary history of parasitism, gene conversion, nervous systems and animal body plans. In this review, we focus on the current hypotheses of spiralian phylogeny and investigate the impact of long-read sequencing on the quality of genome assemblies. We examine chromosome-level assemblies to highlight key genomic features that have driven spiralian evolution, including karyotype, synteny and the Hox gene organization. In addition, we show how chromosome rearrangement has influenced spiralian genomic structures. Although spiralian genomes have undergone substantial changes, they exhibit both conserved and lineage-specific features. We recommend increasing sequencing efforts and expanding functional genomics research to deepen insights into spiralian biology.
Collapse
|
39
|
de Jersey AM, Lavers JL, Zosky GR, Rivers-Auty J. The understudied global experiment of pollution's impacts on wildlife and human health: The ethical imperative for interdisciplinary research. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2023; 336:122459. [PMID: 37633432 DOI: 10.1016/j.envpol.2023.122459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 08/02/2023] [Accepted: 08/23/2023] [Indexed: 08/28/2023]
Abstract
The global impact of pollution on human and wildlife health is a growing concern. The health impacts of pollution are significant and far-reaching yet poorly understood as no one field of research has the practices and methodologies required to encapsulate the diversity of these consequences. This paper advocates that interdisciplinary research is essential to comprehend the full extent of the impact of pollution. Medical and ecological research play a key role in investigating the health consequences of the pollution crisis, yet the wildlife experience is often neglected. This paper outlines how applying advanced techniques and expertise adapted in medical research to wildlife exposed to pollutants offers a unique perspective to understanding the full diversity of impacts to health. The challenges that impede the progress of this research include the lack of support for interdisciplinary research among funding streams, limitations in field-specific techniques, and a lack of communication between researchers from different disciplines. Of awarded funding from major national research councils across Australia, Europe, and the United States of America, only 0.5% is dedicated to pollution focused research. This is inclusive of laboratory equipment, mitigation strategies, quantification of environmental samples and health consequences research. Of that, 0.03% of funding is awarded to explaining the wildlife experience and documenting the health consequences observed despite being model organisms to environmentally and biologically relevant models for pollution exposure. This calls for a coordinated effort to overcome these hurdles and to promote interdisciplinary research in order to fully comprehend the consequences of pollution exposure and protect the health of humans, wildlife, and the environment. An interdisciplinary approach to this problem is timely given the magnitude of negative health consequences associated with exposure, the number of pollutants already present within the environment and the continual development of new compounds.
Collapse
Affiliation(s)
- Alix M de Jersey
- Tasmanian School of Medicine, College of Health and Medicine, University of Tasmania, Hobart, Tasmania, 7000, Australia
| | - Jennifer L Lavers
- Bird Group, The Natural History Museum, Akeman Street, Tring, Hertfordshire, HP23 6AP, United Kingdom; Esperance Tjaltjraak Native Title Aboriginal Corporation, 11A Shelden Road, Esperance, Western Australia, 6450, Australia.
| | - Graeme R Zosky
- Menzies Institute for Medical Research, College of Health and Medicine, University of Tasmania, Hobart, Tasmania, 7000, Australia
| | - Jack Rivers-Auty
- Tasmanian School of Medicine, College of Health and Medicine, University of Tasmania, Hobart, Tasmania, 7000, Australia
| |
Collapse
|
40
|
Fourreau CJL, Kise H, Santander MD, Pirro S, Maronna MM, Poliseno A, Santos ME, Reimer JD. Genome sizes and repeatome evolution in zoantharians (Cnidaria: Hexacorallia: Zoantharia). PeerJ 2023; 11:e16188. [PMID: 37868064 PMCID: PMC10586311 DOI: 10.7717/peerj.16188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Accepted: 09/06/2023] [Indexed: 10/24/2023] Open
Abstract
Across eukaryotes, large variations of genome sizes have been observed even between closely related species. Transposable elements as part of the repeated DNA have been proposed and confirmed as one of the most important contributors to genome size variation. However, the evolutionary implications of genome size variation and transposable element dynamics are not well understood. Together with phenotypic traits, they are commonly referred to as the "C-value enigma". The order Zoantharia are benthic cnidarians found from intertidal zones to the deep sea, and some species are particularly abundant in coral reefs. Despite their high ecological relevance, zoantharians have yet to be largely studied from the genomic point of view. This study aims at investigating the role of the repeatome (total content of repeated elements) in genome size variations across the order Zoantharia. To this end, whole-genomes of 32 zoantharian species representing five families were sequenced. Genome sizes were estimated and the abundances of different repeat classes were assessed. In addition, the repeat overlap between species was assessed by a sequence clustering method. The genome sizes in the dataset varied up to 2.4 fold magnitude. Significant correlations between genome size, repeated DNA content and transposable elements, respectively (Pearson's correlation test R2 = 0.47, p = 0.0016; R2 = 0.22, p = 0.05) were found, suggesting their involvement in the dynamics of genome expansion and reduction. In all species, long interspersed nuclear elements and DNA transposons were the most abundant identified elements. These transposable elements also appeared to have had a recent expansion event. This was in contrast to the comparative clustering analysis which revealed species-specific patterns of satellite elements' amplification. In summary, the genome sizes of zoantharians likely result from the complex dynamics of repeated elements. Finally, the majority of repeated elements (up to 70%) could not be annotated to a known repeat class, highlighting the need to further investigate non-model cnidarian genomes. More research is needed to understand how repeated DNA dynamics relate to zoantharian evolution and their biology.
Collapse
Affiliation(s)
- Chloé Julie Loïs Fourreau
- Molecular Invertebrate Systematics and Ecology (MISE) Lab, Graduate School of Engineering and Science, University of the Ryukyus, Nishihara, Okinawa, Japan
| | - Hiroki Kise
- Molecular Invertebrate Systematics and Ecology (MISE) Lab, Graduate School of Engineering and Science, University of the Ryukyus, Nishihara, Okinawa, Japan
- AIST Tsukuba Central, Geological Survey of Japan, National Institute of Advanced Industrial Science and Technology, Tsukuba, Ibaraki, Japan
| | - Mylena Daiana Santander
- Departamento de Genética e Biologia Evolutiva, Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil
| | - Stacy Pirro
- Iridian Genomes, Bethesda, United States of America
| | - Maximiliano M. Maronna
- Molecular Invertebrate Systematics and Ecology (MISE) Lab, Graduate School of Engineering and Science, University of the Ryukyus, Nishihara, Okinawa, Japan
- Faculdade de Ciências, Universidade Estadual Paulista (UNESP), Bauru, Brazil
| | - Angelo Poliseno
- Molecular Invertebrate Systematics and Ecology (MISE) Lab, Graduate School of Engineering and Science, University of the Ryukyus, Nishihara, Okinawa, Japan
| | - Maria E.A. Santos
- Molecular Invertebrate Systematics and Ecology (MISE) Lab, Graduate School of Engineering and Science, University of the Ryukyus, Nishihara, Okinawa, Japan
- Okinawa Institute of Science and Technology, Onna, Okinawa, Japan
| | - James Davis Reimer
- Molecular Invertebrate Systematics and Ecology (MISE) Lab, Graduate School of Engineering and Science, University of the Ryukyus, Nishihara, Okinawa, Japan
- Tropical Biosphere Research Center, University of the Ryukyus, Nishihara, Okinawa, United States of America
| |
Collapse
|
41
|
Marks RA, Farrant JM. Editorial: Women in plant biotechnology 2022. FRONTIERS IN PLANT SCIENCE 2023; 14:1292441. [PMID: 37900758 PMCID: PMC10613046 DOI: 10.3389/fpls.2023.1292441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Accepted: 09/14/2023] [Indexed: 10/31/2023]
Affiliation(s)
- Rose A. Marks
- Department of Horticulture, Michigan State University, East Lansing, MI, United States
- Plant Resilience Institute, Michigan State University, East Lansing, MI, United States
| | - Jill M. Farrant
- Department of Molecular and Cell Biology, University of Cape Town, Rondebosch, South Africa
| |
Collapse
|
42
|
Vertacnik KL, Herrig DK, Godfrey RK, Hill T, Geib SM, Unckless RL, Nelson DR, Linnen CR. Evolution of five environmentally responsive gene families in a pine-feeding sawfly, Neodiprion lecontei (Hymenoptera: Diprionidae). Ecol Evol 2023; 13:e10506. [PMID: 37791292 PMCID: PMC10542623 DOI: 10.1002/ece3.10506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2023] [Revised: 07/17/2023] [Accepted: 07/21/2023] [Indexed: 10/05/2023] Open
Abstract
A central goal in evolutionary biology is to determine the predictability of adaptive genetic changes. Despite many documented cases of convergent evolution at individual loci, little is known about the repeatability of gene family expansions and contractions. To address this void, we examined gene family evolution in the redheaded pine sawfly Neodiprion lecontei, a noneusocial hymenopteran and exemplar of a pine-specialized lineage evolved from angiosperm-feeding ancestors. After assembling and annotating a draft genome, we manually annotated multiple gene families with chemosensory, detoxification, or immunity functions before characterizing their genomic distributions and molecular evolution. We find evidence of recent expansions of bitter gustatory receptor, clan 3 cytochrome P450, olfactory receptor, and antimicrobial peptide subfamilies, with strong evidence of positive selection among paralogs in a clade of gustatory receptors possibly involved in the detection of bitter compounds. In contrast, these gene families had little evidence of recent contraction via pseudogenization. Overall, our results are consistent with the hypothesis that in response to novel selection pressures, gene families that mediate ecological interactions may expand and contract predictably. Testing this hypothesis will require the comparative analysis of high-quality annotation data from phylogenetically and ecologically diverse insect species and functionally diverse gene families. To this end, increasing sampling in under-sampled hymenopteran lineages and environmentally responsive gene families and standardizing manual annotation methods should be prioritized.
Collapse
Affiliation(s)
- Kim L. Vertacnik
- Department of EntomologyUniversity of KentuckyLexingtonKentuckyUSA
| | | | - R. Keating Godfrey
- McGuire Center for Lepidoptera and Biodiversity, University of FloridaGainesvilleFloridaUSA
| | - Tom Hill
- National Institute of Allergy and Infectious DiseasesBethesdaMarylandUSA
| | - Scott M. Geib
- Tropical Crop and Commodity Protection Research UnitUnited States Department of Agriculture: Agriculture Research Service Pacific Basin Agricultural Research CenterHiloHawaiiUSA
| | - Robert L. Unckless
- Department of Molecular BiosciencesUniversity of KansasLawrenceKansasUSA
| | - David R. Nelson
- Department of Microbiology, Immunology and BiochemistryUniversity of Tennessee Health Science CenterMemphisTennesseeUSA
| | | |
Collapse
|
43
|
Sproul JS, Hotaling S, Heckenhauer J, Powell A, Marshall D, Larracuente AM, Kelley JL, Pauls SU, Frandsen PB. Analyses of 600+ insect genomes reveal repetitive element dynamics and highlight biodiversity-scale repeat annotation challenges. Genome Res 2023; 33:1708-1717. [PMID: 37739812 PMCID: PMC10691545 DOI: 10.1101/gr.277387.122] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 09/20/2023] [Indexed: 09/24/2023]
Abstract
Repetitive elements (REs) are integral to the composition, structure, and function of eukaryotic genomes, yet remain understudied in most taxonomic groups. We investigated REs across 601 insect species and report wide variation in RE dynamics across groups. Analysis of associations between REs and protein-coding genes revealed dynamic evolution at the interface between REs and coding regions across insects, including notably elevated RE-gene associations in lineages with abundant long interspersed nuclear elements (LINEs). We leveraged this large, empirical data set to quantify impacts of long-read technology on RE detection and investigate fundamental challenges to RE annotation in diverse groups. In long-read assemblies, we detected ∼36% more REs than short-read assemblies, with long terminal repeats (LTRs) showing 162% increased detection, whereas DNA transposons and LINEs showed less respective technology-related bias. In most insect lineages, 25%-85% of repetitive sequences were "unclassified" following automated annotation, compared with only ∼13% in Drosophila species. Although the diversity of available insect genomes has rapidly expanded, we show the rate of community contributions to RE databases has not kept pace, preventing efficient annotation and high-resolution study of REs in most groups. We highlight the tremendous opportunity and need for the biodiversity genomics field to embrace REs and suggest collective steps for making progress toward this goal.
Collapse
Affiliation(s)
- John S Sproul
- Department of Biology, Brigham Young University, Provo, Utah 84602, USA;
- Department of Biology, University of Nebraska Omaha, Omaha, Nebraska 68182, USA
- Department of Biology, University of Rochester, Rochester, New York 14627, USA
| | - Scott Hotaling
- School of Biological Sciences, Washington State University, Pullman, Washington 99163, USA
- Department of Watershed Sciences, Utah State University, Logan, Utah 84322, USA
| | - Jacqueline Heckenhauer
- LOEWE Center for Translational Biodiversity Genomics (LOEWE-TBG), 60325 Frankfurt, Germany
- Senckenberg Research Institute and Natural History Museum Frankfurt, 60325 Frankfurt, Germany
| | - Ashlyn Powell
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, Utah 84602, USA
| | - Dez Marshall
- Department of Biology, University of Nebraska Omaha, Omaha, Nebraska 68182, USA
| | | | - Joanna L Kelley
- School of Biological Sciences, Washington State University, Pullman, Washington 99163, USA
- Department of Ecology and Evolutionary Biology, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Steffen U Pauls
- LOEWE Center for Translational Biodiversity Genomics (LOEWE-TBG), 60325 Frankfurt, Germany
- Senckenberg Research Institute and Natural History Museum Frankfurt, 60325 Frankfurt, Germany
- Department of Insect Biotechnology, Justus-Liebig-University Gießen, 35392 Gießen, Germany
| | - Paul B Frandsen
- LOEWE Center for Translational Biodiversity Genomics (LOEWE-TBG), 60325 Frankfurt, Germany
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, Utah 84602, USA
- Data Science Lab, Smithsonian Institution, Washington, District of Columbia 20560, USA
| |
Collapse
|
44
|
Montanari S, Deng C, Koot E, Bassil NV, Zurn JD, Morrison-Whittle P, Worthington ML, Aryal R, Ashrafi H, Pradelles J, Wellenreuther M, Chagné D. A multiplexed plant-animal SNP array for selective breeding and species conservation applications. G3 (BETHESDA, MD.) 2023; 13:jkad170. [PMID: 37565490 PMCID: PMC10542201 DOI: 10.1093/g3journal/jkad170] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Revised: 05/15/2023] [Accepted: 06/30/2023] [Indexed: 08/12/2023]
Abstract
Reliable and high-throughput genotyping platforms are of immense importance for identifying and dissecting genomic regions controlling important phenotypes, supporting selection processes in breeding programs, and managing wild populations and germplasm collections. Amongst available genotyping tools, single nucleotide polymorphism arrays have been shown to be comparatively easy to use and generate highly accurate genotypic data. Single-species arrays are the most commonly used type so far; however, some multi-species arrays have been developed for closely related species that share single nucleotide polymorphism markers, exploiting inter-species cross-amplification. In this study, the suitability of a multiplexed plant-animal single nucleotide polymorphism array, including both closely and distantly related species, was explored. The performance of the single nucleotide polymorphism array across species for diverse applications, ranging from intra-species diversity assessments to parentage analysis, was assessed. Moreover, the value of genotyping pooled DNA of distantly related species on the single nucleotide polymorphism array as a technique to further reduce costs was evaluated. Single nucleotide polymorphism performance was generally high, and species-specific single nucleotide polymorphisms proved suitable for diverse applications. The multi-species single nucleotide polymorphism array approach reported here could be transferred to other species to achieve cost savings resulting from the increased throughput when several projects use the same array, and the pooling technique adds another highly promising advancement to additionally decrease genotyping costs by half.
Collapse
Affiliation(s)
- Sara Montanari
- The New Zealand Institute for Plant and Food Research Ltd, Motueka 7198, New Zealand
| | - Cecilia Deng
- The New Zealand Institute for Plant and Food Research Ltd, Auckland 1025, New Zealand
| | - Emily Koot
- The New Zealand Institute for Plant and Food Research Ltd, Palmerston North 4410, New Zealand
| | - Nahla V Bassil
- USDA-ARS National Clonal Germplasm Repository, Corvallis, OR 97333, USA
| | - Jason D Zurn
- Department of Plant Pathology, Kansas State University, Manhattan, KS 66506, USA
| | | | | | - Rishi Aryal
- Department of Horticultural Science, North Carolina State University, Raleigh, NC 27695, USA
| | - Hamid Ashrafi
- Department of Horticultural Science, North Carolina State University, Raleigh, NC 27695, USA
| | | | - Maren Wellenreuther
- The New Zealand Institute for Plant and Food Research Ltd, Nelson 7010, New Zealand
- School of Biological Sciences, University of Auckland, Auckland 1010, New Zealand
| | - David Chagné
- The New Zealand Institute for Plant and Food Research Ltd, Palmerston North 4410, New Zealand
| |
Collapse
|
45
|
Bornstein K, Gryan G, Chang ES, Marchler-Bauer A, Schneider VA. The NIH Comparative Genomics Resource: addressing the promises and challenges of comparative genomics on human health. BMC Genomics 2023; 24:575. [PMID: 37759191 PMCID: PMC10523801 DOI: 10.1186/s12864-023-09643-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2023] [Accepted: 08/31/2023] [Indexed: 09/29/2023] Open
Abstract
Comparative genomics is the comparison of genetic information within and across organisms to understand the evolution, structure, and function of genes, proteins, and non-coding regions (Sivashankari and Shanmughavel, Bioinformation 1:376-8, 2007). Advances in sequencing technology and assembly algorithms have resulted in the ability to sequence large genomes and provided a wealth of data that are being used in comparative genomic analyses. Comparative analysis can be leveraged to systematically explore and evaluate the biological relationships and evolution between species, aid in understanding the structure and function of genes, and gain a better understanding of disease and potential drug targets. As our knowledge of genetics expands, comparative genomics can help identify emerging model organisms among a broader span of the tree of life, positively impacting human health. This impact includes, but is not limited to, zoonotic disease research, therapeutics development, microbiome research, xenotransplantation, oncology, and toxicology. Despite advancements in comparative genomics, new challenges have arisen around the quantity, quality assurance, annotation, and interoperability of genomic data and metadata. New tools and approaches are required to meet these challenges and fulfill the needs of researchers. This paper focuses on how the National Institutes of Health (NIH) Comparative Genomics Resource (CGR) can address both the opportunities for comparative genomics to further impact human health and confront an increasingly complex set of challenges facing researchers.
Collapse
Affiliation(s)
| | - Gary Gryan
- The MITRE Corporation, 7525 Colshire Dr, McLean, VA, USA
| | - E Sally Chang
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Aron Marchler-Bauer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Valerie A Schneider
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA.
| |
Collapse
|
46
|
Pinto BJ, Gamble T, Smith CH, Keating SE, Havird JC, Chiari Y. The revised reference genome of the leopard gecko (Eublepharis macularius) provides insight into the considerations of genome phasing and assembly. J Hered 2023; 114:513-520. [PMID: 36869788 PMCID: PMC10445513 DOI: 10.1093/jhered/esad016] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Accepted: 02/28/2023] [Indexed: 03/05/2023] Open
Abstract
Genomic resources across squamate reptiles (lizards and snakes) have lagged behind other vertebrate systems and high-quality reference genomes remain scarce. Of the 23 chromosome-scale reference genomes across the order, only 12 of the ~60 squamate families are represented. Within geckos (infraorder Gekkota), a species-rich clade of lizards, chromosome-level genomes are exceptionally sparse representing only two of the seven extant families. Using the latest advances in genome sequencing and assembly methods, we generated one of the highest-quality squamate genomes to date for the leopard gecko, Eublepharis macularius (Eublepharidae). We compared this assembly to the previous, short-read only, E. macularius reference genome published in 2016 and examined potential factors within the assembly influencing contiguity of genome assemblies using PacBio HiFi data. Briefly, the read N50 of the PacBio HiFi reads generated for this study was equal to the contig N50 of the previous E. macularius reference genome at 20.4 kilobases. The HiFi reads were assembled into a total of 132 contigs, which was further scaffolded using HiC data into 75 total sequences representing all 19 chromosomes. We identified 9 of the 19 chromosomal scaffolds were assembled as a near-single contig, whereas the other 10 chromosomes were each scaffolded together from multiple contigs. We qualitatively identified that the percent repeat content within a chromosome broadly affects its assembly contiguity prior to scaffolding. This genome assembly signifies a new age for squamate genomics where high-quality reference genomes rivaling some of the best vertebrate genome assemblies can be generated for a fraction of previous cost estimates. This new E. macularius reference assembly is available on NCBI at JAOPLA010000000.
Collapse
Affiliation(s)
- Brendan J Pinto
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
- Center for Evolution and Medicine, Arizona State University, Tempe, AZ, USA
- Department of Zoology, Milwaukee Public Museum, Milwaukee, WI, USA
| | - Tony Gamble
- Department of Zoology, Milwaukee Public Museum, Milwaukee, WI, USA
- Department of Biological Sciences, Marquette University, Milwaukee WI, USA
- Bell Museum of Natural History, University of Minnesota, St Paul, MN, USA
| | - Chase H Smith
- Department of Integrative Biology, University of Texas at Austin, Austin, TX, USA
| | - Shannon E Keating
- Department of Biological Sciences, Marquette University, Milwaukee WI, USA
| | - Justin C Havird
- Department of Integrative Biology, University of Texas at Austin, Austin, TX, USA
| | - Ylenia Chiari
- Department of Biology, George Mason University, Fairfax, VA, USA
| |
Collapse
|
47
|
Pinto BJ, Gamble T, Smith CH, Wilson MA. A lizard is never late: Squamate genomics as a recent catalyst for understanding sex chromosome and microchromosome evolution. J Hered 2023; 114:445-458. [PMID: 37018459 PMCID: PMC10445521 DOI: 10.1093/jhered/esad023] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Accepted: 04/03/2023] [Indexed: 04/07/2023] Open
Abstract
In 2011, the first high-quality genome assembly of a squamate reptile (lizard or snake) was published for the green anole. Dozens of genome assemblies were subsequently published over the next decade, yet these assemblies were largely inadequate for answering fundamental questions regarding genome evolution in squamates due to their lack of contiguity or annotation. As the "genomics age" was beginning to hit its stride in many organismal study systems, progress in squamates was largely stagnant following the publication of the green anole genome. In fact, zero high-quality (chromosome-level) squamate genomes were published between the years 2012 and 2017. However, since 2018, an exponential increase in high-quality genome assemblies has materialized with 24 additional high-quality genomes published for species across the squamate tree of life. As the field of squamate genomics is rapidly evolving, we provide a systematic review from an evolutionary genomics perspective. We collated a near-complete list of publicly available squamate genome assemblies from more than half-a-dozen international and third-party repositories and systematically evaluated them with regard to their overall quality, phylogenetic breadth, and usefulness for continuing to provide accurate and efficient insights into genome evolution across squamate reptiles. This review both highlights and catalogs the currently available genomic resources in squamates and their ability to address broader questions in vertebrates, specifically sex chromosome and microchromosome evolution, while addressing why squamates may have received less historical focus and has caused their progress in genomics to lag behind peer taxa.
Collapse
Affiliation(s)
- Brendan J Pinto
- School of Life Sciences, Arizona State University, Tempe, AZ, United States
- Center for Evolution and Medicine, Arizona State University, Tempe, AZ, United States
- Department of Zoology, Milwaukee Public Museum, Milwaukee, WI, United States
| | - Tony Gamble
- Department of Zoology, Milwaukee Public Museum, Milwaukee, WI, United States
- Department of Biological Sciences, Marquette University, Milwaukee, WI, United States
- Bell Museum of Natural History, University of Minnesota, St Paul, MN, United States
| | - Chase H Smith
- Department of Integrative Biology, University of Texas at Austin, Austin, TX, United States
| | - Melissa A Wilson
- School of Life Sciences, Arizona State University, Tempe, AZ, United States
- Center for Evolution and Medicine, Arizona State University, Tempe, AZ, United States
- Center for Mechanisms of Evolution, Biodesign Institute, Tempe, AZ, United States
| |
Collapse
|
48
|
Cunha TJ, de Medeiros BAS, Lord A, Sørensen MV, Giribet G. Rampant loss of universal metazoan genes revealed by a chromosome-level genome assembly of the parasitic Nematomorpha. Curr Biol 2023; 33:3514-3521.e4. [PMID: 37467752 DOI: 10.1016/j.cub.2023.07.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Revised: 05/21/2023] [Accepted: 07/03/2023] [Indexed: 07/21/2023]
Abstract
Parasites may manipulate host behavior to increase the odds of transmission or to reach the proper environment to complete their life cycle.1,2 Members of the phylum Nematomorpha (known as horsehair worms, hairworms, or Gordian worms) are large endoparasites that affect the behavior of their arthropod hosts. In terrestrial hosts, they cause erratic movements toward bodies of water,3,4,5,6 where the adult worm emerges from the host to find mates for reproduction. We present a chromosome-level genome assembly for the freshwater Acutogordius australiensis and a draft assembly for one of the few known marine species, Nectonema munidae. The assemblies span 201 Mbp and 213 Mbp in length (N50: 38 Mbp and 716 Kbp), respectively, and reveal four chromosomes in Acutogordius, which are largely rearranged compared to the inferred ancestral condition in animals. Both nematomorph genomes have a relatively low number of genes (11,114 and 8,717, respectively) and lack a high proportion (∼30%) of universal single-copy metazoan orthologs (BUSCO genes7). We demonstrate that missing genes are not an artifact of the assembly process, with the majority of missing orthologs being shared by the two independent assemblies. Missing BUSCOs are enriched for Gene Ontology (GO) terms associated with the organization of cilia and cell projections in other animals. We show that most cilium-related genes conserved across eukaryotes have been lost in Nematomorpha, providing a molecular basis for the suspected absence of ciliary structures in these animals.
Collapse
Affiliation(s)
- Tauana J Cunha
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA; Field Museum of Natural History, 1400 S DuSable Lake Shore Drive, Chicago, IL 60605, USA.
| | - Bruno A S de Medeiros
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA; Field Museum of Natural History, 1400 S DuSable Lake Shore Drive, Chicago, IL 60605, USA
| | - Arianna Lord
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
| | - Martin V Sørensen
- Natural History Museum of Denmark, University of Copenhagen, Universitetsparken 15, DK-2100 Copenhagen, Denmark
| | - Gonzalo Giribet
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
| |
Collapse
|
49
|
Umu SU, Paynter VM, Trondsen H, Buschmann T, Rounge TB, Peterson KJ, Fromm B. Accurate microRNA annotation of animal genomes using trained covariance models of curated microRNA complements in MirMachine. CELL GENOMICS 2023; 3:100348. [PMID: 37601971 PMCID: PMC10435380 DOI: 10.1016/j.xgen.2023.100348] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Revised: 03/15/2023] [Accepted: 05/26/2023] [Indexed: 08/22/2023]
Abstract
The annotation of microRNAs depends on the availability of transcriptomics data and expert knowledge. This has led to a gap between the availability of novel genomes and high-quality microRNA complements. Using >16,000 microRNAs from the manually curated microRNA gene database MirGeneDB, we generated trained covariance models for all conserved microRNA families. These models are available in our tool MirMachine, which annotates conserved microRNAs within genomes. We successfully applied MirMachine to a range of animal species, including those with large genomes and genome duplications and extinct species, where small RNA sequencing is hard to achieve. We further describe a microRNA score of expected microRNAs that can be used to assess the completeness of genome assemblies. MirMachine closes a long-persisting gap in the microRNA field by facilitating automated genome annotation pipelines and deeper studies into the evolution of genome regulation, even in extinct organisms.
Collapse
Affiliation(s)
- Sinan Uğur Umu
- Department of Pathology, Institute of Clinical Medicine, University of Oslo, Oslo, Norway
| | - Vanessa M. Paynter
- The Arctic University Museum of Norway, UiT - The Arctic University of Norway, Tromsø, Norway
| | - Håvard Trondsen
- Department of Pathology, Institute of Clinical Medicine, University of Oslo, Oslo, Norway
| | | | - Trine B. Rounge
- Department of Research, Cancer Registry of Norway, Oslo, Norway
- Centre for Bioinformatics, Department of Pharmacy, University of Oslo, Oslo, Norway
| | - Kevin J. Peterson
- Department of Biological Sciences, Dartmouth College, Hanover, NH, USA
| | - Bastian Fromm
- The Arctic University Museum of Norway, UiT - The Arctic University of Norway, Tromsø, Norway
| |
Collapse
|
50
|
Xie H, Chen Z, Pang S, Du W. Efficient and Highly Continuous Chromosome-Level Genome Assembly of the First Chameleon Genome. Genome Biol Evol 2023; 15:evad131. [PMID: 37481259 PMCID: PMC10410292 DOI: 10.1093/gbe/evad131] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 07/06/2023] [Accepted: 07/10/2023] [Indexed: 07/24/2023] Open
Abstract
Most amniote genomes are diploid and moderate in size (∼1-6 Gbp) and contain a large proportion of repetitive sequences. The development of next-generation sequencing technology, especially the emergence of high-fidelity (HiFi) long-read data, has made it feasible to resolve high-quality genome assembly for nonmodel species efficiently. However, reference genomes for squamate reptiles have lagged behind other amniote lineages. Here, we de novo assembled the first genome from the Chameleonidae family, the panther chameleon (Furcifer pardalis). We obtained telomere-to-telomere contigs using only HiFi data, reaching a contig N50 of 158.72 Mbp. The final chromosome-level assembly is 1.61 Gbp in size, and 100% of primary contigs were placed to pseudochromosomes using Hi-C interaction data. We also found that sequencing depth > 30 folds can ensure both the integrity and accuracy of the genome, whereas insufficient depth led to false increase in genome size and proportion of duplicated genes. We provide a high-quality reference genome valuable for evolutionary and ecological studies in chameleons as well as provide comparative genomic resources for squamate reptiles.
Collapse
Affiliation(s)
- Hongxin Xie
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Zixuan Chen
- College of Biological Sciences, China Agricultural University, Beijing, China
| | | | - Weiguo Du
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|